The document describes a course on data analysis using Python. The course is worth 3 credits and involves 4 modules. Module 1 covers Python fundamentals like data types, operators, and control statements. Module 2 focuses on functions, data structures like lists, tuples, dictionaries, and sets. Module 3 discusses object-oriented concepts in Python including classes, inheritance, and exceptions. Module 4 introduces numerical analysis in Python using the NumPy library for multidimensional arrays and array operations. Module 5 covers data manipulation and visualization using Pandas and Matplotlib. The course aims to teach Python programming, use of numerical libraries, and basic data science tasks like data retrieval, processing, and visualization.
The document describes a course on data analysis using Python. The course is worth 3 credits and involves 4 modules. Module 1 covers Python fundamentals like data types, operators, and control statements. Module 2 focuses on functions, data structures like lists, tuples, dictionaries, and sets. Module 3 discusses object-oriented concepts in Python including classes, inheritance, and exceptions. Module 4 introduces numerical analysis in Python using the NumPy library for multidimensional arrays and array operations. Module 5 covers data manipulation and visualization using Pandas and Matplotlib. The course aims to teach Python programming, use of numerical libraries, and basic data science tasks like data retrieval, processing, and visualization.
The document describes a course on data analysis using Python. The course is worth 3 credits and involves 4 modules. Module 1 covers Python fundamentals like data types, operators, and control statements. Module 2 focuses on functions, data structures like lists, tuples, dictionaries, and sets. Module 3 discusses object-oriented concepts in Python including classes, inheritance, and exceptions. Module 4 introduces numerical analysis in Python using the NumPy library for multidimensional arrays and array operations. Module 5 covers data manipulation and visualization using Pandas and Matplotlib. The course aims to teach Python programming, use of numerical libraries, and basic data science tasks like data retrieval, processing, and visualization.
Pre-requisites: Basics of Data Structures and Mathematics
Course objectives: The student will learn how to apply
1. Fundamentals and Data structures of pythons programming language. 2. Object oriented concepts in python programming language. 3. Retrieving, processing, storing and visualization of data using python . MODULE-1: INTRODUCTION TO PYTHON Brief history of python, Data types - Built-in, Sequence, Sets, Strings, Literals, constants, keywords, variables, naming convention. Operators – Types, Precedence & Associativity, Input, Output, file handling, Control Statements.
MODULE-2: FUNCTIONS AND DATA STRUCTURES IN PYTHON
Functions – basics of functions, functions as objects, recursive functions, List –methods to process lists, Shallow & Deep copy, Nested lists, lists as matrices, lists as stacks, Queues, -Deques, Tuples - basic operations on tuples, nested tuples, Dictionaries – operations on dictionary, ordered dictionary, iteration on dictionary, conversion of lists & strings into dictionary, Sets & frozen sets, looping techniques on lists & dictionaries, Lamda, filter, reduce, map, list comprehension, iterators and generators.
MODULE-3: OBJECTS IN PYTHON
Class and instance attributes, inheritance, multiple inheritance, methods resolution order, magic methods and operator overloading, meta classes, abstract and inner classes, exception handling, modular programs and packages.
MODULE-4: NUMERICAL ANALYSIS IN PYTHON
Introduction to NumPy, NumPy array object, Creating a multidimensional array, NumPy numerical types - Data type objects, Character codes, dtype constructors. dtype attributes. One- dimensional slicing and indexing. Manipulating array shapes -- Stacking arrays, Splitting NumPy arrays, NumPy array attributes, Converting arrays, Creating array views and copies. Indexing with a list of locations. Indexing NumPy arrays with Booleans. Broadcasting NumPy arrays.
MODULE-5: DATA MANIPULATION AND VISUALIZATION IN PYTHON
Data frames in panda, Creating data frames from .csv and excel files, Lists of tuples, Dataf rames aggregation and concatenation, plotting data using matplotlib & panda Course Outcomes: After completion of course, students would be able to:
1. Write programs efficiently in python
2. Effectively use numerical analysis libraries of python 3. Carry out basic data science operations like retrieving, processing and visualizing using python. REFERENCES: 1. Wesley J Chun, Core Python Programming, Prentice Hall, Second Edition, 2006 2. Ivan Idris, Python Data Analysis, PacktPublishing,UK, 2014 (freely available online) 3. Wes McKinney, Python for Data Analysis, O‟Reilly - 2013