You are on page 1of 14

Vadlamudi Bala Siva Seetha Ramaiah​

Data Science Using With Python


Agenda

A
Introduction

Python Data Structures

Python Libraries

Some Additional Libraries

Data Science vs Data


Analytics

Presentation title Summary 2


Introduction

Presentation title 3
What is Python

• Python is open source, interpreted, high level language and provides great approach for object-
oriented programming. It is one of the best language used by data scientist for various data science
projects/application.

Presentation title 4
• When it comes to data science application ,it gives an extensive library to bargain with and not suggest to this open source ,

interpreted high level tools.

• Python is a common goal programming language , that is growing more and more famous for creating data science

• Some programming languages exist in the mind of data science.

• Python is extensively used in the scientific and research summation because this is easy to use has very easy syntax that

makes very simple adapt for people who don’t have a software engineer background.

• Exact data sure of that as well.

Data Science 5
Python Data Structures

• So in python, they are the following data:



structures, lists, tuple, dictionaries. set also available only in python 2.5 old versions so the list is like 1d arrays but you can
also create alist of other lists and take a multidimensional array. A data structure fundamentally just says that they are a
structure that can be data concurrently, in other words, say it raises a collection of relevant data. And the completed-in data
structure in python language Tuple, Dictionary list, set, string so we will learn howto use each of them.
• Tuple
• Dictionary
• List
• Set
• String

Presentation title 6
Python Libraries For Data Science
• Numpy
• Keras
• SciPy
• Pandas
• SciKit-Learn
• Matplotlib
• Seaborn
• Tensorflow

Presentation title 7
NumPy
NumPy (Numerical Python) is the fundamental package for numerical computation in Python; it contains a powerful N-
dimensional array object. It has around 18,000 comments on GitHub and an active community of 700 contributors. It’s a
general-purpose array-processing package that provides high-performance multidimensional objects called arrays and tools for
working with them. NumPy also addresses the slowness problem partly by providing these multidimensional arrays as well as
providing functions and operators that operate efficiently on these arrays.

Features:
•Provides fast, precompiled functions for numerical routines
•Array-oriented computing for better efficiency
•Supports an object-oriented approach
•Compact and faster computations with vectorization

Applications:
•Extensively used in data analysis
•Creates powerful N-dimensional array
8
•Forms the base of other libraries, such as SciPy and scikit-learn
SciPy
SciPy (Scientific Python) is another free and open-source Python library for data science that is extensively used for high-
level computations. SciPy has around 19,000 comments on GitHub and an active community of about 600 contributors. It’s
extensively used for scientific and technical computations, because it extends NumPy and provides many user-friendly and
efficient routines for scientific calculations.

Features:
•Collection of algorithms and functions built on the NumPy extension of Python
•High-level commands for data manipulation and visualization
•Multidimensional image processing with the SciPy ndimage submodule
•Includes built-in functions for solving differential equations

Applications:
•Multidimensional image operations
•Solving differential equations and the Fourier transform
9
•Optimization algorithms
TensorFlow

The first in the list of python libraries for data science is TensorFlow. TensorFlow is a library for high-performance
numerical computations with around 35,000 comments and a vibrant community of around 1,500 contributors. It’s used
across various scientific fields. TensorFlow is basically a framework for defining and running computations that involve
tensors, which are partially defined computational objects that eventually produce a value.

Features:
•Better computational graph visualizations
•Reduces error by 50 to 60 percent in neural machine learning
•Parallel computing to execute complex models
•Seamless library management backed by Google
•Quicker updates and frequent new releases to provide you with the latest features

10
Some Additional Libraries
• Data Exploration finding out further about data we hold.

• Data Munging cleaning the data and performing by it to perform it rightly suit statistical modelling.

• Predective modelling moving the actual algorithms and should fun.

Presentation title 11
Data Science vs Data Analytics

Presentation title 12
Conclusion

• Data science with python is truly an excellent tool and is growing an increasingly attractive language with
data scientists. The object being, thats easy to read, blends fine with other databases including tools like
Spark and Hadoop. Majorly, this has a famous computational power and has the highest data analytics
libraries.

Presentation title 13
Submitted By:-

Thank you Vadlamudi Bala Siva Seetha


Ramaiah

20AR1A0592

You might also like