You are on page 1of 17

DEBRE TABOR UNVERISITY

DEPARTMENT OF INFORIMATION TECHNOLOGY


ASSIGNMENT OF AI
TITLE OF DATA SCIENCE
Data Science
• Data science is the study of data to extract meaningful insights for
business. It is a multidisciplinary approach that combines principles
and practices from the fields of mathematics, statistics, artificial
intelligence, and computer engineering to analyze large amounts of
data.
• Data Science is known to use data analytics software for this process.
Data Analytics refers to transforming, inspecting, cleaning, and
modeling data. Data scientists must teach them self about cloud
computing.
Continue…
• .Data scientists must teach them self about cloud computing. This is
important before cloud computing will provide the field of data
science with the ability to utilize various platforms and tools, to help
store and analyze extensive data.
• Data scientists can use tools such as MS SQL, Business Objects, and
Microsoft Azure to help them understand cloud computing better.
• The main goal of data analytics is to find any helpful information that
will further help in any decision-making conclusion made by the
company.
Data analytics is a necessity for businesses and organizations as it helps
in:
• Reducing Costs by locating and identifying unnecessary operations or
processes
• Understand the preferences of consumers to avail of customized
services or products. This leads to a better competitive advantage.
• Making effective and faster decisions based on the information
provided
Cloud computing
• Cloud Computing is a method of hosting a network of remote servers on the
Internet.
• It also refers to on the demand availability of computing resource over the
internet.
• The term cloud is referred to as a metaphor for the internet.
• The main purpose of cloud computing is to store and process data in cloud or
availing remote IT services without physically installing any IT resources.
• These servers are primarily responsible for data storage, management, and
processing. However, a local server or a personal computer does not perform
this.
• The cloud is characterized as a service provided by hardware and software
resources.
• Cloud Service Provider (CSP) is known to offer cloud services. Microsoft Azure, 
Amazon Web Series, IBM, Google etc….
Continue..
Clouds have numerous advantages that make them the best choice for
any organization, large or small. There are many advantages of using
cloud computing in organizations. These include:

• Scalability, Robustness, Reliability, and Accessibility.


• Flexible and Cost-Effective
• Agility and Increased Business Value
• Operation Efficiency Improvement
• Better security
Continue….

Cloud Services are considered deployment and service models.


A service type of model includes:
• Infrastructure as a Service (IaaS)
• Platform as a Service (PaaS)
• Software as a Service (SaaS)
Continue…
• Infrastructure as a service (IAAS):
It means complete infrastructure will be provided to you. Maintenance
related tasks will be done by cloud provider and you can use it as per your
requirement. It can be used as public and private both.
• Examples of IaaS are virtual machines, load balancers, and network
attached storage.
• Platform as a service (PAAS): Here we have object storage, queuing,
databases, runtime etc. All these we can get directly from the cloud
provider. It’s our responsibility to configure and use that. Providers will
give us the resources but connectivity to our database and other similar
activities are our responsibility.
• Examples of PaaS are Windows Azure and Google App Engine (GAE).
Continue…
• Applications or software as a service (SAAS)
example. Salesforce.com, drop box, google drive etc. Here we do not
have any responsibility. We are using the application that is running on
the cloud. All infrastructure setup is the responsibility. of the service
provider.
Deployment type models include
• Private Clouds
• .Public Clouds
• Hybrid Clouds
Continue..
Continue…
• Private Clouds
In this model the cloud infrastructure is used exclusively by a single organization
The organization may run private cloud or outsourced it to a hosting company .
The service and infrastructure are maintained on a private network
This type of model is expensive with an excellent level of security.
Example AWS, VMWare
• Public Clouds
This model refers to a more cost-efficient model generally available on the internet , it can be used
by general public.
All hardware, software and other supporting infrastructure is owned and managed by the cloud
providers
Some public clouds include Google Drive, Google Gmail, and so on. In this model, the data is not
100% secure.
• Hybrid Clouds
This model refers to a mix of both private clouds and public clouds types. In this case, there is a
higher risk of a security breach. Example NASA
Big data
• Big data refers to the data which is huge in size and also increasing
rapidly with respect to time. Big data includes structured data,
unstructured data as well as semi-structured data.
• Big data can not be stored and processed in traditional data
management tools it needs specialized big data management tools.
• It refers to complex and large data sets having 5 V’s information
assets. It includes data storage, data analysis, data mining and data
visualization.
Examples of big data includes social media data, e-commerce data,
weather station data, IoT Sensor data etc.
Characteristic of big data
• Variety of Big data – means that different types of data Structured,
unstructured, and semi structured data.
• Velocity of Big data – Speed of data generation or motion of data
• Volume of Big data – Huge volumes or size of data that is being
generated
• Value of Big data – Extracting useful information and making it valuable
• Variability of Big data – Inconsistency which can be shown by the data at
times.
• 
• Advantages of Big Data :
• Cost Savings
• Better decision-making
• Better Sales insights
• Increased Productivity
• Improved customer service
• Disadvantages of Big Data :
• Incompatible tools
• Security and Privacy Concerns
• Need for cultural change
• Rapid change in technology
• Specific hardware needs
data science and big data relationship
• Big Data is a vast resource of information collected in structured and
unstructured forms but needs additional steps and processes to
uncover the underlying information. 
• Hence, big data cannot be processed without data science for
business decision-making.
• Data science handles big data by transforming, analyzing, and
visualizing it to bring meaningful insights. As a result, both are distinct
yet complementary and have their importance and significance.
THANK YOU!!

You might also like