ASSIGNMENT OF AI TITLE OF DATA SCIENCE Data Science • Data science is the study of data to extract meaningful insights for business. It is a multidisciplinary approach that combines principles and practices from the fields of mathematics, statistics, artificial intelligence, and computer engineering to analyze large amounts of data. • Data Science is known to use data analytics software for this process. Data Analytics refers to transforming, inspecting, cleaning, and modeling data. Data scientists must teach them self about cloud computing. Continue… • .Data scientists must teach them self about cloud computing. This is important before cloud computing will provide the field of data science with the ability to utilize various platforms and tools, to help store and analyze extensive data. • Data scientists can use tools such as MS SQL, Business Objects, and Microsoft Azure to help them understand cloud computing better. • The main goal of data analytics is to find any helpful information that will further help in any decision-making conclusion made by the company. Data analytics is a necessity for businesses and organizations as it helps in: • Reducing Costs by locating and identifying unnecessary operations or processes • Understand the preferences of consumers to avail of customized services or products. This leads to a better competitive advantage. • Making effective and faster decisions based on the information provided Cloud computing • Cloud Computing is a method of hosting a network of remote servers on the Internet. • It also refers to on the demand availability of computing resource over the internet. • The term cloud is referred to as a metaphor for the internet. • The main purpose of cloud computing is to store and process data in cloud or availing remote IT services without physically installing any IT resources. • These servers are primarily responsible for data storage, management, and processing. However, a local server or a personal computer does not perform this. • The cloud is characterized as a service provided by hardware and software resources. • Cloud Service Provider (CSP) is known to offer cloud services. Microsoft Azure, Amazon Web Series, IBM, Google etc…. Continue.. Clouds have numerous advantages that make them the best choice for any organization, large or small. There are many advantages of using cloud computing in organizations. These include:
• Scalability, Robustness, Reliability, and Accessibility.
• Flexible and Cost-Effective • Agility and Increased Business Value • Operation Efficiency Improvement • Better security Continue….
Cloud Services are considered deployment and service models.
A service type of model includes: • Infrastructure as a Service (IaaS) • Platform as a Service (PaaS) • Software as a Service (SaaS) Continue… • Infrastructure as a service (IAAS): It means complete infrastructure will be provided to you. Maintenance related tasks will be done by cloud provider and you can use it as per your requirement. It can be used as public and private both. • Examples of IaaS are virtual machines, load balancers, and network attached storage. • Platform as a service (PAAS): Here we have object storage, queuing, databases, runtime etc. All these we can get directly from the cloud provider. It’s our responsibility to configure and use that. Providers will give us the resources but connectivity to our database and other similar activities are our responsibility. • Examples of PaaS are Windows Azure and Google App Engine (GAE). Continue… • Applications or software as a service (SAAS) example. Salesforce.com, drop box, google drive etc. Here we do not have any responsibility. We are using the application that is running on the cloud. All infrastructure setup is the responsibility. of the service provider. Deployment type models include • Private Clouds • .Public Clouds • Hybrid Clouds Continue.. Continue… • Private Clouds In this model the cloud infrastructure is used exclusively by a single organization The organization may run private cloud or outsourced it to a hosting company . The service and infrastructure are maintained on a private network This type of model is expensive with an excellent level of security. Example AWS, VMWare • Public Clouds This model refers to a more cost-efficient model generally available on the internet , it can be used by general public. All hardware, software and other supporting infrastructure is owned and managed by the cloud providers Some public clouds include Google Drive, Google Gmail, and so on. In this model, the data is not 100% secure. • Hybrid Clouds This model refers to a mix of both private clouds and public clouds types. In this case, there is a higher risk of a security breach. Example NASA Big data • Big data refers to the data which is huge in size and also increasing rapidly with respect to time. Big data includes structured data, unstructured data as well as semi-structured data. • Big data can not be stored and processed in traditional data management tools it needs specialized big data management tools. • It refers to complex and large data sets having 5 V’s information assets. It includes data storage, data analysis, data mining and data visualization. Examples of big data includes social media data, e-commerce data, weather station data, IoT Sensor data etc. Characteristic of big data • Variety of Big data – means that different types of data Structured, unstructured, and semi structured data. • Velocity of Big data – Speed of data generation or motion of data • Volume of Big data – Huge volumes or size of data that is being generated • Value of Big data – Extracting useful information and making it valuable • Variability of Big data – Inconsistency which can be shown by the data at times. • • Advantages of Big Data : • Cost Savings • Better decision-making • Better Sales insights • Increased Productivity • Improved customer service • Disadvantages of Big Data : • Incompatible tools • Security and Privacy Concerns • Need for cultural change • Rapid change in technology • Specific hardware needs data science and big data relationship • Big Data is a vast resource of information collected in structured and unstructured forms but needs additional steps and processes to uncover the underlying information. • Hence, big data cannot be processed without data science for business decision-making. • Data science handles big data by transforming, analyzing, and visualizing it to bring meaningful insights. As a result, both are distinct yet complementary and have their importance and significance. THANK YOU!!