You are on page 1of 298
Sy Artificial Intelligence and Machine Learning (Code : 302049) Semester VI - Mechanical and Automobile Engineering (Savitribai Phule Pune University) Strictly as per the New Credit System Syllabus (2019 Course) Savitribai Phule Pune University w.e.f. academic year 2021-2022 Pravin Goyal OF Techtnoutedye ml Artificial Intelligence and Machine Learning (Code : 302049) Pravin Goyal Semester VI - Mechanical and Automobile Engineering (Savitribai Phule Pune University) Copyright © Author. All rights reserved. No part of this publication may be reproduced, copied, or stored ina retrieval system, distributed or transmitted in any form or by any means, including photocopy, recording, or other electronic or mechanical methods, without the prior written permission of the publisher. ‘This book is sold subject to the condition that it shall not, by the way of trade or otherwise, be lent, resold, hired out, or otherwise circulated without the publisher’s prior written consent in any form of binding or cover other than which it is published and without a similar condition including this condition being imposed on the subsequent purchaser and without limiting the rights under copyright reserved above. First Printed in India: March 2024 First Edition 2 January 2022 This edition is for sale in India, Bangladesh, Bhutan, Maldives, Nepal, Pakistan, Sri Lanka and designated countries in South-East Asia, Sale and purchase of this book outside of these countries is unauthorized by the publisher. ISBN ; 978-93-5563-007-0 Copyright Application Number - L-98904/2021 reeks TECHKNOWLEDGE PUBLICATIONS : d Office’ 37/2, Ashtavinayak Industrial Estate, B/S, First floor, Maniratna Complex, Taware Colony, Rone cnrenn ‘Aranyeshwar Corner, Pune - 411 009. Maharashtra State, India Ph :91-20-24221234, 91-20-24225678. Narhe, Pune, Maharashtra State, India. “4 Pune- 471041 Email :info@techknowledgebooks.com, Website : www.techknowledgebooks.com Subject Code : 302049 Book Code : PELS6A |] serrate se re eC ‘We dedicate this Publication soulfully and wholeheartedly, in loving memory of our beloved founder director, Late Shri. Pradeepji Lalchandji Lunawat, who will always be an inspiration, a positive force and. strong support behind us. ‘My work is my prayer to God” ~ Late Shri, Pradeepji L. Lunawat Soulful Tribute and Gratitude for all Your Sacrifices, Hardwork. and 40 years of Strong Vision... Dedicated to Krishna, the Greatest Almighty Dear Readers, Congratulations! If you are reading this page, you are amongst that fortunate and blessed sroup of individuals who have survived the pandemic Covid-19. 1 salute your bravery to support yourself and several around you to jointly fight over the dreadful, Nevertheless, it has touched each of our lives and our dear and near ones in various unprecedented ways. It has touched our lives as well at TechKnowledge Publications where our entire team has worked hard throughout these tough times and did every possible thing they could to bring this book on time in your hands, My deepest gratitude to them for everything they did. Isincerely. acknowledge the never let go attitude of our team leads Arunoday sit, Chandroday sir and, Shtal si. Their invincible belief that this too shall pass, ‘motivated me to keep working on this book. {feel short of words as I close this book and hand it over to my team to bring this to you. I wish by the time you read it, we would have come across a long way towards a bright horizon having a great sense of human hope. Take care of yourself. Pravin Goyal gga Savitribai Phule Pune University Third Year of Mechanical and Automobile Engineering (2019 Course) 302049 : Artificial Intelligence and Machine Learning Teaching Scheme : Credits : Examination Scheme and Marks TH : 03 Hours/Week 03 Mid Semester (TH) : 30 Marks End_Semester (TH): 70 Marks Prereqt ites Linear Algebra, Probability, Statistics, Logical Reasoning. Course Objectives 1. ACQUAINT with fundamentals of artificia intelligence and machine learning. 2. LEARN feature extraction and selection techniques for processing data set. 3, UNDERSTAND basic algorithms used in classification and regression problems. 4. OUTLINE steps involved in development of machine learning model. 5, FAMILIARIZE with concepts of reinforced and deep learning. 6. IMPLEMENT AND ANALYZE machine learning model in mechanical engineering problems. Course Outcomes On completion of the course, learner will be able to C01 : DEMONSTRATE fundamentals of artificial intelligence and machine learning. (C02 : APPLY feature extraction and selection techniques. C03 : APPLY machine learning algorithms for classification and regression problems. C04: DEVISE AND DEVELOP a machine learning model using various steps. COS : EXPLAIN concepts of reinforced and deep learning, C06 : SIMULATE machine learning model in mechanical engineering problems. Unit I: Introduction to AT and ML 06 hrs History of Al, Comparison of Al with Data Science, Need of Al in Mechanical Engineering, Introduction to Machine Learning, Basics: Reasoning, problem solving, Knowledge representation, Planning, Learning Perception, Motion and manipulation. Approaches to AI : Cybernetics and brain simulation, Symbolic, Sub-symbolic, Statistical, Approaches to ML: Supervised learning, Unsupervised learning, Reinforcement learning, (Refer Chapter 1) Unit 17 : Feature Extraction and Selection 08 hrs Feature extraction : Statistical features, Principal Component Analysis, Feature selection : Ranking, Decision tree - Entropy reduction and information gain, Exhaustive, best first, Greedy forward & backward, Applications of feature extraction and selection algorithms in Mechanical Engineering. (Refer Chapter 2) Unit IIT : Clas: ication and Regression 08 hrs Classification : Decision tree, Random forest, Naive Bayes, Support vector machine. Regression : Logistic Regression, Support Vector Regression, Regression trees: Decision tree, random forest, K-Means, K-Nearest Neighbor (KNN). Applications of classification and regression algorithms in Mechanical Engineering, (Refer Chapter 3) Unit IV : Development of ML Model 07 hrs Problem identification : classification, clustering, regression, ranking, Steps in ML modeling, Data Collection, Data pre-processing, Model Selection, Model training (Training, Testing, K-fold Cross Validation), Model evaluation (understanding and interpretation of confusion matrix, Accuracy, Precision, Recall, True positive, false positive etc), Hyper parameter Tuning, Predictions (Refer Chapter 4) Unit V : Reinforced and Deep Learning 08 hrs Characteristics of reinforced learning ; Algorithms : Value Based, Policy Based, Model Based; Positive vs Negative Reinforced Learning; Models: Markov Decision Process, Q Learning. Characteristics of Deep Learning [Artificial Neural Network, Convolution Neural Network. Application of Reinforced and Deep Learning in Mechanical Engineering, (Refer Chapter 5) Unit VI : Applications 08 hrs Human Machine Interaction, Predictive Maintenance and Health Management, Fault Detection, Dynamic System Order Reduction, Image based part classification, Process Optimization, Material Inspection, Tuning of control algorithms. (Refer Chapter 6) gaa BE _Avtilicial inteligence & Machine Learning (SPPU) 1 Table of Contents Chapter 1: Introduction to Al and ML 1-110 1-38 11 Introduction to Machine Learning ——— eect Lh Ld Howdoes Machine Learning Work 2wsw-- so 13 11.2 Key Terms Associated with Machine Learning —— . aS 113 Data Formats 7 a oe 15 LL DIKWPyramldnonenn — - 1-6 115 Categories of Data Analytic, een ane 7 12 Typesof Machine Lesrhingaeunn - 18 L2A Supervised Learning won . - 7 . 19 122 Unsupervised Learning. ven = - 19 1.23 Reinforcement Learning ce aaa 124 Howto Choose the Right Machine Learning Algorithm 2sewsne cnn ance) 13 Issues in Machine Learning. cnn even 13 14 Stepsin Developing a Machine Learning Application eo 15 Introduction to Artificial Inteligence : pe FID 151 Types of Alem sen - ; - 1-20 152 History of Alensnuonn a : cence ET 153 Comparison of Al with Data Science... nen oo 122 154 Approaches 10 Aloo - oe = LS.A(A)_ Symbol . ——— nan 22M 154(8)_ Sub-symbolie(Connectionist Approach) nay 1SA(O) Stasi eer : - _128 155 Need ol Mechanical Engineering: eeeeeee nen 128 16 Basics ss ntnctonnaneenn hens 28 1.7 Application of Machine Learning and Artifical inteligence : = 1-28 (Chapter 2: Feature Extraction and Selection 2100228 2 Feature and Feature Boge eeeeeneannrennneenent cen ant 2A Dataennmmnnennn a sone | DAZ TS rennin even . a 213 Modelseeanneennn nnn 22 a oo . . 23 218 Feature Engineering son . _ 23 : Wise WH Artificial intelligence & Machine Learning (SPPU) 2 Table of Contents Tie Data Engineering -vs Feature Engineeringnnnm : = enna 2S 2.2 Dimensionalty Reduction Techniques nm = oeeeee 26 2.22 ‘Typesof Dimensionalty Reduction Techniques. 28 23 Feature Transformation... ee eB 234 Feature Construction... oe —— = 28 232 Quantization oF Binning = = 28 233° Log Transform a —_— 2 — sneer AO 234 Feature Scaling or Normalisation... : — aa 235. MisMax Sealing wo — — zat 236 Standardisation (Variance Sealing) onsen = ene PAD 213.7 Encoding Categorical Variable ne-nenewnvere seen 2A 238 — OneHot Encoding. - : = a 214 23:9 Dummy Coding.enninnaemnnnnnnninnmn 23.10 Peature Hashingionw 23.11. Handling Textual Features nner ee _ 217 23.412 Feature Bxtraction : a = : eon 2B 2:4 Principal Component Analysis (PCA) nn — a - 218 25 Feature (Subset) Select wane = : i : 2-20 26 Key Drivers of Feature Sel&tit monn : ee 2.20 26.1 Measures of Feature Relevance... nn 228 262 Measures of Feature Redundancy... : a enn 22 2.7 Overall Feature Selection Proces$ xn a . 2.24 28 Feature Selection Approach mn — a 224 ne on - ener BBS 210 Decision Treen Lecce een eee penn 2°26 2.41 Applications of Feature Extraction and Selection Algorithms in Mechanical Engineering enon 2.26 ‘Chapter 3: Classification and Regression 3-1 103-108 34 Gassification Model —— BAL Decision Tre wncsnnnmnnnnan 34.2. KeyTerms and Concepts. 3..2(8) Entropy. 3.1.2(B) Information Gain... BA.2(0) Gain Ratio enn 3.1.2(0) Gini Index. F_Ariticil intelligence & Machine Learning (SPPU) 3 Table of Contents 32 324 323 324 332 333 334 335 34 341 342 343 344 35 351 352 36 364 362 363 a7 371 3.7.4(8) 372 3.7.2(8) 38 3a1 39 340 3a Decision Tree Algorithms. : wn - on 828 ‘The General Agorithtcensonnnanninneneinnnn 328 1D3 Algorithme an 329 CAS Algor thE een se IB CART Algorithm, peeeeeeeness cence FB Evaluating a Decision Tree.. Nalve Bayer Bayes’ Algorithm (Theorem) nu Naive Bayes Classtiet cmon Smoothing, = ‘Advantages of Nalve Bayes Classifier. Disadvantages of Naive Bayes Classifier. : a conten 345 Support Vector Machines (SVM)... 7 er 347 Maximuin Margin Linear Separators ee 348 Quadratic Programing Solution to Finding Maximum Margin SeparatorS.o.. eed) Kernels for Learning Non-Linear Functions (Kernel Trick). . one 8ST Comparison between Logistic Regression and SVM.. — ~ 352 Diagnostics (Evaluation Measures} of ClassiflS.nne1-ne aes 353 ROC Curve a 3.56 ‘Area Under the Curve (AUC).. - eee oT) Additional Classification Methods... 7 357 Bagging nna — a : een 357 Boosting. eee 3-59 Random Forests enn = 3-60 Regression Analysis. ee. 361 Linear Regression nn Use Cases (or Applications of for Linear Regressi0R xn Logistic Regression Use Cases (or Applications of for Logistic Regr@S$10M w-memmens Support Vector Regression (SVR)... Regression Trees nn. Reasons to Choose and Caution uw Additional Regression Modes.. Clustering. _ We reaieonnag YF antiicial intelligence & Machine Learning (SPPU) 4 ‘Table of Contents BALL Properties of Chistefnenn : on 3S 3112 Types of Clustering. —— een TS 3113 _ Use Cases (Applications) o = nnn 36 BALA Kemeans. oe oo Soeen Ane 73 31S Determining the Number of Clusters neuen : eee BAL6 —DiagnostleSenninnnnnnnnnnnne : — 3.11.7 Reasons to Choose and Cautions (Drawbacks / Challenges). - 342 keNearest Neighbours (kNN) Classification Algorithm. —— a 3.93 3.13 Applications of Classification and Regression Algorithms in Mechanical Engineering... Chapter 4: Development of ML Modet 411049 4:1 Problem dent BI01 earner oer a a eon 42 Steps in Mb Modeling nner _ at 43, Enhancing Performance of CaSificatioh axeo-neeernnnnrninenttctentnttntenannnmer 434 Cross-Validation. oc = 7 = Se neh d 43:1(8) Fold Cross-Validation.... - - - . 43 434(C)_Leave-P-Out Cross Validation (LpOCV) : : +4 43.2. Sub-Sampling nnn : senna 433 Hyperparameter Tuning Techniques o--neanun —— vent 433A) What Do HyperparametersDo?. a . eee 43.3(8) How is Hyperparameter Tuning Carried out? aon 6 43.3(C) Hyperparameter Tuning Algorithms. aoe een Chapter 5: Reinforced and Deep Learning 5-110 556 5.1 Introduction to Neural NEWOFKS nnneennneen nen Sa. Fundamental Concept, 5.1.2 Biological Neuron. 5.13 Evolution of Neural Networks (Hebb'S RULE) esmnraesnnn 5.2 Artificial Neural Networks (ANN) onsen 521 The Perceptron.. 5.2.2 The Bias Inptt een a ——— ec 5.2.3 Multilayer Perceptr0M new _ a . . pac AZ 5.24 Shallow and Deep Neural Network, e — conan 513 Se festetee & Artficiat Intelligence & Machi Learning (SPU) 5 ‘Table of Contents 53 Neural Network (NN) Architecture. : = 7 nso SE 54 Activation Functions : —— 7 S47 S41 Types of Activation Functions wa. - eS o 5-17 55 Learning Process. — es —— monn 2 56 Convolution Neural Network (CNN or ConvNets). a ~ . 5.22 561 How do Convolutional Neural Networks Work?... a 7 5.23 562 Convolutional Layer — . sone S23 563 Pooling Layer. ss : ee 564° Fully-Connected Layer = . 5.65 Convolutional Neural Networks and Computer Vision, — 57 Reinforcement Learning... : — = 5.25 571 Characteristis of Reinforcement Learning. — 7 . 5-26 57.2 Positive vs Negative Reinforcement Learning = a - 530 573 The Reinforcement Learning Cycle, a 7 58 Markov Models. . aa 5-33 58.1 Steady state = 7 537 5.9 Markov Decision Process (MDP). —— 7 5-40 510 Reinforcement Learning AlgOFIAMS ccsuca-sneerse — ~5-42 540. Mathematical Foundation fr Reinforcement Learning Algorithms... = 543 $4102 _ Learmable Functions in Reinforcement Learning a . : S44 5102(4) Policy-Based Reinforcement Learning Algorithms. — . 546 5.10.2(B) Value-Based Reinforcement Learning Algorithms — 546 5:10.2(C) Model-Based Reinforcement Learning Algorithms. 5-46 $210.2{0) Comparison between Policy-Based, Value-Based, and Model-Rased Algorithms : 5.47 5103 On-Policy and Off Policy Algorithms = = . 5-48 S104 QeLearming (TD Learning) ce avannneoro = : ee san) 5.11 Application of Reinforced and Deep Learning in Mechanical Engineering... ss, Chapter 6: Applications 106-49, 61 Concept Bulding- Internet of Things (1oT) —— s 61.1 Characteristics ofl... — —— 62 PWT Visi0R ess nnnnninnerirncnmnnney ee — — : 64 63 loTand mam — : a — sovane 10) 63.1 Introduction to M2M. —— — so 610 63.1(A) Applications of MaM. e : 2 ae et Pe Feataonieage Er SH ariticial intelligence & Machine Learning (SPPU) 6 64 65 651 652 652(A) 65.218) 65.2(0) 66 661 67 674 674(A) 6748) 67100) 6710) 68 63 610 612 oa2a 6122 6123 6124 Table of Contents Tang OT : = nen 6A Sensors, Actuators, and Smart Ob} an = ae 613 Sensors = : a cone 613 Types of $ens0FS neem ee — 616 Voltage Sensor —— - oon 619 Humidity Sensor. — a - 619 Level Sensor, 7 7 _ ee 6.20 AetUaROFS nar ——= : on ‘Types of Actuators. 7 —— . 621 mart Ob}¢C5 evrenroneen : = a ox Common Smart Objects (IoT Devices) nnn — —— 623 ome Automation. —— eee : . 6-24 Industrial Lo scemorene=nentennnnn 625 Personal and Health (are —nn1-v-nneeemnnn = on 2 6.26 other Uses. eee : : 626 Need of Alin Mechanical Engineering enennonsnnnenn - 627 Human Machine Interaction (Connected Factory) _ : 628 Predictive Maintenance and Health Management (Prognosts) . ~ 631 Fault Detection (Machine Diagnosts and Prognds) nrmnernmnn - enone Dynamic System Order Reduction (Model Order Reduction (MOR) xm 637 Image Based Part Classification nnn : 638 Process Optimisation. ee 638 Material Inspection = : aeneee ee 6-40 “Tuning of Control Algorithms en-mnm —- sneeennnnnns 642 goo A Note to Readers from the Author ee LE AUTNOF Dear Readers, Thank you for choosing to read this book to learn about the fascinating world of Artificial Intelligence and Machine Learning. I welcome and appreciate your decision Artificial Intelligence and Machine Learning involves several key concepts that are ¢rucial for your understanding and appreciating the dept ofthe subject. To make it easy for you to arasp the subject, Cave carefully added some related reading material tha, at times, you may tot find listed directly inthe course syllabus. Do not panic! These topies would make you exam ready a5 well as ready for real world, Not having a solid understanding of these topies would make it very difficult for you to understand the listed course sjlabus tops. So, read on and read all Also, if you are looking to build your career in Artificial Intelligence and Machine Learning domain, then 1 would suggest that you retain this book for your future reference. This book isnot written only to give you pointed and limited understanding of the slabus topics simed at only passing the exam, This can be a good reference aid when you are actually onthe job. Finally, 1 hope you enjoy reading this book and build a strong foundation and ‘understanding of the subject that is required for your success! Pravin Goyal Introduction to Al and ML ‘Ate end of this unit, you shouldbe abo to understand and comprehend the following syllabus topics Introduction to Artificial Intelligence © History of AL ‘© Comparison of Al with Data Science © Need of Al in Mechanical Engineering © Approaches to Al * Cybemetics and brain simulation + symbotc | + Subsymbotc + Statist Preece © Approaches to Mi. + Supenised earning | * Unsupervised teaming | + Reinforcement teaming | + Basics | © Reasoning © Problem Solving © Knowledge representation © Planning © Leaming © Perception © Motion Manipulation 1.1__ Introduction to Machine Learning * What do you mean by learning? What do you mean when you tell someone to “lean* something? The dictionary ‘meaning ofthe word learns "to gain knowledge or understanding of or skillin By study, instruction, or experience’ * You, as a reader or student of a particular course, lean something and then become equipped or capable of carrying Okt Various tasks based on what you learnt. You might have heard about Bloom's Taxonomy. Blooms Taxonomy is a classification ofthe different objectives and skills that learners could achieve out ofa particular learning or a course Fig. 1.1.1 outlines Bloom's Taxonomy. WF Artifical intelligence & Machine Learning (SPPU) 12 Introduction to Aland ML ‘Roping the Concepts, and ideas, Undersianding what ‘the facts mean Rocognang and recahing facts Understand Remember Fig. 1.1.1 : Bloom's Taxonomy « So,as you understand, learning helps to gain skills and carry out various tasks. Like human beings, animals and other organisms also learn which helps them to carry out tasks required as per their lifecycle «You would also agree that, in general, the more you learn (through experience, courses, instructions, manuals, taining workshop or anything else) the more capable (and performant) you become at carrying out tasks based on learning objectives. So, if E = Experience P = Performance fora given task T= Task «Then, you can mathematically say that performance ata given taskis directly proportional to the experience, Isnt? Pq) a E «Ashumans and other living organisms learn, similarly t was thought, can machines made to learn and carry out tasks ata decent performance level? «One of the most notable work towards Machine Learning (ML) was dene by Alan Turing in 1950, yes 70 years ago! In his 1950 paper, “Computing Machinery and Intelligence” Alan Turing asked, "Can machines think?” {See http://www .cseeumbceducourses/471/papers/turing pdf forthe full paper) 4+ The paper describes the“lnitation Game", which involves three participants a human acting as a judge, another human, ‘and a computer that is attempting to convince the judge that it is human, The judge would type into a terminal program to “talk to the other two participants. Both the human and the computer would respond, and the judge would decide ‘which response came from the computer. 4+ Ifthe judge couldn't consistently tell the difference between the human and computer responses, then the computer jwon the game, The test continues today in the form of the Loebner Prize, an annual competition in artificial intelligence. The aim is simple enoug! chat bot program. convince the judges that they are chatting to a human instead of a computer ‘+ in 1959, Arthur Samuel defined machine learning as, “[A] Field of study that gives computers the ability to learn ‘without being explicitly programmed.” Samuel is credited with creating one of the self-learning computer programs with his workat IBM, Samuel is widely known for his work in artificial intelligence. «One the most common definition of Machine Learning came from Tom M, Mitchell, the Chair of Machine Learning at Carnegie Mellon University. Its as following. (Copyright No. -L-98904/2021) WH Arica iteigence & Machine Leaning (PRU) 1.3 Inwoducton to Aand Mt #5 Definition : A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, a8 measured by P, improves with the experience E. Hence, Definition + Machine Learning is the study of computer algorithms and programs that automatically improve their performance, for a given set of tasks, with increase in their experience. + Machine learning is branch of artificial inteigence (making machines intelligent). Machine learning uses various statistical (mathematical) models to learn from collected data and then uses tho {asks such as predictions and classifications, 1A se trained models to carry out various ‘The Fig. 1.1.2 outlines how machine learning works ata high-level Training Data Train the Machine Learning Algorithm L Test ihe Model with new input Ld tsthe model ves Mactine Leaning Mode! "ely Ready ° Fig, 1.1.2 : Machine Learning Model * [am sure you would have made several predictions many a times in ife already, isn't it? Whether iis going to rain, ‘whether a team Is going to win, whether the stock market would go up or down and likewise. Similarly, you can identify cat pretty confidently even if tis of different shape and size and is in almost any position (sleeping, jumping, turning upside down, running eating or whatever), (Copyright No. -1-98908/2021) Wm SF _Antticial intelligence & Machine Learning (SPPU) et Introduction to Al and ML. ‘© How do you do these things? You can do this because you have some sort of mental model trained from your past experiences (and learning’s), Based on those experiences, you can easily predict facts or identify things. Ifit occurs to you that what you predicted or identified was not right, you take that feedback and update your mental model so that you can do a better job next time. Probably, you are less likely to make the same mistake, Unfortunately, machines ‘aren't as intelligent as you are, but we take the similar approach to make them intelligent! «© Machine learning is all about using a huge volume of training (sample) data (also called as big data analysis) and then ‘using that data to build models that can carry outa set of tasks when a similar but new input is provided. «+ For example, ifyou want a machine to automatically identify a cat from various animal pictures fed to it, you first train the machine to recognise different types of cats and also in various positions and environments. For example, assume that the following picture has the training data for identifying cats ina picture, rr a es 0 20 a0 «Once the model has learnt “enough”, it can then successfully carry out a task such as identifying a cat in a picture that was not in the training data, For example, it can identify the cat in the following picture. «Thats a very high-level view of how machine learning works. However, there are many more complications and effort required for building and training a complex machine learning mode for real-life applications. Buta general high-level ‘understanding of how machine learning works would go along way to help you build the complex concepts later on, (Copyright No. -1-98904/2021) re WF Artificial intelligence & Machine Learning (SPPU) 15 Introduction to Al and ML 1.1.2 Key Terms Associated with Machine Learning Let's understand some of the key terms associated with machine learning that you would commonly encounter Table 1.1.4 r Term Description Features or | Anything that you can measure and build data for. For example, the typical length of various animal. ibutes | Feature could be numeric, set of characters, Boolean values, or anything else that describes the data Training | (Big data) or the complete set of sample data (training examples) based on which you would train your Dataset machine learning model Training | Each example (or data point) within a dataset. Example | Testing | The set ofsample data (testing examples) based on which you could test your trained machine learning Dataset ‘model fo its correctness and determining if further training or adjustment is required to the model. Target ‘The Feature or value that you want to predict or identify or the output that the trained machine learning variable | model should produce when an inputis fed to it 1.4.3 Data Formats ‘The training data (also called as Big Data) for machine learning comes in different formats. Data can be machine generated (such as log files) or could be human generated (such as tabular data), Overall, the data format is classified as the following Data Formats Fig. 1.1.3 : Data Formats 1. Structured Data : Structured data exhibits a particular order (also known as model or schema) for storing and working with the data, The data attributes are usuc!ly related and are often the basis of analysis. The structured data is usually generated by machines or compiled by humans. For example, spreadsheets, customer records, transaction records, sales reports, etc. are all structured data. The structured data is usually stored in relational databases or simple CSV or spreadsheet files. 2. Semi-structured Data : Semi-structured data has some definitive patterns for storage, but the data attributes may not be inter-related. The data could be hierarchical or graph-based in nature. The semi-structured data is usualy stored in text files as XMLs, JSON or YAML format. The common sources for semi-structured data is usually machines such as sensors, website feeds, or other application programs, 3. Unstructured Data ; Unstructured data does not exhibit a fixed pattern or a particular schema, This is the most common format of Big Data. Examples of unstructured data are video, audio, tweets, likes, shares, text documents, PDFs, and scanned images. Special tools and mechanisms are required to process unstructured data. Also, itis usually Cleaned (sanitised) before itcan be used for analysis, (Copyright No. -1-98904/2021) WH _antficia intelligence & Machine Learning (SPPU) 1.6 Introduction to Aland ML ‘Comparison between Data Formats Table 1.1.2 comparison Attributes | Structured Data _ | Semi-structured Data | Unstructured Data [ Vohime ofa vw wed |e Proessngconpety | ___tow Min gh Data generated by Humans and Machines Machines | Humans owmomlysorein | Reon Oates | Towa es | any ls Pacers and Seen fi Hel fandom [ssc ot woveegivet | __wowromed | Requbed 1.4.4 DIKW Pyramid It is important to understand how data can be enriched and the journey it takes with each stage of enrichment: To understand this journey, typically DIKW Pyramid is referenced. DIKW is an acronym forthe four stages of data enrichment that are (@) Data (©) Information (c) Knowledgeand (4). Wisdom Wisdom Pr Understanding Knowiodge Knowledge + Meaning information Information + Context Data Events, Records and Transactions Fig. 1.1.4 : DIKW Pyramid 1. Data : This is the lowest bottom of the DIKW pyramid, This is the raw data collected from various events, records, and transactions around you. The data could be generated by machines or by humans, The data itself does not have very high value until itis enriched with more attributes that can be used for further analysis. For example, you could just have a data set that has list of millions of people with their demographic details and how they died. This data may not give you anything actionable. 2. Information : At the next level, the collected data is enriched with the context to give information. You start to build perception about the data to give you hindsight. The hindsight about the information reflects or acknowledges what is contained in the data. For example, you could begin to see that the list of people is actually the list of cancer patients and their life patterns. (Copyright No. -L-98904/2021) here BW Arica Inteigence & Machine Learoing (PU) 4.7 Introduction to Aland ML 3. Knowledge : When you add meaning othe information, you start to gain knowledge This is where you precisely stare analysing the information at hand and make it more useful and meaningful, You could gain deep insights about the {information and be able to answer high-level questions. For example, you can analyse the information on cancer pationts and build patterns around life expectancy after cancer detection with ot without chemotherapy. You could further analyse the effect of various chemotherapy medicines to understand their dosage and effectiveness level A company could then invest in building more effective chemotherapy medicines to improve life expectancy of cancer patients after diagnosis So, understand here thatthe objectives of data analysis must be clear to derive meaningfl knowledge from the information at hand, 4 Wisdom : The final level of wisdom is achieved when you add understanding tothe derived knowledge. Note here that ‘wisdom is not achieved using a technical algorithm ot formula but is based on the human understanding ofthe data analysis that was carried out, For example, after analysis of data on cancer patients, you could understand what Ufestye to follow in terms of det, sleep, and exercise to avoid or delay occurrence of cancer. Ths understanding could be shared withthe world as foresight As you see, the DIKW pyramid helps to put a perspective on visualising the data analytics stages, Now let's take this discussion further and understand the categories of data analytics, 1.1.5 Categories of Data Analytics Now that you havea fair understanding of what Big Data, let's touch upon the spectrum of analytics that is possible on the given data sets, The different types of analytics requite different tools and techniques. Categories of Data Analytics Diagnostic | [Precicive | [ Prescriptive Analytics |] Analytics || Analytics |] Analytics [ Descriptive Increasing value and complexity hindsight insight foresight Fig. 1.1 ategories of Data Analytics Descriptive Analytics : Descriptive analytics answers questions about events that have already occurred, Usually raw data Is queried without adding any contextual information. Ths isthe simplest form of analytes and typically answers questions such as (2) How many units ofa particularitem sold in last 6 months? (b) How many patients died ofa particular cancer type? (6) How many calls did you receive fora particular issue? ‘This kind of analytics is usually done using database queries or simple spreadsheet filters. You could have periodic dashboards and reports that can be used to visualise results ofthe descriptive analytics, 2+ Diagnostic Analytics: Diagnostic Analytics is done to find out cause of a phenomenon or derive reasoning behind events. This analytics goes a level deeper to provide information that can be used to fica particular situation or event. Diagnostic analytics usually adds more context to the data to get information about a particular interest. For example, following are afew questions that can be answered using diagnostic analytics (@)_ Why the sales in quarter 2 lower than quarter 1? (Copyright No. -L-98904/2021) WF Artificial intelligence & Machine Learning (SPPU) 18 Introduction to Aland ML (b) Why are people falling il after eating a particular type of biscults? () Why the mode! X ofthe car preferable over the model ¥ ofthe car? Diagnostic analytics require careful examination of data frm multiple sources and isa litle more involved and skilful exercise than descriptive analytics. 3, Predictive Analytics : Predictive analytics is carried out to forecast and predict future events. The information is further enriched by adding meaning to it to derive knowledge. The predictive data models are carefully created that can pase off future predictions based on the past events, Predictive analytics could possibly answer questions such as (a) What would be the improved life expectancy if choosing medicine A over medicine B? (b) | What would be the sales figure for model X ofthe car in third quarter ? ()_ Which team would likely win the world cup this year? Predictive analytics assumes that certain set of conditions are met or would exist. If there are changes to those conditions, then predictive analytics may not be accurate, ‘4. Prescriptive Analytics : Prescriptive analytics takes the results from predictive analytics and further adds human judgment to prescribe or advise further actions. This reflects the wisdam level from the DIKW pyramid that you learnt earlier. The prescriptive analytics could answer questions such as (a) What should you do to delay cancer? (b) Whats the best time to leave home to reach airport on time? (6) Which medicine would have higher chances of survival for the patient? Prescriptive analytics is the most dificult out of all other analytics. It requires significant skills and time to give effective actions and results, It could also be dependent on not only the analysed data but external conditions such as political pressure, social acceptability, and personal preferences. Comparison between Categories of Data Analytics Table 1.4.3 Comparison Descriptive Diagnostic Predictive Prescriptive Attribute Analytics Analytics Analytics Analytics Complexity Least |__ Medium High Highest | | time requirement to Low Medium High Very High produce results Value of results Shart Term, Medium Term Long Term Very long term Data enrichment Data Information Knowledge Wisdom level | Analytics Frequency | _ Most Common Frequent Not often Rare 1.2 Types of Machine Learning ‘As a human learn in various ways (auditory, visual, kinesthesis), so does machines. You also understand that there could be different types of data formats and various analytics and learning requirements, Ata high-level, machine learnfhg algorithms could be categorized as following, {Copyright No. 1-98904/2021) Aric neigence & Machine Learning SPPU) 4.9 Introduction to Aland ML 1, Supervised Leaming 2, Unsupervised earring | Fig. 1.2.1 : Types of Machine Learning 1.2.1. Supervised Learning 4% Definition : Supervised learning algorithms build a mathematical model of a set of data that contains both the inputs and the desired outputs. * In supervised learning, the machine earning algorithm is trained on labelled data (the training dataset has both input as well as output). This is very similar to you teaching a toddler by showing a picture of an apple and saying, “Look kid, this is an apple’. Fig, 1.2.2 provides a high-level outline of how supervised machine learning algorithm work. ‘Supenised Learning —— nee ‘Supervised Machine Learning ‘Agoritn, These are apples aa : eft fear Fig. 1.2.2: Supervised Machine Learning * The training data set has various examples that contain both input (Features) as well as output (target). Supervised learning is typically used for classification or predicting a particular value (regression). For example, classifying an animal picture as a cat or a dog or whether itis likly to rain. Some ofthe common supervised learning algorithms are Nearest Neighbours, Linear regression, Logistic regression, Nave Bayes, Support Vector Machines, and Decision Trees 1.2.2 Unsupervised Learning % Definition : Unsupervised learning algorithms take a set of data that contains only inputs and ‘automatically find structure in the data in order to group or arrange them in a eluster, anaage (Copyright No. -1-98904/2021) WH Anificial intelligence & Machine Learning (SPPU) 110 Introduction to ALand ML ® Inunsupervised learning, once the model is given a training dataset, it automatically finds patterns and relationships in the dataset by creating clusters (or groups) within it, Unsupervised learning algorithms, however, cannot label the cluster or determine what those created clusters might mean. «For example, ifyou feed a training dataset having thousands of pictures af cats, dogs, and monkeys, an unsupervised earning algorithm can potentially create three different clusters ~ one for each animal but it cannot ell you which luster is what. Its up to the observer or human to make the sense out ofthe created clusters of data + When you feed a new input to such an algorithm, it can place iin one ofthe already created clusters or could create a new group ifitis dissimilar to any ofthe already created clusters. «Fig. 1.23 provides a high-level outline of how unsuy AOA OA* Ax O wOA Training Data ‘Unsupervised Machine Learing ‘Algorithm (Receives only input data and the finds patterns ‘automatically) Fig. 1.2.3 : Unsupervised Machine Learning «Thus, unsupervised learning algorithms learn from the training dataset that has not been explicitly labelled and netther thas the output (or target) associated with i. Unsupervised learning algorithms ate commonly used to find patterns of data and build recommendation engine. For example, you would have commonly seen various recommendations based on your purchase history on e-commerce portals or streaming portals such as YouTube based on genre of videos you have watched, Fig. 1.24 «Some of the common unsupervised learning algorithms are k-Means clustering, Expectation maximization, Hidden Markov Model, DBSCAN, and Parzen window. (Copyright No. -1-98904/2021) erent 5 # BE Arsicialinteigence & Machine Learning (SPPU) 1.4 Introduction to AL and ML 1.2.3 Reinforcement Learning + Treat reinforcement learning as learning from mistakes. The reinforcement learning algorithms work very similar to how you ear by yourself without any guidance ~ basically through hit and trial When you get something right, you get a reward, you feel happy, and you move ahead. When you get something wrong, you get a penalty, you take a step back and then you try to avold the incorrect path while exploring another correct path 4 Definition : Reinforcement machine algorithms improves upon themselves and learn from new situations using a trial-and-error method. + The favourable outputs are encouraged, or ‘reinforced’, and non-favourable outputs are discouraged or ‘punished! Fig. 1.2.5 outlines how reinforcement learning works at a high-level, Action Reinforcement ‘Machine Learning ‘Algorithm Environment (Tees to maximise reward and avoid penalty) State Fig, 1.2.5 : Reinforcement Machine Learning * Reinforcement learning is studied in many other disciplines, such as game theory, control theory, operations research, information theory, simulation-based optimization, mult-agent systems, swarm intelligence, statistics, and genetic algorithms. [ts not used frequently for machine learning problems such as classification or prediction, * Many reinforcement learning algorithms use dynamic programming techniques. Reinforcement learning algorithms do not assume knowledge of an exact mathematical model and are used when exact models are infeasible. Reinforcement ‘earning algorithms are used in autonomous vehicles or in earning to play a game against a human opponent. Comparison between Super sed and Unsupet sed Learning The Table 1.2.1 provides a quick comparison between supervised and unsupervised learning. Table 1.21 Comparison Attribute | Supervised Learning Unsupervised Learning ‘Training Dataset ae Both input and output Only input Used for Classification and Prediction | Finding patterns and understanding data Training Data Is Labelled Not labelled Number of targets Known beforehand Not known Feedback from user Provided Not provided Complexity High Low _ (Copyright No. -L-98904/2021) WF Artificial intelligence & Machine Learning (SPPU) 1.12 Introduction to Aland ML 1.2.4 How to Choose the Right Machine Learning Algorithm 2 + So, how do you choose which machine learning type to go with? Itis simple - ifyou need to predict a target value, then you should choose supervised learning. For example, if you want to find out chances of raining tomorrow, a team ‘winning a tournament, a picture being an apple or a cat, or problems such as these, then you should choose supervised learning method. + However, if you are not solving a prediction or classification problem and your goal Is to group the data and find “interesting” patterns, then you are better of using unsupervised learning + Once you have decided which type of machine learning algorithm you need, you then need to choose a suitable algorithm from that machine learning type. Note here that there is no single answer to what the best algorithm is or what will give you the best results * You will need to try different algorithms and carefully evaluate how they perform as per your requirements, + Comparison between Artificial Intelligence and Machine Learning ‘+ Machine learning is often confused with Artificial Intelligence, Without going too much into details, understand that artificial inteligence is about making smart machines to solve complex problems similar to humans. Machine learning Isa subset of Artificial Intelligence. When you interact with Google Assistant or Apple Siri or Alexa to check weather conditions or ask it to play your favourite song, you experience an Artificially built Intelligent system. The Table 1.2.2 provides quick comparison between Artificial Intelligence and Machine Learning for your reference Table 1.2.2 Comparison Attribute | Machine Learning | Artificial Intelligence | Focus Learn from data_| Solve complex problems Complexity Low High Scope —_ Narrow Broad Human interaction Minimal High Comparison between Data Mining and Machine Learning ‘+ Data mining is 2 process used by companies to turn raw data into useful information, By using software to look for patterns in large batches of data, businesses can learn more about their customers to develop more effective marketing strategies, increase sales and decrease costs. However, there are significant differences between data mining and machine learning. ‘© The Table 1.2.3 provides a quick comparison between data’ ing and machine learning Table 1.23 Comparison Attribute | Machine Learning Data Mining Building atrained model | Required Notrequired Human effortrequired | Only for bullding model | For extracting information from data Use of specific algorithms | Frequent Rare Accuracy High Low ‘Tasks carried out by | Machines Humans Self-learning. Yes No (Copyright No. 1-98904/2021) W Arca nteligence & Machine Learning SPPU) 4.43 Introduction to Aland ML 1.3 _Issues in Machine Learning ‘Some of the common issues or challenges involved in machine learning are as following, Challenges in Machine Learning 1. Data Labelling 2, Shortage of Exparis| 3, Oblaining massive training data sets 6. Bias in data and algorithm eee Fig. 1.3.1 : Challenges in Machine Learning 1. Data Labelling: Today, there isa massive amount of data that is unlabelled and raw. As you know, supervised machine '8 works on labelled data, Without adequate data labels in the training dataset, it is not feasible to build robust ‘machine learning models. Companies are putting thousands of man hours to label data so that it can be appropriately ised for machine learning, This is an active area of research where the labels can be attached to the data as it is used. But today data labelling is far most one ofthe biggest challenges in machine learning 2. Shortage of Experts : Machine learning is an emerging field and there are not many experts around the world. You require experts who can (2) Understand the wide variety of data () Model the data correctly so as to meet the desired objectives (2) Build and manage software and hardware tols and techniques required for machine learning (@) Design appropriate visual interfaces and (©) Also communicate the findings effectively Building a team of experts that have all the required capabilites is challenging and time consuming, 3. Obtaining massive training data sets: As you understand, the more intensive is the training dataset the better and Tobust isthe trained machine learning model. However, i is dificult to obtain massive training datasets for various areas and applications. You may lack historical data, Also, the quality of data for the training dataset matters if the ‘dataset obtained does not represent. fair sample size, then the resultant machine learning models could be erroneous For example if you are trying to build a machine earning model to predict a particular type of cancer from a given set of symptoms, lifestyle, and blood related parameters, then you would require quality data for thousands of patients that have had that particular type of cancer and the details oftheir symptoms, lifestyle, and blood related parameters You would not be able to build a robust and trained machine learning model without having a decent sized and highs uality training data set. I'there are areas that are new, you may not have the training dataset available for it. ‘4. Hard to explain problems and results: Larger and more complex machine learning models make it hard to explain, in human terms, why a certain decision was reached (and even harder when it was reached in real time) (Copyright No. -L-98904/2021), SF _amtficial intelligence & Machine Learning (SPPU) 114 Introduction to Aland ML For example, if you tell healthy person that she has an 80% chance of geting a particular disease, then she may require additional detalls behind that statement. Complex machine learning modes, often bull by experts may not be self-explanatory when used by common people inthe field. This might make it hard for people to believe such systems and may lead them to question their predictions 5. Limited possibilities to reuse a model ; Unlike humans, who can apply their learnings from one circumstance to another circumstance, machine learning models are built specifically per use case. It is difficult to reuse an existing machine learning model for other use cases or solving problems in even related domains. Hence, companies have to invest time and resources to build new model for solving new use cases 6. Bias in data and algorithm : What you read is what you lear, isn't it? Haven't you heard your parents and teacher say that learn from right sources and do not trust everything you get to read. Similarly, the traning datasets use to build machine learning models cou have hidden biases ether knowingly or unknowingly. You will agree for sure iT tll you that our society does have its own space for prejudices and biases and not every decision made i fair to every group. ‘This bias can creep-in into the training dataset and make the machine learning models biased towards a particular group, decision, or societal element. ‘The process and frequency of data collection itself could be uneven across groups and observed behaviours which could further distort how algorithms analyse that data, learn, and make predictions. Negative consequences can Include inisinformed recruiting decisions, misrepresented scientific or medical prognoses, distorted financial models and criminal justice decisions, and several other predictions and actions. In many cases, these bases go unrecognised or disregarded as they are believed to be beyond general human understanding and treated as “advanced data sciences’, “proprietary data and algorithms," or “objective analysis 1.4 _ Steps in Developing a Machine Learning Application In the previous section, you learnt about how machine learning works at a high-level, Extending the discussion from there, ata high-level, developing a machine learning application involves the following steps. Each of the steps could be @ ‘steps in Developing a Machine Learning Application sequence of activities in itself. '5. Test the algorithm NS 6. Use the algorithm 7, Periodic Revist Fig, 1.41 : Steps in Developing a Machine Learning Application {Copyright No. -1-98904/2021) src neigece& Machine earning GPPU) tas Introduction ty Aland ML 1, Collect Data As you understand, to build a machine leaning model, you need a huge volume of training data. You could possibly have this training dataset available internally from your historical business operations or you could engage with external agencies and websites that have training datasets for various purposes either freely available or available for a fee. ‘Wherever you get data from, you need to ensure that itis accurate and verted to be as real as possible. Your trained machine learning model would only be as accurate as your traning data. Some of the popular, publicly availabe, dataset sources are as following. a. Kaggle htps://www kaggle.com/datasets, ataset : As of Sep 2020, Kaggle has 54,876 datasets on various topics. ‘They are available on = haggle a nV Hae sie b. Amazon Web Services (AWS) : As of Sep 2020, AWS has 188 datasets on various topics. They are available on hetps:/ /registry.opendataaws, eens About “The Cancer Genome Atlas ee Ie ono eo Therapeutcally Applicable Research to Generate Efective Treatments (TARGET) om ee eee (Copyright No. -1-98904/2021) nee WH _Antficial Ineltigence & Machine Learning (SPPU) 116 Introduetion to Aland ML c. UCI Machine Learning Repository : As of Sep 2020, UCI has $57 datasets on various topics. They are available on sta = espe iaamartpa] ew [ote] Yor a eae cosmo | gaat [om [| o | oe ce | anaes en oe ee coe oe a cone | See | om | fee |B mmrmsmnnmsm Frnomamtwom| commen | cm | me | we 7 | eaten, esse conenn | Gams | ee [am | ntsc ce re Ee ee ee sc cwnsein | commen | ae | | mm em | a momen] oe | 8 | ay d. Google's Tensor Flow ; Google has a huge repository of various datasets ~ audio, video, images, text, and various others, They are available at https:/ /www.tensorflow.org/datasets/catalog/overview. AP Tensorrow sat awesome yin = Qo sete cake ome f Dateets ay Imetan ¥ fssne ¥ oats > Sa seis | ane Swen Datasets = 1 Some ‘ic neointima iene un ite pi hy cores = ephemera con Usage - Google also provides a search option for finding various datasets across. various dataset repositories at Intps/ /datasetsearch research google.com. (Copyright No. --98904/2023) Wis WH Arcil eigonce& Machine Learning SPPU) 1.17 Introduction to Aland ML Dataset Search sles Unite States Consumer Pe ver ape App (Global expats of apple woe 2019720 Apples Parkin - Source FAO erage eal rice fx apples in Canad 2 byfod, na applesauce and cere, aared [APPLESEED FOUNDATION NC fica yr enn June 2016 ‘elane of es aples expan om Canad 2010:11 2018/18 set von Apples ae Cmputen waiter bis 2019 Produton volume of applesin te European Union 2911-2019 ‘Aoples Saranse Oranges & ‘Mlrasft: Wicrosot has a epostory of various datasets. Thay ae aaleble at htps/msropendatacom categories EH Microsoft | wera asus oir Gam Gnegui AML Than Sada Categories aovoay ==, couputen science VIEW DATASETS > eee, VIEW DATASETS > EanTi science 1) ,eoueanon WEW OATASETS > EW DaTaSETS > HeALTHoanE INFORMATION SciENCE VIEW DATASETS > VIEW DATASETS 5 < ariewarics omer VIEW DATASETS > View OATABETS > puvsics SOCIAL science EW DATASETS » VIEW DATASETS > (Copyright Wo, -1-96904/2021) aa Introduction to Al and Ml f OpenM : As of Sep 2020, Opendil has 3192 datasets on various topics ‘They are available on ttps://vww.openml.org, ano 1 sear Brows Sessa Orwwin @ aust pwns Bem ame So,as you see, there are quite a many dataset repositories around from where you can eolect data 2. Prepare the Input Data once you have the data, you need to ensure that it sin the right format such that can De processed by the chosen algorithm and computer programs. The publicly avallable datasets are usually available In ‘various formats so that you can skip ths step and could directly use the procured training dataset 3. Analyse the Input Data «This s-a crucial step where you need to ensure thatthe input dataset could be parsed properly for your chosen computer program. You also need to ensure thatthe examples are complete (they are not missing values) and are also not skewed (too high or too low compared to rest of the examples). Again if you trust the soure of dataset and ifyou ae sre that the dataset has accurate values;You may choose to skip his step. This stp jst ensures thatthe dataset, based on which you are going to build your masking learning model, meet the desired quality. 4, Train the Algorithm «This s the core step where you start to train your machine learning algorithm to build a model. Based on the chosen algorithm this step cou e simple or could be very comple. You use the collected and analysed nPit training dataset “ind feed ito the chosen algorithm to check how it works onthe input data and make adjustments and corrections as required, «+ Note here thatin te case of unsupervised learning there sno expt traning step because you donot have a farget value, Unsupervised learning algorithms work on the provided input to find patterns However, You may have to “hoote which features would you choose to feed the unsupervised learning algorithm so thatthe discovered patterns are meaningful forthe purpose. {Copyright No. -1-98904/2021) eee W Arica neigence & Machine Learning SPPU) 3-19 Introduction to Aland Mt 5. Test the Algorithm + Asamatter of practice, when you get a dataset, you partition it into 80-20, where 80% of the examples in the dataset are used to train the model and 20% ofthe examples inthe dataset are used to test the trained model. When you are ‘ralninga supervised learning algorithm, once you are suficintly confident thatthe model is well-trained, you put it to test by feeding it new known inputs and confirming if t produces the desired output (since desired output is also known from the fed input data). * In unsupervised learning, you may have to use various evaluation parameters, such as number of clusters created and distance between the cluster objects, to ensure that the model is working as expected, + Hfthe test results are promising you move farther with using the model. However, ifthe test results are not satisfactory, ‘you need to Find the root cause and based on the root cause you may have to © Re-train the model © Make adjustments in the model or data © Trya different algorithm © Collect the dataset from a different source * Testing the algorithm before use isa crucial step to ensuring that your model doesnot product fase results. Do not skip rf 6. Use the Algorithm You spent aot of time collecting and cleaning the data and then building and testing the model. Once you are through ‘hese steps you are good to use the model and si back and enjoy the success from your hard work. You may develop an ‘application based out ofthe model For example, based on someone's health parameters, your machine leaning model coulé deuce health related problems that the person may fac in near future. Based on someone's credit history, your aplication may infer the chances of anew loan geting approved. The applications and usage are plenty as you learned earlier! 7. Periodic Revisit As you often revise to ensure that your learning is still effective, you should periodically review the results that the ‘model is producing and evaluate if there are opportunities for improving it in light of new data. You may carry out minor adjustments tothe model or may re-train it with latest data to fine tune it Tis step is very similar to you getting 4 master health check done for yourself annually to ensure that your body’ vital parameters are doing well. If any Parameter indicates a potential problem, then you either make lifestyle changes or seek medical advice, 1.5 __ Introduction to Artificial Intelligence you have ever interacted with Amazon Alexa, Apple Sie, Google Assistant, Microsoft Cortana, driverless ears, or any other WhatsApp based customer service representative, then you have personally experienced some sort of artificial intelligent system. Artificial Intelligence (Al) is rapidly transforming our world, Remarkable surges in Al capabilities have ted to a wide range of innovations including autonomous vehicles and connected Internet of Things devices in our homes. Al 's even contributing to the development of a brain-controlled robotic arm that can help a paralysed person feel again ‘hrough complex direct human-brain interfaces, These new Al-enabled systems are revolutlonising and benefitting neatly allaspects of our society and economy - everything from commerce and healthcare to transportation and cyber security So, what is AI? (Copyright No. -L-98904/2021) W Arica tteligence& Machine Learning SPPU)__1-20 Introduction ol and ML You may find several definitions of AL. No matter what the definition of Al is, at a high-level, Al system is capable of thinking and acting rationally very much like humans do. So, the simplest definition that you could possibly think of AL is as following, & Definition : Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities of the human mind. Artificial intelligence allows machines to understand and achieve specific goals. Al includes machine learning and deep sub-fields, learning as it ‘tical neigence Fig. 15.4 Machine learning helps machines to automatically learn from existing data without being assisted by human beings. Deep learning allows the machine to absorb huge amounts of unstructured data such as text, images, and audio, Any Al system typically has some of the basic capabilities such as observation (sense, listen for commands, etc), learning, understanding, analysing, problem solving, and responding, 1.5.1 Types of Al Ata high-level, there are two types of Al. 1. Strong Al: Strong Al is resembled by a system when it truly understands what is happening. There may even be emotions and creativity For the most par, itis what you seein science fiction movies. Strong Al is made up of Artificial Ceneral Intelligence (AGI) and Artificial Super Intelligence (ASI) Artificial general intelligence (AGI), or general Al isa theoretical form of Al where a machine would have an intelligence equalled to humans; it would have a self-aware consciousness that has the ability to solve problems, learn, and plan for the future, Artificial Super Intelligence (ASI) also known as super intelligence would surpass the inteligence and ability of the human brain. While strong Al is still entirely theoretical with no practical examples in use today, that doesn't mean Al researchers aren't also exploring its development. In the meantime, the best examples of ASI might be from science fiction, such as HAL, the superhuman, rogue computer assistant in 2001: A Space Odyssey. Note that there are only a handful of companies that focus on this category, such as Google's DeepMind. 2, Weak AI ; With this,a machine is pattern matching and usually focused on narrow and specific tasks. It fs also called fas Narrow Al or Artificial Narrow Intelligence (ANI). For example, Apple Siri, Amazon Alexa, IBM Watson, and ‘autonomous vehicles. The reality is that modern Al is in the early phases of weak (or better called narrow) AL. Reaching the point of strong Al could easily take decades. Some researchers think that it may never happen. (Copyright No -1-98904/2021) Wee W_ Arica ntetigence & Machine Learning (SPPU) Lat Introduction to Aland ML, 1.5.2 History of Al ‘The following infographic gives a general history and evolution of modern Al. | s055 a | 1984 | AL WINTER sr | Seton [Maca lenin aoe 2018 2017 oe |Saniomentaf Seana | Meats |. Boo } Le LL L = L. Some ofthe key dates for Al evolution are as following, * 1950 Alon Tuting published a paper titled “Computing Machinery and tnteligence. Inthe paper, Turing proposed to answer the question ‘can machines think?’ and introduced the Turing Test to determine ia computer can demonstrate the same intelligence (or the results ofthe same intelligence) as a human. * 1956 John McCarthy coined the term ‘artificial intelligence’ atthe first-ever Al conference at Dartmouth College, Later that year, Allen Newell, J.C. Shaw, and Herbert Simon created the Logic Theotis, the frstever running Al software program, * 1967 + Frank Rosenblatt built the Mark 1 Perceptron, the fst computer based on a neural network that ‘earned ‘hough trial and error, Justa year later, Marvin Minsky and Seymour Papert published a book titled Perceptrons, which became both the landmark work on neural networks and, atleast for awhile, an argument against future neural network research projects. * Al Winter: During the early 1970s, the enthuslasm for Al started to wane. It came tobe known as the “Al winter” ‘which lasted through 1980 or so Eventhough there were many strides made with Al, they sill were mostly academic 4nd involved in controlled environments At the time, the computer systems were still limited, * 19805: Neural networks which used a backpropagation algorithm to train itself became widely used in A applications. © 1897: 1BMs Deep Blue beats then word chess champion Garry Kasparov, na chess match (and rematch), * 2011:1BM Watson beats champions Ken Jennings and Brad Rutter a Jeopardy! © 2011: Apple Siri was launched. * 2014: Amazon Alexa and Microsoft Cortana were launched. * 2015; Baldu’s Minwa supercomputer uses special kindof deep neural nswork called a convolutional neural network to identify and categorise images with a higher rte of accuracy than the average human, (Copyright No, -1-98904/2021) eee aval metgene Machine Learning PPO) 222 noduetion to Aland Mi se rre pest’ Alpha program, powered by # deep neural network, bets Le Sol the wort chamion Co aie cee gan mathe victory ssi en the hue nunber of pesle moves the ae rogess aoe az eron afer four moves, ae, Google purchased Deep fr aeprted USD 40 mon. +2016: Gooel Assistant was launched ©2003: Tesla ely to lunch fly dienes as! 1.5.3 Comparison of Al with Data Science 4 Definition : Data science is an interdisciplinary fled that uses scientific methods, processes algorithms and eystems to extract knowledge and insights from noisy, structured and unstructured data and apply hnouledge and actionable insights from data across a broad range of application domains. «Dota science as a concept unifies several domains such as statistics, data analysis, informatics, and their related vnethods in order to understand and analyse real-world phenomena with data [t uses techniques and theories drasvn from many fields within the context of mathematics, statistics, computer sclence, information stience and domain nowledge. A data scientist is someone who creates programming code and combines it with statistical Knowledge to reate insights on business data, Data science is heavy on computer sclence and mathematics, Data science used in tsiness functions such as strategy formation, decsion making and operational processes. ft touches on practices such ‘as artificial intelligence, analytics, predictive analytics and algorithm design. «The Table 1.5:1 provides a quick comparison between Al and Data Science, Tablo 15. Comparison Avtiial intelligence Data Science ] tribute : : nee gman iene mans | Evacngwsfland acon ih ona Complexity Very high Medium | Response In terms of voice or action interns of prediction or visuals | 1.5.4 Approaches to Al here have been several approaches (thoughts) for making machines inteigent and human lke. These approaches are derived from various fields of work and try to discuss if there are possibilities to build an inteligent system by folowing similar behavioural and scientific patterns, Let's learn about some of the common approaches to Al Cybernetics and Brain Simul + cyemetics isthe science of communication and control theory thats concerned especially with the comparative study fautomatic control systems such asthe nervous system and brain and mechanical-electrcal communication systems tn 1948, Wiener published Cybernetics: Or Control and Communication inthe Animal and the Machine. Even though It ‘was a scholarly work filled with complex equations, the book stil became a bestseller, Some of the topes included Netwtonian mechanics, meteorology, statistics, astronomy, and thermodynamics, Wiener compared the huran Brain to the computer. Furthermore, he speculated that a computer would be able to play chess and eventually beat grand vrasters, The main reason is that he believed that a machine could Tearn as it played games. He even thought that computers would be able to replicate themselves. Wiener’ ideas were powerful and spurred the development of A (Copyright No, -L-98904/2021) hee Wari nelience & Machine earning (SPPU)_—_1.23 Inwroduction to Aland ML + The objective of control theory isto control a system so thatthe system's output follows a desired control signal, called the reference. To do this, a (normally feedback) controller is designed that determines what output needs to be monitored, how to compare it with the reference, which system behaviours need to be adjusted, and how to adjust them. The difference between actual and desired output, called the error signal, is applied as feedback to the system input, to bring the actual output closer to the reference, + The Fig. 1.5.2 shows a very simplistic view of cybernetic control loop. ‘Threshoid/Reference ‘ine systom } Contrary Diterence’ Action Reaction Feedoack Fig. 1.5.2 A control system is composed of twa components: The controlled object : The controlled object is what is given, which is the component implementing the business functional capabilities. 2 The controller : The controller is what is designed, based on the model of the controled object. In the presence of various disturbances to the controlled object, the system's goal is to maintain some specified properties of the controlled object's output at, or sufficiently close to, the reference input (also called the set point), To achieve the system's goal, the controller needs to activate some actions to affect the controlled object through the manipulated variables (i. the input ofthe controlled object). These actions are the controls adopted by the controller. Generally, there are three types of basic control systems. 1. An open loop control system imposes no controls on the controlled object even if itis disturbed by events from its interactive environment, 2 A feedforward control system detects an external disturbance and imposes some controls on the controlled object to reduce the impact ofthe disturbance, 3. A Feedback control system detects the output of the controlled object and imposes controls on the abject to reduce deviation in the output, but it does not take care ofthe external disturbance. The Fig, 1.5.3 illustrates these three types of control systems. [Disturbances Fosdforwara Disturbances TL controtier Controled ]_ Output Contrafed ] Output input L_biect irpatL_obict {@) Open toop control (6) Feedforward control DOstiance | Disubances aaa Mesures ‘Souter Error [Feedback ConntoTae |, Output Sep ao ete cenvolor oot] “obec ag Enemas] gear] um f ‘purl_22h race Lester | “On| eet he Nesta Measured output cut | (c) Feedback control (d) Feedforward-feedback control Fig. 153 (Copyright No, -L-98904/2021) kanal WH Artificial tnteligence & Machine Learning (SPPU) 124 introduction to Aland Mt © In open loop and feedforward control systems the control action from the controller is independent ofthe output ofthe controlled object, whereas in the feedback contral system, contra action from the controller depends on the output of the controlled object. «Brain simulation isthe concept of creating a functioning computer model ofa brain or part ofa brain. Brain simulation. projects intend to contribute to a complete understanding ofthe brain, and eventually also assist the process of treating and diagnosing brain diseases, I is thus believed that an Al system could be built using the principles of cybernetics and brain simulation, 1.5.4(A) Symbolic Imagine that I show you the shown in Fig. 15.4 Pek Masi hihvahua Terror Fig. 1. Now, (show you a different picture and ask you what itis, Fig. 1.55 Will you be able to tell? «You will notice that i is not exactly the same as any of the earlier dog pictures, but you could still make a connection and learnt that It fs a dog. You might not know what exactly you have learned, and you probably have trouble describing how you learned that it isa dog, but chances are that your human brain was able to make the connection. «© On the other hand, computers are far better at memorisation. So, you could program a computer to memorise those ight dog Hlustrations. Then if you show the computer one ofthe eight, It could quickly make a connection. But ft is much more difficult to have a computer learn. It could easily fumble on the ninth image as it tries to find the perfect match. «+ Symbolic Al is about memorising and matching these diferent symbols. So a symbolic system could easily memorise the eight dog illustrations and make exact matches. (Copyright No, -1-98904/2021) ee W Areca nteigence& Machine Learing(SPPU) 1.25 Introduction to Aland ML Butif€could not find an exact match, it cannot give an answer. You also could certainly create a symbolic system that translates written text from different languages. A machine could memorise millions of different typed words and Phrases. The challenge is that it would not really be learning a new language. It would just be a digital version of an old- style language phrasebook. In the 1970s and 1980s, symbolic systems were used to create Al software that could make expert decisions. These ‘were commonly called expert systems. Symbolic systems are based on traditional compute logic or pre-programming, that is, the use of approaches like I-Then-Else statements to derive a response. Expert systems involve complex Pathways were often developed by domain experts in particular fields like medicine, finance, and auto manufacturing ‘The Fig. 1.5.6 highlights the key components of an expert system, Knowledge Nonexpert y= oo “trom an expert gor Expert System Query! fg £ knowiedge ij: base — 5 Advice! |S. Fig. 1.5.6 Domain experts “put” their knowledge into a database called Knowledge Base, There isa user interface through which a hot-expert user can ask questions and is then provided a response by the inference engine after consulting the knowledge base, IF you have ever used Microsoft's builtin troubleshooting programs, then you have experienced an expert system behind the scenes that asks you questions about the issue you are facing and help you find the possible causes, For example, the following screenshot is from audio troubleshooting, 1 Payrg ha Which ofthese devices do you want to troubleshoot? ‘Oiiphones sedope 621 Sarso No ckfomitin asl Obiatet- ope 62 Heder AS Ae ‘No tkinomaten sisi © Seeskesnteaiphones ste ud {Curent Def Device "jc neat alae ces! (Copyright No, -L-28904/2021) WH Autificia Intelligence & Machine Learning (SPPU) 1.26 Introduetion to Aland ML lying Ao Jur off Sound Effects and Enhancements Oreo improve sound gusty. To turn them fuse he Tyas a checkbox or ence ter nial then ck apply’ H you dont see an enhancements tab thon your deuce doesthave ay enhancements ~> Yes, open audio enhancements No, don't opan Audio Enhancements «gut there were issues with expert systems, They were often narrow and difficult to apply across other cateb0re® rurthermore,as the expert systems got larger, it became more challenging fo manage them art feed dat, ‘The result vwas that there were more errors inthe outcomes. Next testing the systems often proved to be a complex Proce Let's face it there were times when the experts would disagree on fundamental matters. Finally, expert s¥stemt did not vearn overtime. Instead, there had tobe constant updates to the underying logic models, which added greatly to the cost and complexities, By the late 1980s expert systems started to lose favour in the business world and many start ‘ups merged or went bust. «arly expert systems started to disappear inthe late 19805, but the symbolle approach remained Today, you see tin vats called artifical intelgence planning branch of Al that employs strategies and action sequences to enhance the computer's ability to match symbols and patterns. Al planning attempts to solve the problem of combinatorial explosion (many posible combinations of information) by using something called heuristic reasoning “7 approach hat attempts to give Al aform of common sense, thus liniting the number of patterns the program has 0 rmatch at any one time. This approach is sometimes referred to as limiting the search space. A‘ planning is common with navigation systems suchas Google Maps. You enter your location and Your destination, and the system nds the shortest fastest outst ill uses a symbolic approach that relies om lists, and Google must gather the data to create those lists It does so by pulling data from numerous sources, including satellite and aerial imagery: state cy, and county maps; ts own Street View cars; and from users who contribute their own map information. All this data ts earelly vetted and then stitched together to creat highly detailed maps. Google Maps also extracts current traffic data from local highway authorities to help route drivers around accidents and backups «Unlike early symbolic systems, Google Maps uses heuristic reasoning to limit its search to a certain geographical ares based on the location and destination you enter, and itcan provide detailed directions, such as whether fo Wart left o right ata given intersection without having to search through is entire database of symbols and pattors «Even though it’ considered old-fashioned Al, symbolic systems and Al planning are still used in many ne projects. performs well in systems that have predefined symbols and patterns You can see this with driving directions, but vigo works with contracts logit, and even video games Ifyou ae considering a new Al project don't Pe quick t dismiss the benefits of good old-fashioned Al Newer approaches may not be the right fit {Copyright No. -L-98904/2021) WF Arial aoigence & Machine Leaning SPPU) 1.27 Introduction to Aland 1.5.4(B) _ Sub-symbolic (Connectionist Approach) + 1Flask you to describe an apple, you will usually come up with phrases such as “it is red" “itis round, "it is sweet, “it is fruit’, ete Such a system is typically how symbolic systems ate designed and operated, While this is great, it becomes tough for mathematical models to be built on top of such constructs as there is no clear relationship between these factors. This Is precisely where sub-symbolie or connectionist approach comes into play. The main assumption of the sub-symbolic paradigm is that the ability to extract a good model with limited experience makes a madel successful. Here, Instead of clearly defined human-readable relations. you design less explainable mathematical equations to solve Problems. Once you have a mathematical model (or representation) of something, then you can reasonably predict or classify that thing. You will earn about several mathematical representations and functions throughout the book that attempt to solve prediction or classification problems. ‘+ The Figs. 1.5.7 and 1.5.8 illustrates the difference between symbolic and sub-symbolic approaches, Sr Fig. 1.5.7 : Symbolie approach Fig, 1.5.8 : Sub-symbolic approach ‘+ Contrary to symbolic methods, where the learning happens through the human supervision and intervention, sub- symbolic methods establish correlations between input and output variables. Such relations have high complexity and are offen formalised by functions that map the Input to the output data or the target variables. Sub-symbolic methods Tepresent the Connectionism movement that i trying to mimic a human brain and its complex network of {interconnected neurons with the Artificial Neural Networks (ANN). The sub-symbolic At includes statistical leaning ‘methods, such as Bayesian learning, deep learning, back propagation, and genetic algorithms, * The sub-symbolic methods are more robust against noisy and missing data, and generally have high computing Performance, They are easier to scale up therefore, they are well suitable for big datasets and large knowledge graphs. Moreover, they are better for perceptual problems, and they require less knowledge upfront. However, connectionist ‘methods have some challenges as v.ell. The most important one is the lack of interpretability in these methods. This Presents a big obstacle to their applicability in domains where explanations and interpretations are key polis. * Furthermore, they are highly dependent on the training data they process. At first glance, it might not seem lke a problem, however, this results in an inability to extrapolate results to unseen instances or data which do not follow a ‘similar distribution as the training data, Additionally, due tothe typically large number of parameters that need to be estimated in sub-symbolic models, they require huge computation power and huge amounts of data. Another issue arising is the availabilty of high quality data for training the algorithms, which often are difficult to find. Data need to be correctly labelled and to have decent representatives of the normal not to lead to biased outcomes. Irrespective of the challenges, sub-symbolic methods are the ones that primarily used today. ‘+ Most common applications of sub-symbolic methods include prediction, clustering, pattern classification and ecognition af objects, and Natural Language Processing (NLP) tasks Sub-symbolic Al i also used in applications such as text classification, categorization, speech and text recognition. Sub-symbolic Al is what modern Al is based off mostly (Copyright No, -L-98904/2021) ee BH Artifica ntligence & Machine Learning (SPPU) 1.28 Introduction to Aland ML Comparison between Symbolic and Sub-symbolic Approaches -the Table 15.2 provides a quick comparison between symbolic and sub-symbolic approaches to AL Comparison Attribute ‘symbolic Sub-symbolic Based on symbols Numbers Relationship Logical Mathematical Computation Serial Parallel Data required Less. Alot Learning ‘Through domain experts | Machine-based ‘Training Difficult Easy Processing Speed Slow Fast Easy to explain Yes No 1.5.4(C) Statistical ‘The statistical approach to AL takes the sub-symbolc Al further and uses various statistical concepts such as mean, median, mode, distributions, standard deviations, variances, et. to derive machine learing rnodels based off which Al systems could work, You will learn several statistial models throughout the book, 4.5.5 Need of Alin Mechanical Engineering ‘This topic is covered in Unit 6. Please refer Section 68. 1.6 Basics ‘There is nothing specfct lean for subtopics mentioned inthe syllabus under this heading, You would learn about. several of these concepts and thet role in Aland ML throughout this book: You are good to go 4.7 Application of Machine Learning and Artificial intelligence. 4s There is hardly any area or domain today that does not use machine learning (or artificial inteligencs based on vrachine earning) in some or te other ways So, its hard to just talk about one or more applications of machine learning «But Jet me fist give you some perspective that would help you to understand what machine learning has made possible for human life, The fllowing snapshot from = MeKinsey’s| report available at intps/ /www.mekinsey. com Teatured-insights/artiiciaFinteligence/notesfrom-the-ai-frontier-appicatons-and: vucoflep-earting, shows that Artifical Iteligence (based on machine learning) and surrounding technologies could have a potential value of $95 trillion to $15.4 trillion annually! (Copyright No, -L-98904/2021) W Arcilnegence& Machine Learning (SPPU) 1.29 Introduction to Aland ML (O— Aerosnac and Detense Om. Art CS Scie ‘Tap anit 1s 0 cpp Pott tal anra vale of Al and anaes wens nde $9.5T - $15.4T CO Pramaceutcals and Medical Prodscts CO > Piste and sos Sactr O (O— mec an Enoranment O—otani cas Q-Hnsincare Sse and Sevees O— tees Qpsutcnaiion O—crencat Corer tata cots O- Hon ron 709} 600 509 Atiopact, £00 Reta (© Transport ane togitos Travel Hestncare systems ‘and eons ‘Sailion 6. OConsumer packaged goods Pubic and social sectors ‘OAutomotve and assembly 200) ‘Advanced elacranies! semiconductors © Banking ©msurance © Basic materials 200] Ober Tox eta ans einen Oda Ges [elecommunicatons © omen) 109] 5 ‘apeite Parmacourats ‘and ieaeel Aerospace are Dele Peas ‘OAaospace an a 2 30 rn 50 60 Shore of Al impact n totl impact derived from analytes, % Fig. 1.7.1 ‘+ _Ineach of these areas, Artifical intelligence can create value across the value chain in four ways as highlighted Inthe following snapshots from the report. (Copyright No. -L-98904/2021) aa ate SE _ Artificial Intelligence & Machine Learning (SPPU) 130 Introduction to Aland ML Machine leaming: (2 Nelural Language oslatity ‘Autonomous vohictes computervsion @ vgn C)Mecum — Low mar rovotes OF Viral agents a Projet | BS] | Promotes Enfgrioned | Operations wih | Products and and, higher erie atthe | Enriched, alored, ‘Batting | product. ower | right pice, wt he | "and convenient optcatie | forccasing and | cast anatatier | ight message, nd] user experince arenes | ‘sman sourcing | eficency | tote right targets Personalize ips cpa omana | aaa and suggestions Ancpnte cena | eas | Onin pcig. | ‘alert cima wio, | reapers, | "paeontzo” | asstnca wih petal czemznaand, | “gies” | prenoios, and | wi agri vonaing 0p%'] ‘maanding, | ter webate | sume nctoe sono ar |e aanrtent|cdplys neal tine| | checo, a and micospace complete laste Satvery by drones Eanance comand Trtorae sap ‘a supply |Opimize preventive selection, provider as [°Phmanancas | etme ning | “Sesion raliabiity of improve electricity with time-of-day | insight, automate: sete apes | ‘poacenned | amarante | cudaner soe utes genaraton asso, | ‘reduce energy | HH: mach | wan vituat agents, ‘Ind automate | waste, and prevent], PECTS | and talor usage to “mraade | ektryther | S°BEENSUMETS |" conus pone preferences: repponse Tow paae |remepe| pananasa | omnes tio wmorerae, |"iyneunc | Pesastees | ging wet tyra] acne asorty] 2S. | and te alos Hanutctring ar ine de oor | sevens, otmze |" erance assent and [it pot wor fe | panteance sessment lemraacemerra| | eeS | gee andi ‘requirements delivery time 7 ‘taining Auiomate and ~ i adapt naoies ang Pret seats | speaions; | Prodi costmare |e orudstons area; | ecart, ct on [patents sew | stupas end Hiwalth ca Famer preveriion diagnostic paints’ risk ‘agents to help wipereion | emanate | “axon | pase ravgnt thom tan ther oop imate poner va Personae Anica 0 | puget wchers jargon et caman, tne eee | “ently early “taconinuus | ‘of perlormance 10 Pstass cues, | a a help pradustes | SO snd opts Pagrionnar | are aman or Genghis |_| eames Education (Copyright No. - L-98904/2024} Ft Arca imoigence & Machine Learning (SPPU) Introduction to Aland ML 1.31 Enriched, talored, and convenient user Higher productivity and Products and services minimized maintenance at the right price, with Accurate demand forecasting, smart sourcing, and enlightened R&D 1-2% EBIT improvement using machine leaming to anticipate fruit and vegetable sales 20% stock reduction using deep learning to predict e-commerce purchases 2 milion fewer product ‘etumns per yeer Objective to eur 1 national electricity usage by using deep learning to predict power demand and supply Retail + Electric utilities in Manufac- + 10% yiold turing improvement for integrated-circut roduets using Al to Improve R&D process 39% IT staff reduction by using lt fully automate procurement processes, $300 billion possible savings in the Unites States using machine learning tocls for Population health forecasting £3.3 billion possible savings in the United Kingdom using Alto provide preventive care ‘and reduce nonelective hospital admissions Health care Education and repairs *# 30% reduction of stocking time using ‘autonomous vehicies in ‘warehouses + 20% energy production increase Using machine learning and smart sensors to ‘optimize assets’ yield 10-20% ESIT improvement by using machine learning to ‘enhance predictive maintenance, automate fault prediction, and increase capital productivity 30% increase of material delivery time using machine leaming to determine timing of ‘goods transfer 5% production yield improvement + 30-80% productivity improvement for ‘nurses supported by Al tools + Upto 2% GDP ‘savings for operational efficiencies in developed countries Virtual teaching assistants can answer 40% of students’ routine questions the right massage, the right targets + 50% improvement of assortment officiency + 4-6% salos increase Using geospatial modeling to improve mictomarket attractiveness 20% online sales increase by using dynamic pricing and personalization + 13% EIT improvement by using machine learning to predict sources of servicing revenues enc optimize sales efforts + 5-9% health expenditure reduction by using machine learning to tailor treatments and keep patients engages + 1% increase in enroliment by using @ virtual assistant to follow up with applicants ‘oxperience + $10-$30 savings on ‘monthly bills by using machine teaming to automaticaly suitch electricity supply deals 12% fuel savings for ‘manufacturers customers, afines. by using machine learning to optimize fight routes $2 villion-$10 trillion savings globally by tailoring drugs and treatments 0.2-1.3 additional years of average ite expectancy 85% match with human grading, using ‘machine leaening and Predictive modeling (Copyright No. -L-98904/2021) Besa ric eigen acne aaring SPRY) 22 Inwoductono Aland ML Machine learning personalizes ‘promotions to match shoppors’ profiles; in-store boacons send ‘offers to their smart phones as they browse ‘through the store Retailers can know more about what shoppers want-sometimes before shoppers themselves ‘Autonomous drones using deep leaming technology complete last-mile ‘dalivery, and are able to handle cbstacles of absent recipients Facial recognition provide directions shoppers’ fesyle profile software, machine ‘deep learning learning, and natural Identifies articles language enable bagged by shoppers: virtual agents to greet ‘adding data from you personally, sensors, A allows anicipate orders, and non-stop checkout and ‘automatic payment ior seeans ad Stas opiate ans tabletops enabled wih cpl poesia ret ‘computer vision and ‘tine, wit machine dsp tearing can only tearing leveraging ‘rele and recommend Gata on composts! Somelemarary peci's GF proeswoshe, and ceemea part SEES GEL iveron evelsto maximize revenues Al-enhanced robots ‘An autonomous ‘continuously track nopping at flows Fini sor, and inventory recognize ‘can find is way © rot has and yourvehicl or10@ reper ther her roel crne fo reba bags ne ome delvery eaehouse (Copyright No. -L-98904/2021) Wri etigence & Machine earning (SPU) Introduction to Al and ML. ‘Alcan make the smart grid smarter and reduce the need for utilities to add power plants Machine learning Drones and enabled forcasting Insect-sze robots anticipates supply ‘dently defects, ‘and demand peaks, precict fairs, and ‘and maximizes the inspect assets use of intermittent ‘without interrupting renewable power production ‘Sensors and machine learning allow for by-the-minute adjustments to maximize generation efficiency by adjusting fo changes in wind condtions, for example ‘Smart wires combine with machine learning to ‘enable realtime power ‘dispatching, and ‘optimize it to currant grid toad and to buildings: ‘and machine learning enable utiles to offer services based on ‘sages, weather and ‘other factors (Copyright No, -1-98904/2021) Wise SH Artifical intelligence & Machine Learning (SPPU) 1.34 Introduction to Aland ML ‘Autonomous agnestic devioes Using machine learning and aher Al tecnooogios ‘ean conduct simple medical tests ‘webout men assistance, rebeving ‘Mactine leéming ‘doctors and nurses of program analyzes patents health ‘emote via mobile evice, compares ito mesic records, and recommends a finoss outne or was of possible disease Alin health care; quicker diagnoses, better treatment plans, and improved health insurance Alpowsred agnostic tools dont dsoase faster and with greater accuracy, ‘using historical ‘medical data and patient records Alagorthms optimize hospital operation taing sched, and inventory by using medical and environmental factors te forecast pation behavioe and ‘ésease probebilies the form of Interactive Kiosks, register patients ‘nd tefer thom ‘appropriate. - doctors, improving their experience = ‘and reducing ‘walng tio treatment plans ‘designed by machine leaning to's improve ‘thorapy etisncy by taoring Meaiment fo speci patients! needs and medical Alisights from population health analyses give payers ‘an opportunity reduce ‘hospitalization and ‘realmont costs by encouraging cre providers fo manage. patients wanes (Copyright No. -L-98904/2021) W Artifical ntetignce & Machin earing SPPU) 1.35 Introduction to Aland Mt + Alright, so you got some perspective now, isn’t it? That is what machine learning can potentially do! + Today, machine learning serves various purposes. However, some of the most common application of machine learning areas following. ‘Major Applications of Machine Learning 1. Medical advancement 2. E-commerce and marketing 3. Financial Sector 4, Weather Pattorns 5. Media and Entertainment Fig. 1.7.2 : Applications of Machine Learning 1. Medical advancement : ate used for research and operations in various areas such as. he healthcare industry is one ofthe biggest proponents of machine learning, Large datasets (2) Life threatening diseases such as cancer and AIDS (©) Reduce healtheare costs (©) Find cure for new epidemics and viruses (@)_ Predict the diseases, such as Alzheimer’s, in early stages to improve chances of survival (©) Genetic sequencing to predict genes-based diseases Machine learning provides a way to look at various patients data and build models around which new study and research can be carried out, 2 ECommerce and Marketing: Identifying the next product or marketing effort that would yield results is a great way ‘or companies to build adequate strategies. Consumer patterns, their likes, their purchase potential, time ofthe year when the users buy, market trends, global economy et. are all accounted to come out with the new products and services, The consumer's data is analysed to identify patterns and make predictions about the future demands. For sample, if you are shopping early stage pregnancy products, i is likely that you would purchase infant's products soon, Companies can use this information to push new schemes and offers to you and also equip their warehouses to ship baby products faster to you by placing the inventory closer to you, 3+ Financial Sector : Machine learning is also heavily used in the financal sector. Some ofthe common applications of ‘machine learning in financial sector are {@) _ Detecting frauds. For example, ifyou made a physical transaction in Delhi and then another physical transaction in ‘Mumbal within one hour, iti highly unlikely that you could have made both the transactions. (©) Credit score analysis based on your past reputation to pay dues (©) Quoting customised insurance premiums based on your lifestyle (2) Selling new products and services to you. For example, ifyou have surplus balance in your account throughout the year, then you could be sold investment products (Copyright No, --98904/2021) eran Wri nteigence & Machine Learning (SPPU) 1:36, Introduction to Aland ML part from these, the financial sector also uses machine learning for detecting money laundering, shell companies, Fraudulant transactions, and reporting. Based on the transactions carried out By individuals, the companies can bulld financial health profil of ts consumers and identify future spend patterns and requirements 4c. Weather Patterns : Machine learning i crucial for detecting changes In the weather patterns. You would have heard about (2) The rising ocean temperatures (b) Global warming (Melting glaciers in Antarctica (@)_ Reducing Oxygen level ‘there is a huge amount of data that can be used to predict the weather changes and report how itis affecting our environment It an be used to predict weather forecast, natural disasters and any other changes that could affect our well-being s. Media and Entertainment: The media and entertainment InAUSHY US rachine learning to understand viewing and king patterns forthe media content. Based on the time ofthe day, season, device you are on, your personal interests and ta, the content ean automaticaly be recommended for you sta rure YS would have seen YouTube vecommendations as you watch YouTube videos. Similarly, companies ike spotty can automatically create curated and customised playlists for you based on your listening profile, Review Questions rere area few review questions to help you gauge your understanding of this chapter. Try to attempt these questions and ensure that you can recall the points mentioned in the chapter. TA] Introduction to Machine Learning 0.1 _ Define Machine Learning. Explain the coo idea behind machine learing (4 Marks) 0.2. Howare experiance, performance, and tasks related wih respect 10 machine learning. [4 Marks) @.3 Wiha block diagram, explain how machine Fearing works at a high-level [6 Marks} 0.4. Explain the key tems associated with machine learning {4 Marks) @.5 Explain the various Big Data formats. [6 Marks} @.6 Explain he various traning data formats that could be use for machine leaming [6 Marks} @.7 With exainples, explain the types of Big Data formats. 16 Marks} 10.8 Compare vaious Data Fomats that could be used in mache learning, [6 Marks} @.9 Compare structured, somi-stuctured and unstructured deta. [6 Marks) @.10. Write a short note on DIKW pyramid. [6 Marks} @.44 Draw DIKW pyramid and explain. [6 Marks} (Copyright No, ~L-98904/2021) paket WF _Ariticial inetgence & Machine Learning (SPPU) 1.37 Introduction to Aland ML @ 12 Explain the journey of data enrichment as it moves from hindsight to foresight . 13. Describe the various categories of data analytics. . 14 What tyes of analytics can be performed on a dataset? 15 Write a short note on Descriptive Analytics, 16 Write a short note on Diagnostic Analytics, . 17 White a short note on Predictive Analytics, |. 18 Write a short note on Prescriptive Analytics. 1 19 Compare the various categories of data analytics. . 20 Explain the various types of machine leaming algorithms. 21. Explain supervised learning. 22 Explain unsupervised learning. 28 Explain reinforcement learning 24 Compare supervised and unsupervised learning, - 25 Joo contused on vtich machine leaming algorithm to choose for his project What do you tel hi? 26 Compare ArticialIntoligence and Machine Learning 27. Compare Data Mining and Machine Learning . 28 Explain common issues in machine learning, . 28. Explain common challenges in machine learning, such a model? . 31, Machine learning models could be biased. Comment. . 82 Describe the various steps involved in developing a machine learning application, 138. You could collect training dataset internally or extemally. Comment, {B] Introduction to Artificial intelligence a a 34 Write a short note on artifical Intetigence, 385 Explain the types of Al [6 Marks} [6 Marks} (6 Marks} (4 Marks} (4 Marks} [a Marks} (4 Marks} [6 Marks} [6 Marks} (4 Marks] (8 Marks} [4 Mares} (4 Marks} [4 Marks} [4 Marks] {4 Marks} [6 Maris} [6 Marks) + $0 John is suggesting that he uit @ machine learning model using a taining dataset of 10 examples, Would you trust [4 Marks) [4 Marks] [6 Marks} [4 Marks] [4 Marks) [4 Marks] (Copyright No, -L-98904/2021} ee

You might also like