Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

Ebook553 pages18 hours

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

Name: Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
Author: Pramod Gupta
ISBN: 9783030712709

By Pramod Gupta and Naresh K. Sehgal

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book provides an introduction to machine learning and cloud computing, both from a conceptual level, along with their usage with underlying infrastructure. The authors emphasize fundamentals and best practices for using AI and ML in a dynamic infrastructure with cloud computing and high security, preparing readers to select and make use of appropriate techniques. Important topics are demonstrated using real applications and case studies.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateApr 28, 2021

ISBN9783030712709

Author

Pramod Gupta

Related authors

Skip carousel

Related to Introduction to Machine Learning in the Cloud with Python

Related ebooks

Skip carousel

Introduction to Reliable and Secure Distributed Programming
Ebook
Introduction to Reliable and Secure Distributed Programming
byChristian Cachin
Rating: 0 out of 5 stars
0 ratings
Machine Learning and Deep Learning With Python
Ebook
Machine Learning and Deep Learning With Python
byJames Chen
Rating: 0 out of 5 stars
0 ratings
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
Ebook
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
byDr. Rajkumar Tekchandani
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
Ebook
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
byEkaba Bisong
Rating: 0 out of 5 stars
0 ratings
Design Patterns in Modern C++: Reusable Approaches for Object-Oriented Software Design
Ebook
Design Patterns in Modern C++: Reusable Approaches for Object-Oriented Software Design
byDmitri Nesteruk
Rating: 0 out of 5 stars
0 ratings
Real-time Analytics with Storm and Cassandra
Ebook
Real-time Analytics with Storm and Cassandra
byShilpi Saxena
Rating: 0 out of 5 stars
0 ratings
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
Ebook
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
byPramod Singh
Rating: 0 out of 5 stars
0 ratings
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
Ebook
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
byTshepo Chris Nokeri
Rating: 0 out of 5 stars
0 ratings
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
Ebook
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
byErvin Varga
Rating: 0 out of 5 stars
0 ratings
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
Ebook
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
byCésar Pérez López
Rating: 0 out of 5 stars
0 ratings
Deep Neural Network ASICs The Ultimate Step-By-Step Guide
Ebook
Deep Neural Network ASICs The Ultimate Step-By-Step Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Effective Amazon Machine Learning
Ebook
Effective Amazon Machine Learning
byAlexis Perrier
Rating: 0 out of 5 stars
0 ratings
PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes
Ebook
PySpark SQL Recipes: With HiveQL, Dataframe and Graphframes
byRaju Kumar Mishra
Rating: 0 out of 5 stars
0 ratings
Support Vector Machine: Fundamentals and Applications
Ebook
Support Vector Machine: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Handbook of Metaheuristic Algorithms: From Fundamental Theories to Advanced Applications
Ebook
Handbook of Metaheuristic Algorithms: From Fundamental Theories to Advanced Applications
byChun-Wei Tsai
Rating: 0 out of 5 stars
0 ratings
Mastering Parallel Programming with R
Ebook
Mastering Parallel Programming with R
bySimon R. Chapple
Rating: 0 out of 5 stars
0 ratings
Social Media Data Mining and Analytics
Ebook
Social Media Data Mining and Analytics
byGabor Szabo
Rating: 0 out of 5 stars
0 ratings
Machine Learning Proceedings 1992: Proceedings of the Ninth International Workshop (ML92)
Ebook
Machine Learning Proceedings 1992: Proceedings of the Ninth International Workshop (ML92)
byPeter Edwards
Rating: 0 out of 5 stars
0 ratings
Graph Analytics A Clear and Concise Reference
Ebook
Graph Analytics A Clear and Concise Reference
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
Ebook
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
byRohollah Omidvar
Rating: 0 out of 5 stars
0 ratings
Bayesian Optimization and Data Science
Ebook
Bayesian Optimization and Data Science
byFrancesco Archetti
Rating: 0 out of 5 stars
0 ratings
Metaheuristics for Vehicle Routing Problems
Ebook
Metaheuristics for Vehicle Routing Problems
byNacima Labadie
Rating: 0 out of 5 stars
0 ratings
Deep Neural Nets A Clear and Concise Reference
Ebook
Deep Neural Nets A Clear and Concise Reference
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Deep Neural Nets Deep Learning A Complete Guide
Ebook
Deep Neural Nets Deep Learning A Complete Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
Ebook
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
byTanay Agrawal
Rating: 0 out of 5 stars
0 ratings
Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control
Ebook
Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control
byJames C. Spall
Rating: 4 out of 5 stars
4/5
Knowledge Graph Standard Requirements
Ebook
Knowledge Graph Standard Requirements
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Semantic Knowledge Graphing Third Edition
Ebook
Semantic Knowledge Graphing Third Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Data Governance and Data Management: Contextualizing Data Governance Drivers, Technologies, and Tools
Ebook
Data Governance and Data Management: Contextualizing Data Governance Drivers, Technologies, and Tools
byRupa Mahanti
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Computer Vision with SAS: An Introduction
Ebook
Deep Learning for Computer Vision with SAS: An Introduction
byRobert Blanchard
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C. Lennox
Rating: 4 out of 5 stars
4/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
Our Final Invention: Artificial Intelligence and the End of the Human Era
Ebook
Our Final Invention: Artificial Intelligence and the End of the Human Era
byJames Barrat
Rating: 4 out of 5 stars
4/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
The Exponential Age: How Accelerating Technology is Transforming Business, Politics and Society
Ebook
The Exponential Age: How Accelerating Technology is Transforming Business, Politics and Society
byAzeem Azhar
Rating: 5 out of 5 stars
5/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
Ebook
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
byJasmine Wang
Rating: 5 out of 5 stars
5/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
Impromptu: Amplifying Our Humanity Through AI
Ebook
Impromptu: Amplifying Our Humanity Through AI
byReid Hoffman
Rating: 5 out of 5 stars
5/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
Humans Need Not Apply: A Guide to Wealth & Work in the Age of Artificial Intelligence
Ebook
Humans Need Not Apply: A Guide to Wealth & Work in the Age of Artificial Intelligence
byJerry Kaplan
Rating: 4 out of 5 stars
4/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
10 Great Ways to Earn Money Through Artificial Intelligence(AI)
Ebook
10 Great Ways to Earn Money Through Artificial Intelligence(AI)
byAli Musa
Rating: 5 out of 5 stars
5/5
Hacking : Guide to Computer Hacking and Penetration Testing
Ebook
Hacking : Guide to Computer Hacking and Penetration Testing
byAlex Nordeen
Rating: 5 out of 5 stars
5/5

Related podcast episodes

Skip carousel

[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
Podcast episode
[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
byDataFramed
0 ratings
0% found this document useful
#14 Hidden Markov Models & Statistical Ecology, with Vianey Leos-Barajas
Podcast episode
#14 Hidden Markov Models & Statistical Ecology, with Vianey Leos-Barajas
byLearning Bayesian Statistics
0 ratings
0% found this document useful
Generative AI, cybercrime, and scamability, with Stacey Edmonds
Podcast episode
Generative AI, cybercrime, and scamability, with Stacey Edmonds
byLondon Futurists
100%
100% found this document useful
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Build custom ML tools with Streamlit: featuring Adrien Treuille, Co-Founder and CEO at Streamlit
Podcast episode
Build custom ML tools with Streamlit: featuring Adrien Treuille, Co-Founder and CEO at Streamlit
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
#121 — ChatGPT and How Generative AI is Augmenting Workflows
Podcast episode
#121 — ChatGPT and How Generative AI is Augmenting Workflows
byDataFramed
0 ratings
0% found this document useful
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
Podcast episode
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
byData Engineering Podcast
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
Podcast episode
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
Podcast episode
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
byPHPRoundtable Podcast
0 ratings
0% found this document useful
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
Podcast episode
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
byData Skeptic
0 ratings
0% found this document useful
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
Podcast episode
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
byThe Artificial Intelligence Show
0 ratings
0% found this document useful
Heavy Networking 707: Getting Real With Selector’s AIOps (Sponsored): AI and machine learning are finally being applied to networking in meaningful ways. On today's sponsored show we talk with Selector about its AIOps platform, which ingests networking logs, flows, configurations, SNMP,
Podcast episode
Heavy Networking 707: Getting Real With Selector’s AIOps (Sponsored): AI and machine learning are finally being applied to networking in meaningful ways. On today's sponsored show we talk with Selector about its AIOps platform, which ingests networking logs, flows, configurations, SNMP,
byHeavy Networking
0 ratings
0% found this document useful
Exploring The Patterns And Practices For Deep Learning With Andrew Ferlitsch: An interview with Andrew Ferlitsch about his experiences building and teaching deep learning models and his work on a book to capture those lessons for everyone to learn from.
Podcast episode
Exploring The Patterns And Practices For Deep Learning With Andrew Ferlitsch: An interview with Andrew Ferlitsch about his experiences building and teaching deep learning models and his work on a book to capture those lessons for everyone to learn from.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Linear Programming, PySimpleGUI, and More
Podcast episode
Linear Programming, PySimpleGUI, and More
byThe Real Python Podcast
0 ratings
0% found this document useful
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
Podcast episode
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
Podcast episode
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
Making Automated Machine Learning More Accessible With EvalML: An interview with Angela Lin and Jeremy Shih about the open source EvalML framework for building automated machine learning workflows.
Podcast episode
Making Automated Machine Learning More Accessible With EvalML: An interview with Angela Lin and Jeremy Shih about the open source EvalML framework for building automated machine learning workflows.
byThe Python Podcast.__init__
100%
100% found this document useful
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
Podcast episode
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
byMachine Learning Guide
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
Podcast episode
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
Podcast episode
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
byData Engineering Podcast
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
DataFramed Careers Series Special Announcement!
Podcast episode
DataFramed Careers Series Special Announcement!
byDataFramed
0 ratings
0% found this document useful
167 | Visualization and Statistics with Andrew Gelman and Jessica Hullman
Podcast episode
167 | Visualization and Statistics with Andrew Gelman and Jessica Hullman
byData Stories
0 ratings
0% found this document useful
The Testing Show: Insurance Testing: Software Testing experts discussing trends and directions in the QA industry
Podcast episode
The Testing Show: Insurance Testing: Software Testing experts discussing trends and directions in the QA industry
byThe Testing Show
0 ratings
0% found this document useful
[From the Archives] Ep 122: Dr. Rebekah Willson on Grounded Theory: On this episode, Katie is Joined by Dr. Rebekah Willson, a Lecturer in Information Science in the Department of Computer and Information Sciences, University of Strathclyde, Glasgow, UK. Originally from Canada, she obtained her PhD from Charles Sturt...
Podcast episode
[From the Archives] Ep 122: Dr. Rebekah Willson on Grounded Theory: On this episode, Katie is Joined by Dr. Rebekah Willson, a Lecturer in Information Science in the Department of Computer and Information Sciences, University of Strathclyde, Glasgow, UK. Originally from Canada, she obtained her PhD from Charles Sturt...
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
#122 How Organizations Can Bridge the Data Literacy Gap
Podcast episode
#122 How Organizations Can Bridge the Data Literacy Gap
byDataFramed
0 ratings
0% found this document useful
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
Podcast episode
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
byMachine Learning Guide
0 ratings
0% found this document useful
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
Podcast episode
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
byDataFramed
0 ratings
0% found this document useful
Orange: Visual Data Mining Toolkit with Janez Demšar and Blaž Zupan: Orange: Graphical Toolkit For Data Mining and Visualization in Python (Interview)
Podcast episode
Orange: Visual Data Mining Toolkit with Janez Demšar and Blaž Zupan: Orange: Graphical Toolkit For Data Mining and Visualization in Python (Interview)
byThe Python Podcast.__init__
100%
100% found this document useful

Skip carousel

MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
APC
Article
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
Dec 2, 2019
4 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
National Security Law: For Hong Kong Scholars, A Fear Of The Unknown
This Week in Asia
Article
National Security Law: For Hong Kong Scholars, A Fear Of The Unknown
Aug 8, 2020
14 min read
A.I. Scans For Big Farms That Might Be Polluters
Futurity
Article
A.I. Scans For Big Farms That Might Be Polluters
Apr 9, 2019
3 min read
How AI Algorithms Could Help Design New Drugs
Futurity
Article
How AI Algorithms Could Help Design New Drugs
Apr 6, 2017
A new kind of AI algorithm—designed to work with a small amount of data—may be able to assist in the early stages of drug development. Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguis
3 min read
Silq Is An Easier Quantum Programming Language
Futurity
Article
Silq Is An Easier Quantum Programming Language
Jun 22, 2020
3 min read
Demystifying Artificial Intelligence
Finweek - English
Article
Demystifying Artificial Intelligence
Oct 18, 2019
artificial intelligence (AI) has had a significant global impact by changing the way enterprises, markets and consumers define efficiency and innovation. Financial markets typically feature large volumes of noisy and dynamic data while utilising high
3 min read
Electronic Data Analysis Key To Agri Economics
Farmer's Weekly
Article
Electronic Data Analysis Key To Agri Economics
Nov 9, 2020
Collecting and analysing electronically generated data enable agricultural economists to compile meaningful recommendations for end-users in the agriculture sector. Data collection and analyses were increasingly being made easier, due to the developm
1 min read
01 Giving Data Collectors—and Donors—a Real-Time Rush
Fast Company
Article
01 Giving Data Collectors—and Donors—a Real-Time Rush
Mar 20, 2017
7 min read
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Futurity
Article
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Apr 11, 2019
2 min read
Why Quantum Computers Might Not Break Cryptography
Quanta
Article
Why Quantum Computers Might Not Break Cryptography
May 15, 2017
2 min read
How European Companies Can Use The Cloud To Increase Their Competitiveness
The European Business Review
Article
How European Companies Can Use The Cloud To Increase Their Competitiveness
Nov 25, 2021
5 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
Real World Computing
PC Pro Magazine
Article
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
A.i. Coding
Linux Format
Article
A.i. Coding
Aug 22, 2023
16 min read
Five Steps To Join The Era Of Industry 4.0
Architectural Review Asia Pacific
Article
Five Steps To Join The Era Of Industry 4.0
Sep 4, 2019
When 3D modelling tool Revit first arrived on the scene, Australian architects were some of the world’s earliest adopters, with local users outnumbering Europe and the US combined. As a country, we’re often ahead of the curve, and should be building
1 min read
How Google Is Making The AI That Powers Its Products Better.
HWM Singapore
Article
How Google Is Making The AI That Powers Its Products Better.
Jun 3, 2019
3 min read
Machine-learning On Your Android Phone?
APC
Article
Machine-learning On Your Android Phone?
Dec 30, 2019
4 min read
Tales For Makers
The Shed
Article
Tales For Makers
Oct 3, 2022
4 min read
Safer Cyber
Cosmos Magazine
Article
Safer Cyber
Mar 14, 2024
3 min read
Getting The edge
The European Business Review
Article
Getting The edge
Feb 25, 2021
7 min read
Forward Thinking
Racecar Engineering
Article
Forward Thinking
Feb 4, 2022
8 min read
So Predictable? AI And Landscape Architecture
Landscape Architecture Australia
Article
So Predictable? AI And Landscape Architecture
Apr 30, 2023
6 min read
10 Myths about Cloud Computing
Techfastly
Article
10 Myths about Cloud Computing
Oct 21, 2020
Cloud is a combination of hardware and software that stores your data virtually and gives you access to the desired software and application whenever you need it. Cloud computing is not your traditional computing that bounds and restricts the apps an
4 min read
“We Should Pay Attention To The Way That A New Language Can Redefine The Limits Of Computing”
PC Pro Magazine
Article
“We Should Pay Attention To The Way That A New Language Can Redefine The Limits Of Computing”
Feb 11, 2021
7 min read
Mocap In The Cloud: The Future Of Motion Capture
3D World
Article
Mocap In The Cloud: The Future Of Motion Capture
Nov 30, 2021
4 min read
Darq
PC Pro Magazine
Article
Darq
Jul 9, 2022
3 min read
Edge and Cloud Computing Can They Coexist Peacefully?
Techfastly
Article
Edge and Cloud Computing Can They Coexist Peacefully?
Jun 1, 2022
6 min read
“We’re Learning As We Go And Accepting Any False Starts As Being A Part Of The Process”
PC Pro Magazine
Article
“We’re Learning As We Go And Accepting Any False Starts As Being A Part Of The Process”
Jul 8, 2021
6 min read
Building PCs
Linux Format
Article
Building PCs
Apr 7, 2020
2 min read

Related categories

Skip carousel

Reviews for Introduction to Machine Learning in the Cloud with Python

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Introduction to Machine Learning in the Cloud with Python - Pramod Gupta

Part IConcepts

P. Gupta, N. K. SehgalIntroduction to Machine Learning in the Cloud with Pythonhttps://doi.org/10.1007/978-3-030-71270-9_1

1. Machine Learning Concepts

Pramod Gupta¹ and Naresh K. Sehgal²

(1)

NovaSignal, San Jose, CA, USA

(2)

NovaSignal, Santa Clara, CA, USA

Keywords

Machine learningSupervised learningUnsupervised learningReinforcement learningPredictionClassificationClusteringRegression

Over the last decade, machine learning (ML) has been at the core of our journey toward achieving larger goals in artificial intelligence (AI). It is one of the most influential and important technologies of the present time. ML is considered an application of AI, based on an idea that given sufficient data, machines can learn the necessary operational rules. It impacts every sphere of human life as new AI-based solutions are being developed.

Recently, machine learning has given us practical speech recognition, effective web search, and a vastly improved understanding of the human genome. Machine learning is so pervasive today that one probably uses it dozens of times daily without realizing it. There is no doubt, ML will continue to make headlines in the foreseeable future. It has the potential to improve as more data, powerful hardware, and newer algorithms continue to emerge. As we progress in the book, we will know that ML has a lot of benefits to offer.

The rate of development and complexity of the field make it difficult even for the experts to keep up with new technique. It can therefore be overwhelming for the beginners. This provided sufficient motivation for us to write this text to offer a conceptual-level understanding of machine learning and current state of affairs.

1.1 Terminology

Dataset: The starting point in ML is a dataset, which contains the measured or collected data values represented as numbers or text, a set of examples that contain important features describing the behavior of the problem to be solved. There is one important nuance though: if the given data is noisy, or has a low signal to noise ratio, then even the best algorithm will not help. Sometimes it is referred to as garbage in – garbage out. Thus, we should try to build the dataset as accurately as possible.

Features/attributes: These are also referred to as parameters or variables. Some examples include car mileage, user’s gender, and a word’s frequency in text, in other words, properties/information contained in the dataset that helps to better understand the problem. These parameters or features are the factors for a machine to consider. These parameters are used as input variables in machine learning algorithms to learn and infer, to be able to take an intelligent action. When the data is stored in tables, it is simple to understand, with features as column names. Selecting the right set of features is very important which will be considered in a later chapter. It is the most important part of a machine learning project’s process and usually takes much longer than all other ML steps.

Training data: ML model is built using the training data. This is the data that has been validated and includes desired output. The output or results are generally referred as the labels. The labeled training data helps an ML model to identify key trends and patterns essential to predicting the output later on.

Testing data: After the model is trained, it must be tested to evaluate how accurate it is. This is done by the testing data, where the ML-generated output is compared to the desired output. If both match, then the tests have passed. It is important for both the training and testing datasets to resemble the situations that the ML algorithms will encounter later in the field. Think of a self-driven car’s ML model, which has never seen a stop sign during its training phase. Then it will not know how to react when one is seen on actual drive later on.

Model: There are many ways to solve a given problem. The basic idea is building a mathematical representation that captures relationships between the input and output. In other words, it is a mapping function from input to output. This is achieved by a process known as training. For example, logistic regression algorithm may be trained to produce a logistic regression model. The method one chooses will affect the precision, performance, and complexity of the ML model.

To sum up, an ML process begins by inputting lots of data to a computer, then by using this data, the computer/machine is trained to reveal the hidden patterns and offer insights. These insights are then used to build an ML model, by using one or more algorithms to solve other instances of the same problem.

Let us take an example on the following dataset:

In the above example, there are five features (i.e., Outlook, Temperature, Humidity, Windy, and Class). There are nine observations or rows. In this example, Class is the target or desired output (i.e., to go for a play or no play), which ML algorithm wants to learn and predict for the unseen new datasets. This is a typical classification problem. We will discuss the concept of various tasks performed by ML later in this book.

1.2 What Is Machine Learning?

To demystify machine learning, and to offer a learning opportunity for those who are new to this domain, we will start by exploring the basics of machine learning and the process involved in developing a machine learning model. Machine learning is about building programs with tunable parameters, which are adjusted automatically. The goal is to improve the behavior of an ML model by adapting to previously seen data.

Machine learning is a subfield of artificial intelligence (AI). ML algorithms are the building blocks to make computers learn and act intelligently by generalizing, rather than just storing and retrieving data items like a database system.

While the field of machine learning has not been explored until recently, the term was first coined in 1959 [1]. Most foundational research was done through the 1970s and 1980s. Popularity of machine learning today can be attributed to the availability of vast amounts of data, faster computers, efficient data storage, and evolution of newer algorithms.

At a higher level, machine learning (ML) is the ability of a system to adapt to new data. The learning process advances through iterations offering better quality of response. Applications can learn from previous computations and transactions, by using pattern recognition to produce reliable and better informed results.

Arthur Samuel, a pioneer in the field of artificial intelligence, coined the term Machine Learning in 1959 while at IBM [1]. He defined machine learning as a Field of study that gives computers the capability to learn without being explicitly programmed.

In a layman’s words, machine learning (ML) can be explained as automating and improving the learning process of computers based on experiences, without explicit programming. The basic process starts with feeding data and training the computers (machines). This is achieved by feeding data to an algorithm to build ML models. The choice of algorithm depends upon the nature of task. The machine learning algorithms can perform various tasks using methods such as classification and regression.

Machine learning algorithms can identify patterns in the given data and build models that capture relationships between input and output. This is useful to predict outcome for a new set of inputs without explicit pre-programed rules or models.

1.2.1 Mitchell’s Notion of Machine Learning

Another widely accepted definition of machine learning was proposed by the computer scientist Tom M. Mitchell [2]. His definition states that a machine is said to learn if it is able to take experience and utilize it such that its performance improves upon similar experiences in the future. His definition says little about how machine learning techniques actually learn to transform data into actionable knowledge.

Machine learning also involves study of algorithms that improve a defined category of tasks while optimizing a performance criterion of past experiences. ML uses data and past experiences to realize a given goal or performance criterion.

Most desirable property of machine learning algorithms is the generalization, i.e., a model should perform well on the new or unseen data. The real aim of learning is to do well on test data that was not known during learning or training. The objective of machine learning is to model the true regularities in a data and to ignore the noise in the data.

1.3 What Does Learning Mean for a Computer?

A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks T, as measured by P, improves with experience E. It can be used as a design tool to help us think about which data to collect (E), what decisions software needs to make (T), and how to evaluate its results (P).

Example: playing tennis

E = the experience of playing many games of tennis

T = the task of playing tennis

P = the probability that the program will win the next game

1.4 Difference Between ML and Traditional Programming

Traditional programming: Feed in data and a program (logic), run it on a machine, and get the output.

Machine learning: Feed in data and its corresponding observed output, run it on machine during learning (training) phase. Then the machine generates its own logic, which can be evaluated during testing phase, as shown in Fig. 1.1.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig1_HTML.png

Fig. 1.1

Basic differences between traditional programming and machine learning

1.5 How Do Machines Learn?

Regardless of whether the learner is a human or a machine, basic learning process is similar to that shown in Fig. 1.2. It can be divided into three components as follows:

Data input: It comprises observations, memory storage, and recall to provide a factual basis for further reasoning.

Abstraction: It involves interpretation of data into broader representations.

Generalization: It uses abstracted data to form a basis for insight and taking an intelligent action.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig2_HTML.png

Fig. 1.2

Basic learning process

1.6 Steps to Apply ML

The machine learning process involves building a predictive model that can be used to find a solution for the given problem. Following steps are used in developing an ML model, as shown in Fig. 1.3.

Problem definition: This is an important phase as the choice of the machine learning algorithm/model will depend on the problem to be solved. The problem is defined only after the system has been studied well. For example, it may use classification or regression. In particular, the study will be designed to understand the principles of its behavior in order to make predictions or to make choices (defined as an informed choice). The definition step and the corresponding documentation (deliverables) of the scientific problem or business are both important to focus the analysis on getting results.

Data collection/data extraction: The next stage for machine learning model is a dataset. This step is the most important and forms the foundation of the learning. The predictive power of a model depends not only on the quality of the modeling technique but also on the ability to choose a good dataset upon which to build the model. So, search for the data, its extraction, and subsequent preparation related to data analysis because of their importance in the success of the results. Input data must be chosen with the basic purpose to build a predictive model, and its selection is crucial for the success of the analysis as well. Thus, a poor choice of data, or performing analysis on a data set that is not representative of the system, will lead to models that will deviate from the system under study. Better variety, density, and volume of relevant data will result in better learning prospects for the machine learning.

Prepare the data: Once data has been selected and collected, the next stage is to make sure that the data is in proper format and of good quality. As mentioned earlier, the quality of data is important for predictive power of machine learning algorithms. One needs to spend time determining the quality of data and then take steps for fixing issues such as missing data, inconsistent values, and treatment of outliers. Exploratory analysis is one method to study the nuances of data in details, thereby burgeoning the relevant content of the data. The quality of data is very important for the performance of machine learning algorithms.

Train the algorithm: By the time the data has been prepared for analysis, one is likely to have a sense of what one hopes to learn from the data. A specific machine learning task will result in the selection of an appropriate algorithm. This algorithm will represent data in the form of a model. This step involves choosing the appropriate algorithm and representation of data in the form of the model. The cleaned-up data is split into two parts: train and test; the first part (training data) is used for developing the model, and the second part (test data) is used as a reference. The proportion of data split depends on the prerequisites such as the number of input variables and complexity of the model.

Test the algorithm: Each machine learning model results in a biased solution to the learning problem, so it is important to evaluate how well the algorithm is learned. Depending on the type of model used, one can evaluate the accuracy of the model using a test dataset or may need to develop measures of performance specific to the intended application. To test the performance of the model, the second part of the data (test data) is used. This step determines the precision of the choice of the algorithm based on the desired outcome.

Improving the performance: A better test to check the performance of a model is to observe its performance on the data that was not used during building the model. If better performance is needed, it becomes necessary to utilize more advanced strategies to augment the performance of the model. This step may involve choosing a different model altogether or introducing more variables to augment the accuracy. Hence, significant amount of time needs to be spent in data collection and preparation. One may need to supplement with additional data or perform additional preparatory work as was described in step 2 of this process.

Deployment: After the above steps are completed, if the model appears to be performing satisfactorily, it can be deployed for the intended task. The successes and failures of a deployed model might even provide additional data for the next generation of model.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig3_HTML.png

Fig. 1.3

Machine learning process

The above steps 1–7 are used iteratively during the development of an algorithm.

1.7 Paradigms of Learning

Computers learn in many different ways from the data depending upon what we are trying to accomplish. There is No Free Lunch Theorem famous in machine learning. It states that there is no single algorithm that will work well for all the problems. Each problem has its own characteristics/properties. There are lots of algorithms and approaches to suit each problem with its individual quirks. Broadly speaking, there are three types of learning paradigms:

Supervised learning

Unsupervised learning

Reinforcement learning

Each form of machine learning has differing approaches, but they all follow an underlying iterative process and comparison of actual vs. desired output, as shown in Fig. 1.4.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig4_HTML.png

Fig. 1.4

Three types of learning paradigms

1.7.1 Supervised Machine Learning

Supervised learning , as shown in Fig. 1.5, is the most popular paradigm for machine learning. It is very similar to teaching a child with the use of flash cards. If you are learning a task under supervision, someone is judging whether you are getting the right answers. Similarly, supervised learning means having a full set of labeled data while training an algorithm. Fully labeled means that each observation in the dataset is tagged with the answer that the algorithm should learn. Supervised learning is a form of machine learning in which input is mapped to output using labeled data, i.e., input–output pairs. In this case, we know the expected response, and the model is trained with a teacher. In this type of learning, it is imperative to provide both inputs and outputs to the computer for it to learn from the data. The computer generates a function based on the data that can be used for the prediction of unseen data. Once trained, the model will be able to observe a new, never-seen-before example and predict an outcome for it. The trained model no longer expects the target. It will try to predict the most likely outcome from a new set of observations. The solution can use classification or regression depending on the type of the target.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig5_HTML.png

Fig. 1.5

A supervised learning model

Depending upon the nature of the target, supervised learning can be useful for classification as well as regression type of problems.

If target y has values in affixed set of categorical outcomes (e.g., male/female, true/false), the task to predict y is called classification.

If target y has continuous values (e.g., to represent a price, a temperature), the task to predict y is called regression.

1.7.2 Unsupervised Machine Learning

Unsupervised learning is the opposite of supervised learning. It uses no labels. Instead, the machine is provided with just the inputs to develop a model, as shown in Fig. 1.6. It is a learning method without target/response. The machine learns through observations and finds structures in the data. Here the task of machine is to group unsorted information according to similarities, patterns, and differences without any prior training. Unlike supervised training, no teacher is provided that means no training will be given to the machine. Therefore, the machine is restricted to find hidden patterns in unlabeled data. An example would be to perform customer segmentation or clustering. What makes unsupervised learning an interesting area is that an overwhelming majority of data in our world is unlabeled. Having intelligent algorithms that can take terabytes of unlabeled data and make sense of it are a huge source of potential profit in many industries. This is still an unexplored field of machine learning, and many big technology companies are currently researching it.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig6_HTML.png

Fig. 1.6

An unsupervised learning model

1.7.3 Reinforcement Machine Learning

Reinforcement learning allows machine to automatically determine the ideal behavior within a specific context, in order to maximize its performance. Reinforcement learning is looked upon as learning from mistakes as shown in Fig. 1.7. Over time, learning algorithm learns to make fewer mistakes than it used to. It is very behavior driven.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig7_HTML.png

Fig. 1.7

Reinforcement learning

This learning paradigm is like a dog trainer, who teaches the dog how to respond to specific signs, like catch a ball, jump, or anything else. Whenever the dog responds correctly, the trainer gives a reward to the dog, which can be a bone or a biscuit.

Reinforcement learning is said to be the hope of artificial intelligence because the potential it possesses is immense for many complex real-life problems, such as self-driving cars.

1.7.3.1 Types of Problems in Machine Learning

As depicted in Fig. 1.8, there are three main types of problems that can be solved using machine learning:

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig8_HTML.png

Fig. 1.8

Types of problems solved using machine learning

Classification problem: Classification is the process of predicting the class of a given data points. Classification predictive modeling is the task of approximating a mapping function from input variables to discrete output variables, e.g., spam detection in emails and credit card fraud. In these cases, we try to draw a boundary between different classes as shown in Fig. 1.9. A classifier utilizes some training data to understand how given input variables relate to the class. The dataset may simply be bi-class (e.g., is incoming mail a spam or non-spam?) or it may be multi-class (e.g., health of a patient). Some other examples of classification problems are speech recognition, fraud detection, documents classification, etc. There are various ML algorithms for classification that will be discussed later.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig9_HTML.png

Fig. 1.9

Classification using machine learning

Regression problem: Regression is the task of predicting the value of a continuously varying variable (e.g., a sale price of a house or a height of a tree) given some input variables (aka the predictors, features, or regressors). A continuous output variable is a real-value, such as an integer or floating-point value. These are often quantities such as the amounts and sizes. It tries to model data distribution with the best line/hyper-plane which goes through the points as shown in Fig. 1.10, Regression is based on a hypothesis that can be linear, polynomial, nonlinear, etc. The hypothesis is a function that is based on some hidden parameters and the input values.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig10_HTML.png

Fig. 1.10

Regression using machine learning

Clustering : This type of problem involves assigning the input into two or more clusters based on similarity as shown in Fig. 1.11, for example, clustering customers into similar groups based on their spending habits, age, geography, items they buy, etc. This is unsupervised learning as there is no target available in advance.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig11_HTML.png

Fig. 1.11

Clustering

Figure 1.12 sums up the differences between regression, classification, and clustering.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig12_HTML.png

Fig. 1.12

Regression vs. classification vs. clustering

1.8 Machine Learning in Practice

Machine learning algorithms are a small part of practices by a data analyst or data scientist to do machine learning. In reality, the actual process often looks like:

Start loop

Understand the domain, prior knowledge, and goals. Start by talking to the domain experts. Often the goals are unclear. One may have to try multiple approaches before starting to implement.

Data integration, selection, cleaning, and pre-processing. This is often the most time-consuming part. It is important to have high-quality data. The more data one has, more work may be required because the data can be noisy, remember GIGO (garbage in, garbage out).

Learning models. This is an exciting phase with availability of many tools to experiment with.

Interpreting results. Sometimes it does not matter how a model works as long as it delivers the results. Some domains require that the model is understandable, so we need to be prepared to be challenged by the experts.

Consolidating and deploying discovered knowledge. The majority of projects that are successful in the lab may not be used in practice. In such cases, compare the model’s output with desired results.

End loop

Clearly it is not a one-shot process, but an iterative cycle. It also explains that learning happens by observing. The reason is to learn from the gaps between actual and desired results. The loop iterates until we get a model and the results that can be used in practice. Also, incoming data may change, requiring a new loop.

1.9 Why Use Machine Learning?

It is important to remember that machine learning (ML) does not offer solutions to every type of problem at hand. There are

Enjoying the preview?

Page 1 of 1

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

About this ebook

Pramod Gupta

Related authors

Related to Introduction to Machine Learning in the Cloud with Python

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Introduction to Machine Learning in the Cloud with Python

What did you think?

Book preview

Introduction to Machine Learning in the Cloud with Python - Pramod Gupta

1. Machine Learning Concepts

1.1 Terminology

1.2 What Is Machine Learning?

1.2.1 Mitchell’s Notion of Machine Learning

1.3 What Does Learning Mean for a Computer?

1.4 Difference Between ML and Traditional Programming

1.5 How Do Machines Learn?

1.6 Steps to Apply ML

1.7 Paradigms of Learning

1.7.1 Supervised Machine Learning

1.7.2 Unsupervised Machine Learning

1.7.3 Reinforcement Machine Learning

1.8 Machine Learning in Practice

1.9 Why Use Machine Learning?