Professional Documents
Culture Documents
COURSE DESCRIPTION
This course will cover artificial intelligence (AI) and in particular machine learning (ML), with a focus on
applications in business decisions and processes. The course will look in-depth at the two major types of
ML: supervised learning (including classification and deep learning) and unsupervised learning (including
clustering, association rule learning, and recommender systems). The course will survey key
technologies and applications that are driving the ML revolution. The course will provide students with a
robust theoretical understanding of popular models and techniques, but the course will be application-
focused. The overall goal of the course is to provide a foundation and framework for understanding
how, when, and why to use AI and ML models in data-driven decision-making.
The course will also cover practical topics in the machine learning process, including feature
engineering, feature selection, feature normalization, model evaluation, parameter tuning, model
selection, and ensemble building.
On completion of this course, students will be able to frame various classes of business problems as AI
and ML problems. Students will understand which AI and ML model to use for a given problem, how to
use the model, how to evaluate the model, and how to deploy the model.
The course will include a team project that will provide an opportunity to apply various AI and ML
models to a real-world business dataset.
INSTRUCTOR
Stephen W. Thomas, PhD
TEACHING ASSISTANT
Cecilia Ying, MMA, MSc, PhD (in progress). y.ying@queensu.ca
Page 1 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
REFERENCES
Textbooks
Geron, Aurelien. “Hands-on Machine Learning with Scikit-Learn & Tensorflow.” 2017. O’Reilly
Media. ISBN: 9781491962299.
o GitHub resources: github.com/ageron/handson-ml
Kuhn, Max and Johson, Kjell. "Applied Predictive Modeling." 2nd Edition, 2018. Springer. ISBN-
13: 978-1461468486.
o Resources: appliedpredictivemodeling.com
We will also draw from the following free resources, all available on the course portal:
James, Gareth and Witten, Daniela and Hastie, Trevor and Tibshirani, Robert. "An Introduction
to Statistical Learning with Applications in R." 7th Printing. Springer. www.statlearning.com
Burkov, Andriy. “Machine Learning Engineering.” September 2020.
www.mlebook.com/wiki/doku.php
VanderPlas, Jake. “Python data science handbook: Essential tools for working with data.”
O'Reilly Media, Inc.", 2016. jakevdp.github.io/PythonDataScienceHandbook/index.html
Molnar, Christoph. “Interpretable Machine Learning.” April 26, 2021.
christophm.github.io/interpretable-ml-book
Course Portal
The course portal contains supplementary readings, lecture notes, and interactive material for each
course session. I highly recommend you visit and review the material before each course session.
EVALUATION
Item Value GMMA 2022 Due* MMA 2022S Due* MMAI 2022 Due*
Team Project 50% Nov 12, 2021 Nov 30, 2021 Dec 12, 2021
Assignment 50% Nov 15, 2021 Dec 12, 2021 Dec 26, 2021
*by 11:59 pm Eastern Time on the date indicated.
TEAM PROJECT
Please see the separate file: Project Brief – DrivenData.docx.
THE ML CUP
There can only be one winner. Will it be you?
The ML Cup is an individual competition held throughout the course. There will be multiple events in
which students can earn points. The student with the highest number of total points at the end of the
course will win the cup. The winner will receive a prize and, more importantly, life-long bragging rights.
Page 2 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
I will not reveal any details about the events until 24 hours before they occur. What I will say is this:
come to each event having prepared for that week’s course material. Be nimble, be quick, and jump
over that candlestick!
Page 3 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
COURSE SESSIONS
Page 4 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Session 2: Classification
In this session, we will learn all about classification, a type of supervised machine learning. We will learn
about the different algorithms for building classification models: how they work under the hood, what
their strengths and weakness are, and when to use which.
Naïve Bayes: using Lecture slides Kuhn: Ch 13.6 StatQuest: Nave Bayes
Bayes’ Theorem Lecture notes James: Ch 4.4 3Blue1Brown: Bayes Theorem
Pre-recorded lecture VanderPlas: Ch 5.5
video
GitHub tutorial
Neural Networks: a Lecture slides Geron: Ch 10, 11 StatQuest: Neural Networks Part 1
powerful black box Lecture notes Kuhn: Ch 13.2 StatQuest: NN Part 2
that simulates the Pre-recorded lecture (Backpropagation)
human brain video StatQuest: NN Part 3 (ReLU)
GitHub tutorial StatQuest: NN Part 4 (Inputs and
Outputs)
StatQuest: NN Part 5 (ArgMax and
SoftMax)
StatQuest: NN Part 6 (X Entropy)
LL: Artificial Intelligence
Page 5 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Page 6 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Data leakage: target Lecture slides Burkov: Ch 3.2.8, 3.5 StatQuest Q&A
leakage, train-test Lecture notes Data School: fit vs.
contamination Pre-recorded lecture transform
video Data School: fit_transform
vs transform
Scikit-learn: Common
pitfalls
Page 7 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Pre-recorded lecture
video
Page 8 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
video
GitHub tutorial
Page 9 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Page 10 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Page 11 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
OTHER RESOURCES
I have created a list of notebooks and datasets that students may find helpful:
github.com/stepthom/869_course
I also curate a list of ML books, articles, blogs, data sets, tools, and more:
github.com/stepthom/data_mining_resources
Data Cleaning
Data School
Page 12 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Dimensionality Reduction
StatQuest: PCA, Step-by-Step
DataCamp: Principal Component Analysis (PCA) in Python
IEEE: An Analysis of Dimensionality Reduction Techniques on Big Data
scikit-learn: Decomposing signals in components
Page 13 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
COURSE POLICIES
Late Work
There will be a 5% penalty per day for late work.
Extensions
Deadline extensions will only be given for extenuating circumstances.
Rounding
I round marks to the nearest whole number. I.e., 89.49999 gets rounded to 89; 89.50000 gets rounded
to 90.
Appealing Marks
You may appeal a mark if you believe an error has occurred. To appeal, please do the following.
1. Wait until at least THREE DAYS AFTER you receive your mark to submit your appeal. (Cool down
period.)
2. Write a brief email/memo as follows:
a. Clearly state that you want to appeal a mark.
b. For each question/portion you would like to appeal, describe what you believe the error
was and how you recommend it should have been marked.
3. You must submit your appeal within ONE WEEK of receiving your initial mark.
Please note that if I re-mark your assignment, I reserve the right to decrease your mark if I feel that the
initial mark was too high.
PRIVACY
All students are expected to respect the privacy of others. Capturing of still and moving images and
audio of other individuals without their express consent is a violation of privacy. Similarly, recordings
and images taken of teaching materials and/or the instructor without permission is a violation of privacy
and fails to recognize that instructors have intellectual property rights to their own teaching materials.
The teaching and learning environment should be a safe space in which all participants are expected to
uphold the values of respect, dignity and trust.
Taking and Using Images with Consent | Records Management and Privacy Office (queensu.ca)
Privacy and Remote Teaching and Learning | Records Management and Privacy Office (queensu.ca)
Page 14 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
ACADEMIC INTEGRITY
Students are responsible for familiarizing themselves with, and adhering to, the regulations concerning
academic integrity. General information on academic integrity is available at Academic Integrity @
Queen's University; an overview of Smith's own policies and procedures are also important to review.
You may also find these frequently asked questions on academic integrity helpful for your understanding
of the concept and the regulations surrounding it. Departures from academic integrity include, but are
not limited to, plagiarism, use of unauthorized materials, facilitation, forgery, and falsification. Actions
which contravene the academic integrity regulations carry sanctions that can range from a warning, to
loss of grades on an assignment, to failure of a course, to requirement to withdraw from the university.
Individual Work
I will clearly indicate when students can consult with one another or with experts or resources.
Otherwise, you are required to develop an original response to the assigned topic. Assignments and
examinations identified as individual in nature must be the result of the student's individual effort.
Individuals must not look at, access or discuss any aspect of anyone else's solution (including a student
from a previous year), nor allow anyone else to look at any aspect of their own solution. Likewise,
students are prohibited from utilizing the internet or any other means to access others' solutions to, or
discussions of, the assigned material. If the assignment requires outside research, all sources must be
properly cited and referenced; be careful to cite all material, not only of direct quotations but also of
ideas. Help for citing sources is available through the Queen's University library:
http://library.queensu.ca/help-services/citing-sources.
Group Work
I will clearly indicate when groups may consult with one another or with other experts or resources.
Otherwise, in a group assignment, the group members will work together to develop an original,
consultative response to the assigned topic. Group members must not look at, access or discuss any
aspect of any other group's solution (including a group from a previous year), nor allow anyone outside
of the group to look at any aspect of the group's solution. Likewise, you are prohibited from utilizing the
internet or any other means to access others' solutions to, or discussions of, the assigned material. If the
assignment requires outside research, all sources must be properly cited and referenced; be careful to
cite all material, not only of direct quotations but also of ideas. Help for citing sources is available
through the Queen's University library: http://library.queensu.ca/help-services/citing-sources. The
names of each group member must appear on the submitted assignment, and no one other than the
people whose names appear on the assignment may have contributed in any way to the submitted
solution. In short, the group assignments must be the work of your group, and your group only. All
group members are responsible for ensuring the academic integrity of the work that the group submits.
Page 15 of 16
MMAI/MMA/GMMA 869: Machine Learning and AI
Turnitin.com
When assignments are submitted through the dropbox on the course website, they will be processed
through turnitin.com. Turnitin is a plagiarism detection tool that checks your submission against other
texts, including websites, journal articles, books, and other student submissions in order to verify the or
Page 16 of 16