Professional Documents
Culture Documents
COURSE OVERVIEW
This course uses a learning-by-doing approach to understand key concepts behind Big Data and Artificial
Intelligence (AI), the strategic drivers of these technologies and the value propositions that they provide to
industry. The course covers technologies within this ecosystem including Hadoop, Pig, Hive, Spark, and
Amazon Web Services (AWS). Examples of AI using deep learning will be presented. The focus is on creating
awareness of the technologies, providing some level of familiarity with them through hands-on exercises and
projects, and enabling strategic thinking around the use of these technologies in business.
MATERIALS
Supplement:
Learning with Big Data – The Future of Education (http://www.big-data-book.com/)
Authors: Viktor Mayer-Schönberger, Kenneth Neil Cukier
Hadoop Book (http://www.hadoopbook.com/)
Author: Tom White
Mining of Massive Datasets (http://infolab.stanford.edu/~ullman/mmds/book.pdf)
Authors: Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman
A Human's Guide to Machine Intelligence (https://wsp.wharton.upenn.edu/book/a-humans-guide-to-machine-
intelligence/)
Author: Kartik Hosanagar
Deep Learning (https://www.deeplearningbook.org/)
Authors: Ian Goodfellow, Yoshua Bengio, Aaron Courville
TECHNOLOGY REQUIREMENT
Much of the software needed for big data applications tend to be open source. Therefore, the source programs
are free and will be provided in class. We will be using Amazon Web Services (AWS), Google Colab, as well as
PyTorch and Keras.
COURSE DESCRIPTION:
This course uses a hands-on, learning-by-doing approach to understanding the concepts behind big data, IoT,
and AI, the strategic drivers of these technologies and the value propositions that they provide to industries. In
addition, the course will also serve as an introduction to some of the key technologies within this ecosystem,
such as Hadoop, Pig, Hive, Spark, and AWS. Examples of AI using deep learning will be conducted in class.
The focus is on creating awareness of the technologies, allowing some level of familiarity with them through
assignments, and enabling some strategic thinking around the use of these in business.
LEARNING OBJECTIVES
This is a demanding course. The technologies are still evolving very rapidly. Therefore, there is a level of
experimentation with new material that will take place during the semester. Students are required to be flexible
as and when topics or material in class are revised or modified. We will do our best to ensure that no undue
burden is placed on students. The course has two primary objectives:
1. To allow students to have working knowledge and exposure to key elements of a big data technology
platform, and a basic AI example.
2. To allow students to understand critical business and strategic issues around the use of these technologies in
organizations and to help guide the successful design and implementation of complex data strategy.
Though mastery of this content requires more than one course, an introductory course, such as this, is useful in
allowing students to gain much-needed familiarity with these technologies and concepts. That is the objective of
this course.
LECTURES
The lectures will focus on discussing concepts with simple examples. The lecture meetings will be in the
classroom. The student will be responsible for all the material and updates presented and discussed in the
lectures. The text, lecture notes, and lectures will provide a base of knowledge that participants can use in
assignments and the course project.
GRADING
The Smith School of Business offers rigorous, academically challenging courses and provides meaningful
feedback on student performance to facilitate learning. Transparency and consistency in grading are important
elements that ensure the integrity of the curriculum.
Grading Method
In-Class Activities 400
Individual Assignments 400
Team Project 200
Maximum Points Possible 1000
Final Grades
The following are indicative of how the points on the course would be mapped to a grade. This is only indicative
and may be adjusted at my discretion.
In-Class Activities
To reinforce database concepts and encourage you to read the chapters, activities and exercises will be given
during class time throughout the semester. These activities are open-book individual exercises. Each activity
will be worth 20 points. The activities will be relatively simple and based on the readings and lectures.
Individual Assignments
There will be homework assignments to be completed individually. Each assignment will reinforce specific
concepts related to the relevant chapter/lecture. All assignments include conceptual and applied components. As
you will notice, a significant part of the course grade is determined by your performance on the assignments so
working on the assignments will help you learn information systems concepts and their implementation. It is
not wrong to seek clarifications and minor help with completion of each assignment but there is a very
fine line between seeking assistance and cheating. Please stay on the legal side of that line. You are always
welcome to the office hours should you have questions.
To prevent plagiarism in homework assignments, the grading of homework may also include an oral part. For
each homework assignment, I may randomly select a subset of students to come to my office hours and explain
the logic of the code that was submitted. If a student cannot clearly explain how own code works, it is up to my
discretion to decide whether plagiarism occurs and corresponding measures will be taken.
Each assignment will require you to spend time on the computer. Please plan on it. Each assignment will be
worth 60 points. Submission instructions will be covered at the beginning of the each assignment. All
assignments are due by 11:59pm on the day due. ELMS permits multiple submissions, but only the latest
submission will be graded. Late assignments (by less than 24 hours) will result in a grade reduction of
25%. Assignments submitted late by more than 24 hours will receive a zero. You have been warned!
Team Projects
The team project is expected to give you an opportunity to combine the skills you have acquired in the course.
The description of the team project will provide you with more detailed information. All project deliverables are
due via ELMS by the date and time indicated on the schedule.
All projects are due by 11:59pm on the day due. ELMS permits multiple submissions, but only the latest
submission will be graded. Late assignments (by less than 24 hours) will result in a grade reduction of
25%. Assignments submitted late by more than 24 hours will receive a zero. You have been warned!
COURSE-RELATED POLICIES
Students are responsible for knowing their rights and reviewing all course related policies found at The Graduate
School website. The primary goal of our discussions is to enhance the collective learning of the class. The
following guidelines can help you make effective contributions to our class discussions. Please see the following
list of tips for effective and meaningful discussion participation.
I invite you, if you wish, to tell us how you want to be referred to both in terms of your name and your pronouns
(he/him, she/her, they/them, etc.). The pronouns someone indicates are not necessarily indicative of their gender
identity. Additionally, how you identify in terms of your gender, race, class, sexuality, religion, and dis/ability,
Technology Policy
Please refrain from using cellphones, laptops, and other electronic devices during class sessions unless we have
designated such use as part of a class exercise.
Score/Grade Appeals
You must make the appeal in writing. However, score changes are at the discretion of the instructor and may be
up or down based upon a complete review of the work in question. It is important to recognize that a grade
reflects another person’s judgment of your work. In this sense, all grading is subjective. Appealing scores is
discouraged. Changing a few points on assignments rarely makes a difference in the final grade. Time is much
better spent discussing and clarifying the information content presented in the course. In the event of disputes at
the end of the semester, you will be required to produce the testing material in question.
In the case of a grading mistake (i.e., grade is posted incorrectly, grader did not give credit for an item that
exists) you must first contact the TA who graded the assignment.
In this course, I will expect you to strive for excellence in meeting these six standards in your contributions to
class discussions and your written work. You should expect me and your classmates to push you to meet these
standards. For example, in a class discussion, please fully explain your point and how it relates to the question
that was posed to the class. Describe the evidence on which you base your claims. Be open to feedback about
where you can do more to meet these standards.
The University's Code of Academic Integrity is designed to ensure that the principles of academic honesty and
integrity are upheld. In accordance with this code, the Smith School does not tolerate academic dishonesty.
Please ensure that you fully understand this code and its implications because all acts of academic dishonesty
will be dealt with in accordance with the provisions of this code. All students are expected to adhere to this
Code. It is your responsibility to read it and know what it says, so you can start your professional life on the
right path. As future professionals, your commitment to high ethical standards and honesty begins with
your time at the Smith School.
It is important to note that course assistance websites, such as CourseHero, Chegg, etc., are not permitted
sources for Smith School courses. Material taken or copied from these sites can be deemed unauthorized
material and a violation of academic integrity. These sites offer information that might not be accurate and that
shortcut the learning process, particularly the critical thinking steps necessary for college-level assignments.
Additionally, it is understandable that students may use a variety of online or virtual forums for course-wide
discussion (e.g., GroupMe or WeChat). Collaboration in this way regarding concepts discussed in this course is
permissible. However, collaboration on graded activities and assignments is strictly prohibited. Examples of
prohibited collaboration include: asking for answers on grading components, asking for access codes to grading
components, etc.
To help you avoid unintentional violations, the following table lists levels of collaboration that are acceptable
for each type of assignment. If you ever feel pressured to comply with someone else’s academic integrity
violation, please reach out to me straight away. Also, if you are ever unclear about acceptable levels of
collaboration, please ask!
Open Notes Use Book Ask Friends Search Online Work in Groups
In-Class
yes yes no no no
Activity
Individual
yes yes cited cited no
Assignment
only within
Team Project yes yes cited cited
your team
Course materials are copyrighted and may not be reproduced for anything other than personal use without
written permission. Although every effort has been made to be complete and accurate, unforeseen circumstances
arising during the semester could require the adjustment of material given here. Consequently, given notice to
students, the instructor reserves the right to change any information on this syllabus or in other course materials.
(end of syllabus)