You are on page 1of 3

Korea University International Summer Campus (KU ISC) 2023

Embark on a unique summer


June 27, 2023 ∼ August 3, 2023

ISC356A – Big Data Analytics

Ⅰ. Instructor
Professor : Suyong Song
E-mail : suyong-song@uiowa.edu
Home Institution : University of Iowa
Class Time : 6 Week, Period 1 (9:00am-10:40am KST)
Office : TBA
Office Hours : TBA

Ⅱ. Textbook
Required Textbook : There is no required textbook for this course.
Recommended :
Additional
Readings

Ⅲ. Course Description and Objectives


In order to develop insights from what is called big data, researchers need to go through a series
of phases such as data collecting, cleaning, processing, storing, analysis, and visualization. Among
other programming languages and software tools for data analysis, Python has been widely-used
and known for its usability that allows users to perform everything needed to handle big data and
for its ecosystem with active contributors and users. In this course, students will learn how to
handle big data using Python at each phase of big data analysis. Students will have a chance to
experience social media analysis using real social media data collected from Twitter, which has
been one of the most popular sources for social media analysis, by applying text mining and
visualization techniques.

Ⅳ. Grading
Midterm Exam 30%
Final Exam : 35%
Assignments : 30%
Participation : 5%
Ⅴ. Class Outline
Date Topic Chapter Remarks
June 27 (Tue) Orientation Day
Module 0 - Introduction to Big Data and Big Data
June 28 (Wed)
Analytics
Module 1 - Introduction to Python and Jupyter
June 29 (Thu)
Notebook
Module 2 - Python Basics Part 1: Data Types, Built-in
June 30 (Fri)
Functions, and Operators
Module 2 - Python Basics Part 1: Data Types, Built-in
July 3 (Mon)
Functions, and Operators
Module 2 - Python Basics Part 1: Data Types, Built-in
July 4 (Tue)
Functions, and Operators
Module 3 - Python Basics Part 2: Flow Control, User-
July 5 (Wed) Defined Functions, Modules and Packages, and
Exceptions
Module 3 - Python Basics Part 2: Flow Control, User-
July 6 (Thu) Defined Functions, Modules and Packages, and
Exceptions
July 10 (Mon) Module 4 - Handling Numbers with NumPy
July 11 (Tue) Module 4 - Handling Numbers with NumPy
July 12 (Wed) Module 4 - Handling Numbers with NumPy
July 13 (Thu) Midterm exam
Module 5 - Data Manipulation and Analysis with
July 17 (Mon)
Pandas
Module 5 - Data Manipulation and Analysis with
July 18 (Tue)
Pandas
Module 6 - Handling Files
July 19 (Wed)
Module 6 - Handling Files
July 20 (Thu)
July 24 (Mon) Module 7 - Text Processing with NLTK and TextBlob
July 25 (Tue) Module 7 - Text Processing with NLTK and TextBlob
Module 8 - Data Visualization with Matplotlib and
July 26 (Wed)
Ipywidgets
Module 8 - Data Visualization with Matplotlib and
July 27 (Thu)
Ipywidgets
Module 8 - Data Visualization with Matplotlib and
July 31 (Mon)
Ipywidgets
Aug 1 (Tue) Case study
Aug 2 (Wed) Final exam
/ Graduation Day
Aug 3 (Thu)
(Available both Online / Offline)
Software

The software used for the empirical exercises is Python which can be downloaded on
https://www.anaconda.com/download/. We will discuss how to install and use Python during the
course. Most of our work will be performed in Jupyter Notebooks. Students are free to use other
software such as R, Matlab, or Stata. The R is available on http://www.r-project.org/.

Class Attendance and Participation

Class attendance is very important. You are expected to be in class on time. I will check your
attendance regularly through the Blackboard tool. The course is ultimately meant to be a forum
for the discussion of tools and topics in econometrics. As such, it will rely on active and continuous
participation.

Assignments

There will be homework assigned regularly during the course. These homework will be due at the
beginning of class. No late homework will be accepted. Each homework has equal weight.
For homework, you will be asked to make a group of three. Members of a group work together
and submit one report representing the best of the group's work. In case that group members
prefer individual work, homework can be submitted individually. Copying solutions from any
another source without attribution is considered academic misconduct.

Exams

The midterm exam will take place on July 13th, and the final exam on Aug 2, both in class. Material
covered in the midterm will be everything up to that point. The final exam will be comprehensive,
while mainly focusing on the materials after the midterm exam.

You might also like