You are on page 1of 1

ICT583 Data Science Applications

Tutorial 1
Exercise: Visit the following two links which are famous for open datasets in data science:
https://archive.ics.uci.edu/ml/datasets.php
https://www.kaggle.com/
Identify two data sets that sound interesting to you. For each write brief description and
propose three interesting things you might do with them.

 Kaggle also provide a competitive forum for data scientists. New challenges are posted on a
regular basis, providing a problem definition, training data, and a scoring function over hidden
evaluation data. A leader board displays the scores of the strongest competitors, so you can see
how well your model stacks up in comparison with your opponents.

Lab work: R Basics


R is a powerful language used widely for data analysis and statistical computing.
Your task:
1. Download RStudio (https://www.rstudio.com/products/RStudio ) in your own
computer.
2. By the end of Week 3, complete every exercise/demo in self-exercises: R
Basics in LMS. It will help you quickly master the basics of R, including 'Introduction to
basics', 'Vectors', 'Matrices', 'Factors', 'Data frames' and 'Lists', and pave the foundation
of future data analysis in this unit.

You might also like