Professional Documents
Culture Documents
Since 2004
Hanoi, 02/2023
Course Introduction
● Course Name/Code: Web Mining / INT6132 (Postgraduate)
● #Credits: 03
● Time: 6.00-8.50pm every Thursday (23/02/2023 - 28/05/2023)
● Course Plan: 10 weeks = 9 (lecture + group presentation) + 1 (midterm)
● Learning outcomes:
○ Understand basic concepts of data/text/web mining
○ Understand techniques and algorithms for web mining
○ Utilise tools/libraries to crawl and preprocessing web data
○ Can apply machine learning models in revealing new knowledge from web
○ Can apply the obtained knowledge to solve real world problems
● Primary Textbook:
○ Bing Liu (2011). Web Data Mining: Exploring Hyperlinks, Contents and Usage
Data (2nd Edition), Springer. http://www.cs.uic.edu/~liub/
● Reference Textbook:
○ Data Mining: Practical Machine Learning Tools and Techniques, by Ian
Witten and Eibe Frank, 3rd Ed., Morgan Kaufmann, 2011
○ Phan Xuân Hiếu, Đoàn Sơn, Nguyễn Trí Thành, Hà Quang Thụy (chủ biên),
Nguyễn Thu Trang, Nguyễn Cẩm Tú (2009). Giáo trình Khai phá dữ liệu Web,
NXBGD, Hà Nội, 2009.
Thank you
Email me
trongld@vnu.edu.vn