Professional Documents
Culture Documents
Due Friday by 11:59pm Points 100 Submitting a file upload (Turnitin enabled)
File Types ipynb and html Available until Feb 14 at 11:59pm
You will select a web page that has at least one table in it, and create a report with insights drawn from the
information of the site using web scraping methods (BeautifulSoup).
Your report should be professional, contain an objective, the findings and recommendations, all based on
the data collected.
Your report should include at least 4 different plots (two plots of the same type are not considered different,
for example, two scatter plots are considered similar, even if they plot different aspects of the data).
Your notebook should clearly specify and describe the website you crawled.
Advanced Students:
Advanced students will scrape 2 or more sites and use regular expressions to find patterns between them.
Submission:
Jupyter Notebook
HTML of the Jupyter Notebook
HW Week 3) (1)
https://pacific.instructure.com/courses/56622/assignments/215328?module_item_id=347280 1/2
2/13/2020 HW Week 4) Insights from Web Scraping
Data mining 25.0 pts 13.0 pts 7.0 pts 0.0 pts
Demonstrate professionalism on the Web Messy or No
25.0 pts
web scraping using BeautifulSoup scraping is unclear web Marks
basic mining
https://pacific.instructure.com/courses/56622/assignments/215328?module_item_id=347280 2/2