You are on page 1of 2

2/13/2020 HW Week 4) Insights from Web Scraping

HW Week 4) Insights from Web Scraping


Submit Assignment

Due Friday by 11:59pm Points 100 Submitting a file upload (Turnitin enabled)
File Types ipynb and html Available until Feb 14 at 11:59pm

You will select a web page that has at least one table in it, and create a report with insights drawn from the
information of the site using web scraping methods (BeautifulSoup).

Your report should be professional, contain an objective, the findings and recommendations, all based on
the data collected.

Your report should include at least 4 different plots (two plots of the same type are not considered different,
for example, two scatter plots are considered similar, even if they plot different aspects of the data).

Your notebook should clearly specify and describe the website you crawled.

Advanced Students:

Advanced students will scrape 2 or more sites and use regular expressions to find patterns between them.

Submission:

Jupyter Notebook
HTML of the Jupyter Notebook

HW Week 3) (1)

https://pacific.instructure.com/courses/56622/assignments/215328?module_item_id=347280 1/2
2/13/2020 HW Week 4) Insights from Web Scraping

Criteria Ratings Pts

Name of the 10.0 pts 0.0 pts


Student and Full Marks No Marks 10.0 pts
Project

Objective 10.0 pts 6.0 pts 0.0 pts


Clearly identified, realistic and logic Vague, incomplete or illogical No Marks 10.0 pts

Data mining 25.0 pts 13.0 pts 7.0 pts 0.0 pts
Demonstrate professionalism on the Web Messy or No
25.0 pts
web scraping using BeautifulSoup scraping is unclear web Marks
basic mining

Findings 20.0 pts 10.0 pts 0.0 pts


Findings reported are based on the data, Findings reported are No
20.0 pts
and meet the objective of the project vague, not based on the Marks
data

Plots 10.0 pts 5.0 pts 0.0 pts


at least 4 different plots Less than 4 plots No Marks 10.0 pts

Recommendations 15.0 pts 7.8 pts 0.0 pts


Recommendations are logical, follow a Recommendations are No
15.0 pts
the flow of events and are based on the unrelated to the data Marks
data collected collected or insights

Professionalism 10.0 pts 5.0 pts 3.0 pts 0.0 pts


Report is highly Report looks like Report is messy, No
10.0 pts
professional, ready to be a class exercise all over the place Marks
puslish

Total Points: 100.0

https://pacific.instructure.com/courses/56622/assignments/215328?module_item_id=347280 2/2

You might also like