Welcome to Scribd!

Parsing The Web: Let S Find The Following Data For The First 100 Movies

Uploaded by

0% found this document useful (0 votes)

10 views3 pages

The document describes parsing web data to extract the release date, movie title, and production budget for the first 100 movies from a website. It uses the Beautiful Soup and Pandas libraries in Python to make a request to the target URL, parse the HTML response, extract the data from table rows into a dictionary, add it to an info array, and convert that into a Pandas dataframe for output.

Original Description:

Búsqueda de información especifica por filtrado de etiquetas usando Beautiful Soup

Original Title

Parsing Web

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views3 pages

Parsing The Web: Let S Find The Following Data For The First 100 Movies

Uploaded by

Josue Sanchez

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

PARSING THE WEB

Let´s find the following data for the first 100 movies:

Release Date Movie Production Budget

Código Implementado:
import requests
# Import the beautiful soup
from bs4 import BeautifulSoup
# Export library
import pandas as pd

TARGET_URL='https://www.the-numbers.com/movie/budgets/all'

info=[] # arreglo general

data={} # diccionoario final

myData=requests.get(TARGET_URL)
# Using beautiful soup library for parsing fetched data
soup= BeautifulSoup(myData.text, 'html.parser')
elements=soup.find_all("tr")
for elem in elements:
valores = []
dat = {}
itemtd=elem.find_all("td")
if itemtd:
valores.append(itemtd[1].text)
valores.append(itemtd[2].text)
valores.append(itemtd[3].text)

#se almacena la data en diccionarios con clave numérica por posición

dat[itemtd[0].text]=valores

#se agrega al arreglo general para crear el diccionario final

info.append(dat)

data["peliculas"]=info # se agraga para clave valor al diccionario data

dataFrame = pd.DataFrame.from_dict(data)
print(dataFrame)

Resultado al ejecutar el código:

Web Scraping
Document21 pages
Web Scraping
Aiur
No ratings yet
Crud App Django
Document7 pages
Crud App Django
Temur Pallaev
No ratings yet
Assignment 2 PDF
Document25 pages
Assignment 2 PDF
Boni Halder
No ratings yet
Source Code Python Jemmy
Document7 pages
Source Code Python Jemmy
Fadilah Riczky
No ratings yet
Updates & FAQ
Document3 pages
Updates & FAQ
dropped95si
No ratings yet
Frontend Code For Mini Project - Final Code
Document3 pages
Frontend Code For Mini Project - Final Code
Nøaman Kay
No ratings yet
WSMA Lab Manual 2
Document8 pages
WSMA Lab Manual 2
Ashish Kurapathi
No ratings yet
Flask
Document4 pages
Flask
KSHITIZ GUPTA
No ratings yet
From
Document4 pages
From
22125035
No ratings yet
Lab Building Simple Shopping Cart Using Python, Flask, MySQL
Document14 pages
Lab Building Simple Shopping Cart Using Python, Flask, MySQL
Joker Jr
No ratings yet
Aim: Write A Program To Parse XML Text, Generate Web Graph and Compute Topic Specific Page Rank. Source Code
Document5 pages
Aim: Write A Program To Parse XML Text, Generate Web Graph and Compute Topic Specific Page Rank. Source Code
SumitMaurya
0% (1)
Sahil Malhotra 16 BCE 0113 Web Mining L51+L52: 1. Universal Crawling 1.1. CODE
Document11 pages
Sahil Malhotra 16 BCE 0113 Web Mining L51+L52: 1. Universal Crawling 1.1. CODE
sahil
No ratings yet
DPVP Projects (1,4,7)
Document4 pages
DPVP Projects (1,4,7)
gopal.sivakrish
No ratings yet
Dpa Lab Practical File All Iu2041230030
Document22 pages
Dpa Lab Practical File All Iu2041230030
Devansh Chauhan
No ratings yet
Module 05.0 - PA - Pandas - DataFrame - Select - Data
Document3 pages
Module 05.0 - PA - Pandas - DataFrame - Select - Data
RAHUL DUTTA
No ratings yet
Getting Data II Solutions
Document9 pages
Getting Data II Solutions
jfaghm
No ratings yet
Laboratoire D'intelligence Artificielle: ESIEA 3A 2019-2020 Mihir Sarkar Mihir@media - Mit.edu
Document22 pages
Laboratoire D'intelligence Artificielle: ESIEA 3A 2019-2020 Mihir Sarkar Mihir@media - Mit.edu
Slal Opza
No ratings yet
Data Wrangling (Data Preprocessing)
Document4 pages
Data Wrangling (Data Preprocessing)
Siddharth Raul
No ratings yet
Modifiedip
Document27 pages
Modifiedip
sayantuf17
No ratings yet
01 Python 02 Data Sourcing
Document9 pages
01 Python 02 Data Sourcing
AyoubENSAT
No ratings yet
Matrix
Document2 pages
Matrix
Banana banna
No ratings yet
Emag Scraper
Document1 page
Emag Scraper
john ripper
No ratings yet
SDFG
Document4 pages
SDFG
gprasadatvu
No ratings yet
Python + MongoDB
Document12 pages
Python + MongoDB
rocioburgos00
No ratings yet
Shante Pro4
Document22 pages
Shante Pro4
ASWIN P
No ratings yet
Data Cleaning and Exploratory Data Analysis With Pandas On Trending Youtube Video Statistics
Document5 pages
Data Cleaning and Exploratory Data Analysis With Pandas On Trending Youtube Video Statistics
Babar Roomi
No ratings yet
Building A Search Engine
Document11 pages
Building A Search Engine
Sagar Saagi
No ratings yet
6 - Text Vectorization-CSC688-SP22
Document5 pages
6 - Text Vectorization-CSC688-SP22
Crypto Genius
No ratings yet
25 Awesome Python Scripts
Document26 pages
25 Awesome Python Scripts
moises tinte
No ratings yet
Python Codes
Document17 pages
Python Codes
Akhil
No ratings yet
Sudhinpro 7
Document21 pages
Sudhinpro 7
ASWIN P
No ratings yet
Homework - CSPC
Document4 pages
Homework - CSPC
no Name
No ratings yet
Ip Worksheet 3 - Q'S
Document6 pages
Ip Worksheet 3 - Q'S
Shabin Muhammed
No ratings yet
Cs Activity
Document29 pages
Cs Activity
hariharan97g
No ratings yet
Natural Language Processing
Document17 pages
Natural Language Processing
coding ak
No ratings yet
DMT Function
Document10 pages
DMT Function
Muhammad Salman
No ratings yet
Split and Create Py File
Document3 pages
Split and Create Py File
kakashi hatake
No ratings yet
Association Rules Problem Statement
Document29 pages
Association Rules Problem Statement
Dathu Gurram
100% (1)
Price Comparsion System Using Ai
Document12 pages
Price Comparsion System Using Ai
Bharath Kumar
No ratings yet
Task 4 Model Answer
Document3 pages
Task 4 Model Answer
Atharva Kasar
No ratings yet
Advanced Database
Document23 pages
Advanced Database
ravikumarrk
No ratings yet
Convert HTML Table Into CSV File in Python
Document4 pages
Convert HTML Table Into CSV File in Python
Jayadevan
No ratings yet
PHP Basics 2020
Document16 pages
PHP Basics 2020
Jaye 99
No ratings yet
Web Mining: 19BCE2483 Anubhav Bhandary Prob.1
Document4 pages
Web Mining: 19BCE2483 Anubhav Bhandary Prob.1
ANUBHAV BHANDARY 19BCE2483
No ratings yet
Solved WT - DS
Document123 pages
Solved WT - DS
sarveshsdeshmukh
No ratings yet
Python Lab ALL 10 Prgms
Document16 pages
Python Lab ALL 10 Prgms
dvyvmsfcdwzbxpmymt
No ratings yet
Experiment 05 (B)
Document2 pages
Experiment 05 (B)
RANDHIR KUMAR
No ratings yet
Python Project Code Word For Cbse 12th Grocery Management
Document36 pages
Python Project Code Word For Cbse 12th Grocery Management
Vanisha Pathak
100% (1)
Python NLP
Document15 pages
Python NLP
Pierre Tibokbe
No ratings yet
Freda Song Drechsler - Maneuvering WRDS Data
Document8 pages
Freda Song Drechsler - Maneuvering WRDS Data
RicardoHenriquez
No ratings yet
Web Engineering Lab 13
Document15 pages
Web Engineering Lab 13
Eesha Arif
No ratings yet
Algorithm
Document8 pages
Algorithm
kumar207y1a3330
No ratings yet
Lab7 - Python Assisted Exploitation
Document11 pages
Lab7 - Python Assisted Exploitation
Saw Gyi
No ratings yet
Lab 2 - Data Preparation
Document3 pages
Lab 2 - Data Preparation
Muhammad Rafli
No ratings yet
DMV Scraper
Document3 pages
DMV Scraper
himanshu1491
No ratings yet
CARL
Document3 pages
CARL
Ali Kamran
No ratings yet
Practical File Python
Document25 pages
Practical File Python
kaizenpro01
No ratings yet
Google Scrapper Com Envio de Email Funcionando
Document3 pages
Google Scrapper Com Envio de Email Funcionando
lucas
No ratings yet
Shrova Mall 3 - The Backend: A-to-Z e-commerce full-stack application
From Everand
Shrova Mall 3 - The Backend: A-to-Z e-commerce full-stack application
Abdelfattah Ragab
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet