The document provides a 2 day assignment to scrape provider details from a given clickable URL across all pages and store it in a CSV, XLSX, or JSON file using Python. The details to extract include name, title, gender, expertise, research interests, phone, location, and education. Multithreading is preferred for the scraping. The deliverable is a zip file with the data in one of the specified formats and the Python code.
The document provides a 2 day assignment to scrape provider details from a given clickable URL across all pages and store it in a CSV, XLSX, or JSON file using Python. The details to extract include name, title, gender, expertise, research interests, phone, location, and education. Multithreading is preferred for the scraping. The deliverable is a zip file with the data in one of the specified formats and the Python code.
The document provides a 2 day assignment to scrape provider details from a given clickable URL across all pages and store it in a CSV, XLSX, or JSON file using Python. The details to extract include name, title, gender, expertise, research interests, phone, location, and education. Multithreading is preferred for the scraping. The deliverable is a zip file with the data in one of the specified formats and the Python code.
Hilbert Disease 287-3 Center University;S , MD Medicine, Sleep 550 8 Devine St UNY Medicine, Critical North Haven, Downstate Care Medicine, CT 06473 College of Internal Medicine Medicine, Brooklyn, NY Dwain PsyD Male Psychiatry - (203) Yale Medicine University Fehon, 688- Psychiatry of Hartford PsyD 20 York St. 9779 New Haven, CT 06510
Note:
1. All Providers to be stored in CSV, XLSX or Json (Any One Format)
2. Languages to be used : Python 3. Non Selenium approach is preferred 4. Using of Multi-Threading is an added advantage.
Deliverables : A Zip (.zip) file containing CSV, XLSX or Json (Any One Format) and Python Code.