This project was commissioned by a client on 2023-09-26. If you're interested in similar work, check out my freelance data analyst profile on Fastwork.
The aim of this project was to scrape dentists' information across Indonesia, specifically focusing on their name, address, and phone number. The data was sourced from the following links:
- Pelayanan Kesehatan - Praktek Mandiri
- Pelayanan Kesehatan - Klinik (Gigi)
- Pelayanan Kesehatan - Klinik (Dental)
- Pelayanan Kesehatan - Rumah Sakit (Gigi)
- Ikatan Ortodontis Indonesia
To carry out the scraping process, I utilized Selenium. Since each link presents its own unique format, especially the fifth link which is vastly different from the others, I crafted a dedicated script for each source. For instance, scraping_1_link.py
corresponds to the first link, automated by scraping_1_link.yml
for GitHub Actions and the data is saved in the Data
folder. This pattern was followed for the remaining links. Post-scraping, all data collected from the individual sources was unified into a single Excel file, ensuring a comprehensive dataset.
Thank you for reviewing this repository. Please don't hesitate to reach out for further information or collaboration opportunities.