Skip to content

Scrape Premier League Data and load it into Google BigQuery

Notifications You must be signed in to change notification settings

hamza99-elback/data_scrapping_pl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

data_scrapping_pl_google_big_query

In this project, I have implemented a straightforward ETL (Extract, Transform, Load) process. I used Python, along with the Selenium library, to scrape data from the Premier League website. After extracting the data, I performed various transformations using Pandas and NumPy. Finally, I loaded the transformed data into a Google BigQuery database.

For the next version of this project, I plan to enhance the ETL pipeline by utilizing Apache Airflow and the BigQueryOperator to automate and streamline the data extraction, transformation, and loading processes.

Bellow you find the result of this ETL Pipline.

image

LINKS: https://www.premierleague.com/stats/top/players/goals image

Releases

No releases published

Packages

No packages published

Languages