Skip to content

The project is to determine if sales of books within the Race and Social Justice genre have increased in the last 2-3 years by using the Extract, Transform and Load method based on the Amazon Top 50 Bestselling List from 2009 to 2020.

Notifications You must be signed in to change notification settings

tnguyen0306/Race-and-Social-Justice-in-Literature

 
 

Repository files navigation

ETL Project

Project Title

Race and Social Justice in Literature

result

Team Members

  • Brandy Knust
  • Tyler Nguyen

Project Proposal

Our project is to look for the genre of each book that is on the Amazon Top 50 Bestselling List from 2009 to 2020, and determine if sales of books within the Race and Social Justice genre have increased in the last 2-3 years by using the Extract, Transform and Load method.

Questions

  1. How many books about Race or Social Justice were in the top 50 in the past decade?

  2. Why would we think it's important?

  3. Which other genres are more popular?

Data References

Use of publicly available dataset to download the Amazon Top 50 Bestselling data using Kaggle on Jupyter Notebook.

Use of publicly available book information to scrape needed data.

Files

Rough Breakdown of Tasks

  • Data identification
  • Data scraping and extraction (Selenium, BeautifulSoup, Pandas)
  • Data cleanup (SQL, Pandas)
  • Data aggregation (SQL, Pandas)
  • Data analysis
  • Data visualization (Pandas)
  • Summary
  • Documentation
  • Presentation

About

The project is to determine if sales of books within the Race and Social Justice genre have increased in the last 2-3 years by using the Extract, Transform and Load method based on the Amazon Top 50 Bestselling List from 2009 to 2020.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%