Skip to content

satyamgeek/Web_scraping_IMDB_Top250

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web_scraping_IMDB_Top250

This is a Web scraping application which scraps key features from Top250 movie on IMDB website.

Here, in this application we have used Python3, BeautifulSoup, Sqlite3 to extract all the data off the site to store it in two differnt forms.
1> CSV format.
2> SQL database.

Libraries:

1> bs4
2> lxml
3> requests
4> csv
5> pandas as pd
6> sqlite3

The project folder contains the application notebook and python file.

This application gives user the option to select the format to save in.

image SQL_save

User can also execute QUERY over the database.

The name of the database is given with the schemas of the database: Query_statement

Data of the movies collected:

1>Movie Name.
2>Release Year.
3>Actors.
4>Directors.
5>Link for the review page.

Preview

csv_file