Skip to content

searching through the images from many newspapers looking for the occurrences of keywords and faces

Notifications You must be signed in to change notification settings

kapil-verma/Newspaper_Image_Search

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Newspaper_Image_Search

This project code allows one to search through the images from many newspaper looking for the occurrences of keywords and faces. E.g. if you search for "Mike" it will return a contact sheet of all of the faces which were located on the newspaper page which mentions the name "Mike"

Contents

  • Main.py takes a ZIP file of images (newspaper images) and process them, and finally it will return a contact sheet of all of the faces which were located on the newspaper page which mentions the name we search for.
  • readonly folder contains face (front profile) detection classifier and an image containing face detection result for searching the key word "Mark".
  • we use OpenCV to detect faces, tesseract to do optical character recognition, and PIL to composite images together into contact sheets.

Dataset

Each page of the newspapers is saved as a single PNG image in a file images.zip These newspapers are in english, and contain a variety of stories, advertisements and images.
Note: This file is fairly large (~200 MB) and may take some time to work with, I would encourage you to use a smaller subset of these images for testing.
Dataset link

About

searching through the images from many newspapers looking for the occurrences of keywords and faces

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages