Skip to content

Latest commit

 

History

History
10 lines (9 loc) · 287 Bytes

README.md

File metadata and controls

10 lines (9 loc) · 287 Bytes

Preprocessing

Create dataset from Greek Parliament Records.

The excecution order is:

  1. collect_url_documents.py
  2. speech_files_extraction.py
  3. unprocessed_csv_extraction.py
  4. csv_preprocessing.py
  5. collect_names_from_political_parties.py
  6. name_and_political_party_matching.py