Skip to content

HyejinWon/wiki-history-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wiki-history-extractor

Extract ko-wiki history data from XML file. Only support Korean Language.


How to use

  1. Should setup WikiExtractor

  2. run wiki extractor

python -m wikiextractir.WikiExtractor 'your wiki dump file' -o 'output path'
  1. run processing.py
python processing.py --path 'output path' --write-path 'result output path'

Dependency

About

Extract ko-wiki history data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published