Application returns a summary of a text. It's currently implemented with a simple ranking algorithm based on word occurrences.
See example for example dump.
- file_x.txt is a file with dummy text. x is the language code according to the ISO 639-1 standard.
- stopwords-x.txt is a list of stop-words for respective language.
Documentation can be found in docs/ with entry point index.html at the root level.
Since the core functionality is implemented, further improvements can be made on top. This includes (listed after most likely to be done first)...
- Generate Javadocs
- Generate executable
- Analyze text reduction
- Write final summary to a file
- Import text from other file formats (currently only .txt)
- Create GUI
- Generate UML diagram