The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
-
Updated
Jul 2, 2024 - Rust
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Faster, modernized fork of the language identification tool langid.py
Natural language detection library for .NET, suitable for long and short text alike
Recognize languages in text in a Laravel application
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Implementation of a Pushdown Automaton that recognizes strings belonging to a language valid arithmetic expressions over floating point numbers
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
The todo app Select20 leverages language recognition to manage tasks more efficiently. The distraction-free and blazing fast app supports offline usage and compatibility to CalDav.
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Natural language detection library for Rust. Try demo online: https://whatlang.org/
👄 Fork of the language detector Lingua, with the intention to increase detection speed and reduce memory consumption
Implementation of a parser, a compiler and an interpreter for a programming language called “SimplanPlus” which is based on ANTLR.
A TensorFlow-based spoken language identification
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
This project focuses on language translation of images to texts using Pytesseract. This program successfully translates 4 different images in terms of languages and sources into english. This program is capable to translate more than 50 languages using Pytesseract and google translate.
The LALR parser generator (LPG) is a tool for developing scanners and parsers. Supports multi-language . Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
This project is about creating an automated youtube videos scraper using Airflow, Selenium, ytb-dlp library.
Add a description, image, and links to the language-recognition topic page so that developers can more easily learn about it.
To associate your repository with the language-recognition topic, visit your repo's landing page and select "manage topics."