Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
-
Updated
Aug 2, 2018 - Python
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
Python package that can change Amharic language that written in English alphabet to Amharic alphabet character.
The set of files used for the development of the Amharic Corpus.
simple bs4 based web crawl for a corpus in need of statistical machine translation
This repository is an implementation of Amharic speech to text setup.
Amharic-Word Embedding-Word2vec is a pre-trained distributed word representation (word embedding) which aims to provide the Amharic NLP researcher with free to use.
Speech-to-text data collection with Kafka, Airflow, and Spark, building a pipeline that can be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.
This is a simple telegram bot like Google, you can search for anything and get links of the first three results, you can search for music to get lyrics and you can also translate English to amharic.
A project for scraping and preprocessing data to enhance large language models (LLMs). Provides a scalable and flexible foundation for developing APIs focused on fine-tuning LLMs.
Add a description, image, and links to the amharic topic page so that developers can more easily learn about it.
To associate your repository with the amharic topic, visit your repo's landing page and select "manage topics."