3SA: Semantic Search for Speeches in Audio

Semantic search is the ability to search for documents by understanding the overall meaning of the query rather than using simple keyword matches. Recent breakthroughs in NLP like Bert, Albert, Roberta, etcetera, paved the way for the development of such powerful semantic search engines. But most of these search algorithms are mainly focused on textual information, i.e., both the document and the query are in natural language. In this project, we aim to develop a semantic search algorithm for arbitrary objects (objects which are not in natural language), specifically for speeches in audio, by leveraging advanced NLP techniques. We introduce 3SA, Semantic Search for Speeches in Audio, which can enable the search for audio files, semantically. We perform our experiments on the Librispeech dataset and further evaluate our search results using basic information retrieval metrics.

Run instructions

Make sure you have Python>=3.6 Setup a virtual environment:

python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
jupyter notebook

Run the project notebook - project.ipynb file.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
output		output
README.md		README.md
README.sikram		README.sikram
README.vkosaraj		README.vkosaraj
README.yla570		README.yla570
datautils.py		datautils.py
feature_extractor.py		feature_extractor.py
label_prep.py		label_prep.py
milestone.ipynb		milestone.ipynb
model.py		model.py
model_arch.jpg		model_arch.jpg
model_arch1.py		model_arch1.py
project.ipynb		project.ipynb
requirements.txt		requirements.txt
spectrogram.png		spectrogram.png
tensorb.png		tensorb.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3SA: Semantic Search for Speeches in Audio

Run instructions

About

Releases

Packages

Languages

SyedIkram/Semantic-Search-for-speeches-in-Audio

Folders and files

Latest commit

History

Repository files navigation

3SA: Semantic Search for Speeches in Audio

Run instructions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages