GitHub - Shivamkak19/Deepfake-Detector

VoiceProtect - Deepfake Audio Detector - Implemented on the Tortoise-TTS Library

VoiceProtect allows users to gauge whether audio files or live audio streams have been generated with AI. As deepfake audio scams are becoming more prevalent, reliable deepfake audio detectors will become increasingly valuable. This app leverages the Tortoise-TTS library, specifically the AudioMiniEncoderWithClassifierHead() function along with a classification model available publicly at Tortoise-TTS on HuggingFace (saved as classifier.pth in root of this repository).

The original intent of this app is toward a live time deployment with iOS/android call data, which is not accessible via public API's. The pyaudio record audio input acts a prototype for the feature of live scam call detection with call data.

View Product · Report Bug · Request Feature

Table of Contents

- Built With
Getting Started
- Prerequisites
- Installation
Usage
Roadmap
Contributing
License
Contact
Acknowledgments

Built With

(back to top)

Getting Started

Below, the set-up process is listed to host VoiceProtect on your local machine. Be careful to install both the library requirements and the system requirements.

Prerequisites

To run this project, you must download the latest version of the pip installer. Additionally, download the system requirements listed below.

Download ffmpeg: https://ffmpeg.org/download.html (used by pydub)
portaudio19-dev: macOS see below, windows should install implicitly with pyaudio
```
pip install --upgrade pip
```
MACOS ONLY:
```
brew install portaudio
```

Installation

Clone the repo

git clone https://github.com/Shivamkak19/Deepfake-Detector.git

Switch to tortoise_tts folder
```
cd tortoise_tts
```
Install dependencies
```
pip install -r requirements.txt
```
Deploy Streamlit app on local server
```
streamlit run voiceProtect_app.py
```

(back to top)

Usage

Use the local VoiceProtect deployment to analyze the likelihood that an input audio file or live audio recording contains audio created with generative AI. To receive results, wait until the streamlit app has finished processing function calls (indicated in the product pictures). The accuracy of this identification system is based on preset tortoise-tts models and functions, as described in main description above.

Make sure to launch the file ./tortoise_tts/voiceProtect_app.py. The main app must be launched within the tortoise_tts folder, as tortoise_tts must be launched in the main thread to resolve signal issues with the atlastk library (see issues.txt).

Uploaded file audio classification:

Live audio stream classification:

Results processing:

** Additionally, the live streamlit deployment of VoiceProtect is currently facing issues with detecting an input device for audio recording with pyaudio. Check back here for updates. **

(back to top)

Roadmap

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/newFeature)
Commit your Changes (git commit -m 'Add some new feature to Deepfake-Detector')
Push to the Branch (git push origin feature/newFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

Contact

Shivam Kak: [email protected]
Project Link: https://github.com/Shivamkak19/Deepfake-Detector

(back to top)

Acknowledgments

AI Anytime, for tutorials on the Tortoise-TTS library, useful function calls, and integration with other relevant libraries (torchaudio, librosa, etc).
AI Anytime Youtube Channel

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
__pycache__		__pycache__
resources		resources
tortoise_tts		tortoise_tts
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
check.py		check.py
classifier.pth		classifier.pth
configWavPlot.py		configWavPlot.py
inputAudio.py		inputAudio.py
issues.txt		issues.txt
packages.txt		packages.txt
requirements.txt		requirements.txt
voiceProtect_app.py		voiceProtect_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation