Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 455 Bytes

File metadata and controls

8 lines (5 loc) · 455 Bytes

Malicious-Prompt-Filter-for-RAG-Database

GitHub repository for a tool that detects and filters malicious prompts before they are entered into a Retrieval-Augmented Generation (RAG) database, ensuring data integrity and security.

This filter is currently for Text data. Image data can be handled after Stephanalysis and converted to text to check for malicious prompts.

Model 1 - 98.4% accuracy.

Random Forest based model - 0.9954359274429491 accuracy