GitHub - evolvingstuff/kNNGen: Using k-nearest neighbors, and infinite-lookback ngrams with LLMs

kNNGen

Experimenting with some of the ideas in this paper:

Generalization through Memorization: Nearest Neighbor Language Models

and later might incorporate ideas from this paper as well:

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Setup

Requires Docker.

Install and run Milvus, as explained here:

# Download the installation script
$ curl -sfL https://raw.githubusercontent.com/milvus-io/milvus/master/scripts/standalone_embed.sh -o standalone_embed.sh

# Start the Docker container
$ bash standalone_embed.sh start

Optionally run milvus_test.py to see if that worked.

Create a .env file, and inside of it add your HuggingFace API token, like so:

HF_TOKEN=your_hugging_face_api_token_here

Or add the equivalent to your system's environment variables.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
TODO.txt		TODO.txt
main.py		main.py
milvus_test.py		milvus_test.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kNNGen

Setup

About

Languages

evolvingstuff/kNNGen

Folders and files

Latest commit

History

Repository files navigation

kNNGen

Setup

About

Topics

Resources

Stars

Watchers

Forks

Languages