Skip to content

Is there a way to download models without engaging GPU memory? #416

Answered by AlpinDale
murtaza-nasir asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, you may download the model with huggingface_hub first.

pip install huggingface_hub hf_transfer
export HF_HUB_ENABLE_HF_TRANSFER=1
huggingface-cli download LoneStriker/MiquMaid-v2-2x70B-DPO-GPTQ --local-dir-use-symlinks False --local-dir ./MiquMaid-v2-2x70B-DPO-GPTQ

You can also use the --cache-dir flag to specify the cache directory, if ~/.cache doesn't have enough space to accommodate each model chunk.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@murtaza-nasir
Comment options

Answer selected by murtaza-nasir
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants