Create guide for how to deploy an existing Hugging Face model on self-hosted LLM Engine #141

rkaplan · 2023-07-19T21:15:57Z

Requested by Biju on Twitter.

yixu34 · 2023-07-20T17:47:41Z

We're currently wrapping up some testing for a self-contained helm install on your own EKS cluster. Once that's ready, we'll ship the docs too.

yixu34 · 2023-07-21T00:59:26Z

Btw just want to clarify and acknowledge that #153 solves part but not all of the ask - it shows you how to deploy a self-hosted model for an existing model in our Model Zoo, which represents a subset of Hugging Face models. It does not show how you would also add to the Model Zoo, i.e. build an endpoint from an arbitrary Hugging Face model. That will require some follow-up work.

yixu34 added documentation Improvements or additions to documentation enhancement New feature or request labels Jul 19, 2023

yixu34 assigned song-william Jul 20, 2023

yixu34 mentioned this issue Jul 20, 2023

GKE Helm deployment #157

Closed

ruizehung-scale mentioned this issue Jul 20, 2023

Add llm endpoint creation and inference sample code to self hosting d… #153

Merged

yixu34 assigned ruizehung-scale Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create guide for how to deploy an existing Hugging Face model on self-hosted LLM Engine #141

Create guide for how to deploy an existing Hugging Face model on self-hosted LLM Engine #141

rkaplan commented Jul 19, 2023 •

edited

Loading

yixu34 commented Jul 20, 2023

yixu34 commented Jul 21, 2023

Create guide for how to deploy an existing Hugging Face model on self-hosted LLM Engine #141

Create guide for how to deploy an existing Hugging Face model on self-hosted LLM Engine #141

Comments

rkaplan commented Jul 19, 2023 • edited Loading

yixu34 commented Jul 20, 2023

yixu34 commented Jul 21, 2023

rkaplan commented Jul 19, 2023 •

edited

Loading