Skip to content

Pinned Loading

  1. FastChat FastChat Public

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Python 36.3k 4.5k

Repositories

Showing 6 of 6 repositories
  • arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    lm-sys/arena-hard-auto’s past year of commit activity
    Jupyter Notebook 396 Apache-2.0 48 3 1 Updated Sep 3, 2024
  • FastChat Public

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    lm-sys/FastChat’s past year of commit activity
    Python 36,325 Apache-2.0 4,469 752 (3 issues need help) 93 Updated Sep 2, 2024
  • lm-sys/lm-sys.github.io’s past year of commit activity
    JavaScript 49 20 1 1 Updated Aug 29, 2024
  • RouteLLM Public

    A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

    lm-sys/RouteLLM’s past year of commit activity
    Python 2,830 Apache-2.0 210 20 6 Updated Aug 10, 2024
  • llm-decontaminator Public

    Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

    lm-sys/llm-decontaminator’s past year of commit activity
    Python 185 Apache-2.0 13 4 0 Updated Dec 20, 2023
  • vicuna-blog-eval Public archive

    The code and data for the GPT-4 based benchmark in the vicuna blog post

    lm-sys/vicuna-blog-eval’s past year of commit activity
    Python 33 Apache-2.0 8 0 0 Updated Aug 2, 2023

Most used topics

Loading…