Skip to content
Change the repository type filter

All

    Repositories list

    • HELM for hidden dataset eval
      Python
      Apache License 2.0
      2011Updated Nov 21, 2024Nov 21, 2024
    • Website for NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1 GPU + 1 Day
      HTML
      MIT License
      8700Updated Aug 2, 2024Aug 2, 2024
    • Python
      0000Updated Jun 5, 2024Jun 5, 2024
    • The eval scripts used to run and eval submissions
      Shell
      MIT License
      0000Updated Dec 13, 2023Dec 13, 2023
    • Jupyter Notebook
      0001Updated Nov 19, 2023Nov 19, 2023
    • helm

      Public
      Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
      Python
      Apache License 2.0
      254100Updated Oct 26, 2023Oct 26, 2023
    • NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
      Python
      5625400Updated Oct 23, 2023Oct 23, 2023