Skip to content
Change the repository type filter

All

    Repositories list

    • Baselines for Neural MMO -- new users should treat this repo as a starter project
      Python
      MIT License
      39501Updated Nov 26, 2024Nov 26, 2024
    • Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
      Python
      MIT License
      2631500Updated May 30, 2024May 30, 2024
    • DRLX

      Public
      Diffusion Reinforcement Learning Library
      Python
      MIT License
      717481Updated Feb 13, 2024Feb 13, 2024
    • trlx

      Public
      A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
      Python
      MIT License
      4704.5k7915Updated Jan 8, 2024Jan 8, 2024
    • autocrit

      Public
      A repository for transformer critique learning and generation
      Python
      178533Updated Dec 7, 2023Dec 7, 2023
    • OpenELM

      Public
      Evolution Through Large Models
      Python
      MIT License
      8669661Updated Nov 15, 2023Nov 15, 2023
    • Jupyter Notebook
      Apache License 2.0
      52201Updated Aug 27, 2023Aug 27, 2023
    • Jupyter Notebook
      0100Updated Aug 10, 2023Aug 10, 2023
    • magiCARP is an API used for crossencoder training.
      Python
      31040Updated Jul 27, 2023Jul 27, 2023
    • tinypar

      Public
      Python
      8000Updated Jul 16, 2023Jul 16, 2023
    • Polygraph

      Public
      RLHF Mechanistic Interpretability and Deception
      MIT License
      1600Updated Jul 14, 2023Jul 14, 2023
    • squeakily

      Public
      A library for squeakily cleaning and filtering language datasets.
      Jupyter Notebook
      Apache License 2.0
      94522Updated Jul 10, 2023Jul 10, 2023
    • maxtext

      Public
      A simple, performant and scalable Jax LLM!
      Python
      Apache License 2.0
      297100Updated Jun 30, 2023Jun 30, 2023
    • sft

      Public
      Python
      3201Updated Jun 29, 2023Jun 29, 2023
    • Python
      MIT License
      4800Updated Jun 21, 2023Jun 21, 2023
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language model based chatbots.
      Python
      Apache License 2.0
      4.6k400Updated Apr 26, 2023Apr 26, 2023
    • This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
      Jupyter Notebook
      Apache License 2.0
      32500Updated Apr 21, 2023Apr 21, 2023
    • pilev2

      Public
      Python
      MIT License
      91113Updated Mar 24, 2023Mar 24, 2023
    • goosebox

      Public
      sandboxed eval server for running code snippets
      MIT License
      1100Updated Mar 1, 2023Mar 1, 2023
    • cheese

      Public
      Used for adaptive human in the loop evaluation of language and embedding models.
      Python
      MIT License
      2430430Updated Mar 1, 2023Mar 1, 2023
    • Code-Pile

      Public
      This repository contains all the code for collecting large scale amounts of code from GitHub.
      Python
      MIT License
      29105184Updated Feb 17, 2023Feb 17, 2023
    • Python
      MIT License
      53413Updated Jan 29, 2023Jan 29, 2023
    • Stuff related to scraping the Code Review StackExchange
      Python
      51101Updated Jan 19, 2023Jan 19, 2023
    • For experiments involving instruct gpt. Currently used for documenting open research questions.
      MIT License
      371250Updated Nov 8, 2022Nov 8, 2022
    • Code used for sourcing and cleaning the BigScience ROOTS corpus
      E
      Apache License 2.0
      41400Updated Nov 6, 2022Nov 6, 2022
    • 👀
      MIT License
      0320Updated Oct 7, 2022Oct 7, 2022
    • Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
      Python
      Other
      60200Updated Jul 28, 2022Jul 28, 2022