Awesome Pretrained Language Models

Pretrained Language Models (PLMs) have been achieved great success over many NLP tasks. And with the rapid development of PLMs, there is a need to make a list to show these succesful and remarkable language models.

Lists of Language Models

Group by companies and organizations.

Awesome Pretrained Language Models
Lists of Language Models
- Al21 Labs
- Alibaba
- Amazon
- Baidu
- BigSicence
- DeepMind
- EleutherAI
- Meta (Facebook)
- Fudan University
- Google
- HIT & iFLYTEK
- Huawei
- Inspur
- IDEA
- JD (JINGDONG)
- Langboat
- Microsoft
- Nvidia
- OpenAI
- OPPO
- Tencent
- THU (Tsinghua) & BAAI (Beijing)
- More PLMs are coming ...
Claims

Al21 Labs

Jurassic-1
- JURASSIC-1: TECHNICAL DETAILS AND EVALUATION
- github repo

Alibaba

PLUG (Pre-training for Language Understanding and Generation))
- 超大规模中文生成

Amazon

Alexa Teacher Model
- Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems
- AlexaTM 20B
  - AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
- github repo (model hasn't been publicly released yet, stay tuned!)

Baidu

ERNIE family: github repo

PLATO family: github repo

BigSicence

T0 series
BLOOM
- Hugging Face 🤗

DeepMind

Gopher
- Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Chinchilla
- Training Compute-Optimal Large Language Models
Retro
- Improving language models by retrieving from trillions of tokens
- github repo (Non-offical implementation)

EleutherAI

Hugging Face 🤗

GPT-Neo
- github repo
GPT-J
- github repo
GPT-NeoX
- github repo

Meta (Facebook)

Fudan University

CPT
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
- github repo

Google

T5 family
LaMDA
- LaMDA: Language Models for Dialog Applications
- github repo (Non-offical)
FLAN
- Finetuned Language Models Are Zero-Shot Learners
- github repo
GShard
- GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Meena
- Towards a Human-like Open-Domain Chatbot
- github repo (Non-offical)
PaLM
- PaLM: Scaling Language Modeling with Pathways
- github repo (Non-offical)
UL2
Minerva
- Solving Quantitative Reasoning Problems with Language Models
- Minerva SAMPLE EXPLORER
Flan-PaLM
- Scaling Instruction-Finetuned Language Models
- Flan-T5 checkpoints

HIT & iFLYTEK

PERT:
- PERT: Pre-training BERT with Permuted Language Model
- github repo

哈工大讯飞联合实验室: Joint Laboratory of HIT and iFLYTEK Research (HFL).

Huawei

PanGu (盘古) family
NEZHA (哪吒)
- NEZHA: Neural Contextualized Representation for Chinese Language Understanding

Huawei-Noah offical github repo for Pretrained-Language Model.

Inspur

Yuan 1.0
- Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning
- github repo

IDEA

封神榜——中文语言预训练模型开源计划
- Erlangshen (二郎神)
- Wenzhong (闻仲)
- Rangdeng (燃灯)
- Yuyuan (余元)
- Bigan (比干)
- Zhouwenwang (周文王)
- Taiyi (太乙)

IDEA refers to 粤港澳大湾区数字经济研究院.

JD (JINGDONG)

织女 Vega v1

Langboat

Mengzi
- Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
- github repo

Microsoft

Megatron-Turning NLG family
DeBERTa series
DeepNMT
- Very Deep Transformers for Neural Machine Translation
- github repo
DeepNet
- DeepNet: Scaling Transformers to 1,000 Layers
GODEL
- GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
- github repo
METRO
- METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

Nvidia

Megatron-LM
- Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
- github repo

OpenAI

GPT-family
- GPT, GPT-2, GPT-3
- WebGPT
  - WebGPT: Browser-assisted question-answering with human feedback
- InstructGPT
  - Training language models to follow instructions with human feedback
Codex
- Evaluating Large Language Models Trained on Code

OPPO

OBERT

Tencent

Motian (摩天)
ShenZhou (神舟)
ShenNonG (神农)
HunYuan (混元)

THU (Tsinghua) & BAAI (Beijing)

THUDM
- Wenhui (文汇)
  - github repo
- GLM-130B
  - github repo
  - demo for GLM-130B

THUDM refers to "Data Mining Research Group at Tsinghua University".

Tsinghua AI
- ERNIE
  - ERNIE: Enhanced Language Representation with Informative Entities
- CPM family
  - CPM-1
    - CPM: A Large-scale Generative Chinese Pre-trained Language Model
    - github repo
  - CPM-2
THU-CoAI
- EVA family
  - EVA 1.0
    - EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
    - github project
  - EVA 2.0
    - EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
    - github project

THU-CoAI refers to "Conversational AI groups from Tsinghua University".

BAAI
- Chinese-Transformer-XL
  - github repo
  - 启智 AI协作平台
- GLM
  - GLM: General Language Model Pretraining with Autoregressive Blank Infilling
  - 启智 AI协作平台

More PLMs are coming ...

🔥Coming soon!🔥

Claims

Thanks for all these companies and orgnizations paying a lot of money and efforts to build and train these large models for the benefit of all human beings.

This repo is inspired by awesome pretrained Chinese NLP models.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Pretrained Language Models

Lists of Language Models

Al21 Labs

Alibaba

Amazon

Baidu

BigSicence

DeepMind

EleutherAI

Meta (Facebook)

Fudan University

Google

HIT & iFLYTEK

Huawei

Inspur

IDEA

JD (JINGDONG)

Langboat

Microsoft

Nvidia

OpenAI

OPPO

Tencent

THU (Tsinghua) & BAAI (Beijing)

More PLMs are coming ...

Claims

About

Releases

Packages

License

DEROOCE/Awesome-Pretrained-Language-Model

Folders and files

Latest commit

History

Repository files navigation

Awesome Pretrained Language Models

Lists of Language Models

Al21 Labs

Alibaba

Amazon

Baidu

BigSicence

DeepMind

EleutherAI

Meta (Facebook)

Fudan University

Google

HIT & iFLYTEK

Huawei

Inspur

IDEA

JD (JINGDONG)

Langboat

Microsoft

Nvidia

OpenAI

OPPO

Tencent

THU (Tsinghua) & BAAI (Beijing)

More PLMs are coming ...

Claims

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages