- 大模型(大型语言模型,LLMs)是当下AI和NLP研究与产业中最重要的方向之一。
- 模型一览
Model | 作者l | Size l | 类型 l | 开源l |
---|---|---|---|---|
LLaMa | Meta AI | 7B-65B | Decoder | open |
OPT | Meta AI | 125M-175B | Decoder | open |
T5 | 220M-11B | Encoder-Decoder | open | |
mT5 | 235M-13B | Encoder-Decoder | open | |
UL2 | 20B | Encoder-Decoder | open | |
PaLM | 540B | Decoder | no | |
LaMDA | 2B-137B | Decoder | no | |
FLAN-T5 | 同T5 | Encoder-Decoder | open | |
FLAN-UL2 | 同U2 | Encoder-Decoder | open | |
FLAN-PaLM | 同PaLM | Decoder | no | |
FLAN | 同LaMDA | Decoder | no | |
BLOOM | BigScience | 176B | Decoder | open |
T0 | BigScience | 3B | Decoder | open |
BLOOMZ | BigScience | 同BLOOM | Decoder | open |
mT0 | BigScience | 同T0 | Decoder | open |
GPT-Neo | EleutherAI | 125M-2.7B | Decoder | open |
GPT-NeoX | EleutherAI | 20B | Decoder | open |
GPT3 | OpenAI | 175B (davinci) | Decoder | no |
GPT4 | OpenAI | unknown | OpenAI | no |
InstructGPT | OpenAI | 1.3B | Decoder | no |
Alpaca | Stanford | 同LLaMa | Decoder | open |