このページについて
GPTに代表される生成モデルの主要論文を備忘録も兼ねてまとめます。
言語モデルそのもの系
- Language Models are Few-Shot Learners
- Scaling Language Models: Methods, Analysis & Insights from Training Gopher
- Unsupervised Neural Machine Translation with Generative Language Models Only
- Training Compute-Optimal Large Language Models
- Flamingo: a Visual Language Model for Few-Shot Learning
- JURASSIC-1: TECHNICAL DETAILS AND EVALUATION
ユースケース系
- Competition-Level Code Generation with AlphaCode
- Training language models to follow instructions with human feedback
- WebGPT: Browser-assisted question-answering with human feedback
- Evaluating Large Language Models Trained on Code - arXiv