More than 1 year has passed since last update.

LLM（大規模言語モデル）についての概要まとめ

Last updated at 2024-06-10Posted at 2024-05-05

はじめに

近年話題になっているLLM(Large Language Model)について、言葉はよく聞くけど、概要や、何ができるのかなど、いまいち把握できていなかったので、自分なりにまとめました。

項番	目次
1	はじめに
2	LLMとは
3	実装例
4	おわりに
5	参考サイト

from transformers import T5Tokenizer, AutoModelForCausalLM

tokenizer = T5Tokenizer.from_pretrained("rinna/japanese-gpt2-medium")
model =　AutoModelForCausalLM.from_pretrained("rinna/japanese-gpt2-medium")

first_sentence = "今日は雨。だから私は"  
x = tokenizer.encode(first_sentence, return_tensors="pt", add_special_tokens=False)  

y = model.generate(x, max_length=100)
generated_sentence = tokenizer.batch_decode(y, skip_special_tokens=True)
print(generated_sentence)

実行結果

['今日は雨。だから私は、いつもより少しだけ早めに起きて、洗濯物を干して、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を畳んで、洗濯物を']

おわりに

LLMの概要と、実装例を簡単にまとめました。
Transformerなどの具体的な仕組みについてはまだ理解が進んでいないので、それらも後々理解していきたいと考えております。

参考サイト

・rinna社のGPT-2大規模言語モデル
・Udemy講座　大規模言語モデル（LLM）の仕組み入門

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up

LLM（大規模言語モデル）についての概要まとめ

はじめに

目次

LLMとは

概要

活用例

実装例

おわりに

参考サイト