MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI, AI(8)
garet Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh. VQA: Visual Question ...
1
1
Comment0
3 search resultsShowing 1~3 results
You need to log-in
garet Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh. VQA: Visual Question ...
n. 2022. Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question ...
n. 2022. Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question ...
3 search resultsShowing 1~3 results
Qiita is a knowledge sharing service for engineers.