5

More than 5 years have passed since last update.

Word2Vec+LSTMでSNLIデータセットのクラス分類

Last updated at 2017-08-23Posted at 2017-08-23

与えられた２つの文章が前提と仮説の関係になっているかクラス分類するデータセットとしてSNLIがあります．
この問題を解くために学習済みのWord2Vec¹とLSTMを組み合わせたモデルをKerasで実装してみたという内容です．

SNLI²データセットの詳細については記事「SNLIデータセットの読み込み方」³に記載しています．
参考になれば幸いです．

前準備

SNLIデータセットのダウンロード

The Stanford Natural Language Inference (SNLI) Corpusよりダウンロードできます．

wget https://nlp.stanford.edu/projects/snli/snli_1.0.zip
unzip snli_1.0.zip

学習済みWord2Vecの重みのダウンロード

Google Newsによって学習済みのWord2Vecの重みは下記サイト¹のリンク"GoogleNews-vectors-negative300.bin.gz"からダウンロードできます．

実装

学習済みWord2VecとLSTMを組み合わせたモデルをSNLIデータセットで学習を行うコード⁴です．記事⁵が参考になります．

実験結果

Train: 82.01%
Valid: 80.12%
epoch数: 18

となり，既存の300D LSTM⁶と同程度の性能が出せていることが確認できます．
ここから更に性能を上げるためにDecomposable Attention⁷ ⁸やESIM⁹ ¹⁰が使えたりします．

References

Milkolov et al., Efficient Estimation of Word Representations in Vector Space, 2013. ↩
Bowman et al., A large annotated corpus for learning natural language inference. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015. ↩
namakemono, SNLIデータセットの読み込み方, 2017. ↩
namakemono, Trains a LSTM with Word2Vec on the SNLI Dataset., 2017. ↩
lystdo, LSTM with word2vec embeddings, 2017. ↩
Bowman et al., The Stanford Natural Language Inference (SNLI) Corpus. ↩
Parikh, A Decomposable Attention for Natural Language Inference, 2016.(Decomposable Attentionの原論文) ↩
namakemono, Decomposable Attentionアルゴリズムの解説と実装, 2017. ↩
Chen, Enhanced LSTM for Natural Language Inference, 2017. (ESIMの原論文) ↩
namakemono, ESIMアルゴリズムの解説と実装, 2017. ↩

5

Register as a new user and use Qiita more conveniently

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up

5