17
15

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

More than 5 years have passed since last update.

HTMLパーサーメモ

Last updated at Posted at 2014-07-31

HTMLのテキスト部分だけを抜き出したいとか、特定タグの内容を抽出したいとかいうことを実現するものは、HTMLパーサーと呼ぶらしい。
ライセンス含めて使えそうなのを参考からピックアップする。

検討結果

2014-07-31時点で jsoup が良いんじゃないかと思った。
MITライセンスなのと、パーサー機能、使い方も悪くなさそう。最終更新日も割と最近。

参考

17
15
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
17
15

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?