More than 5 years have passed since last update.

[python] htmlファイルを読み込んでスクレイピング練習

Last updated at 2020-05-30Posted at 2020-02-20

はじめに

スクレイピングやってて躓いたときにcopy outerHTMLで要素解析すること多いと思うんだけど忘れていたのでメモ

コピーしてきたHTMLでファイルを作って読み込む

from bs4 import BeautifulSoup

with open('copy.html', encoding='utf-8') as f:
    html = f.read()

soup = BeautifulSoup(html, 'html.parser')