29
17

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

More than 5 years have passed since last update.

UnicodeDecodeError: 'shift_jis' codec can't decode bytes in position

Last updated at Posted at 2016-08-05

Shift-JISだと思ったら違ったという話…

データ入力

read_csvの文字コード指定

基本的にread_csvでデータを読むことが多いので、read_csv利用時の注意事項。

csvデータのエンコード形式に注意、特にWindows+excelで作成したcsvファイルの読み込み時

In[36]: df = pd.read_csv('hoge.csv',header=0,encoding='Shift_JIS')

UnicodeDecodeError: 'shift_jis' codec can't decode bytes in position 16-17: illegal multibyte sequence

"UnicodeDecodeError:"が発生し。encodingの値が正しくない模様。
cp932 を指定するとうまく通った。

df = pd.read_csv('hoge.csv',header=0,encoding='cp932')

参考

http://minus9d.hatenablog.com/entry/2015/07/30/225841

29
17
1

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
29
17

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?