LoginSignup
57
68

More than 3 years have passed since last update.

Kaggleデータセットまとめ

Posted at

Fintech

Santander Customer Transaction Prediction
https://www.kaggle.com/c/santander-customer-transaction-prediction/data

Kaggle datasets in finance category (ファイナンス系kaggleデータ一覧)
https://www.kaggle.com/tags/finance

Bitcoin Price Prediction (LightWeight CSV)
https://www.kaggle.com/team-ai/bitcoin-price-prediction

Uniqlo (FastRetailing) Stock Price Prediction
https://www.kaggle.com/daiearth22/uniqlo-fastretailing-stock-price-prediction

Foreign Exchange (FX) Prediction - USD/JPY
https://www.kaggle.com/team-ai/foreign-exchange-fx-prediction-usdjpy

Foreign Exchange(FX) Prediction - EUR/USD
https://www.kaggle.com/meehau/EURUSD/kernels

Credit Card Fraud クレジットカードの詐欺検知データ (66MBなので重め)
https://www.kaggle.com/mlg-ulb/creditcardfraud

StockPrice and News ニュースと株価の相関分析(6MB)
https://www.kaggle.com/aaron7sun/stocknews

Loan Data for risk analysis 貸付リスク計算データ (6KB 軽め)
https://www.kaggle.com/zhijinzhai/loandata

Loan Data for risk analysis 貸付リスク計算データ (240MB very heavy )
https://www.kaggle.com/wendykan/lending-club-loan-data

Medical

Synchronized brainwave dataset 脳波
https://www.kaggle.com/berkeley-biosense/synchronized-brainwave-dataset

Breast Cancer Wisconsin (Diagnostic) Data Set 乳がん
https://www.kaggle.com/uciml/breast-cancer-wisconsin-data

Hospital General Information 病院
https://www.kaggle.com/cms/hospital-general-information

Zika Virus Epidemic ジカ熱
https://www.kaggle.com/cdc/zika-virus-epidemic

Cervical Cancer Risk Classification 子宮頸がん
https://www.kaggle.com/loveall/cervical-cancer-risk-classification

Medical Appointment No Shows 患者のドタキャン分析
https://www.kaggle.com/joniarroba/noshowappointments

Mental Health in Tech Survey テック業界のメンタルヘルス
https://www.kaggle.com/osmi/mental-health-in-tech-survey

NASA/Space

Exoplanet Hunting in Deep Space 惑星探査データ
https://www.kaggle.com/keplersmachines/kepler-labelled-time-series-data

Solar Radiation Prediction 太陽の放射線データ
https://www.kaggle.com/dronio/SolarEnergy

Climate Change: Earth Surface Temperature Data 地球の表面温度データ
https://www.kaggle.com/berkeleyearth/climate-change-earth-surface-temperature-data

Meteorite Landings 隕石の衝突データ
https://www.kaggle.com/nasa/meteorite-landings

UFO Sightings UFO発見データ
https://www.kaggle.com/NUFORC/ufo-sightings

Open Exoplanet Catalogue 太陽系外惑星データ
https://www.kaggle.com/mrisdal/open-exoplanet-catalogue

Kepler Exoplanet Search Results 太陽系外惑星データ2
https://www.kaggle.com/nasa/kepler-exoplanet-search-results

Marketing/Retail

Predict Future Sales EC売上予測
https://www.kaggle.com/c/competitive-data-science-predict-future-sales/data

Springleaf Marketing Response ダイレクトメールの反応分析 150MB
https://www.kaggle.com/c/springleaf-marketing-response/data

Coupon Purchase Prediction リクルートのポンパレのデータ
https://www.kaggle.com/c/coupon-purchase-prediction/data

Airbnb New User Bookings Airbnbの予約データ分析
https://www.kaggle.com/c/airbnb-recruiting-new-user-bookings/data

Rossmann Store Sales 小売店売上予測
https://www.kaggle.com/c/rossmann-store-sales/data

Home Depot Product Search Relevance
https://www.kaggle.com/c/home-depot-product-search-relevance/data

Acquire Valued Shoppers Challenge
https://www.kaggle.com/c/acquire-valued-shoppers-challenge/data

Getting real about fake news
https://www.kaggle.com/mrisdal/fake-news

Starbucks Locations Worldwide
https://www.kaggle.com/starbucks/store-locations

Retail rocket recommendation system dataset
https://www.kaggle.com/retailrocket/ecommerce-dataset

Grupo Bimbo Inventory Demand 食品の売上最適化と返品の最小化 (Trainデータ3GBデータあり)
https://www.kaggle.com/c/grupo-bimbo-inventory-demand/data

Innerwear Data from Victoria's Secret
https://www.kaggle.com/PromptCloudHQ/innerwear-data-from-victorias-secret-and-others

NLP(自然言語処理)

Shinzo Abe Twitter Data(安倍首相のTwitterデータ)
https://www.kaggle.com/team-ai/shinzo-abe-japanese-prime-minister-twitter-nlp/version/1

World News on Reddit 掲示板上のニュースデータ解析
https://www.kaggle.com/rootuser/worldnews-on-reddit

South Park Dialogue アニメ作品台本のセリフデータから話者を特定
https://www.kaggle.com/tovarischsukhov/southparklines

Deep NLP Chatbotと履歴書データの解析
https://www.kaggle.com/samdeeplearning/deepnlp

Python Questions from StackOverFlow プログラミングQ&AサイトのPythonに関する質問分析
https://www.kaggle.com/stackoverflow/pythonquestions

Japanese English Bilingual Corpus(日本語と英語のWikipediaコーパス)
https://www.kaggle.com/team-ai/japaneseenglish-bilingual-corpus

A list of the 15,000 most common word forms in Japanese 日本語の頻出語15000リスト
https://www.kaggle.com/rtatman/japanese-lemma-frequency

Japanese Whisky Review Dataset (日本のウイスキーのレビュー)
https://www.kaggle.com/koki25ando/japanese-whisky-review

(上級者向け) Q&AサイトQuoraの類似質問を分類するコンペ
https://www.kaggle.com/c/quora-question-pairs/data

HR

Kaggle ML and Data Science Survey, 2017 データ分析業界全体の分析
https://www.kaggle.com/kaggle/kaggle-survey-2017

U.S. Incomes by Occupation and Gender 性別による収入格差の分析
https://www.kaggle.com/jonavery/incomes-by-career-and-gender

Daily Happiness & Employee Turnover 業績と社員幸福度の相関性分析
https://www.kaggle.com/harriken/employeeturnover

IBM HR Analytics Employee Attrition & Performance IBMの離職率分析
https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset

2016 New Coder Survey 新人ソフトウエアエンジニア15000人分の属性データ
https://www.kaggle.com/freecodecamp/2016-new-coder-survey-

57
68
1

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
57
68