データセット一覧
機械学習の手法の勉強や,手法のベンチマークなどに使うデータセットの一覧です.
Synthetic datasets
| dataset |
Classes |
Instances |
Dimension |
function |
| blobs |
k |
N |
d |
sklearn.datasets.make_blobs |
| moons |
2 |
N |
2 |
sklearn.datasets.make_moons |
| circles |
2 |
N |
2 |
sklearn.datasets.make_circles |
| S curve |
1 |
N |
2 |
sklearn.datasets.make_s_curve |
| swiss roll |
1 |
N |
2 |
sklearn.datasets.make_swiss_roll |
Datasets
Image datasets
| dataset |
Classes |
Instances |
Dimension |
function |
| Handwritten digits |
10 |
5620 |
8 * 8 |
sklearn.datasets.load_digits |
| MNIST |
10 |
60,000 (training) 10,000 (test) |
28 * 28 |
torchvision.datasets.MNIST |
| Fasion MNIST |
10 |
60,000 (training) 10,000 (test) |
28 * 28 |
torchvision.datasets.FashionMNIST |
| CIFAR-10 |
10 |
60000 (50000 training images and 10000 test images) |
32 * 32 |
torchvision.datasets.CIFAR10 |
| STL10 |
10 |
500 training images, 800 test images per class |
96 * 96 |
torchvision.datasets.STL10 |
| Olivetti face images |
40 |
400 |
92 * 112 (64 * 64) |
sklearn.datasets.fetch_olivetti_faces |
| Stanford Dogs Dataset |
120 |
20580 |
375 * 50 |
|
| dataset |
classes |
Objects |
Dimension |
| t4.8k |
6 |
8000 |
2 |
| t7.10k |
9 |
8000 |
2 |
| Complex9 |
9 |
3031 |
2 |
| t8.8k |
8 |
10000 |
2 |
| Aggregation |
7 |
788 |
2 |
| Spiral |
3 |
312 |
2 |
| D31 |
31 |
3100 |
2 |
| R15 |
15 |
600 |
2 |
| Flame |
2 |
240 |
2 |
| Compound |
6 |
399 |
2 |
| pathbased |
3 |
300 |
2 |

異常検知
時系列データ
データサイト