Open Datasets

We believe in open science and make our datasets publicly available to advance AI research worldwide.

12
Total Datasets
45,000
Total Downloads
2.5 TB
Total Size
350
Citations
AICIL-3D: Large-Scale 3D Object Detection Dataset

AICIL-3D: Large-Scale 3D Object Detection Dataset

Computer VisionCC BY-NC 4.0v1.0
500 GB
45 citations
8,500 downloads

A comprehensive dataset for 3D object detection containing 100K annotated scenes from urban driving environments. Includes LiDAR point clouds, RGB images, and precise 3D bounding boxes.

100,000
Samples
10
Classes
MedAI: Medical Image Dataset for Multi-Disease Detection

MedAI: Medical Image Dataset for Multi-Disease Detection

Medical AIRestricted (Research Use Only)v2.0
800 GB
28 citations

Diverse collection of medical images (CT, MRI, X-ray) for training and evaluating AI diagnostic systems. Includes annotations for breast cancer, lung cancer, and colorectal cancer.

50,000
Samples
MultiLang: Low-Resource Language Dataset

MultiLang: Low-Resource Language Dataset

NLPCC BY 4.0v1.5
150 GB
62 citations
15,000 downloads

Parallel text corpus for 100+ low-resource languages, enabling machine translation and multilingual NLP research.

100
Languages
10.0M
Sentences
DownloadPaper