Hugging Face · Schema
Hugging Face Dataset
Schema for a dataset hosted on the Hugging Face Hub, including metadata, structure, splits, and repository information.
Properties
| Name | Type | Description |
|---|---|---|
| _id | string | Internal unique identifier for the dataset |
| id | string | Dataset repository ID in the format author/dataset-name or dataset-name |
| author | string | Author or organization that owns the dataset |
| sha | string | Latest Git commit SHA of the dataset repository |
| lastModified | string | Timestamp of the last modification |
| createdAt | string | Timestamp when the dataset was created |
| private | boolean | Whether the dataset is private |
| disabled | boolean | Whether the dataset has been disabled |
| gated | object | Access gating configuration |
| tags | array | Tags associated with the dataset |
| downloads | integer | Number of downloads in the last 30 days |
| likes | integer | Number of likes/favorites |
| description | string | Short description of the dataset |
| citation | string | Citation text for the dataset (BibTeX format) |
| siblings | array | Files in the dataset repository |
| cardData | object | Parsed metadata from the dataset card YAML front matter |