Evals · Example Payload
Evals Dataset Example
Measuring Massive Multitask Language Understanding — multiple-choice benchmark spanning 57 subjects from STEM and international law to nutrition and religion.
knowledgemultitaskmultiple-choicebenchmark
Evals Dataset Example is an example object payload from Evals, with 12 top-level fields. It illustrates the shape of data this provider's APIs accept or return.
Top-level fields
idnamedescriptionversiontasksourcelicensesplitscase_counttagscreatedmodified