Documentation Index
Fetch the complete documentation index at: https://docs.arkor.ai/llms.txt
Use this file to discover all available pages before exploring further.
Dataset
Thedataset field on createTrainer takes one of two shapes: a HuggingFace repository name or a blob URL.
HuggingFace (most projects)
triage, translate, redaction) all use this form. Optional split and subset let you target a specific split or named subset.
Blob URL (your own data)
GET once at the start of the run.
Local files are not in DatasetSource today. To use one, host it as a blob URL or upload it to a private HuggingFace repo first.
Reference
For the full discriminated union, every field, thetoken semantics, and the rationale behind picking each form, see the DatasetSource reference.