2023-2025Datasets

Training Datasets

Datasets for Training, Reasoning, and RLHF

★5 Public Datasets · Powering #1 Models & RLHF Experiments

Custom datasets built for targeted training experiments. The simple-math family explores minimal arithmetic corpora for reasoning under SFT and DPO. Tree of Knowledge introduces symbolic knowledge structuring. The german-humanlike pair demonstrates downstream artifacts produced by composing SingleMoM RLHF experts.

Client: Independent — Juanako.AI
Role: Dataset author
Duration: 2023-2025
Team: Solo

Outcomes

Public Datasets

800K rows

Largest

Yes

Used in #1 Models

Math · Knowledge · Style

Coverage

Datasets

Name	Date	Size	Type	Purpose / Used In
simple-math	20-Jan-2024	779K rows	SFT	Arithmetic reasoning · SimpleSmaug #1 34B
simple-math-DPO	27-Jan-2024	800K rows	DPO	Arithmetic preference training
Tree of Knowledge	24-May-2023	~5MB	Symbolic	Knowledge structure · Cybertron 7B v1/v2
german-humanlike-clean-1k	26-Mar-2025	856 rows	RLHF	SingleMoM expert composition (curated)
german-humanlike-large	26-Mar-2025	10.8K rows	RLHF	SingleMoM expert composition (full)

datasethuggingfacesynthetic-datarlhfdposftdata-engineering