2023-2025Datasets
Training Datasets
Datasets for Training, Reasoning, and RLHF
★5 Public Datasets · Powering #1 Models & RLHF Experiments
Custom datasets built for targeted training experiments. The simple-math family explores minimal arithmetic corpora for reasoning under SFT and DPO. Tree of Knowledge introduces symbolic knowledge structuring. The german-humanlike pair demonstrates downstream artifacts produced by composing SingleMoM RLHF experts.
- Client
- Independent — Juanako.AI
- Role
- Dataset author
- Duration
- 2023-2025
- Team
- Solo
Outcomes
5
Public Datasets
800K rows
Largest
Yes
Used in #1 Models
Math · Knowledge · Style
Coverage
Datasets
| Name | Date | Size | Type | Purpose / Used In |
|---|---|---|---|---|
| simple-math | 20-Jan-2024 | 779K rows | SFT | Arithmetic reasoning · SimpleSmaug #1 34B |
| simple-math-DPO | 27-Jan-2024 | 800K rows | DPO | Arithmetic preference training |
| Tree of Knowledge | 24-May-2023 | ~5MB | Symbolic | Knowledge structure · Cybertron 7B v1/v2 |
| german-humanlike-clean-1k | 26-Mar-2025 | 856 rows | RLHF | SingleMoM expert composition (curated) |
| german-humanlike-large | 26-Mar-2025 | 10.8K rows | RLHF | SingleMoM expert composition (full) |
datasethuggingfacesynthetic-datarlhfdposftdata-engineering