mlx-community/Dolci-Think-DPO-32B-Flat
Viewer
•
Updated
•
200k
•
9
•
1
mlx-community/Josiefied-Qwen3-dpo-v1-flat
Viewer
•
Updated
•
500
•
29
•
1
mlx-community/dolma3_mix-common_crawl-art_and_design-160k
Viewer
•
Updated
•
160k
•
12
•
1
mlx-community/Dolci-Instruct-SFT-No-Tools-400K
Viewer
•
Updated
•
402k
•
10
mlx-community/Dolci-Instruct-SFT-No-Tools-200K
Viewer
•
Updated
•
202k
•
6
mlx-community/Dolci-Instruct-SFT-No-Tools-100K
Viewer
•
Updated
•
102k
•
19
mlx-community/Dolci-Think-RL-7B-2k
Viewer
•
Updated
•
2.2k
•
59
•
2
mlx-community/ultrafeedback-prompts-flat-rlhf
Viewer
•
Updated
•
37.9k
•
1
•
1
mlx-community/recycling_the_web-400K
Viewer
•
Updated
•
400k
•
31
mlx-community/recycling_the_web-1k
Viewer
•
Updated
•
1.1k
•
121
•
1
mlx-community/medfit-dataset
Viewer
•
Updated
•
6.44k
•
19
•
3
mlx-community/recycling_the_web-100K
Viewer
•
Updated
•
100k
•
49
mlx-community/recycling_the_web-200K
Viewer
•
Updated
•
200k
•
23
mlx-community/recycling_the_web-1m
Viewer
•
Updated
•
1M
•
34
mlx-community/mlx_lm_calibration_v5
Viewer
•
Updated
•
1
•
3
mlx-community/Intermediate-Thinking-130k
Viewer
•
Updated
•
135k
•
78
•
3
mlx-community/hermes-reasoning-tool-use
Viewer
•
Updated
•
51k
•
56
•
4
Viewer
•
Updated
•
959k
•
33
•
6
mlx-community/dhanishtha-2.0-superthinker
Viewer
•
Updated
•
11.7k
•
20
•
2
Viewer
•
Updated
•
8.57k
•
117
mlx-community/dclm-baseline-1.0-138k
Viewer
•
Updated
•
138k
•
4
•
1
mlx-community/orpo-dpo-mix-40k-flat-mlx
Viewer
•
Updated
•
44.2k
•
1
mlx-community/Human-Like-DPO
Viewer
•
Updated
•
972
•
43
•
4
mlx-community/orpo-dpo-mix-40k-mlx
Viewer
•
Updated
•
44.2k
•
36
mlx-community/fineweb-200k
Viewer
•
Updated
•
200k
•
33
•
1
mlx-community/qwen3_dwq_calibration_1332_235b
Viewer
•
Updated
•
1.33k
•
13
•
2
mlx-community/qwen3_dwq_calibration_5328
Viewer
•
Updated
•
5.33k
•
26
mlx-community/qwen3_dwq_calibration_2664
Viewer
•
Updated
•
2.66k
•
1
mlx-community/qwen3_dwq_calibration_1332
Viewer
•
Updated
•
1.33k
•
3
•
2
Viewer
•
Updated
•
1k
•
47