Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
196
49
KABI
dongguanting
Follow
tcy6's profile picture
vanillaOVO's profile picture
derrickzhu's profile picture
59 followers
·
97 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
liked
a dataset
1 day ago
XXHStudyHard/EnvScaler-SFT-Traj-9K
upvoted
a
paper
2 days ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
upvoted
a
paper
2 days ago
ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition
View all activity
Organizations
dongguanting
's models
16
Sort: Recently updated
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
21 days ago
•
21
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
21 days ago
•
13
•
1
dongguanting/QwQ-32B-ARPO-DeepSearch
33B
•
Updated
21 days ago
•
9
•
1
dongguanting/aepo_light
8B
•
Updated
Nov 3, 2025
•
6
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
13
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
8
•
1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
31
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
10
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12, 2025
•
3
•
3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
13
•
5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
15
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30, 2025
•
6
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
43
•
4
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6, 2025
•
3
•
1
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6, 2025
•
2
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25, 2025
•
7
•
5