Zhizhou Sha
JamesSand
AI & ML interests
None yet
Recent Activity
updated
a model
about 3 hours ago
JamesSand/qwen1.7b-adam-reset-muon-lr-1e-6-fp64-global_step_200
published
a model
about 3 hours ago
JamesSand/qwen1.7b-adam-reset-muon-lr-1e-6-fp64-global_step_200
updated
a model
about 3 hours ago
JamesSand/qwen3-4b-svd-muon-adam-1e-6-beta1-0.9-beta2-0.999-store-paramname-global_step_5