AI & ML interests
None yet
Organizations
None yet
MattBou00/SequentialLR001_2000samples_R1-checkpoint-epoch-60
Updated
MattBou00/SequentialLR001_2000samples_R1-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
MattBou00/SequentialLR001_2000samples_R1-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
•
1
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/SequentialLR00001_2000samples-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR00001_2000samples-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-reward-2025-11-21_15-54-01
Updated
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleLR001-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-reward-2025-11-21_15-24-09
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-checkpoint-epoch-80
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5ACTUALRETRYRUNNINGCODE-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/ROUND5RETRYRUNNINGCODE-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE8_round1-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleRound1B-checkpoint-epoch-60
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleRound1B-checkpoint-epoch-40
Reinforcement Learning
•
1B
•
Updated
MattBou00/SingleRound1B-checkpoint-epoch-20
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE8_round3-reward-2025-09-22_18-35-27
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE8_round3
Reinforcement Learning
•
1B
•
Updated
MattBou00/llama-3-2-1b-detox_v1f_SCALE8_round3-checkpoint-epoch-100
Reinforcement Learning
•
1B
•
Updated