Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mashiro's picture
9

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
upvoted a paper 6 days ago
RM-R1: Reward Modeling as Reasoning
upvoted a paper 11 days ago
Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling
View all activity

Organizations

None yet

AlexMashiro 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs