RoboChallenge.ai

community

https://robochallenge.ai/

robochallengeai

robochallenge

Activity Feed

AI & ML interests

large scale real-robot-based benchmark platform of embodied intelligence

Recent Activity

RoboChallengeAI updated a dataset about 2 hours ago

RoboChallenge/task_table30_hang_toothbrush_cup

RoboChallengeAI updated a dataset about 2 hours ago

RoboChallenge/task_table30_stack_color_blocks

RoboChallengeAI updated a dataset about 3 hours ago

RoboChallenge/Table30

View all activity

Papers

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

View all Papers

RoboChallengeAI

updated 2 datasets about 2 hours ago

RoboChallenge/task_table30_hang_toothbrush_cup

Updated about 2 hours ago • 10

RoboChallenge/task_table30_stack_color_blocks

Updated about 2 hours ago • 14

RoboChallengeAI

updated a dataset about 3 hours ago

RoboChallenge/Table30

Updated about 3 hours ago • 3.27k • 8

AdinaY

posted an update 2 days ago

Post

974

Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0

3 replies

AdinaY

posted an update 2 days ago

Post

1775

Qwen just released two new model series: Qwen3-VL-Embedding & Qwen3-VL-Reranker 🚀

✨ 2B / 8B - Apache2.0
✨ 30+ languages
✨ Supported text, images, screenshots, videos, and arbitrary multimodal combinations

Qwen3-VL-Embedding: Flexible vector sizes (64–2048)
https://huggingface.co/collections/Qwen/qwen3-vl-embedding
Qwen3-VL-Reranker: Built for recall>rerank pipelines
https://huggingface.co/collections/Qwen/qwen3-vl-reranker

AdinaY

posted an update 3 days ago

Post

500

MOSS Transcribe Diarize 🔊 A multimodal model for Speaker-Attributed, Time-Stamped Transcription from OpenMOSS.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization (2601.01554)
OpenMOSS-Team/MOSS-transcribe-diarize

✨ Single-pass end-to-end SATS
✨ 128k context, ~90 min audio
✨ Robust to overlap & noise

1 reply

AdinaY

posted an update 3 days ago

Post

213

Daily Papers just got an AI reading assistant 🔥

You can ask any question you want: clarify a paragraph, get a short summary...all without leaving the page!

✨ Powered by HuggingChat + Hugging Face MCP server

AdinaY

posted an update 5 days ago

Post

1760

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

AdinaY

posted an update 5 days ago

Post

1874

MiniMax M2.1 blog is out🔥
https://huggingface.co/blog/MiniMaxAI/multilingual-and-multi-task-coding-with-strong-gen

Only a year into open source, MiniMax is already making a great impact. Not only through solid models/products, but also by how well the team uses community platforms like Hugging Face.

HF Teams, blogs, Daily Papers, Spaces as project pages, and always experimenting with new ways to engage. Super impressive!

AdinaY

posted an update 6 days ago

Post

3577

2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context

1 reply

AdinaY

submitted a paper to Daily Papers 9 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 10 days ago • 232

AdinaY

posted an update 22 days ago

Post

754

Following up on LLaDA 2.0 , the paper is now out on Daily Papers🔥
It has sparked a lot of discussion in the community for showing how discrete diffusion LLMs can scale to 100B and run faster than traditional AR models.
LLaDA2.0: Scaling Up Diffusion Language Models to 100B (2512.15745)

AdinaY

posted an update 25 days ago

Post

4596

Finch 💰 an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work.

FinWorkBench/Finch

✨ Built from real enterprise data (Enron + financial institutions), not synthetic tasks
✨ Tests end-to-end finance workflows
✨ Multimodal & cross-file reasoning
✨ Expert annotated (700+ hours) and genuinely challenging hard

AdinaY

authored a paper 25 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published 26 days ago • 49

smrset

updated a dataset about 2 months ago

RoboChallenge/Table30

Updated about 3 hours ago • 3.27k • 8

AdinaY

posted an update 2 months ago

Post

3380

Kimi K2 Thinking is now live on the hub 🔥

moonshotai/Kimi-K2-Thinking

✨ 1T MoE for deep reasoning & tool use
✨ Native INT4 quantization = 2× faster inference
✨ 256K context window
✨ Modified MIT license

xianbao

authored a paper 2 months ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20, 2025 • 7

smrset

authored a paper 2 months ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20, 2025 • 7

AdinaY

posted an update 2 months ago

Post

734

Chinese open source AI in October wasn’t about bigger models, it was about real world impact 🔥

https://huggingface.co/collections/zh-ai-community/october-2025-china-open-source-highlights

✨ Vision-Language & OCR wave 🌊
- DeepSeek-OCR : 3B
- PaddleOCR-VL : 0.9B
- Qwen3-VL : 2B / 4B / 8B / 32B /30B-A3B
- Open-Bee: Bee-8B-RL
- http://Z.ai Glyph :10B

OCR is industrializing, the real game now is understanding the (long context) document, not just reading it.

✨ Text generation: scale or innovation?
- MiniMax-M2: 229B
- Antgroup Ling-1T & Ring-1T
- Moonshot Kimi-Linear : linear-attention challenger
- Kwaipilot KAT-Dev

Efficiency is the key.

✨ Any-to-Any & World-Model : one step forward to the real world
- BAAI Emu 3.5
- Antgroup Ming-flash-omni
- HunyuanWorld-Mirror: 3D

Aligning with the “world model” globally

✨ Audio & Speech + Video & Visual: released from entertainment labs to delivery platforms
- SoulX-Podcast TTS
- LongCat-Audio-Codec & LongCat-Video by Meituan delivery paltform
- xiabs DreamOmni 2

Looking forward to what's next 🚀

AdinaY

authored a paper 2 months ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20, 2025 • 7

AI & ML interests

Recent Activity

Papers

Team members 7

RoboChallenge's activity