Daniel Gaderbauer's picture

Daniel Gaderbauer

Nelathan

·

https://www.instagram.com/nelathan_arts/

AI & ML interests

Computer science, Watercolor, Game Dev

Recent Activity

liked a model 7 days ago

allura-forge/Llama-3.3-8B-Instruct

liked a model 7 days ago

bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF

liked a model 7 days ago

shb777/Llama-3.3-8B-Instruct-128K

View all activity

Organizations

None yet

upvoted 3 collections 14 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 18 days ago • 77

GLM-4.6V

3 items • Updated 29 days ago • 47

GLM-4.7

2 items • Updated 15 days ago • 39

upvoted 3 collections 21 days ago

VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 21 days ago • 39

AFM KDA

Collection of KDA conversions of AFM • 2 items • Updated 22 days ago • 3

Teacher Logits

Logits captured from large models to act as the teacher for distillation • 3 items • Updated 22 days ago • 7

upvoted 3 collections about 1 month ago

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 13 days ago • 26

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 136

Trinity

Collection of Arcee AI models in the Trinity family • 8 items • Updated 25 days ago • 21

upvoted an article about 1 month ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

296

upvoted 2 collections about 2 months ago

Olmo 3 Pre-training

All artifacts related to Olmo 3 pre-training • 10 items • Updated 14 days ago • 32

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 14 days ago • 157

upvoted a collection 2 months ago

Agent Data Protocol

2 items • Updated Oct 29, 2025 • 11

upvoted a collection 3 months ago

BERT Hash Nano Models

Set of BERT models with a modified embeddings layer • 4 items • Updated 14 days ago • 9

upvoted a paper 3 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 25

upvoted a collection 3 months ago

💧 LFM2

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 26 items • Updated about 19 hours ago • 131

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4, 2025

•

267

upvoted 2 collections 4 months ago

EmbeddingGemma

3 items • Updated Sep 11, 2025 • 105

Hermes 4 Collection

13 items • Updated Dec 2, 2025 • 77

upvoted a collection 5 months ago

Tfree-HAT-7b-pretrained

Tokenizer free models based on Hierarchical Autoregressive Transformer (https://arxiv.org/abs/2501.10322) trained from scratch. • 2 items • Updated Aug 1, 2025 • 10