Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ubergarm
/
GLM-4.7-GGUF

Text Generation
GGUF
English
Chinese
imatrix
conversational
ik_llama.cpp
glm4_moe
Model card Files Files and versions
xet
Community
10
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Is this a thinking model?

3
#10 opened 1 day ago by
geveent

Why does this double my PP and improve TG?

4
#9 opened 4 days ago by
gtkunit

anyone running via cpu+gpu+rpc gpu ?

3
#8 opened 9 days ago by
gopi87

EPYC, RTX 5090 vs RTX 6000

πŸ”₯ 1
7
#7 opened 11 days ago by
sousekd

Testing IQ5_K

πŸ‘ 1
1
#6 opened 12 days ago by
shewin

Stable run on 2x RTX 5090 and 2 Xeon E5 2696 V4 and DDR4 with ik_llama.cpp - 6.1 t/s on IQ4_K and 5.1 t/s on IQ5_K, opencode works with this

πŸ‘ 1
6
#5 opened 13 days ago by
martossien

IQ3_KS is awesome!

πŸ”₯ ❀️ 2
#4 opened 14 days ago by
mtcl

9.31mb first part Q5?

πŸ‘ 1
2
#3 opened 15 days ago by
inritwritten

Please make IQ2_KS version πŸ™

❀️ 2
2
#2 opened 15 days ago by
Buridda

Can't wait for a q4 quant from you

πŸ€— 1
5
#1 opened 15 days ago by
mtcl
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs