LemmyChan
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to Technology@lemmygrad.mlEnglish · 30 days ago

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers

github.com

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • Aii@programming.dev
  • technology@lemmy.ml
  • hackernews@lemmy.bestiver.se
  • localllama@sh.itjust.works
7
external-link

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers

github.com

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to Technology@lemmygrad.mlEnglish · 30 days ago
message-square
0
link
fedilink
  • cross-posted to:
  • Aii@programming.dev
  • technology@lemmy.ml
  • hackernews@lemmy.bestiver.se
  • localllama@sh.itjust.works
GitHub - intel/auto-round: A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.
github.com
external-link
A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers....
alert-triangle
You must log in or # to comment.

Technology@lemmygrad.ml

technology@lemmygrad.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmygrad.ml

A tech news sub for communists

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 104 users / week
  • 284 users / month
  • 399 users / 6 months
  • 4 local subscribers
  • 1.43K subscribers
  • 207 Posts
  • 306 Comments
  • Modlog
  • mods:
  • Muad'Dibber@lemmygrad.ml
  • burlemarx@lemmygrad.ml
  • egs81t@lemmygrad.ml
  • BE: 0.19.15
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org