• Deutsch
    • English
  • About bonndoc
  • Guidelines
  • English 
    • Deutsch
    • English
  • Login
Search 
  •   bonndoc Home
  • Zentrale wissenschaftliche Einrichtungen
  • Search
  •   bonndoc Home
  • Central Academic Institutions
  • Search
JavaScript is disabled for your browser. Some features of this site may not work without it.

Search

Show Advanced FiltersHide Advanced Filters

Filters

Use filters to refine the search results.

Now showing items 1-1 of 1

  • Sort Options:
  • Relevance
  • Title Asc
  • Title Desc
  • Issue Date Asc
  • Issue Date Desc
  • Results Per Page:
  • 5
  • 10
  • 20
  • 40
  • 60
  • 80
  • 100
Thumbnail

Towards Uncertainty-Aware Low-Bit Quantized LLMs for On-Device Inference 

Sparrenberg, Lorenz; Schneider, Tobias; Deußer, Tobias; Berger, Armin; Sifa, Rafet (2026-03-06)
Quantizing large language models (LLMs) significantly reduces memory usage and computational requirements, enabling efficient on-device inference. However, aggressive quantization can degrade model performance and exacerbate ...

Contact | Impressum
Indexed by 
BASE
Theme by 
Atmire NV
 

 

Discover

AuthorBerger, Armin (1)Deußer, Tobias (1)Schneider, Tobias (1)Sifa, Rafet (1)Sparrenberg, Lorenz (1)Subjectclassification (1)GPT (1)large language models (1)
LLM (1)
quantization (1)Qwen (1)regression (1)... View MoreClassification (DDC)004 Informatik (1)... View MoreResource Type
Konferenzveröffentlichung (1)
... View MoreDate Issued
2026 (1)

Browse

All of bonndocCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsClassification (DDC)Resource TypeOpen Access Fund (University Bonn)This CommunityBy Issue DateAuthorsTitlesSubjectsClassification (DDC)Resource TypeOpen Access Fund (University Bonn)

Contact | Impressum
Indexed by 
BASE
Theme by 
Atmire NV