1

MARVEL: Modular Abstention for Reliable and Versatile Expert LLMs featured image

MARVEL: Modular Abstention for Reliable and Versatile Expert LLMs

A modular abstention framework for reliable expert LLMs that enables selective abstention from uncertain questions.

avatar
Bingbing Wen
Read more
Do Language Models Mirror Human Confidence? Exploring Psychological Insights to Address Overconfidence in LLMs featured image

Do Language Models Mirror Human Confidence? Exploring Psychological Insights to Address Overconfidence in LLMs

Exploring psychological insights to address overconfidence in LLMs by comparing with human confidence patterns.

Chenjun Xu*
Read more
AutoScale-Automatic Prediction of Compute-optimal Data Composition for Training LLMs featured image

AutoScale-Automatic Prediction of Compute-optimal Data Composition for Training LLMs

Automatic prediction of compute-optimal data composition for efficient LLM training.

Feiyang Kang
Read more
Characterizing LLM Abstention Behavior in Science QA with Context Perturbations featured image

Characterizing LLM Abstention Behavior in Science QA with Context Perturbations

Characterizing LLM abstention behavior in science QA with context perturbations.

avatar
Bingbing Wen
Read more
CCQ: cross-class query network for partially labeled organ segmentation featured image

CCQ: cross-class query network for partially labeled organ segmentation

Cross-class query network for partially labeled organ segmentation in medical images.

Xuyang Liu
Read more
ExpScore: Learning metrics for recommendation explanation featured image

ExpScore: Learning metrics for recommendation explanation

Learning metrics for evaluating recommendation explanations.

avatar
Bingbing Wen
Read more