Mitigating Overconfidence in Large Language Models: A Behavioral Lens on Confidence Estimation and Calibration

Dec 1, 2024·

Bingbing Wen

Chenjun Xu

Bin Han

Robert Wolfe

Lucy Lu Wang

Bill Howe

· 1 min read

Abstract

We take a behavioral perspective on confidence estimation and calibration in large language models, proposing methods to mitigate overconfidence by aligning model confidence with reliability.

Type

Publication

NeurIPS 2024 Workshop on Behavioral Machine Learning

We examine overconfidence in large language models through a behavioral lens and propose approaches to improve calibration so that stated confidence better reflects actual reliability.

Last updated on Dec 1, 2024

Confidence Calibration Overconfidence Large Language Models

Authors

Bingbing Wen

PhD Student

← Know Your Limits: A Survey of Abstention in Large Language Models Feb 1, 2025

Characterizing LLM Abstention Behavior in Science QA with Context Perturbations Oct 1, 2024 →