Escaping the SpuriVerse: Can Large Vision-Language Models Generalize Beyond Seen Spurious Correlations?

Sep 1, 2025·

Yifei Yang

Changping Lee

Sheng Shen Feng

Dongxu Zhao

Bingbing Wen

Andrew Z. Liu

Yulia Tsvetkov

Bill Howe

· 1 min read

Abstract

We study whether large vision-language models can generalize beyond spurious correlations observed during training, introducing a benchmark to probe robustness to spurious signals.

Type

Publication

NeurIPS 2025 Datasets and Benchmarks

We introduce a benchmark to evaluate whether large vision-language models can move beyond spurious correlations and generalize robustly to distribution shifts that break shortcut cues.

Last updated on Sep 1, 2025

Vision-Language Models Spurious Correlations Robustness

Authors

Bingbing Wen

PhD Student

← Asking the Missing Piece: Context-Driven Clarification for Ambiguous VQA Dec 1, 2025

Tensorized Clustered LoRA Merging for Multi-Task Interference Aug 15, 2025 →