InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models

Dec 1, 2023ยท
Bingbing Wen
Bingbing Wen
,
Zhengyuan Yang
,
Jianfeng Wang
,
Zhe Gan
,
Bill Howe
,
Lijuan Wang
ยท 1 min read
Abstract
We present InfoVisDial, a comprehensive visual dialogue dataset created by bridging large multimodal and language models to enable informative conversations about visual content.
Type
Publication
Internship at Microsoft Azure AI

We present InfoVisDial, a comprehensive visual dialogue dataset created by bridging large multimodal and language models to enable informative conversations about visual content. This work was conducted during an internship at Microsoft Azure AI.