InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models
Dec 1, 2023ยท
,,,,,ยท
1 min read
Bingbing Wen
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Bill Howe
Lijuan Wang

Abstract
We present InfoVisDial, a comprehensive visual dialogue dataset created by bridging large multimodal and language models to enable informative conversations about visual content.
Type
Publication
Internship at Microsoft Azure AI
We present InfoVisDial, a comprehensive visual dialogue dataset created by bridging large multimodal and language models to enable informative conversations about visual content. This work was conducted during an internship at Microsoft Azure AI.