Multimodal Models

MMMG: A Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation

Evaluation suite for diverse multitask multimodal generation with large multimodal models.

Read more
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models featured image

InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models

An informative visual dialogue dataset created by bridging large multimodal and language models.

Read more