3

Clarify or Answer: Reinforcement Learning for Agentic VQA with Context Under-specification featured image

Clarify or Answer: Reinforcement Learning for Agentic VQA with Context Under-specification

Reinforcement learning for agentic VQA that balances clarification and answering under underspecified context.

Read more

Tensorized Clustered LoRA Merging for Multi-Task Interference

Tensorized clustered LoRA merging to reduce multi-task interference in adapter-based LLM fine-tuning.

Read more

MMMG: A Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation

Evaluation suite for diverse multitask multimodal generation with large multimodal models.

Read more
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models featured image

InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models

An informative visual dialogue dataset created by bridging large multimodal and language models.

Read more

Towards Generating Robust, Fair, and Emotion-Aware Explanations for Recommender Systems

Improving robustness, fairness, and emotion-awareness of explanations in recommender systems.

Read more

EGCR: Explanation Generation for Conversational Recommendation

Generating explanations in multi-turn conversational recommendation with EGCR.

Read more