Skip to content

Latest commit

 

History

History
100 lines (77 loc) · 14.5 KB

README.md

File metadata and controls

100 lines (77 loc) · 14.5 KB

awesome-multimodal-dialogue

Awesome License: MIT

List of Papers, Datasets and Code Repositories for open-domain multimodal dialogue. This repo contains a majority of research works in the multimodal dialogue (M.M.D) field, but it still may not encompass all the noteworthy works (especially those in 2023 and related to LLMs, which will be updated later).

This repo is under W.I.P. Please feel free to open issues and make PRs!

🗃 Datasets

  • Visual Dialog, [CVPR 2017] [data] [code]
  • Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation, [IJCNLP 2017] [data]
  • TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat, [arXiv 2023] [MM 2023] [data]

🏹 Methods

VisDial

Image Grounded

Session Level
Turn Level

Multimodal Response

Others

visit_count