논문 링크 : https://arxiv.org/abs/2305.06355
VideoChat: Chat-Centric Video Understanding
In this paper, we initiate an attempt of developing an end-to-end chat-centric video understanding system, coined as VideoChat. It integrates video foundation models and large language models via a learnable neural interface, excelling in spatiotemporal re
arxiv.org
Published : 2021.03.24 (arxiv - 24.01.14 기준)
Citation : 106회 (24.01.14기준)