[논문리뷰 : 개념] VideoChat : Chat-Centric Video Understanding

논문 링크 : https://arxiv.org/abs/2305.06355

VideoChat: Chat-Centric Video Understanding

In this paper, we initiate an attempt of developing an end-to-end chat-centric video understanding system, coined as VideoChat. It integrates video foundation models and large language models via a learnable neural interface, excelling in spatiotemporal re

arxiv.org

Published : 2021.03.24 (arxiv - 24.01.14 기준)

Citation : 106회 (24.01.14기준)

'논문 리뷰' 카테고리의 다른 글

[논문리뷰 : 개념] Semantic Scene Understanding with Large Language Models on Unmanned Aerial Vehicles (0)	2024.01.27
[논문리뷰 : 개념] mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (0)	2024.01.26
[논문리뷰 : 개념] In-flight positional and energy use data set of a DJI Matrice 100quadcopter for small package delivery (0)	2024.01.02
[논문리뷰 : 개념] CapERA: Captioning Events in Aerial Videos (0)	2023.12.06
[논문 리뷰 : 서베이] Multimodal Learning With Transformers: A Survey (0)	2023.11.11

시작은 미약하였으나 , 그 끝은 창대하리라

[논문리뷰 : 개념] VideoChat : Chat-Centric Video Understanding

'논문 리뷰' 카테고리의 다른 글

티스토리툴바

[논문리뷰 : 개념] VideoChat : Chat-Centric Video Understanding

'논문 리뷰' 카테고리의 다른 글

관련글

티스토리툴바