논문 리뷰 [논문리뷰: 핵심개념만] Unified vision-language pre-training for image captioning and vqa by 애플파ol 2024. 6. 3. - 나의 개인연구에 필요한 정보만 취득하기 위해 필요부분만 정리함. 공유하기 게시글 관리 시작은 미약하였으나 , 그 끝은 창대하리라 '논문 리뷰' 카테고리의 다른 글 [논문리뷰: 핵심개념만] Vision GNN : An Image Is Worth Graph of Nodes (0) 2024.08.19 [논문리뷰: 핵심개념만] Pure Transformers are Powerful Graph Learners (0) 2024.06.23 [논문리뷰: 핵심개념만] Oscar: Object-semantics aligned pre-training for vision-language tasks (0) 2024.05.05 [논문리뷰:개념] DeepNet, Foundation Transformers (1) 2024.03.27 [논문리뷰 : 개념] LLaVA: Large Language and Vision Assistant (Visual Instruction Tuning) (0) 2024.02.23 관련글 [논문리뷰: 핵심개념만] Vision GNN : An Image Is Worth Graph of Nodes [논문리뷰: 핵심개념만] Pure Transformers are Powerful Graph Learners [논문리뷰: 핵심개념만] Oscar: Object-semantics aligned pre-training for vision-language tasks [논문리뷰:개념] DeepNet, Foundation Transformers