논문 리뷰21 [논문리뷰: 핵심개념만] VLAAD: Vision and Language Assistant for Autonomous Driving - 나의 개인연구에 필요한 정보만 취득하기 위해 필요부분만 정리함. 2025. 1. 22. [논문리뷰: 핵심개념만] DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model - 나의 개인연구에 필요한 정보만 취득하기 위해 필요부분만 정리함. 참고 번외 : LLaVA 논문 2025. 1. 10. [논문리뷰 : 서베이]. A survey on multimodal large language models 논문 링크 : https://arxiv.org/abs/2306.13549 A Survey on Multimodal Large Language ModelsRecently, Multimodal Large Language Model (MLLM) represented by GPT-4V has been a new rising research hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform multimodal tasks. The surprising emergent capabilities of MLLM, such as wrarxiv.org 2024. 12. 29. [논문리뷰 : 서베이] Vision-Language Models for Vision Tasks: A Survey 논문링크 : https://ieeexplore.ieee.org/abstract/document/10445007 Vision-Language Models for Vision Tasks: A SurveyMost visual recognition studies rely heavily on crowd-labelled data in deep neural networks (DNNs) training, and they usually train a DNN for each single visual recognition task, leading to a laborious and time-consuming visual recognition paradigm. To addieeexplore.ieee.org 2024. 12. 16. [논문리뷰: 핵심개념만] Vision GNN : An Image Is Worth Graph of Nodes - 나의 개인연구에 필요한 정보만 취득하기 위해 필요부분만 정리함. 2024. 8. 19. [논문리뷰: 핵심개념만] Pure Transformers are Powerful Graph Learners - 나의 개인연구에 필요한 정보만 취득하기 위해 필요부분만 정리함. 2024. 6. 23. 이전 1 2 3 4 다음