리미

  • 홈
  • 태그
  • 방명록

LLM 2

[논문리뷰] Training-free Video Temporal Grounding usingLarge-scale Pre-trained Models

https://arxiv.org/abs/2408.16219 Training-free Video Temporal Grounding using Large-scale Pre-trained ModelsVideo temporal grounding aims to identify video segments within untrimmed videos that are most relevant to a given natural language query. Existing video temporal localization models rely on specific datasets for training and have high data collection costarxiv.org Abstract Video Temporal ..

논문 리뷰/MultiModal 2024.12.06

[논문리뷰] GROUNDED-VIDEOLLM: SHARPENING FINEGRAINED TEMPORAL GROUNDING IN VIDEO LARGELANGUAGE MODELS

https://arxiv.org/abs/2410.03290 Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language ModelsVideo Large Language Models (Video-LLMs) have demonstrated remarkable capabilities in coarse-grained video understanding, however, they struggle with fine-grained temporal grounding. In this paper, we introduce Grounded-VideoLLM, a novel Video-LLM adept atarxiv.orgABSTRACT..

논문 리뷰/MultiModal 2024.11.16
이전
1
다음
더보기
프로필사진

리미

  • 분류 전체보기
    • 논문 리뷰
      • MultiModal
      • NLP
      • ComputerVision
      • ML&DL
      • RL
      • RAG
    • 공부
      • 강화학습
      • RAG
    • 태블로
    • 데보션영 3기

최근글과 인기글

  • 최근글
  • 인기글

Tag

LSTM, SKT, 티스토리챌린지, rl, srlm, InfoNCE, 데보션영, blip-2, reinforcement learning, 강화학습, weicom, Clip, tableau, Distillation, Mamba, 오블완, vlm, LLM, clipseg, trainig-free,

Calendar

«   2026/05   »
일 월 화 수 목 금 토
1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31

방문자수Total

  • Today :
  • Yesterday :

Copyright © AXZ Corp. All rights reserved.

티스토리툴바