0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

CVPR 2024 Day4 PMで気になったpaperを羅列。
後から忘れないようにするためのメモ的立ち位置。
詳しく知りたいものは後日paperを読む予定。

multimodal

Text2Loc: 3D Point Cloud Localization from Natural Language

https://cvpr.thecvf.com/virtual/2024/poster/29628
poster

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

https://cvpr.thecvf.com/virtual/2024/poster/30022
poster

Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions

https://cvpr.thecvf.com/virtual/2024/poster/31839
poster

Harnessing Large Language Models for Training-free Video Anomaly Detection

https://cvpr.thecvf.com/virtual/2024/poster/29246
poster

Streaming Dense Video Captioning

https://cvpr.thecvf.com/virtual/2024/poster/31433
poster

Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

https://cvpr.thecvf.com/virtual/2024/poster/30320
poster

LLMs are Good Sign Language Translators

https://cvpr.thecvf.com/virtual/2024/poster/30247
poster

VideoLLM-online: Online Video Large Language Model for Streaming Video

https://cvpr.thecvf.com/virtual/2024/poster/29835
training
The training method in our LIVE framework
inference
Inference pipeline in our LIVE framework

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation

https://cvpr.thecvf.com/virtual/2024/poster/29730
task definition (semantic audio-visual navigation)
task
architecture
architecture

SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

https://cvpr.thecvf.com/virtual/2024/poster/31092
poster

その他

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

https://cvpr.thecvf.com/virtual/2024/poster/30131
overview

PEM: Prototype-based Efficient MaskFormer for Image Segmentation

https://cvpr.thecvf.com/virtual/2024/poster/30870
poster

Extreme Point Supervised Instance Segmentation

https://cvpr.thecvf.com/virtual/2024/poster/30370
poster

Looking 3D: Anomaly Detection with 2D-3D Alignment

https://cvpr.thecvf.com/virtual/2024/poster/31190
poster

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

https://cvpr.thecvf.com/virtual/2024/poster/31171
poster

Supervised Anomaly Detection for Complex Industrial Images

https://cvpr.thecvf.com/virtual/2024/poster/30567
poster

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

https://cvpr.thecvf.com/virtual/2024/poster/30829
poster

Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts

https://cvpr.thecvf.com/virtual/2024/poster/31830
poster

Multi-Task Dense Prediction via Mixture of Low-Rank Experts

https://cvpr.thecvf.com/virtual/2024/poster/29418
poster

Matching Anything by Segmenting Anything

https://cvpr.thecvf.com/virtual/2024/poster/29590
poster

YOLO-World: Real-Time Open-Vocabulary Object Detection

https://cvpr.thecvf.com/virtual/2024/poster/30009
poster

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?