1
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

CVPR 2024 Day4 AMで気になったpaperを羅列。
後から忘れないようにするためのメモ的立ち位置。
詳しく知りたいものは後日paperを読む予定。

3D

3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces

https://cvpr.thecvf.com/virtual/2024/poster/29885
poster

CityDreamer: Compositional Generative Model of Unbounded 3D Cities

https://cvpr.thecvf.com/virtual/2024/poster/29266
poster

Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation

https://cvpr.thecvf.com/virtual/2024/poster/29444
poster

MonoCD: Monocular 3D Object Detection with Complementary Depths

https://cvpr.thecvf.com/virtual/2024/poster/30921
poster

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

https://cvpr.thecvf.com/virtual/2024/poster/29964
poster

Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

https://cvpr.thecvf.com/virtual/2024/poster/31680
poster

MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models

https://cvpr.thecvf.com/virtual/2024/poster/30683
poster

LaneCPP: Continuous 3D Lane Detection using Physical Priors

https://cvpr.thecvf.com/virtual/2024/poster/30930
poster

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

https://cvpr.thecvf.com/virtual/2024/poster/30607
poster

Gated Fields: Learning Scene Reconstruction from Gated Videos

https://cvpr.thecvf.com/virtual/2024/poster/29275
D1A8C98E-1D5E-4E9E-BE6D-9E43A6A40D76_1_105_c.jpeg

depth

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

https://cvpr.thecvf.com/virtual/2024/poster/31264
poster

UniDepth: Universal Monocular Metric Depth Estimation

https://cvpr.thecvf.com/virtual/2024/poster/31417
poster

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

https://cvpr.thecvf.com/virtual/2024/poster/30176
poster

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

https://cvpr.thecvf.com/virtual/2024/poster/29435
poster


poster

multimodal

Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images

https://cvpr.thecvf.com/virtual/2024/poster/29250
poster

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

https://cvpr.thecvf.com/virtual/2024/poster/31268
architecture

ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

https://cvpr.thecvf.com/virtual/2024/poster/29580
poster

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

https://cvpr.thecvf.com/virtual/2024/poster/31575
architecture

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

https://cvpr.thecvf.com/virtual/2024/poster/31877
poster

GLaMM: Pixel Grounding Large Multimodal Model

https://cvpr.thecvf.com/virtual/2024/poster/31094
architecture

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

https://cvpr.thecvf.com/virtual/2024/poster/31492
poster

Pixel-Aligned Language Model

https://cvpr.thecvf.com/virtual/2024/poster/31639
poster

VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens

https://cvpr.thecvf.com/virtual/2024/poster/29676
poster

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

https://cvpr.thecvf.com/virtual/2024/poster/31270
CLIP as RNN

See Say and Segment: Teaching LMMs to Overcome False Premises

https://cvpr.thecvf.com/virtual/2024/poster/31231
poster

Segment and Caption Anything

https://cvpr.thecvf.com/virtual/2024/poster/29271
poster

RegionGPT: Towards Region Understanding Vision Language Model

https://cvpr.thecvf.com/virtual/2024/poster/31126
poster

LISA: Reasoning Segmentation via Large Language Model

https://cvpr.thecvf.com/virtual/2024/poster/30109
poster

Taming Self-Training for Open-Vocabulary Object Detection

https://cvpr.thecvf.com/virtual/2024/poster/29999
poster


poster

その他

Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge

https://cvpr.thecvf.com/virtual/2024/poster/30824
poster
049FC36A-DDA0-42A0-BEA8-5DE9F33300FA_1_105_c.jpeg

Long-Tailed Anomaly Detection with Learnable Class Names

https://cvpr.thecvf.com/virtual/2024/poster/31789
poster

The Manga Whisperer: Automatically Generating Transcriptions for Comics

https://cvpr.thecvf.com/virtual/2024/poster/31457
poster


poster

1
1
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
1
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?