Skip to content

amusi/CVPR2026-Papers-with-Code

Repository files navigation

CVPR 2026 论文和开源项目合集(Papers with Code)

CVPR 2026 decisions are now available on OpenReview!25.42% = 4090 / 16092

注1:欢迎各位大佬提交issue,分享CVPR 2026论文和开源项目!

注2:关于往年CV顶会论文以及其他优质CV论文和大盘点,详见: https://github.com/amusi/daily-paper-computer-vision

欢迎扫码加入【CVer学术交流群】,可以获取CVPR 2026等最前沿工作!这是最大的计算机视觉AI知识星球!每日更新,第一时间分享最新最前沿的计算机视觉、AIGC、扩散模型、多模态、深度学习、自动驾驶、医疗影像和遥感等方向的学习资料,快加入学起来!

【CVPR 2026 论文开源目录】

3DGS(Gaussian Splatting)

Agent

Avatars

Backbone

CLIP

Mamba

Embodied AI

GAN

OCR

NeRF

DETR

Prompt

多模态大语言模型(MLLM)

大语言模型(LLM)

NAS

ReID(重识别)

扩散模型(Diffusion Models)

Vision Transformer

视觉和语言(Vision-Language)

StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues

ApET: Approximation-Error Guided Token Compression for Efficient VLMs

目标检测(Object Detection)

异常检测(Anomaly Detection)

目标跟踪(Object Tracking)

医学图像(Medical Image)

医学图像分割(Medical Image Segmentation)

自动驾驶(Autonomous Driving)

3D点云(3D-Point-Cloud)

3D目标检测(3D Object Detection)

3D语义分割(3D Semantic Segmentation)

Low-level Vision

超分辨率(Super-Resolution)

去噪(Denoising)

图像去噪(Image Denoising)

3D人体姿态估计(3D Human Pose Estimation)

#3D Visual Grounding(3D视觉定位)

图像生成(Image Generation)

ExpPortrait: Expressive Portrait Generation via Personalized Representation

视频生成(Video Generation)

图像编辑(Image Editing)

视频编辑(Video Editing)

3D生成(3D Generation)

3D重建(3D Reconstruction)

tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction

Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning

RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing

人体运动生成(Human Motion Generation)

视频理解(Video Understanding)

具身智能(Embodied AI)

遥感(Remote)

Brewing Stronger Features: Dual-Teacher Distillation for Multispectral Earth Observation

知识蒸馏(Knowledge Distillation)

深度估计(Depth Estimation)

立体匹配(Stereo Matching)

暗光图像增强(Low-light Image Enhancement)

图像压缩(Image Compression)](#IC)

场景图生成(Scene Graph Generation)

风格迁移(Style Transfer)

图像质量评价(Image Quality Assessment)

视频质量评价(Video Quality Assessment)

压缩感知(Compressive Sensing)

数据集(Datasets)

其他(Others)

Decoupling Defense Strategies for Robust Image Watermarking

Multi-Modal Representation Learning via Semi-Supervised Rate Reduction for Generalized Category Discovery

The Invisible Gorilla Effect in Out-of-distribution Detection