Media Summary: Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality. Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos.

Cvpr 2026 Processmaker - Detailed Analysis & Overview

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality. Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos. Hakyeong Kim, Ruicheng Wang, Chengtang Yao, Jiaolong Yang, Min H. Kim ( An overview of our paper, "SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation". Accepted in Tuna: Taming Unified Visual Representations for Native Unified Multimodal Models.

GOR-IS presents a 3D Gaussian object removal framework that edits scenes in the intrinsic space, enabling physically consistent ... Joonki Min, Chaeyun Kim, Hyungwook Choi, Yejin Kim, Kihyun Kim, Yohan Jo, Joonseok Lee. Fine-Grained Multi-Image Object ... How much do video diffusion models know about the 4D world? By introducing a 4D VAE, we jointly estimate geometry and ... Presentation for the paper: Raphael Maser*, Siddhartha Gairola*, Sukrut Rao, Bernt Schiele: Align Once to Explain: Feature ...

Photo Gallery

[CVPR 2026] ProcessMaker
[CVPR 2026]
CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)
[CVPR 2026] MUST
[CVPR 2026] Learning to Drive is a Free Gift Official Video
CVPR 2026 Paper: ReSAM: Refine, Requery, and Reinforce
[CVPR 2026] Dense Metric Depth Completion from Sparse Direct Time-of-Flight Sensors
[CVPR 2026] SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation
CVPR 2026 - Tuna
CVPR 2026 Highlight, GOR-IS: 3D Gaussian Object Removal in the Intrinsic Space.
[CVPR 2026] Fine-Grained Multi-Image Object Hallucination Benchmark
[CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
View Detailed Profile
[CVPR 2026] ProcessMaker

[CVPR 2026] ProcessMaker

ProcessMaker

[CVPR 2026]

[CVPR 2026]

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)

Project Page: https://aka.ms/task-transfer-vlms Paper: https://arxiv.org/abs/2511.18787.

[CVPR 2026] MUST

[CVPR 2026] MUST

MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality.

[CVPR 2026] Learning to Drive is a Free Gift Official Video

[CVPR 2026] Learning to Drive is a Free Gift Official Video

Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos.

CVPR 2026 Paper: ReSAM: Refine, Requery, and Reinforce

CVPR 2026 Paper: ReSAM: Refine, Requery, and Reinforce

Video Presentation of

[CVPR 2026] Dense Metric Depth Completion from Sparse Direct Time-of-Flight Sensors

[CVPR 2026] Dense Metric Depth Completion from Sparse Direct Time-of-Flight Sensors

Hakyeong Kim, Ruicheng Wang, Chengtang Yao, Jiaolong Yang, Min H. Kim (

[CVPR 2026] SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation

[CVPR 2026] SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation

An overview of our paper, "SketchDeco: Training-Free Latent Composition for Precise Sketch Colourisation". Accepted in

CVPR 2026 - Tuna

CVPR 2026 - Tuna

Tuna: Taming Unified Visual Representations for Native Unified Multimodal Models.

CVPR 2026 Highlight, GOR-IS: 3D Gaussian Object Removal in the Intrinsic Space.

CVPR 2026 Highlight, GOR-IS: 3D Gaussian Object Removal in the Intrinsic Space.

GOR-IS presents a 3D Gaussian object removal framework that edits scenes in the intrinsic space, enabling physically consistent ...

[CVPR 2026] Fine-Grained Multi-Image Object Hallucination Benchmark

[CVPR 2026] Fine-Grained Multi-Image Object Hallucination Benchmark

Joonki Min, Chaeyun Kim, Hyungwook Choi, Yejin Kim, Kihyun Kim, Yohan Jo, Joonseok Lee. Fine-Grained Multi-Image Object ...

[CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

[CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

How much do video diffusion models know about the 4D world? By introducing a 4D VAE, we jointly estimate geometry and ...

[CVPR 2026] Align Once to Explain

[CVPR 2026] Align Once to Explain

Presentation for the paper: Raphael Maser*, Siddhartha Gairola*, Sukrut Rao, Bernt Schiele: Align Once to Explain: Feature ...