Media Summary: Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Adapting In-context Generation for Enhanced Composed Image Retrieval. Rameen Abdal, James Burgess, Sergey Tulyakov, Kuan-Chieh Wang Snap Research , Stanford University ...

Cvpr 2026 One Patch To - Detailed Analysis & Overview

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement. Adapting In-context Generation for Enhanced Composed Image Retrieval. Rameen Abdal, James Burgess, Sergey Tulyakov, Kuan-Chieh Wang Snap Research , Stanford University ... ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers. Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos. MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality.

Photo Gallery

[CVPR 2026] One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework
CVPR 2026 paper of PL-Stitch
[CVPR 2026]
CVPR 2026 Paper Pre
[CVPR 2026] Visual PersonalizationTuring Test
[CVPR 2026] Memory-Efficient Fine-Tuning DiTs via Dynamic Patch Sampling and Block Skipping
CVPR 2026
[CVPR 2026] ProcessMaker
CVPR 2026 paper of BTTF
[CVPR 2026] Learning to Drive is a Free Gift Official Video
[CVPR 2026] MUST
CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)
View Detailed Profile
[CVPR 2026] One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework

[CVPR 2026] One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework

Short overview of our

CVPR 2026 paper of PL-Stitch

CVPR 2026 paper of PL-Stitch

CVPR 2026

[CVPR 2026]

[CVPR 2026]

Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement.

CVPR 2026 Paper Pre

CVPR 2026 Paper Pre

Adapting In-context Generation for Enhanced Composed Image Retrieval.

[CVPR 2026] Visual PersonalizationTuring Test

[CVPR 2026] Visual PersonalizationTuring Test

Rameen Abdal, James Burgess, Sergey Tulyakov, Kuan-Chieh Wang Snap Research , Stanford University ...

[CVPR 2026] Memory-Efficient Fine-Tuning DiTs via Dynamic Patch Sampling and Block Skipping

[CVPR 2026] Memory-Efficient Fine-Tuning DiTs via Dynamic Patch Sampling and Block Skipping

Presentation Slides for

CVPR 2026

CVPR 2026

CVPR 2026

[CVPR 2026] ProcessMaker

[CVPR 2026] ProcessMaker

ProcessMaker: A Generalized Process Visualization Framework with Adaptive Sequence Steps on Diffusion Transformers.

CVPR 2026 paper of BTTF

CVPR 2026 paper of BTTF

CVPR 2026

[CVPR 2026] Learning to Drive is a Free Gift Official Video

[CVPR 2026] Learning to Drive is a Free Gift Official Video

Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos.

[CVPR 2026] MUST

[CVPR 2026] MUST

MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality.

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)

CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language Models (in person)

Project Page: https://aka.ms/task-transfer-vlms Paper: https://arxiv.org/abs/2511.18787.

[CVPR 2026] CarlaOcc

[CVPR 2026] CarlaOcc

CVPR 2026