Media Summary: Video for the paper of Virtual Sparse Convolution for Session Tag: THU-PM-104 Abstract: LiDAR and camera are two modalities available for A short video for introducing "ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for

Cvpr2023 3d Spatial Multimodal Knowledge - Detailed Analysis & Overview

Video for the paper of Virtual Sparse Convolution for Session Tag: THU-PM-104 Abstract: LiDAR and camera are two modalities available for A short video for introducing "ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for

Photo Gallery

[CVPR2023] 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud
Multi-modal Gait Recognition via Effective Spatial-Temporal Feature Fusion (CVPR 2023)
CVPR2023 VirConv
[CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4
MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving (CVPR2023)
CVPR2023 Understanding the Robustness of 3D Object Detection with Bird's Eye View Representations...
[CVPR 2023] 3D Cinemagraphy from a Single Image
CVPR 2023 Demo: Interchange Transfer-based Knowledge Distillation for 3D Object Detection
CVPR2023 paper: ULIP
Learning 3D Scene Priors with 2D Supervision (CVPR'2023)
[CVPR 2023] Viewpoint Equivariance for Multi-View 3D Object Detection
[CVPR2023] WeakMono3D Representation
View Detailed Profile
[CVPR2023] 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud

[CVPR2023] 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud

In-depth understanding of a

Multi-modal Gait Recognition via Effective Spatial-Temporal Feature Fusion (CVPR 2023)

Multi-modal Gait Recognition via Effective Spatial-Temporal Feature Fusion (CVPR 2023)

Video presentation in 8 minutes of our

CVPR2023 VirConv

CVPR2023 VirConv

Video for the paper of Virtual Sparse Convolution for

[CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4

[CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4

CVPR 2023

MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving (CVPR2023)

MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving (CVPR2023)

Session Tag: THU-PM-104 Abstract: LiDAR and camera are two modalities available for

CVPR2023 Understanding the Robustness of 3D Object Detection with Bird's Eye View Representations...

CVPR2023 Understanding the Robustness of 3D Object Detection with Bird's Eye View Representations...

CVPR-2023

[CVPR 2023] 3D Cinemagraphy from a Single Image

[CVPR 2023] 3D Cinemagraphy from a Single Image

We present

CVPR 2023 Demo: Interchange Transfer-based Knowledge Distillation for 3D Object Detection

CVPR 2023 Demo: Interchange Transfer-based Knowledge Distillation for 3D Object Detection

itKD detection result for Waymo

CVPR2023 paper: ULIP

CVPR2023 paper: ULIP

A short video for introducing "ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for

Learning 3D Scene Priors with 2D Supervision (CVPR'2023)

Learning 3D Scene Priors with 2D Supervision (CVPR'2023)

Project: https://yinyunie.github.io/sceneprior-page/ Holistic

[CVPR 2023] Viewpoint Equivariance for Multi-View 3D Object Detection

[CVPR 2023] Viewpoint Equivariance for Multi-View 3D Object Detection

Paper at

[CVPR2023] WeakMono3D Representation

[CVPR2023] WeakMono3D Representation

Weakly Supervised Monocular

Neural Kaleidoscopic Space Sculpting [CVPR 2023]

Neural Kaleidoscopic Space Sculpting [CVPR 2023]

CVPR2023