Media Summary: In this AI Research Roundup episode, Alex discusses the paper: ' In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ...

Spatialclaw Code Driven Vlm Spatial - Detailed Analysis & Overview

In this AI Research Roundup episode, Alex discusses the paper: ' In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ... In this AI Research Roundup episode, Alex discusses the paper: 'S-Agent: In this AI Research Roundup episode, Alex discusses the paper: 'Think3D: Thinking with Presentation video for our paper, SpatialVLM: Endowing Vision-Language Models with

In this AI Research Roundup episode, Alex discusses the paper: 'Why Far Looks Up: Probing This paper introduces Ego3D-Bench, the first benchmark for evaluating 3D MLV Group Seminar (24.09.09) [Paper] SpatialVLM: Endowing Vision-Language Models with Vision-Language Models like GPT-4 are incredibly powerful, but they have a surprising blind spot: Ready to become a certified watsonx AI Assistant Engineer? Register now and use

Photo Gallery

SpatialClaw: Code-Driven VLM Spatial Reasoning
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning
SpatialReasoner-R1: VLM Spatial Logic
SPATIALUNCERTAIN: Testing VLM Spatial Limits
S-Agent: Teaching VLMs 3D Spatial Reasoning
Think3D: 3D Spatial Reasoning for VLMs
Spatial VLM presentation, CVPR 2024
SpatialTunnel: Probing 3D Spatial Bias in VLMs
[Daily Podcast] Ego3D-Bench: Advancing 3D Spatial Reasoning in VLMs
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]
Teaching AI to See Like a Human: The SpatialLadder Breakthrough
What Are Vision Language Models? How AI Sees & Understands Images
View Detailed Profile
SpatialClaw: Code-Driven VLM Spatial Reasoning

SpatialClaw: Code-Driven VLM Spatial Reasoning

In this AI Research Roundup episode, Alex discusses the paper: '

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

SpatialClaw

SpatialReasoner-R1: VLM Spatial Logic

SpatialReasoner-R1: VLM Spatial Logic

In this AI Research Roundup episode, Alex discusses the paper: 'Fine-Grained Preference Optimization Improves

SPATIALUNCERTAIN: Testing VLM Spatial Limits

SPATIALUNCERTAIN: Testing VLM Spatial Limits

In this AI Research Roundup episode, Alex discusses the paper: 'Seeing Isn't Knowing: Do VLMs Know When Not to Answer ...

S-Agent: Teaching VLMs 3D Spatial Reasoning

S-Agent: Teaching VLMs 3D Spatial Reasoning

In this AI Research Roundup episode, Alex discusses the paper: 'S-Agent:

Think3D: 3D Spatial Reasoning for VLMs

Think3D: 3D Spatial Reasoning for VLMs

In this AI Research Roundup episode, Alex discusses the paper: 'Think3D: Thinking with

Spatial VLM presentation, CVPR 2024

Spatial VLM presentation, CVPR 2024

Presentation video for our paper, SpatialVLM: Endowing Vision-Language Models with

SpatialTunnel: Probing 3D Spatial Bias in VLMs

SpatialTunnel: Probing 3D Spatial Bias in VLMs

In this AI Research Roundup episode, Alex discusses the paper: 'Why Far Looks Up: Probing

[Daily Podcast] Ego3D-Bench: Advancing 3D Spatial Reasoning in VLMs

[Daily Podcast] Ego3D-Bench: Advancing 3D Spatial Reasoning in VLMs

This paper introduces Ego3D-Bench, the first benchmark for evaluating 3D

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities [Jihun Lee]

MLV Group Seminar (24.09.09) [Paper] SpatialVLM: Endowing Vision-Language Models with

Teaching AI to See Like a Human: The SpatialLadder Breakthrough

Teaching AI to See Like a Human: The SpatialLadder Breakthrough

Vision-Language Models like GPT-4 are incredibly powerful, but they have a surprising blind spot:

What Are Vision Language Models? How AI Sees & Understands Images

What Are Vision Language Models? How AI Sees & Understands Images

Ready to become a certified watsonx AI Assistant Engineer? Register now and use

[CVPR2026] SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving

[CVPR2026] SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving

Project Page: https://zhenghao2519.github.io/SpaceDrive_Page/ Paper: https://arxiv.org/abs/2512.10719