Foundation Model Based Open Vocab

Media Summary: Paper: Abstract: This paper describes a strategy for implementing a robotic system capable of ... Going dizzy with all the different AI-related terms being thrown around? What's a transformer, and what does it have to do with ... MERL researchers Kuan-Chuan Peng, Suhas Lohit, and Michael J. Jones and their co-authors Xinhao Xiang and Jiawei Zhang ...

Foundation Model Based Open Vocab - Detailed Analysis & Overview

Paper: Abstract: This paper describes a strategy for implementing a robotic system capable of ... Going dizzy with all the different AI-related terms being thrown around? What's a transformer, and what does it have to do with ... MERL researchers Kuan-Chuan Peng, Suhas Lohit, and Michael J. Jones and their co-authors Xinhao Xiang and Jiawei Zhang ... IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 Exploring the Potential of Large Explore watsonx → Building a trustworthy and efficient, high-performing SPEAKER: Shalini De Mello is a Director of Research, New Experiences at NVIDIA, where she leads a team on AI-Mediated ...

VIDEO DESCRIPTION ✍️ Let's kick things off by introducing the concept of AI and machine learning ICCV 2023 Tutorial on Learning with Noisy and Unlabeled Data for Large OVRCOAT: Mitigating Objectness Bias and Region-to-Text Misalignment for Join Ananya Kumar, a fifth-year PhD student at Stanford University, as he delves into the world of

Photo Gallery

Foundation Model based Open Vocab Task Planning & Executive System for General Purpose Service Robot

AI Buzzwords Explained: Foundation Model, Chatbot, and More

[BMVC 2025] Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

[CVPR'24] Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

Why Are There So Many Foundation Models?

How to Build Enterprise-Ready Foundation Models

Open Vocabulary Recognition with Large Text-to-Image Foundational Model

What is a Foundation Model ?

Open-Vocabulary Visual Perception upon Frozen Vision and Language Models (Yin Cui, Google)

Shalini De Mello: Open-Vocabulary Recognition with Large Image-Text Foundational Models

OVRCOAT: Open-Vocabulary Panoptic Segmentation | CVPR 2026

TiPToP: Modular Open-Vocabulary Robot Planning Without Training Data

View Detailed Profile

Foundation Model based Open Vocab Task Planning & Executive System for General Purpose Service Robot

Foundation Model based Open Vocab Task Planning & Executive System for General Purpose Service Robot

Paper: https://arxiv.org/abs/2308.03357 Abstract: This paper describes a strategy for implementing a robotic system capable of ...

AI Buzzwords Explained: Foundation Model, Chatbot, and More

AI Buzzwords Explained: Foundation Model, Chatbot, and More

Going dizzy with all the different AI-related terms being thrown around? What's a transformer, and what does it have to do with ...

[BMVC 2025] Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

[BMVC 2025] Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

MERL researchers Kuan-Chuan Peng, Suhas Lohit, and Michael J. Jones and their co-authors Xinhao Xiang and Jiawei Zhang ...

[CVPR'24] Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

[CVPR'24] Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 Exploring the Potential of Large

Why Are There So Many Foundation Models?

Why Are There So Many Foundation Models?

Check out watsonx: hhttps://ibm.biz/BdvyLa There are a lot of

How to Build Enterprise-Ready Foundation Models

How to Build Enterprise-Ready Foundation Models

Explore watsonx → https://ibm.biz/BdPANF Building a trustworthy and efficient, high-performing

Open Vocabulary Recognition with Large Text-to-Image Foundational Model

Open Vocabulary Recognition with Large Text-to-Image Foundational Model

SPEAKER: Shalini De Mello is a Director of Research, New Experiences at NVIDIA, where she leads a team on AI-Mediated ...

What is a Foundation Model ?

What is a Foundation Model ?

VIDEO DESCRIPTION ✍️ Let's kick things off by introducing the concept of AI and machine learning

Open-Vocabulary Visual Perception upon Frozen Vision and Language Models (Yin Cui, Google)

Open-Vocabulary Visual Perception upon Frozen Vision and Language Models (Yin Cui, Google)

ECCV 2022 CVinW Workshop Invited Talk:

Shalini De Mello: Open-Vocabulary Recognition with Large Image-Text Foundational Models

Shalini De Mello: Open-Vocabulary Recognition with Large Image-Text Foundational Models

ICCV 2023 Tutorial on Learning with Noisy and Unlabeled Data for Large

OVRCOAT: Open-Vocabulary Panoptic Segmentation | CVPR 2026

OVRCOAT: Open-Vocabulary Panoptic Segmentation | CVPR 2026

OVRCOAT: Mitigating Objectness Bias and Region-to-Text Misalignment for

TiPToP: Modular Open-Vocabulary Robot Planning Without Training Data

TiPToP: Modular Open-Vocabulary Robot Planning Without Training Data

Paper: TiPToP: A Modular

Foundation Models Tutorial, and Why Not to Fine Tune Them

Foundation Models Tutorial, and Why Not to Fine Tune Them

Join Ananya Kumar, a fifth-year PhD student at Stanford University, as he delves into the world of