Media Summary: Today, I want to share a new episode with Aman Khan. The best way to learn about Want your team maximizing Claude? I run 1:1 and team In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for

How To Evaluate Ai Applications - Detailed Analysis & Overview

Today, I want to share a new episode with Aman Khan. The best way to learn about Want your team maximizing Claude? I run 1:1 and team In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for Are you still relying on the "vibe check" to Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

Photo Gallery

How to evaluate AI applications
LLM as a Judge: Scaling AI Evaluation Strategies
Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan
The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)
How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)
How to Evaluate (and Improve) Your LLM Apps
A Practical Guide to Evaluating Generative AI Applications - Updated Nov 2025
Stop Guessing: How to Actually Measure AI Performance (AI Evals)
How to Evaluate AI Agents ?
How to evaluate your Gen AI models with Vertex AI
How to evaluate an LLM application
Must-Learn AI Skill for PMs: AI Evals (and how to set them up)
View Detailed Profile
How to evaluate AI applications

How to evaluate AI applications

Vertex

LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Today, I want to share a new episode with Aman Khan. The best way to learn about

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

Learn how to professionally

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want to learn real

How to Evaluate (and Improve) Your LLM Apps

How to Evaluate (and Improve) Your LLM Apps

Want your team maximizing Claude? I run 1:1 and team

A Practical Guide to Evaluating Generative AI Applications - Updated Nov 2025

A Practical Guide to Evaluating Generative AI Applications - Updated Nov 2025

In this comprehensive talk (adapted from my presentation at ODSC), I provide a practical, hands-on framework for

Stop Guessing: How to Actually Measure AI Performance (AI Evals)

Stop Guessing: How to Actually Measure AI Performance (AI Evals)

Are you still relying on the "vibe check" to

How to Evaluate AI Agents ?

How to Evaluate AI Agents ?

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

How to evaluate your Gen AI models with Vertex AI

How to evaluate your Gen AI models with Vertex AI

Gen

How to evaluate an LLM application

How to evaluate an LLM application

How to evaluate

Must-Learn AI Skill for PMs: AI Evals (and how to set them up)

Must-Learn AI Skill for PMs: AI Evals (and how to set them up)

NOTE: see our updated

AI Agents, Clearly Explained

AI Agents, Clearly Explained

My