Media Summary: Shishir Patal, a Research Scientist at Meta, delivered a presentation on Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. Just when it seems like we know how to govern Generative

Evaluating Ai Agents Outcome Process - Detailed Analysis & Overview

Shishir Patal, a Research Scientist at Meta, delivered a presentation on Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. Just when it seems like we know how to govern Generative What You'll Learn: Complete DeepEval installation and configuration Writing your first test cases for This video introduces a new series on testing

Photo Gallery

Evaluating AI Agents: Outcome, Process, and Cost
Evaluating AI Agents: Outcome vs. Process and How to Test Them
Agentic Evals by Shishir Patil
LLM as a Judge: Scaling AI Evaluation Strategies
Evaluating and Debugging Non-Deterministic AI Agents
How to evaluate agents in practice
How to Evaluate AI Agents ?
Metrics for Measuring AI Agent Quality
How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems
Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil
Beginner's Guide to Agent Evaluations
The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)
View Detailed Profile
Evaluating AI Agents: Outcome, Process, and Cost

Evaluating AI Agents: Outcome, Process, and Cost

A flashy demo proves an

Evaluating AI Agents: Outcome vs. Process and How to Test Them

Evaluating AI Agents: Outcome vs. Process and How to Test Them

How do you know if an

Agentic Evals by Shishir Patil

Agentic Evals by Shishir Patil

Shishir Patal, a Research Scientist at Meta, delivered a presentation on

LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx

Evaluating and Debugging Non-Deterministic AI Agents

Evaluating and Debugging Non-Deterministic AI Agents

Evaluate

How to evaluate agents in practice

How to evaluate agents in practice

Evaluating Agents

How to Evaluate AI Agents ?

How to Evaluate AI Agents ?

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

Metrics for Measuring AI Agent Quality

Metrics for Measuring AI Agent Quality

Just when it seems like we know how to govern Generative

How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems

How to Evaluate AI Agents: Comprehensive Strategies for Reliable, High‑Quality Agentic Systems

Evaluating AI agents

Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil

Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil

Is 2025 the year of

Beginner's Guide to Agent Evaluations

Beginner's Guide to Agent Evaluations

When companies deploy their

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

What You'll Learn: Complete DeepEval installation and configuration Writing your first test cases for

The agent evaluation revolution

The agent evaluation revolution

This video introduces a new series on testing