Media Summary: Aparna Dhinakaran is Co-Founder and Chief Product Officer of Arize AI; Dat Ngo is an ML Solutions Architect at Arize AI. Join the AI Evals September 2026 cohort: Doing Hamel Husain and Shreya Shankar teach the world's most popular course on AI evals and have trained over 2000 PMs and ...

Deep Dive Into Llm Evaluation - Detailed Analysis & Overview

Aparna Dhinakaran is Co-Founder and Chief Product Officer of Arize AI; Dat Ngo is an ML Solutions Architect at Arize AI. Join the AI Evals September 2026 cohort: Doing Hamel Husain and Shreya Shankar teach the world's most popular course on AI evals and have trained over 2000 PMs and ... Most devs are using LLMs daily but don't have a clue about some of the fundamentals. Understanding tokens is crucial because ...

Photo Gallery

Deep Dive into LLM Evaluation with Weights & Biases
Deep Dive into LLMs like ChatGPT
Advanced LLM Evaluation: Classes of LLM Evals – A Deep Dive
How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)
A Deep Dive on LLM Evaluation
Why AI evals are the hottest new skill for product builders | Hamel Husain & Shreya Shankar
LLM Evaluation for QA Engineers | Complete Deep Dive (Part 1)
Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan
Evaluation for Large Language Models (LLMs) and Generative AI - A Deep Dive
How to Test GenAI Agents in Production: MLflow Tracing & Evaluation Deep Dive
Evaluating LLM-based Applications
LLM as a Judge: Scaling AI Evaluation Strategies
View Detailed Profile
Deep Dive into LLM Evaluation with Weights & Biases

Deep Dive into LLM Evaluation with Weights & Biases

In

Deep Dive into LLMs like ChatGPT

Deep Dive into LLMs like ChatGPT

This is a general audience

Advanced LLM Evaluation: Classes of LLM Evals – A Deep Dive

Advanced LLM Evaluation: Classes of LLM Evals – A Deep Dive

Aparna Dhinakaran is Co-Founder and Chief Product Officer of Arize AI; Dat Ngo is an ML Solutions Architect at Arize AI.

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want

A Deep Dive on LLM Evaluation

A Deep Dive on LLM Evaluation

Join the AI Evals September 2026 cohort: https://maven.com/parlance-labs/evals?promoCode=yt-2026 Doing

Why AI evals are the hottest new skill for product builders | Hamel Husain & Shreya Shankar

Why AI evals are the hottest new skill for product builders | Hamel Husain & Shreya Shankar

Hamel Husain and Shreya Shankar teach the world's most popular course on AI evals and have trained over 2000 PMs and ...

LLM Evaluation for QA Engineers | Complete Deep Dive (Part 1)

LLM Evaluation for QA Engineers | Complete Deep Dive (Part 1)

Want

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Today, I want

Evaluation for Large Language Models (LLMs) and Generative AI - A Deep Dive

Evaluation for Large Language Models (LLMs) and Generative AI - A Deep Dive

Evaluation

How to Test GenAI Agents in Production: MLflow Tracing & Evaluation Deep Dive

How to Test GenAI Agents in Production: MLflow Tracing & Evaluation Deep Dive

Dive into

Evaluating LLM-based Applications

Evaluating LLM-based Applications

Evaluating LLM

LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready

Most devs don't understand how LLM tokens work

Most devs don't understand how LLM tokens work

Most devs are using LLMs daily but don't have a clue about some of the fundamentals. Understanding tokens is crucial because ...