Media Summary: Agents break the old rules of observability. Latency, throughput, and error rates still matter, but once software starts making ... Dive into the intricacies of observability and decision-making with 's Lori MacVittie and special guest Chris Hain. Tune in ... Traditional performance meant deterministic response times. Identical inputs produced near-identical execution times.

Pop Goes The Stack Measuring - Detailed Analysis & Overview

Agents break the old rules of observability. Latency, throughput, and error rates still matter, but once software starts making ... Dive into the intricacies of observability and decision-making with 's Lori MacVittie and special guest Chris Hain. Tune in ... Traditional performance meant deterministic response times. Identical inputs produced near-identical execution times. GPUs get all the attention, but in inference, the real bottleneck is often memory, specifically the KV cache. In this episode of 's ... AI is no longer a lab tool—it's showing up in pipelines, production systems, and the places where “seemed like a good idea” ... Uptime used to mean reliability. But in the LLM era, five nines just means your liar is always available. Real reliability now ...

Traditional reliability meant consistency. Given identical inputs, systems produced identical outputs. Costs were stable and ... Multi-model AI isn't a buzzword anymore, it's how organizations are actually operating. In this episode of 's If you've been treating “garbage in, garbage out” as a metaphor, this episode turns it into a live-fire scenario. Lori MacVittie and ... OpenClaw is what happens when the industry looks at autonomous agents and decides they should have more autonomy, more ... What does "availability" mean in a world of inferencing and ever-shifting workloads? It's no longer just about servers ... Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave ...

Anthropic lobbed a million-token grenade into the coding wars, and suddenly every startup with a “clever context ...

Photo Gallery

Pop Goes the Stack | Measuring what matters: Observability for agents | Agentic AI
Pop Goes the Stack | Chasing Logic Chains: Inference tracing | Observability
Pop Goes the Stack | The Impact of Inference: Performance | AI
Pop Goes the Stack | KV cache is the real inference bottleneck (Not GPUs) | Agentic AI
Pop Goes the Stack | DevOps meets agents: Risk, audit, and the Deming playbook | AI
Pop Goes the Stack: Five nines of wrong - Detecting drift and errors in AI systems | LLM
Pop Goes the Stack | The Impact of Inference: Reliability | AI
Pop Goes the Stack | Model routing isn’t load balancing (And that’s why you’re not ready) | AI
Pop Goes the Stack | Data poisoning: You can’t patch what an LLM “learns” | AI
Pop Goes the Stack | OpenClaw: Multi-agent autonomy, secrets, and blast radius | AI
Pop Goes the Stack | The Impact of Inference: Availability | AI
Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI
View Detailed Profile
Pop Goes the Stack | Measuring what matters: Observability for agents | Agentic AI

Pop Goes the Stack | Measuring what matters: Observability for agents | Agentic AI

Agents break the old rules of observability. Latency, throughput, and error rates still matter, but once software starts making ...

Pop Goes the Stack | Chasing Logic Chains: Inference tracing | Observability

Pop Goes the Stack | Chasing Logic Chains: Inference tracing | Observability

Dive into the intricacies of #AI observability and decision-making with #F5's Lori MacVittie and special guest Chris Hain. Tune in ...

Pop Goes the Stack | The Impact of Inference: Performance | AI

Pop Goes the Stack | The Impact of Inference: Performance | AI

Traditional performance meant deterministic response times. Identical inputs produced near-identical execution times.

Pop Goes the Stack | KV cache is the real inference bottleneck (Not GPUs) | Agentic AI

Pop Goes the Stack | KV cache is the real inference bottleneck (Not GPUs) | Agentic AI

GPUs get all the attention, but in inference, the real bottleneck is often memory, specifically the KV cache. In this episode of #F5's ...

Pop Goes the Stack | DevOps meets agents: Risk, audit, and the Deming playbook | AI

Pop Goes the Stack | DevOps meets agents: Risk, audit, and the Deming playbook | AI

AI is no longer a lab tool—it's showing up in pipelines, production systems, and the places where “seemed like a good idea” ...

Pop Goes the Stack: Five nines of wrong - Detecting drift and errors in AI systems | LLM

Pop Goes the Stack: Five nines of wrong - Detecting drift and errors in AI systems | LLM

Uptime used to mean reliability. But in the LLM era, five nines just means your liar is always available. Real reliability now ...

Pop Goes the Stack | The Impact of Inference: Reliability | AI

Pop Goes the Stack | The Impact of Inference: Reliability | AI

Traditional reliability meant consistency. Given identical inputs, systems produced identical outputs. Costs were stable and ...

Pop Goes the Stack | Model routing isn’t load balancing (And that’s why you’re not ready) | AI

Pop Goes the Stack | Model routing isn’t load balancing (And that’s why you’re not ready) | AI

Multi-model AI isn't a buzzword anymore, it's how organizations are actually operating. In this episode of #F5's

Pop Goes the Stack | Data poisoning: You can’t patch what an LLM “learns” | AI

Pop Goes the Stack | Data poisoning: You can’t patch what an LLM “learns” | AI

If you've been treating “garbage in, garbage out” as a metaphor, this episode turns it into a live-fire scenario. Lori MacVittie and ...

Pop Goes the Stack | OpenClaw: Multi-agent autonomy, secrets, and blast radius | AI

Pop Goes the Stack | OpenClaw: Multi-agent autonomy, secrets, and blast radius | AI

OpenClaw is what happens when the industry looks at autonomous agents and decides they should have more autonomy, more ...

Pop Goes the Stack | The Impact of Inference: Availability | AI

Pop Goes the Stack | The Impact of Inference: Availability | AI

What does "availability" mean in a world of #AI inferencing and ever-shifting workloads? It's no longer just about servers ...

Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI

Pop Goes the Stack | Alien autopsy of LLMs: Constitutions, deception, guardrails | AI

Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave ...

Pop Goes the Stack | When context eats your architecture | GenAI

Pop Goes the Stack | When context eats your architecture | GenAI

Anthropic lobbed a million-token grenade into the coding wars, and suddenly every #AI startup with a “clever context ...