Media Summary: Guest lecture for the LLMs course at McGill Slides are here: ... Description: Fine-tuning large language models (LLMs) for downstream tasks is an essential Distributed machine learning is an important area that has been receiving considerable attention from academic and industrial ...

Every Step Evolves Scaling Reinforcement - Detailed Analysis & Overview

Guest lecture for the LLMs course at McGill Slides are here: ... Description: Fine-tuning large language models (LLMs) for downstream tasks is an essential Distributed machine learning is an important area that has been receiving considerable attention from academic and industrial ... For more information about Stanford's online Artificial Intelligence programs, visit: To learn more about ... With Anthony Liang, Yigit Korkmaz, and Jesse Zhang ... In this AI Research Roundup episode, Alex discusses the paper: '

In this video, I will give you the "big picture" that makes everything click when it comes to learning This video introduces the variety of methods for model-based and model-free Want a $200M Operating System? Let's work together: arxiv: 1) Overview and Main Themes Emergence of LLM Reasoning: – The survey discusses ... In this episode of EvoAgentX Talk, Ouyang Siru (profile: presents a detailed sharing on SkillOS, ...

Photo Gallery

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model (Oct 2025)
Rishabh Agarwal: The Art of Scaling Reinforcement Learning Compute for LLMs
Evolution Strategies at Scale: LLM Fine Tuning Beyond Reinforcement Learning
Scaling Up Reinforcement Learning
Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 9: Scaling Laws
Ep#84: Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons
EGGROLL: Scaling Evolution Strategies Training
A visual guide on Reinforcement Learning - the 6 things that makes it “click”
Reinforcement Learning Series: Overview of Methods
Every Level of Scale Explained in 23 Minutes
[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han
A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason and Agentic Systems
View Detailed Profile
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model (Oct 2025)

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model (Oct 2025)

Title:

Rishabh Agarwal: The Art of Scaling Reinforcement Learning Compute for LLMs

Rishabh Agarwal: The Art of Scaling Reinforcement Learning Compute for LLMs

Guest lecture for the LLMs course at McGill https://mcgill-nlp.github.io/teaching/comp767-ling782-W26/ Slides are here: ...

Evolution Strategies at Scale: LLM Fine Tuning Beyond Reinforcement Learning

Evolution Strategies at Scale: LLM Fine Tuning Beyond Reinforcement Learning

Description: Fine-tuning large language models (LLMs) for downstream tasks is an essential

Scaling Up Reinforcement Learning

Scaling Up Reinforcement Learning

Distributed machine learning is an important area that has been receiving considerable attention from academic and industrial ...

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 9: Scaling Laws

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 9: Scaling Laws

For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai To learn more about ...

Ep#84: Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

Ep#84: Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

With Anthony Liang, Yigit Korkmaz, and Jesse Zhang ...

EGGROLL: Scaling Evolution Strategies Training

EGGROLL: Scaling Evolution Strategies Training

In this AI Research Roundup episode, Alex discusses the paper: '

A visual guide on Reinforcement Learning - the 6 things that makes it “click”

A visual guide on Reinforcement Learning - the 6 things that makes it “click”

In this video, I will give you the "big picture" that makes everything click when it comes to learning

Reinforcement Learning Series: Overview of Methods

Reinforcement Learning Series: Overview of Methods

This video introduces the variety of methods for model-based and model-free

Every Level of Scale Explained in 23 Minutes

Every Level of Scale Explained in 23 Minutes

Want a $200M Operating System? Let's work together: https://go.scalable.co/491qxq2 ...

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

Why is

A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason and Agentic Systems

A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason and Agentic Systems

arxiv: https://arxiv.org/pdf/2504.09037 1) Overview and Main Themes • Emergence of LLM Reasoning: – The survey discusses ...

EvoAgentX Talk: SkillOS: Learning Skill Curation for Self-Evolving Agents

EvoAgentX Talk: SkillOS: Learning Skill Curation for Self-Evolving Agents

In this episode of EvoAgentX Talk, Ouyang Siru (profile: https://siruo2.notion.site/) presents a detailed sharing on SkillOS, ...