Llm Reinforcement Learning Tutorial - 搜索视频

Reinforcement Learning with LLMs: a new era of AI agents

YouTubeShaw Talebi

Reinforcement Learning with LLMs: a new era of AI agents

📈 Transform Your Business with AI: https://aibuilder.academy/yt/slJqu3N16Xc 🤓 Get the (free) Claude Code Course: https://aibuilder.academy/courses/yt/slJqu3N16Xc This is the 2nd video in a larger series on reinforcement learning (RL) with LLMs. Here, I discuss 3 ways people are using RL to train modern LLMs and AI agents. ️ Series ...

已浏览 3869 次2 个月之前

JMI LLM Updated Syllabus 2026 📚 | Complete Breakdown + Subjects, Pattern & Preparation Strategy

JMI LLM Updated Syllabus 2026 📚 | Complete Breakdown + Subjects, Pattern & Preparation Strategy

YouTubeMLS LAW ACADEMY

已浏览 288 次1 个月前

NISM LLM 2026 Preparation Strategy | Complete Syllabus & Subject-Wise Plan

NISM LLM 2026 Preparation Strategy | Complete Syllabus & Subject-Wise Plan

YouTubeJurisedge LLM & UGC Law

已浏览 768 次2 个月之前

Mastering LLM Chatbots And RAG Evaluation Crash Course

Mastering LLM Chatbots And RAG Evaluation Crash Course

YouTubeKrish Naik

已浏览 2.5万次1 个月前

热门视频

LLM Fine-Tuning Course – From Supervised FT to RLHF, LoRA, and Multimodal

LLM Fine-Tuning Course – From Supervised FT to RLHF, LoRA, and Multimodal

YouTubefreeCodeCamp.org

已浏览 5.7万次1 个月前

Reinforcement Learning with Verifiable Rewards - Teaching LLMs to Solve Problems

Reinforcement Learning with Verifiable Rewards - Teaching LLMs to Solve Problems

YouTubeAdam Lucek

已浏览 5530 次5 个月之前

LLM Full Course 2026 | LLM Tutorial For Beginners | Introduction to LLM | LLM Training | Simplilearn

LLM Full Course 2026 | LLM Tutorial For Beginners | Introduction to LLM | LLM Training | Simplilearn

YouTubeSimplilearn

已浏览 1467 次3 周前

LLM Application Process

Generative AI+LLM Full Course 2026 | Gen AI & LLM Tutorial for Beginner | Gen AI Explained | Edureka

Generative AI+LLM Full Course 2026 | Gen AI & LLM Tutorial for Beginner | Gen AI Explained | Edureka

YouTubeedureka!

已浏览 1.1万次7 个月之前

Building LLM Application Part 1- Prompt Engineering

Building LLM Application Part 1- Prompt Engineering

YouTubeM365 & Modern Tech Hub

已浏览 1837 次4 个月之前

2025 BC Law LLM Application Workshop

2025 BC Law LLM Application Workshop

已浏览 87 次5 个月之前

LLM Fine-Tuning Course – From Supervised FT to RLHF, LoRA, and Multimodal

LLM Fine-Tuning Course – From Supervised FT to RLHF, LoRA, and Multimodal

已浏览 5.7万次1 个月前

YouTubefreeCodeCamp.org

Reinforcement Learning with Verifiable Rewards - Teaching LLMs to Solve Problems

Reinforcement Learning with Verifiable Rewards - Teaching LLMs to Solve Problems

已浏览 5530 次5 个月之前

YouTubeAdam Lucek

LLM Full Course 2026 | LLM Tutorial For Beginners | Introduction to LLM | LLM Training | Simplilearn

LLM Full Course 2026 | LLM Tutorial For Beginners | Introduction to LLM | LLM Training | Simplilearn

已浏览 1467 次3 周前

YouTubeSimplilearn

Lecture 4 - Reinforcement Learning - Basics | Reasoning LLMs from Scratch

Lecture 4 - Reinforcement Learning - Basics | Reasoning LLMs from Scratch

已浏览 7554 次2025年4月17日

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 10: RL for LLM Reasoning

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 10: RL for LLM Reasoning

已浏览 3754 次4 个月之前

YouTubeStanford Online

Microsoft Agent Lightning: Next-Gen LLM Reinforcement Learning Framework Explained

Microsoft Agent Lightning: Next-Gen LLM Reinforcement Learning Framework Explained

已浏览 960 次5 个月之前

YouTubeAI Learning Hub - Byte-Size AI Learn

Reinforcement Learning With Human Values - New LLM Reasoning Training Method

Reinforcement Learning With Human Values - New LLM Reasoning Training Method

已浏览 212 次6 个月之前

YouTubeVuk Rosić

How to finetune LLMs to THINK with Reinforcement Learning (GRPO from scratch!)

已浏览 2.5万次10 个月之前

YouTubeNeural Breakdown with AVB

Reinforcement Learning for LLMs in 2025

已浏览 1.5万次2025年2月10日

YouTubeTrelis Research

Reinforcement Learning: A (practical) introduction

已浏览 2783 次3 个月之前

YouTubeShaw Talebi

How to do Distributed RL Training for LLM? feat. Eric Yang from Gradient

已浏览 1316 次3 周前

YouTubeDeep Learning with Yacine

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

已浏览 86 次5 个月之前

YouTubeAI Podcast Series. Byte Goose AI.

[UCLA RL-LLM] Chapter 3.2: Reinforcement learning with verifiable rewards (RLVR)

已浏览 3640 次9 个月之前

YouTubeErnest Ryu

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

已浏览 11.2万次9 个月之前

YouTubeAI Engineer

How Reinforcement Learning Works (Tutorial)

已浏览 3.3万次4 个月之前

YouTubeMatthew Berman

verl: Flexible and Scalable Reinforcement Learning Library for LLM Reasoning and Tool-Calling

已浏览 4921 次8 个月之前

Train & Fine-Tune Your Own LLM - සිංහලෙන් | Pre-Training, Fine-Tuning with LoRA & QLoRA

已浏览 2006 次2 周前

Efficient LLM RL Training with Experience Replay

已浏览 20 次2 周前

YouTubeAI Research Roundup

RLHF from scratch, step-by-step, in code

已浏览 2825 次10 个月之前

YouTubeAshwani Kumar

Intro to Fine-Tuning Large Language Models

已浏览 5.7万次7 个月之前

YouTubefreeCodeCamp.org

Evolution Strategies at Scale: LLM Fine Tuning Beyond Reinforcement Learning

已浏览 454 次1 个月前

YouTubealphaXiv

How LLMs Are Actually Trained: Pre-Training vs. Post-Training Explained (with Julien Launay)

已浏览 4571 次8 个月之前

YouTubeSuper Data Science: ML & AI Podcast with Jo…

GDPO Explained: NVIDIA Fixes GRPO for LLM Reinforcement Learning

已浏览 3196 次3 个月之前

YouTubeAI Papers Academy

LLM Course – Build a Semantic Book Recommender (Python, OpenAI, LangChain, Gradio)

已浏览 40.4万次2025年1月27日

YouTubefreeCodeCamp.org

[UCLA RL-LLM] Chapter 1.1: MDP foundations, imitation learning, and value iteration

已浏览 7758 次9 个月之前

YouTubeErnest Ryu

Reinforcement Learning (RL) for LLMs

已浏览 1.3万次2025年3月12日

YouTubeNatasha Jaques

Training LLM to play chess using Deepseek GRPO reinforcement learning

已浏览 1.9万次2025年3月1日

YouTubeEfficient NLP

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

已浏览 6369 次2025年3月25日

YouTubeAI Papers Academy

Advanced LLM Post-Training: SFT, DPO, Reinforcement Learning w/ Maxime Labonne (Liquid AI)

已浏览 274 次5 个月之前

YouTubeYouth AI Initiative

展开