Back to feedSemiAnalysisTechnologypodcastRL Systems Mind the Gap: Matching Trainer and Generator ThroughputTuesday, June 16, 2026Kimbo ChenView original–––RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker