AI infrastructure for reinforcement learning and LLM post-training
Bespoke Labs builds infrastructure for RL-based AI agents and LLM fine-tuning, with a tech stack optimized for GPU-accelerated training (CUDA, PyTorch, vLLM, Triton). The project list reveals dual focus: core platform work (agentic AI, RL environment curation, CUDA kernel development) alongside applied work (trading systems, backtesting, adversarial analysis). Pain points cluster around scaling research to production and managing frontier-scale agent training — a gap their 11-engineer, 4-researcher mix is actively hiring to close.
Notable leadership hires: Tech Lead
Bespoke Labs is a venture-funded startup in Mountain View building AI tools for data curation and post-training of large language models, with a focus on reinforcement learning for agents. The company operates across infrastructure (GPU profiling, data pipeline scaling, CUDA kernel development), research (adversarial risk, uncertainty quantification, systematic strategy development), and customer delivery on multi-million-dollar contracts. They're hiring across engineering, research, operations, and security roles in the US, India, Singapore, and Uganda, reflecting an expansion from a 2–10-person founding team into a scaled engineering organization.
CUDA, PyTorch, vLLM, Triton for GPU-accelerated training; JAX, NumPy, SciPy, scikit-learn for numerical compute; Apache Airflow and Prefect for orchestration; AWS, GCP, Azure for cloud; PostgreSQL and Elasticsearch for data storage; Next.js and TypeScript for frontend.
Core platform infrastructure for agentic AI and RL environments; CUDA kernel development for training and inference; data pipeline scaling and calibration/uncertainty quantification pipelines; trading systems and backtesting frameworks; GPU profiling and benchmarking tooling; customer project delivery.
Bespoke Labs's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.