LLM inference and agent infrastructure for enterprise software automation
Poolside operates a model-serving and software-agent platform built on PyTorch, Kubernetes, and NVIDIA GPU infrastructure. The tech stack—heavy on ML frameworks (JAX, vLLM, Triton), orchestration (Kubernetes, Terraform, ArgoCD), and observability (Datadog, Prometheus)—reflects a company focused on distributed LLM training, inference optimization, and deployment at scale. Active projects around multi-device inference, source-code generation, and deployment architecture evolution signal a shift from model-building toward production infrastructure; pain points around latency, throughput, and experimentation-pipeline velocity indicate the technical hurdles between research and reliable enterprise deployments.
Poolside builds models and deployment infrastructure for enterprise software agents. Founded in 2023, the company operates from San Francisco with 51–200 employees structured heavily toward engineering (14 roles) and data (5 roles), with leadership representation across three C-level positions. Their active project list spans LLM training and inference, code generation, secure deployment in hybrid environments, and automation tooling. The technology footprint—PyTorch, Kubernetes, vLLM, CUDA, and NVIDIA—indicates deep ML ops maturity; challenges around synthetic data, deployment scaling, and experimentation latency suggest they are solving production reliability and velocity problems endemic to LLM deployment, not foundational model research.
PyTorch, JAX, Kubernetes, AWS, Terraform, ArgoCD, vLLM, NVIDIA, Triton, CUDA, Datadog, Prometheus, and Go. Recently adopting Rust.
San Francisco, California. Currently hiring only in the United States.
Other companies in the same industry, closest in size