Full-stack AGI company building open-source models and agentic systems
Zyphra is a full-stack AGI company operating a heavy research and engineering footprint (13 of 15 active roles), with concentrated hiring in research and core ML engineering. The tech stack reflects deep infrastructure work: PyTorch, vLLM, Ray, and SGLang for model training and inference; Kubernetes and Slurm for orchestration at scale; and active adoption of RAG plus browser automation (Puppeteer, Playwright) suggesting expansion into retrieval-augmented and agentic pipelines. Current project focus spans large-scale audio model training, novel architectures, and secure agent runtimes—indicating a shift from pure language models toward multimodal and interactive systems.
Zyphra is a San Francisco-based AGI company with 51–200 employees developing open-source models and infrastructure for large-scale machine learning. The company operates across three core areas: model research and training (audio and novel architectures), search and retrieval systems for structured and unstructured data at scale, and agentic systems with secure execution runtimes. Their engineering roadmap includes kernel optimization for GPU workloads, observability and reliability improvements for ML pipelines, and deployment tooling (Terraform, Ansible, Docker, Kubernetes) to support distributed training runs. The organization is U.S.-based and actively hiring in research and engineering roles.
PyTorch, Python, vLLM, Ray, SGLang, Kubernetes, Slurm for model training and inference. AWS, Azure, GCP for cloud. React, Electron, Tauri for frontend. Adopting RAG, Puppeteer, and Playwright for retrieval and automation.
Large-scale audio model training, novel model architectures, search/retrieval pipelines, open-source text-to-speech, GPU kernel optimization, agentic systems with secure runtimes, and observability infrastructure for ML workloads.
Other companies in the same industry, closest in size