Zyphra Tech Stack

Full-stack AGI company building open-source language models and agentic systems

Technology, Information and Internet San Francisco, California 51–200 employees Privately Held

Zyphra is an AI research and product company focused on large-scale model training, RAG systems, and agentic runtimes. The stack—PyTorch, vLLM, Ray, Pinecone, Weaviate—reflects deep infrastructure work; active adoption of RAG, Puppeteer, and Playwright across projects signals a shift toward retrieval-augmented and agent-driven capabilities. The hiring mix skews research and engineering (12 of 14 open roles), with intentional depth in mid-level and lead positions, suggesting they're scaling model development and backend systems in parallel.

Tech Stack 43 technologies

Core StackPyTorch Python Apache Spark Weaviate Pinecone RAG Ansible Terraform AWS Docker Kubernetes JavaScript TypeScript React Figma Playwright Apache Beam FAISS Azure GCP Apptainer Slurm vLLM Ray SGLang macOS Electron Tauri Puppeteer CUDA+13 more

AdoptingRAG Playwright Puppeteer

What Zyphra Is Building

◆Challenges

Reliability of ml workloads
Scalability of compute environments
Incident response improvement
Solving fundamental bottlenecks in contemporary models
Improving core modeling capabilities
Designing efficient architectures for gpu hardware
Performance optimization of large-scale language model training
Building active community
Increasing open-source adoption
Gathering community feedback

▲Active Projects

Large-scale audio training runs
Search and retrieval pipelines across large-scale structured and unstructured data
Next generation open-source text-to-speech and audio models
Community growth strategy
Build and deployment systems
Release processes
Agentic systems and interaction projects
Secure virtualized runtimes and backend services for agent execution
Observability systems
Backend layer for rag systems

Hiring Activity

Accelerating15 roles · 2 in 30d

Department

Engineering

Research

Sales

Seniority

Mid

Lead

Senior

Principal

Company intelligence

Find more companies like Zyphra by tech stack, pain points and active projects

Get started free

About Zyphra

Zyphra builds full-stack AGI technology from model training through deployment. The company is actively working on large-scale audio and language model training, search and retrieval pipelines, open-source text-to-speech models, agentic systems, and secure execution runtimes for agents. Infrastructure includes orchestration (Kubernetes, Slurm), distributed compute frameworks (Spark, Beam, Ray), and cloud-native tooling across AWS, Azure, and GCP. They are based in San Francisco with 51–200 employees and are currently hiring across the United States.

HeadquartersSan Francisco, California

Company Size51–200 employees

Hiring MarketsUnited States

Frequently Asked Questions

What technology does Zyphra use?

Core stack: PyTorch, Python, vLLM, Ray, Kubernetes, Slurm. Data layer: FAISS, Weaviate, Pinecone, Apache Spark, Beam. Infrastructure: AWS, Azure, GCP, Docker, Terraform. Actively adopting RAG, Puppeteer, Playwright.

What is Zyphra working on?

Large-scale audio and language model training, search/retrieval pipelines, open-source text-to-speech models, agentic systems, agent execution runtimes, observability systems, and RAG backends. Focus areas include community growth and increasing open-source adoption.

Similar Companies in Technology, Information and Internet

Other companies in the same industry, closest in size

Fluidstack

Technology, Information and Internet

Retell

Technology, Information and Internet

adaption

Technology, Information and Internet

Athennian

Technology, Information and Internet

How this profile is built

Zyphra's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →

This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.