echoloc

Baseten Tech Stack

AI infrastructure platform for deploying and scaling large language models

Software Development San Francisco, CA 201–500 employees Privately Held

Baseten operates an inference-focused AI infrastructure platform built on NVIDIA GPUs, Kubernetes, and custom performance optimization (vLLM, TensorRT, CUDA). The stack reveals a systems-level engineering focus: adoption of InfiniBand and RDMA signifies movement toward high-performance interconnect, while the project list spans frontier model APIs, billing platform integrations, and internal ML optimization — indicating product maturation beyond early-stage tooling. Hiring is engineering-heavy (55 of 102 roles) with senior-skewed seniority distribution, paired with active GTM hiring, suggesting scaling from technical adoption to enterprise sales motion.

Tech Stack 82 technologies

Core StackSlack Kubernetes PyTorch Docker Python Go JavaScript React TypeScript C++ Salesloft Grafana Loki Prometheus Terraform vLLM InfiniBand NVIDIA Nsight Systems GitHub Codespaces ComfyUI Whisper NVLink CUDA Nsight Systems Nsight Compute TensorRT Outreach TensorRT-LLM SGLang X+48 more
AdoptingCursor RDMA InfiniBand Codex NVLink

What Baseten Is Building

Challenges

  • Capacity management
  • Improving engineering productivity
  • Scalable infrastructure
  • Scaling large models
  • Billing strategic lever
  • Deploying ml models
  • Building foundation and processes for customer-facing teams
  • Multi-tenant isolation
  • Enhancing ci pipeline speed
  • Optimizing ml model inference performance

Active Projects

  • Model apis for frontier models
  • Baseten inference stack
  • Playbook style deployments
  • Go-to-market strategy for fastest growing customer segment
  • Building foundation for customer-facing teams
  • Compensation plan design and rollout
  • The fastest, most accurate whisper transcription
  • Customer pocs
  • Billing platform and orb integrations
  • Internal tooling for quoting and approvals

Hiring Activity

Steady100 roles · 35 in 30d

Department

Engineering
55
Marketing
10
HR
9
Sales
9
Product
5
Ops
3
Data
2
Legal
2

Seniority

Senior
46
Mid
27
Manager
17
Junior
5
Lead
3

Notable leadership hires: Head of Legal Operations

Company intelligence

Find more companies like Baseten by tech stack, pain points and active projects

Get started free

About Baseten

Baseten provides an inference platform designed to simplify deployment and scaling of large language models. The platform abstracts hardware provisioning and optimization, offering global availability with claimed 99.99% uptime. The company targets developers and machine learning teams who need to move models from experimental to production without managing underlying infrastructure. Core technical surface includes model APIs for frontier models, a proprietary inference stack, and billing integrations. The organization is headquartered in San Francisco and operates with 201–500 employees, currently hiring across engineering, sales, marketing, and product.

HeadquartersSan Francisco, CA
Company Size201–500 employees
Hiring MarketsUnited States, Canada

Frequently Asked Questions

What is Baseten's tech stack?

Baseten's core stack includes NVIDIA CUDA, PyTorch, Kubernetes, vLLM, TensorRT, and InfiniBand. Recent adoptions: RDMA, Cursor, and Codex. Monitoring and observability via Prometheus, Grafana, and Loki.

Where does Baseten hire?

Baseten hires in the United States and Canada. Headquarters is San Francisco, CA.

How this profile is built

Baseten's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →

This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.