AI inference platform unifying model deployment across hardware and cloud
Modular builds a compiler and runtime stack for AI inference, with heavy investment in low-level optimization (CUDA, MLIR, LLVM, kernel libraries) and multi-cloud deployment (AWS, GCP, Azure, Kubernetes). The engineering-dominated org (15 of 18 roles) and project focus on GPU kernel optimization, hardware abstraction, and portability infrastructure suggest they're solving friction around deploying trained models to heterogeneous hardware — a core pain point their roadmap explicitly targets.
Modular is an AI developer platform founded in 2022, focused on unifying the development and deployment of AI across diverse hardware and cloud environments. The company operates from a distributed footprint with 51–200 employees, concentrated in engineering and based primarily in the United States, United Kingdom, and Norway. Their core surface area spans AI inference optimization (LLM inference, GPU kernels, cold-start performance), model deployment tooling, and an emerging hardware abstraction strategy via open-source and Kubernetes operators. Active hiring skews toward senior and lead engineers, signaling maturity in execution and a scaling infrastructure play.
Modular uses C/C++, CUDA, LLVM, MLIR, Rust, Python, and Mojo for its core compiler and runtime. Deployment targets include Kubernetes, Helm, and cloud platforms (AWS, GCP, Azure). ML frameworks integrated are PyTorch, TensorFlow, and JAX.
Core projects include LLM inference platform development, GPU kernel optimization, AI inference kernel optimization, hardware architecture support, and portability infrastructure. The roadmap also covers Mojo standard library development and open hardware ecosystem strategy.
Modular's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.