Liquid AI builds optimized AI systems using low-level GPU infrastructure (CUDA, custom kernels, TensorRT) paired with production training frameworks (PyTorch, DeepSpeed, Megatron-LM). The stack reveals an engineering-first org solving performance constraints: projects span custom CUDA kernel development, data-loading bottleneck elimination, and on-device VLM inference under latency constraints. Adopting llama.cpp and ExecuTorch signals focus on efficient edge deployment alongside enterprise-scale training.
Liquid AI, founded in 2023 and headquartered in Cambridge, Massachusetts, develops efficient AI systems optimized for both training at scale and inference under real-time constraints. The company serves enterprise customers deploying large language and vision models, with active work on Japanese LLM fine-tuning and dataset curation suggesting regional expansion. Their technical surface spans distributed GPU training infrastructure, benchmark harnesses to track performance regression, and evaluation frameworks for multi-language model quality. The hiring velocity is accelerating, concentrated heavily in engineering roles across the United States and Japan.
Core: CUDA, PyTorch, DeepSpeed, Megatron-LM, cuDNN, cuBLAS for training. Inference: vLLM, TensorRT, llama.cpp, ExecuTorch. Supporting: JAX, TensorFlow, Hugging Face, SGLang, ONNX for model optimization and deployment.
Custom CUDA kernel development, distributed training infrastructure for GPU clusters, data pipelines for audio model training, Japanese LLM fine-tuning for enterprise, on-device VLM inference, benchmark and evaluation frameworks, and multimodal data loading systems.
Other companies in the same industry, closest in size
Liquid AI's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.