Cast AI Tech Stack

Kubernetes automation and cost optimization for multi-cloud AI workloads

Software Development Miami, FL 201–500 employees Founded 2019 Privately Held

Cast AI automates Kubernetes cluster management across AWS, GCP, and Azure, with a tech stack centered on Go, Python, and container orchestration (Kubernetes, ArgoCD, Helm). The company is actively adopting large language models (LLAMA, Grok) and deploying inference optimization tools (vLLM, SGLang, TensorRT), signaling a pivot toward AI workload automation alongside traditional cloud cost optimization. Engineering dominates hiring (77 of 123 roles), with senior-level concentration (91 senior vs. 13 mid), indicating focus on scaling core platform capabilities rather than sales or customer success.

Tech Stack 61 technologies

Core StackGo Python Kubernetes AWS PostgreSQL gRPC Prometheus Grafana Loki GitLab CI/CD ArgoCD MySQL C++ ClickHouse Helm AWS RDS PyTorch Auth0 Okta GCP Azure Google Cloud Pub/Sub REST Tempo Envoy Cloud SQL Azure SQL Database vLLM SGLang TensorRT+29 more

AdoptingGo LLAMA Grok Karpenter

What Cast AI Is Building

◆Challenges

Manual tuning inefficiencies
Improve performance
Boost productivity
Manual decision making
Manual tuning of kubernetes
Improving application performance
Reducing cloud costs
Reducing infrastructure costs
Scalable reporting system
Scaling kubernetes environments

▲Active Projects

Authn secure login experiences
Workload optimization (woop)
Billing and audit trails
Autoscaler
Automated llm selection
Infrastructure-as-code platform development
Repeatable environment creation for testing, disaster recovery, and production
Self-service sso integrations
Authz rbac system
Llm deployment and management in kubernetes

Hiring Activity

Accelerating120 roles · 55 in 30d

Department

Engineering

Sales

Product

Marketing

Support

Finance

Ops

Seniority

Senior

Mid

Manager

Company intelligence

Find more companies like Cast AI by tech stack, pain points and active projects

Get started free

About Cast AI

Cast AI builds an automation platform for Kubernetes and cloud-native infrastructure, targeting engineering teams managing multi-cloud deployments. The product addresses two operational pain points: manual tuning inefficiencies in Kubernetes cluster management and rising infrastructure costs across AWS, GCP, and Azure. Cast AI's active project roadmap emphasizes workload optimization, autoscaling, LLM deployment in Kubernetes, and infrastructure-as-code tooling. The company is headquartered in Miami and operates across 15 countries, with primary engineering centers in Eastern Europe and the United States.

HeadquartersMiami, FL

Company Size201–500 employees

Founded2019

Hiring MarketsBulgaria, Romania, Lithuania, Poland, Ukraine, Hungary, United States, Germany

Frequently Asked Questions

What tech stack does Cast AI use?

Go, Python, Kubernetes, AWS, GCP, Azure, PostgreSQL, Prometheus, Grafana, GitLab CI/CD, ArgoCD, and Helm. Recent additions include vLLM, SGLang, and TensorRT for LLM optimization.

What is Cast AI working on?

Workload optimization, Kubernetes autoscaling, LLM deployment and selection automation, authentication and authorization systems, billing/audit trails, and infrastructure-as-code platforms.