echoloc

Cast AI Tech Stack

Kubernetes automation and cost optimization for multi-cloud AI workloads

Software Development Miami, FL 201–500 employees Founded 2019 Privately Held

Cast AI automates Kubernetes cluster management across AWS, GCP, and Azure, with a tech stack centered on Go, Python, and container orchestration (Kubernetes, ArgoCD, Helm). The company is actively adopting large language models (LLAMA, Grok) and deploying inference optimization tools (vLLM, SGLang, TensorRT), signaling a pivot toward AI workload automation alongside traditional cloud cost optimization. Engineering dominates hiring (77 of 123 roles), with senior-level concentration (91 senior vs. 13 mid), indicating focus on scaling core platform capabilities rather than sales or customer success.

Tech Stack 61 technologies

Core StackGo Python Kubernetes AWS PostgreSQL gRPC Prometheus Grafana Loki GitLab CI/CD ArgoCD MySQL C++ ClickHouse Helm AWS RDS PyTorch Auth0 Okta GCP Azure Google Cloud Pub/Sub REST Tempo Envoy Cloud SQL Azure SQL Database vLLM SGLang TensorRT+29 more
AdoptingGo LLAMA Grok Karpenter

What Cast AI Is Building

Challenges

  • Manual tuning inefficiencies
  • Improve performance
  • Boost productivity
  • Manual decision making
  • Manual tuning of kubernetes
  • Improving application performance
  • Reducing cloud costs
  • Reducing infrastructure costs
  • Scalable reporting system
  • Scaling kubernetes environments

Active Projects

  • Authn secure login experiences
  • Workload optimization (woop)
  • Billing and audit trails
  • Autoscaler
  • Automated llm selection
  • Infrastructure-as-code platform development
  • Repeatable environment creation for testing, disaster recovery, and production
  • Self-service sso integrations
  • Authz rbac system
  • Llm deployment and management in kubernetes

Hiring Activity

Accelerating120 roles · 55 in 30d

Department

Engineering
77
Sales
19
Product
5
Marketing
4
Support
4
Finance
1
HR
1
Ops
1

Seniority

Senior
91
Mid
13
Manager
8
Company intelligence

Find more companies like Cast AI by tech stack, pain points and active projects

Get started free

About Cast AI

Cast AI builds an automation platform for Kubernetes and cloud-native infrastructure, targeting engineering teams managing multi-cloud deployments. The product addresses two operational pain points: manual tuning inefficiencies in Kubernetes cluster management and rising infrastructure costs across AWS, GCP, and Azure. Cast AI's active project roadmap emphasizes workload optimization, autoscaling, LLM deployment in Kubernetes, and infrastructure-as-code tooling. The company is headquartered in Miami and operates across 15 countries, with primary engineering centers in Eastern Europe and the United States.

HeadquartersMiami, FL
Company Size201–500 employees
Founded2019
Hiring MarketsBulgaria, Romania, Lithuania, Poland, Ukraine, Hungary, United States, Germany

Frequently Asked Questions

What tech stack does Cast AI use?

Go, Python, Kubernetes, AWS, GCP, Azure, PostgreSQL, Prometheus, Grafana, GitLab CI/CD, ArgoCD, and Helm. Recent additions include vLLM, SGLang, and TensorRT for LLM optimization.

What is Cast AI working on?

Workload optimization, Kubernetes autoscaling, LLM deployment and selection automation, authentication and authorization systems, billing/audit trails, and infrastructure-as-code platforms.

Similar Companies in Software Development

Other companies in the same industry, closest in size