GenAI infrastructure platform for banking, financial services, and insurance
AbleCredit operates a GenAI workflow infrastructure layer purpose-built for BFSI enterprises, running on AWS with Kubernetes, vLLM, and Temporal. The tech stack reveals a deep focus on inference optimization (NVIDIA, vLLM, SGLang, Triton, FastAPI, gRPC) and async execution (Kafka, Redis, Celery, Temporal), paired with extensive security hardening (zero-trust, GuardDuty, Vault, KMS, AWS WAF). Current hiring—concentrated in engineering and security, weighted toward senior roles, with velocity accelerating into India—signals both infrastructure maturity and a scaling phase driven by compliance and deployment complexity.
AbleCredit builds GenAI infrastructure for financial services firms, enabling them to deploy custom workflows across onboarding, claims, credit decisioning, and collections. Founded in 2023, the company is based in Milton, Delaware, and operates as a privately held firm with 11–50 employees. Their platform sits between foundation models and enterprise BFSI operations, handling model serving, async task orchestration, compliance automation, and inference scaling under load. The product is architected around containerized LLM deployment, real-time API exposure (gRPC/FastAPI), and queue-based workflow execution to isolate inference scaling from operational dependencies.
AWS (EKS, KMS, GuardDuels, WAF, CloudTrail), Kubernetes, NVIDIA GPUs, vLLM, Triton, Python, Go, FastAPI, gRPC, Kafka, Redis, Celery, Temporal, Terraform, Vault, Docker.
Zero-trust access control, security pipeline automation, GPU-based LLM deployment and inference tuning, FastAPI/gRPC API design, async queue-based AI workflow execution, and defense-in-depth architecture.
Other companies in the same industry, closest in size