Hebbia builds generative AI agents for finance, deployed across investment banks and major asset managers. The stack reveals infrastructure-heavy engineering (Kafka, Spark, Hadoop, Airflow, Elasticsearch) optimized for high-scale document indexing and retrieval—core to surfacing insights across filings and internal data. Active hiring skews heavily senior (34 of 45 open roles) and engineering-focused (22 roles), while projects center on scaling matrix/indexing platforms and multi-agent frameworks, indicating rapid expansion of core AI capabilities rather than stabilization.
Hebbia is a generative AI platform for financial services, enabling investment banks and asset managers to automate analyst workflows around investment research, due diligence, and deal origination. The product surfaces insights across filings, research, and internal documents with citations, and generates documents, presentations, and spreadsheets via AI agents. Built on a distributed data stack (Spark, Hadoop, Airflow, Snowflake) coupled with real-time search (Elasticsearch, Kafka), the platform handles millions of documents at institutional scale. The company is based in New York and operates across the United States and United Kingdom.
Hebbia runs on AWS and Azure with Apache Spark, Hadoop, Kafka, and Elasticsearch for data processing and search. The backend uses Python, Go, C++, Rust, and Java. Data pipelines rely on Apache Airflow and dbt on Snowflake. Frontend uses Figma and Adobe tools. Sales/ops layer includes Salesforce, Outreach, Gong, and Apollo.
Current projects include scaling matrix and deployment platforms, building a custom multi-agent framework, o(1) universal indexing, high-scale document systems, and performance optimization solutions. Internal focus includes observability frameworks and new go-to-market structures.
Hebbia's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.