AI answer engine powered by search and large language models
Perplexity operates an answer engine combining search, retrieval-augmented generation (RAG), and large-scale LLM inference. The tech stack reveals a production-heavy orientation: Kafka, Flink, Spark, and Dagster power real-time data pipelines; Kubernetes and Terraform manage infrastructure at scale; PostgreSQL, DynamoDB, Cassandra, and ClickHouse support varied query patterns. Active adoption of RAG and dbt signals maturation of grounding and data transformation layers, while pain points cluster around inference bottlenecks, search quality, and cluster utilization—classic scaling challenges for inference-driven products.
Notable leadership hires: Site Lead, Design Director
Perplexity builds an answer engine that combines web search with large language models to deliver sourced responses to user queries. The product stack spans search ranking, RAG pipelines for grounding, LLM inference orchestration, and an emerging agentic layer for infrastructure management. The company is 201–500 people, headquartered in San Francisco, with engineering-heavy hiring across the United States, Serbia, Germany, United Kingdom, and Japan. Active projects reveal a maturing platform: developing inference APIs for internal and external consumption, deploying ML models for real-time responses, building LLM-as-a-judge systems for quality control, and constructing AI-readable data warehouses to support reasoning at scale.
Perplexity runs on Python, Go, and Rust for services; Kafka, Apache Flink, and Spark for streaming data; Kubernetes and Terraform for infrastructure; PostgreSQL, DynamoDB, Cassandra, and ClickHouse for storage; and Databricks and Snowflake for analytics. React and TypeScript power front-end interfaces.
Current projects include search platform and model stack components, inference APIs for external customers, large-scale ML model deployment for real-time inference, agent-driven infrastructure management, LLM-as-a-judge systems, and RAG pipelines for answer grounding.
Perplexity's technology stack, projects, and hiring signals are inferred from public hiring and company data — career pages, public listings, and company web presence — then clustered and de-duplicated. Figures are estimates that refresh over time. Read our full methodology →
This is not an official vendor or customer list. It is a technology-adoption signal inferred from public data, intended for B2B research.