Application reliability platform for engineers to test, monitor, and observe
Checkly builds an application reliability platform combining testing, monitoring, and observability for engineering teams. The stack—Node.js, TypeScript, PostgreSQL, ClickHouse, Kubernetes, and OpenTelemetry—reflects a backend-heavy architecture designed to ingest and process observability signals at scale. Active projects cluster around infrastructure hardening, hybrid cloud optimization (AWS + bare metal), and ClickHouse deployment for check run logs, suggesting the platform is shifting toward higher-volume data ingestion and lower-latency query patterns.
Checkly is an Application Reliability Platform that helps engineering teams catch errors, monitor uptime, and reduce mean time to recovery (MTTR) across staging and production environments. The product combines testing (Playwright integration), monitoring (multi-region uptime checks), and observability (full-stack traces) into a single platform. Teams use it to alert on outages, update status pages in real-time, and identify root causes down to packet-level detail. Founded in 2018 and based in New York, Checkly operates with 51–200 employees across engineering and sales functions, hiring across North America and Europe.
Checkly's core stack includes Node.js, TypeScript, PostgreSQL, and ClickHouse for data, plus Kubernetes and AWS for infrastructure. They use OpenTelemetry for observability instrumentation and Playwright for browser testing.
Current projects focus on infrastructure reliability, hybrid cloud optimization (AWS and bare metal), ClickHouse deployment for observability data, status pages product improvements, and developer experience enhancements.
Other companies in the same industry, closest in size