World's largest trade publisher scaling AI and data infrastructure
Penguin Random House operates at massive scale—11,000+ employees across 300+ imprints, six continents, and 20+ countries, publishing 16,000+ new titles annually. The tech stack reveals a publisher in transition: core publishing systems (SAP, FileMaker, EPUB) sit alongside emerging ML infrastructure (Python, PyTorch, TensorFlow, RAG adoption) and web development tools (React, JavaScript). Active projects cluster around LLM applications, recommendation systems, and AI-driven marketing discovery, while pain points center on data accuracy, system stability, and operational fragmentation—typical friction points when legacy publishing workflows collide with modern data and ML demands.
Penguin Random House is the world's largest trade book publisher, operating through more than 300 imprints and brands with English, German, and Spanish-language publishing businesses. The company sells more than 700 million print, audio, and ebook copies annually and maintains publishing lists that include more than 80 Nobel Prize laureates and hundreds of globally recognized authors. Headquartered in New York and owned by Bertelsmann since 2020, the organization employs over 11,000 people globally. Current hiring focuses on data roles, marketing, and engineering, with active projects spanning title-specific marketing campaigns, recommendation systems, pricing products, and AI systems for discovery.
Core publishing systems include SAP, FileMaker Pro, and EPUB standards. Data and AI layers run on Python, PyTorch, TensorFlow, and SQL. Frontend development uses React and JavaScript. Creative work spans Adobe Creative Cloud suite (Photoshop, Illustrator, InDesign) and Canva.
Active projects include LLM-based application patterns, recommendation systems, AI systems for marketing and discovery, pricing products, online experimentation, and end-to-end software systems. Operational initiatives focus on backlist file updates, title-specific marketing campaigns, and reducing duplicated cross-team effort.
Other companies in the same industry, closest in size