2,607 companies are actively hiring PySpark talent across 50 countries.
PySpark is the Python interface for Apache Spark, used to process and analyze massive datasets at scale across distributed computing environments. When a company posts PySpark roles, they're signaling active investment in data pipeline infrastructure — typically alongside cloud data platforms, orchestration tools, and storage layers. These companies are likely evaluating or already purchasing solutions like cloud warehouses, data lakehouse platforms, or MLOps tooling.
A company hiring PySpark engineers is building or scaling a data engineering function — which means budget is flowing toward data infrastructure. They're likely replacing legacy ETL tools, expanding into real-time analytics, or migrating to a cloud-native stack. Vendors selling data integration, pipeline orchestration, cloud storage, or observability tools should be actively prospecting these accounts.
| Company | Industry | HQ Location | Size | Jobs | Links |
|---|---|---|---|---|---|
Veeva Systems veeva.com |
Software Development | Pleasanton, United States | 5,001+ | 1,970 | |
Sii Poland sii.pl |
IT Services and IT Consulting | Warsaw, Poland | 5,001+ | 1,922 | |
AgileEngine agileengine.com |
Software Development | Boca Raton, United States | 1,001–5,000 | 1,926 | |
Apex Systems apexsystems.com |
IT Services and IT Consulting | Glen Allen, United States | 1,001–5,000 | 1,758 | |
Lensa lensa.com |
Internet Publishing | West Chester, United States | 51–200 | 3,768 | |
hackajob hackajob.com |
Software Development | Greater London, United Kingdom | 51–200 | 1,389 | |
Bright Vision Technologies bvteck.com |
IT System Custom Software Development | Bridgewater Township, United States | 51–200 | 992 | |
targetjobs UK targetjobs.co.uk |
Technology, Information and Internet | Greater London, United Kingdom | 51–200 | 1,727 | |
BioSpace biospace.com |
Internet News | West Des Moines, United States | 11–50 | 1,858 | |
NPAworldwide npaworldwide.com |
Professional Services | Grand Rapids, United States | 2–10 | 1,774 | |
Walmart walmart.com |
Retail | Bentonville, United States | 10,001+ | 2,311 | |
Tata Consultancy Services tcs.com |
IT Services and IT Consulting | Mumbai, India | 10,001+ | 2,070 | |
CVS Health cvshealth.com |
Hospitals and Health Care | Woonsocket, United States | 10,001+ | 2,175 | |
WPP Media wppmedia.com |
Advertising Services | New York, United States | 10,001+ | 1,448 | |
Wipro wipro.com |
IT Services and IT Consulting | Bengaluru, India | 10,001+ | 1,956 | |
ALDI Nord Group aldi-nord.de |
Retail | Essen, Germany | 10,001+ | 1,967 | |
PwC Deutschland pwc.de |
Business Consulting and Services | Frankfurt, Germany | 10,001+ | 1,936 | |
PNC pnc.com |
Financial Services | Pittsburgh, United States | 10,001+ | 2,213 | |
UST ust.com |
IT Services and IT Consulting | Aliso Viejo, United States | 10,001+ | 1,880 | |
JLL co.jll |
Real Estate | Chicago, United States | 10,001+ | 1,861 | |
Optum optum.com |
Hospitals and Health Care | Eden Prairie, United States | 10,001+ | 2,081 | |
Deloitte deloitte.com |
Business Consulting and Services | Sydney, Australia | 10,001+ | 1,789 | |
HCLTech hcltech.com |
IT Services and IT Consulting | Noida, India | 10,001+ | 1,863 | |
TEKsystems teksystems.com |
IT Services and IT Consulting | United States | 10,001+ | 1,866 | |
Burlington Stores, Inc. burlington.com |
Retail | Bordentown Township, United States | 10,001+ | 1,847 | |
Comcast comca.st |
Telecommunications | Philadelphia, United States | 10,001+ | 1,797 | |
Caterpillar Inc. caterpillar.com |
Machinery Manufacturing | Irving, United States | 10,001+ | 1,609 | |
Lam Research lamresearch.com |
Semiconductor Manufacturing | Fremont, United States | 10,001+ | 1,472 | |
Leidos leidos.com |
Defense and Space Manufacturing | Reston, United States | 10,001+ | 1,773 | |
Thermo Fisher Scientific thermofisher.com |
Biotechnology Research | Waltham, United States | 10,001+ | 2,172 | |
Honeywell honeywell.com |
Appliances, Electrical, and Electronics Manufacturing | Charlotte, United States | 10,001+ | 1,789 | |
Labcorp labcorp.com |
Hospitals and Health Care | Burlington, United States | 10,001+ | 1,809 | |
Cushman & Wakefield cushmanwakefield.com |
Real Estate | Chicago, United States | 10,001+ | 1,803 | |
Boeing boeing.com |
Aviation & Aerospace | Arlington, United States | 10,001+ | 2,323 | |
Capgemini capgemini.com |
IT Services and IT Consulting | Paris, France | 10,001+ | 1,629 | |
HSBC hsbc.com |
Financial Services | Greater London, United Kingdom | 10,001+ | 1,775 | |
Infosys infosys.com |
IT Services and IT Consulting | Electronic City, India | 10,001+ | 1,755 | |
EY ey.com |
Professional Services | Greater London, United Kingdom | 10,001+ | 1,635 | |
Capital One capitalone.com |
Financial Services | McLean, United States | 10,001+ | 1,764 | |
Oracle oracle.com |
IT Services and IT Consulting | Austin, United States | 10,001+ | 1,732 | |
AT&T att.com |
Telecommunications | Dallas, United States | 10,001+ | 2,229 | |
Cognizant cognizant.com |
IT Services and IT Consulting | Teaneck, United States | 10,001+ | 1,745 | |
TD td.com |
Banking | Toronto, Canada | 10,001+ | 1,726 | |
Booz Allen Hamilton boozallen.co |
IT Services and IT Consulting | Mclean, United States | 10,001+ | 1,753 |
Nearly 2,800 companies posting over 93,000 PySpark-related roles indicates this isn't a niche skill — it's a mainstream data engineering requirement. That volume suggests a broad, competitive buyer pool actively investing in distributed data infrastructure right now.
PySpark hiring almost always co-occurs with Python, Databricks, and Spark — a combination that points to a Lakehouse architecture or a Spark-native data platform build-out. Companies showing all four signals are likely deep in a Databricks adoption cycle or evaluating it, making them high-fit targets for complementary tools like data quality, governance, or BI platforms. This stack cluster is one of the cleaner buying signals available for data infrastructure sales.
Where and how companies hire PySpark talent
PySpark is the Python API for Apache Spark, powering large-scale distributed data processing and big-data pipelines.
Free account · no credit card required
PySpark enables data teams to run large-scale data processing jobs using Python, typically on cloud infrastructure. Businesses use it to build data pipelines, train machine learning models, and power analytics at scale. Hiring for PySpark signals a company is past the proof-of-concept stage and actively building production-grade data systems.
PySpark hiring is a leading indicator that a company is investing in data infrastructure — often ahead of major platform or tooling purchases. An SDR tracking these signals can time outreach to match when budget decisions are actively being made, not after contracts are signed.
Vendors best positioned to sell into PySpark-hiring companies include cloud data platform providers, ETL and pipeline orchestration vendors, data observability and quality tools, and MLOps or model deployment platforms.