echoloc
COMPANIES USING

Companies Using PySpark

2,607 companies are actively hiring PySpark talent across 50 countries.

PySpark is the Python interface for Apache Spark, used to process and analyze massive datasets at scale across distributed computing environments. When a company posts PySpark roles, they're signaling active investment in data pipeline infrastructure — typically alongside cloud data platforms, orchestration tools, and storage layers. These companies are likely evaluating or already purchasing solutions like cloud warehouses, data lakehouse platforms, or MLOps tooling.

Buying Signal

A company hiring PySpark engineers is building or scaling a data engineering function — which means budget is flowing toward data infrastructure. They're likely replacing legacy ETL tools, expanding into real-time analytics, or migrating to a cloud-native stack. Vendors selling data integration, pipeline orchestration, cloud storage, or observability tools should be actively prospecting these accounts.

Top Companies44 shown

Top companies hiring PySpark talent in 2026
CompanyIndustryHQ LocationSizeJobsLinks
Veeva Systems
veeva.com
Software Development Pleasanton, United States 5,001+ 1,970
Sii Poland
sii.pl
IT Services and IT Consulting Warsaw, Poland 5,001+ 1,922
AgileEngine
agileengine.com
Software Development Boca Raton, United States 1,001–5,000 1,926
Apex Systems
apexsystems.com
IT Services and IT Consulting Glen Allen, United States 1,001–5,000 1,758
Lensa
lensa.com
Internet Publishing West Chester, United States 51–200 3,768
hackajob
hackajob.com
Software Development Greater London, United Kingdom 51–200 1,389
Bright Vision Technologies
bvteck.com
IT System Custom Software Development Bridgewater Township, United States 51–200 992
targetjobs UK
targetjobs.co.uk
Technology, Information and Internet Greater London, United Kingdom 51–200 1,727
BioSpace
biospace.com
Internet News West Des Moines, United States 11–50 1,858
NPAworldwide
npaworldwide.com
Professional Services Grand Rapids, United States 2–10 1,774
Walmart
walmart.com
Retail Bentonville, United States 10,001+ 2,311
Tata Consultancy Services
tcs.com
IT Services and IT Consulting Mumbai, India 10,001+ 2,070
CVS Health
cvshealth.com
Hospitals and Health Care Woonsocket, United States 10,001+ 2,175
WPP Media
wppmedia.com
Advertising Services New York, United States 10,001+ 1,448
Wipro
wipro.com
IT Services and IT Consulting Bengaluru, India 10,001+ 1,956
ALDI Nord Group
aldi-nord.de
Retail Essen, Germany 10,001+ 1,967
PwC Deutschland
pwc.de
Business Consulting and Services Frankfurt, Germany 10,001+ 1,936
PNC
pnc.com
Financial Services Pittsburgh, United States 10,001+ 2,213
UST
ust.com
IT Services and IT Consulting Aliso Viejo, United States 10,001+ 1,880
JLL
co.jll
Real Estate Chicago, United States 10,001+ 1,861
Optum
optum.com
Hospitals and Health Care Eden Prairie, United States 10,001+ 2,081
Deloitte
deloitte.com
Business Consulting and Services Sydney, Australia 10,001+ 1,789
HCLTech
hcltech.com
IT Services and IT Consulting Noida, India 10,001+ 1,863
TEKsystems
teksystems.com
IT Services and IT Consulting United States 10,001+ 1,866
Burlington Stores, Inc.
burlington.com
Retail Bordentown Township, United States 10,001+ 1,847
Comcast
comca.st
Telecommunications Philadelphia, United States 10,001+ 1,797
Caterpillar Inc.
caterpillar.com
Machinery Manufacturing Irving, United States 10,001+ 1,609
Lam Research
lamresearch.com
Semiconductor Manufacturing Fremont, United States 10,001+ 1,472
Leidos
leidos.com
Defense and Space Manufacturing Reston, United States 10,001+ 1,773
Thermo Fisher Scientific
thermofisher.com
Biotechnology Research Waltham, United States 10,001+ 2,172
Honeywell
honeywell.com
Appliances, Electrical, and Electronics Manufacturing Charlotte, United States 10,001+ 1,789
Labcorp
labcorp.com
Hospitals and Health Care Burlington, United States 10,001+ 1,809
Cushman & Wakefield
cushmanwakefield.com
Real Estate Chicago, United States 10,001+ 1,803
Boeing
boeing.com
Aviation & Aerospace Arlington, United States 10,001+ 2,323
Capgemini
capgemini.com
IT Services and IT Consulting Paris, France 10,001+ 1,629
HSBC
hsbc.com
Financial Services Greater London, United Kingdom 10,001+ 1,775
Infosys
infosys.com
IT Services and IT Consulting Electronic City, India 10,001+ 1,755
EY
ey.com
Professional Services Greater London, United Kingdom 10,001+ 1,635
Capital One
capitalone.com
Financial Services McLean, United States 10,001+ 1,764
Oracle
oracle.com
IT Services and IT Consulting Austin, United States 10,001+ 1,732
AT&T
att.com
Telecommunications Dallas, United States 10,001+ 2,229
Cognizant
cognizant.com
IT Services and IT Consulting Teaneck, United States 10,001+ 1,745
TD
td.com
Banking Toronto, Canada 10,001+ 1,726
Booz Allen Hamilton
boozallen.co
IT Services and IT Consulting Mclean, United States 10,001+ 1,753

2,607 companies use PySpark. Want the full list?

Export to CSV

Free account · no credit card required

Market Signal

Nearly 2,800 companies posting over 93,000 PySpark-related roles indicates this isn't a niche skill — it's a mainstream data engineering requirement. That volume suggests a broad, competitive buyer pool actively investing in distributed data infrastructure right now.

Stack Intelligence

PySpark hiring almost always co-occurs with Python, Databricks, and Spark — a combination that points to a Lakehouse architecture or a Spark-native data platform build-out. Companies showing all four signals are likely deep in a Databricks adoption cycle or evaluating it, making them high-fit targets for complementary tools like data quality, governance, or BI platforms. This stack cluster is one of the cleaner buying signals available for data infrastructure sales.

Hiring Landscape

PySpark in the Market

Where and how companies hire PySpark talent

Top Roles

Data Engineer2,047Software Engineer1,160Data Scientist1,147Project Manager1,067Business Analyst1,004Product Manager876Data Analyst864Devops Engineer851Full Stack Developer680Java Developer619Ai Engineer613Product Owner592

Related Technologies

What is PySpark?

PySpark is the Python API for Apache Spark, powering large-scale distributed data processing and big-data pipelines.

Take Action

See who’s actively building with PySpark

Open in echoloc →

Free account · no credit card required

Browse PySpark by Region & Industry

Frequently Asked Questions

What is PySpark?

PySpark enables data teams to run large-scale data processing jobs using Python, typically on cloud infrastructure. Businesses use it to build data pipelines, train machine learning models, and power analytics at scale. Hiring for PySpark signals a company is past the proof-of-concept stage and actively building production-grade data systems.

Why should sales teams track PySpark hiring?

PySpark hiring is a leading indicator that a company is investing in data infrastructure — often ahead of major platform or tooling purchases. An SDR tracking these signals can time outreach to match when budget decisions are actively being made, not after contracts are signed.

Who should be selling to companies that use PySpark?

Vendors best positioned to sell into PySpark-hiring companies include cloud data platform providers, ETL and pipeline orchestration vendors, data observability and quality tools, and MLOps or model deployment platforms.