Cloudera Logo

Cloudera

Staff Software Engineer - Apache Spark

Reposted 4 Days Ago
In-Office or Remote
3 Locations
165K-230K Annually
Senior level
In-Office or Remote
3 Locations
165K-230K Annually
Senior level
The Staff Software Engineer will architect and build scalable solutions for Cloudera's Data Platform, contribute to Apache Spark, enhance engineering processes, and work with large-scale distributed systems.
The summary above was generated by AI

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. 

Are you ready to architect the future of big data? Cloudera is searching for a visionary Staff Software Engineer with deep expertise in distributed systems to join the Apache Spark Team. You will be at the forefront of innovation, building our next-generation, enterprise-grade system designed to conquer data challenges at a massive scale—running Spark on thousands of nodes and crunching petabytes of data for the world's largest companies. This is your chance to directly influence the open-source community as a key contributor to Apache Spark while collaborating with a high-impact, distributed team that includes multiple Spark committers. If you're passionate about pushing the boundaries of distributed data processing, come build the impossible with us.  

As a Staff Engineer you will:

  • Pioneer Scalable Solutions: Architect, implement, and deliver next-generation features for Cloudera’s Data Engineering Experience, operating at a massive scale on thousands of production nodes.

  • Drive Open-Source Innovation: Be a core contributor to Apache Spark, directly shaping the future of distributed data processing in the open-source community.

  • Build with Modern Stacks: Develop high-performance features using Scala, Java, and Python on modern data platforms.

  • Deepen Technical Mastery: Gain and apply expert-level knowledge in core distributed data processing concepts, including:

    • SQL Planners and Optimizers

    • Data layout and modern table formats like Apache Parquet and Iceberg

    • Fault tolerance and resilience in large-scale distributed systems.

  • Own the Technology Stack: Develop a deep technical understanding of components across the Cloudera Data Engineering Experience, with a focus on Iceberg and Spark, applying this knowledge to your daily tasks.

  • Conquer Large-Scale Challenges: Work hands-on with massive distributed systems, scaling from hundreds to thousands of nodes in live production clusters.

  • Ensure System Integrity: Conduct thorough root cause analysis, debug complex system-level deployment issues, and resolve failures to maintain high system quality.

  • Enhance Engineering Velocity: Improve internal infrastructure and tooling to streamline development, testing, and deployment processes.

  • Collaborate and Influence: Work closely with a high-impact, distributed team and stakeholders to drive product vision and delivery.

We are excited about you if you have:
  • Professional Experience: 5-7+ years of experience in professional software development.

  • Leadership & Delivery: Proven experience leading technical initiatives and delivering complex product enhancements from concept to production.

  • Core Languages: Strong proficiency in Java, Scala, or other JVM-based language.

  • Systems Expertise: Solid experience in the design and development of distributed systems.

  • Engineering Excellence: Passion for clean coding, attention to detail, and a focus on software quality and maintainability.

  • Communication: Strong oral and written communication skills for effective collaboration across a distributed team.

  • Autonomy: Demonstrated ability to research, problem-solve, and operate independently without constant supervision.

  • Growth Mindset: An open-minded approach with a desire to learn new technologies and an unwavering passion for building exceptional products.

You might also have:
  • Spark & Ecosystem Experience with using/developing Apache Spark, Apache Iceberg, or other related technologies.

  • Distributed Systems Mastery: Deep experience with large-scale, distributed systems design and development, including a strong understanding of scaling, performance optimization, and scheduling.

  • SQL Expertise: Experience with SQL Planners and Optimizers

  • Open-Source Contributions: Prior experience as a contributor to open-source projects.

Why this role matters:

You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.

 Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.
 

 The expected base salary range for this role in:

  • California & Washington is $184,000 - $230,000 USD

  • Canada is $165,000 - $206,000 CAD

The salary will vary depending on your job-related skills, experience and location.  

This position is not eligible for sponsorship.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-REMOTE

#LI-SZ1

Cloudera Austin, Texas, USA Office

515 Congress, Austin, TX, United States, 78701

Similar Jobs

13 Hours Ago
Remote or Hybrid
United States
67K-101K Annually
Junior
67K-101K Annually
Junior
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Provide tactical HR support for Manheim Shared Services including employee relations, program implementation, talent and workforce initiatives, data analysis and reporting, and continuous improvement. Advise managers on policies, coordinate HR program logistics, conduct exit interviews, and partner with HRBPs and COEs to improve employee experience and organizational effectiveness. Up to 25% travel; US remote.
Top Skills: Excel
13 Hours Ago
Remote or Hybrid
United States
92K-154K Annually
Senior level
92K-154K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Customer Success Manager is responsible for driving customer outcomes across a portfolio, managing relationships and retention, and collaborating across various teams to ensure value realization and maximize customer satisfaction.
Top Skills: AICloudCustomer SuccessManaged ServicesSaaS
13 Hours Ago
Remote or Hybrid
TX, USA
67K-101K Annually
Junior
67K-101K Annually
Junior
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Provide compliance support for employment laws and internal policies: administer posters and notices, manage communications and audits, conduct data analyses, update policies, support cyclical programs, and serve as project lead for small HR compliance initiatives while partnering with stakeholders.
Top Skills: HrisMicrosoft Office (ExcelPowerpoint)Reporting ToolsWordWorkday

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account