Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Python/PySpark Developer

AT Capgemini
Capgemini

Python/PySpark Developer

Charlottesville, VA

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and build a more sustainable, more inclusive world.

Job Location : Whippany NJ (Hybrid)

Key Responsibilities

  • We are looking for a highly skilled Python/PySpark Developer with hands-on experience in Big Data technologies to join our dynamic data engineering team.
  • The ideal candidate will have a strong background in building scalable distributed data processing systems using PySpark and working on large datasets.
  • You will collaborate with data scientists engineers and other stakeholders to design and implement efficient data pipelines.
  • Develop optimize and maintain ETL pipelines using PySpark to process large-scale datasets across distributed environments.
  • Design and implement complex data transformation logic using PySpark and other Big Data tools.
  • Work with various Big Data technologies such as Hadoop Hive HBase Kafka and Spark to build robust scalable data systems.
  • Collaborate with data engineers and data scientists to integrate data from multiple sources and create unified datasets.

Want more jobs like this?

Get jobs in Charlottesville, VA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

Required Skills

  • Write efficient reusable and scalable Python code to handle both batch and real-time data processing tasks.
  • Ensure data quality consistency and reliability by implementing data validation monitoring and error handling.
  • Fine-tune and optimize PySpark jobs to improve performance in distributed environments.
  • Manage and maintain data flows in HDFS ensuring scalability and fault tolerance.
  • Perform data extraction aggregation and reporting using SQL and NoSQL databases.
  • Participate in system design discussions and provide recommendations for architecture and performance improvements

Life At Capgemini

Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:

  • Flexible work
  • Healthcare including dental, vision, mental health, and well-being programs
  • Financial well-being programs such as 401(k) and Employee Share Ownership Plan
  • Paid time off and paid holidays
  • Paid parental leave
  • Family building benefits like adoption assistance, surrogacy, and cryopreservation
  • Social well-being benefits like subsidized back-up child/elder care and tutoring
  • Mentoring, coaching and learning programs
  • Employee Resource Groups
  • Disaster Relief

Disclaimer

Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fuelled by its market leading capabilities in AI, cloud and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2023 global revenues of €22.5 billion.

Client-provided location(s): Bridgewater, VA, USA
Job ID: CapGemini-97588-en_US
Employment Type: Other