Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Kafka DevOps Engineer

AT EPAM Systems
EPAM Systems

Senior Kafka DevOps Engineer

Madrid, Spain

Do you have a system engineering background and strong knowledge and experience in Kafka? Are you an open-minded professional with good English skills? If it sounds like you, this could be the perfect opportunity to join EPAM as a Senior DevOps Engineer with Kafka.
Our teams work in highly agile working environments for Fortune 1000 clients, following XP practices and best CI/CD practices. We are looking for a Senior DevOps Engineer with wide experience in Kafka and open-minded personality, who can join our friendly environment and become a core contributor to our team of experts.
We are building Kafka platform support team. The candidate will be responsible for installing, monitoring, troubleshooting, and maintaining Kafka platform, ensuring optimal performance, security, developing new features/automation/integration. Kafka Platform team are also involved into supporting PagerDuty and Uptrends SaaS including automation of these platforms support.

Want more jobs like this?

Get jobs in Madrid, Spain delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

This role requires participation in an on-call support rota, including up to 8 hours on one weekend day (Saturday or Sunday).

#LI-DNI

Responsibilities
  • Install and provision new Kafka clusters and supporting components
  • Regularly monitor the health and performance of the Kafka platform and data pipelines
  • Identify and fix issues related to the platform, including data pipelines, network problems, cloud or containerization resources failures, or software bugs
  • Perform regular performance tuning of Kafka platform components
  • Monitor and optimize the cost and performance of Kafka clusters
  • Upgrade the Kafka platform to newer versions, including planning, testing, and implementation
  • Manage the security of the Kafka platform, including access control lists, encryption, and regular security reviews
  • Perform regular backups and disaster recovery procedures
  • Manage the capacity of the Kafka platform, including projecting future growth and scaling needs
  • Document procedures, configurations, and issue resolutions, and share knowledge with the team
  • Work with Confluent Support for issues that cannot be resolved in-house
  • Support existing Infrastructure as Code (IaC) and Configuration Management (CM) automations/pipelines for Kafka platform management and maintenance
  • Maintain and enhance onboarding automation scripts
  • Support Kafka self-service automations for Topic, RBAC, Schema, Connectors management
  • Support application teams in setting up and onboarding Kafka consumers, producers, connectors, and streams
  • Provide support for team requests in Slack and convert to CLOUD Tickets if complexity is high
  • Implement new features released by the vendor as part of their product roadmap, in coordination with our client's Platform owner team
  • Implement and enhance automation scripts and processes to reduce the number of tickets via self-service
Requirements
  • Proven experience in the implementation and maintenance of Confluent Platform
  • Strong knowledge of HELM Kubernetes/Containers, Docker
  • Kafka Confluent Stack - Advanced
  • Terraform experience
  • Cloud GCP and AWS experience (compute, networking, storage, IAM)
  • Jenkins, Python/Shell scripting and Automation experience
  • Linux administration
  • PagerDuty and Uptrends
  • Excellent problem-solving skills and the ability to troubleshoot complex issues
  • Strong written and verbal communication skills
  • Experience in automating processes and maintaining automated scripts
  • Understanding of networking, and capability to coordinate with different teams including the Networking team, and CICD team, among others
Nice to have
  • Relevant certifications (such as Confluent Certified Developer or Administrator for Apache Kafka) would be an advantage
We offer
  • Private health insurance
  • EPAM Employees Stock Purchase Plan
  • 100% paid sick leave
  • Referral Program
  • Professional certification
  • Language courses
EPAM is a leading digital transformation services and product engineering company with over 52,800 EPAMers in more than 55 countries and regions. Since 1993, our multidisciplinary teams have been helping make the future real for our clients and communities around the world. In 2018, we opened an office in Spain that quickly grew to over 1,450 EPAMers distributed between the offices in Málaga and Madrid as well as remotely across the country. Here you will collaborate with multinational teams, contribute to numerous innovative projects, and have an opportunity to learn and grow continuously.
  • WORK & LIFE BALANCE. Enjoy more of your personal time with flexible work options, 24 working days of annual leave and paid time off for numerous public holidays.
  • CONTINUOUS LEARNING CULTURE. Craft your personal Career Development Plan to align with your learning objectives. Take advantage of internal training, mentorship, sponsored certifications and LinkedIn courses.
  • CLEAR & DIFFERENT CAREER PATHS. Grow in engineering or managerial direction to become a People Manager, in-depth technical specialist, Solution Architect, or Project/Delivery Manager.
  • STRONG PROFESSIONAL COMMUNITY. Join a global EPAM community of highly skilled experts and connect with them to solve challenges, exchange ideas, share expertise and make friends.

Client-provided location(s): Spain
Job ID: EPAM-epamgdo_blt8c56edf87e923e68_en-us_Other_Spain
Employment Type: Other