Do you have a system engineering background, strong knowledge, and experience in Kafka? Are you an open-minded professional with good English skills? If it sounds like you, this could be the perfect opportunity to join EPAM as a Senior DevOps Engineer with Kafka.
EPAM is shaping the digital future for Fortune 1000 companies, building complex solutions using modern technologies. We are looking for a Senior DevOps Engineer with wide experience in Kafka and an open-minded personality who can join our friendly environment and become a core contributor to our team of experts.
We are building a Kafka platform support team. The candidate will be responsible for installing, monitoring, troubleshooting, and maintaining the Kafka platform, ensuring optimal performance and security, and developing new features/automation/integration. The Kafka Platform team is also involved in supporting PagerDuty and Uptrends SaaS, including automating these platforms' support.
Want more jobs like this?
Get jobs in Madrid, Spain delivered to your inbox every week.
This role requires participation in an on-call support rota, including up to 8 hours on one weekend day (Saturday or Sunday).
#LI-DNI
Responsibilities
- Install and provision new Kafka clusters and supporting components
- Regularly monitor the health and performance of the Kafka platform and data pipelines
- Identify and fix issues related to the platform, including data pipelines, network problems, cloud or containerization resources failures, or software bugs
- Perform regular performance tuning of Kafka platform components
- Monitor and optimize the cost and performance of Kafka clusters
- Upgrade the Kafka platform to newer versions, including planning, testing, and implementation
- Manage the security of the Kafka platform, including access control lists, encryption, and regular security reviews
- Perform regular backups and disaster recovery procedures
- Manage the capacity of the Kafka platform, including projecting future growth and scaling needs
- Document procedures, configurations, and issue resolutions, and share knowledge with the team
- Work with Confluent Support for issues that cannot be resolved in-house
- Support existing Infrastructure as Code (IaC) and Configuration Management (CM) automations/pipelines for Kafka platform management and maintenance
- Maintain and enhance onboarding automation scripts
- Support Kafka self-service automations for Topic, RBAC, Schema, Connectors management
- Support application teams in setting up and onboarding Kafka consumers, producers, connectors, and streams
- Provide support for team requests in Slack and convert to CLOUD Tickets if complexity is high
- Implement new features released by the vendor as part of their product roadmap, in coordination with our client's Platform owner team
- Implement and enhance automation scripts and processes to reduce the number of tickets via self-service
- Proven experience in the implementation and maintenance of Confluent Platform
- Strong knowledge of HELM Kubernetes/Containers, Docker
- Kafka Confluent Stack - Advanced
- Terraform experience
- Cloud GCP and AWS experience (compute, networking, storage, IAM)
- Jenkins, Python/Shell scripting and Automation experience
- Linux administration
- PagerDuty and Uptrends
- Excellent problem-solving skills and the ability to troubleshoot complex issues
- Strong written and verbal communication skills
- Experience in automating processes and maintaining automated scripts
- Understanding of networking, and capability to coordinate with different teams including the Networking team, and CICD team, among others
- Relevant certifications (such as Confluent Certified Developer or Administrator for Apache Kafka) would be an advantage
- Private health insurance
- EPAM Employees Stock Purchase Plan
- 100% paid sick leave
- Referral Program
- Professional certification
- Language courses
- Why Join EPAM
- WORK AND LIFE BALANCE. Enjoy more of your personal time with flexible work options, 24 working days of annual leave and paid time off for numerous public holidays.
- CONTINUOUS LEARNING CULTURE. Craft your personal Career Development Plan to align with your learning objectives. Take advantage of internal training, mentorship, sponsored certifications and LinkedIn courses.
- CLEAR AND DIFFERENT CAREER PATHS. Grow in engineering or managerial direction to become a People Manager, in-depth technical specialist, Solution Architect, or Project/Delivery Manager.
- STRONG PROFESSIONAL COMMUNITY. Join a global EPAM community of highly skilled experts and connect with them to solve challenges, exchange ideas, share expertise and make friends.