We are seeking a talented Senior Kafka Engineer to join our team remotely.
The successful candidate will be responsible for installing, monitoring, troubleshooting, and maintaining Kafka platform, ensuring optimal performance and security, and developing new features, automation, and integration. This role requires participation in an on-call support rota, including up to 8 hours on one weekend day (Saturday or Sunday).
Join us to leverage your expertise - apply!
#LI-DNI#REF_CO_DBC#LI-AP13
Responsibilities
- Install and provision new Kafka clusters and supporting components
- Regularly monitor the health and performance of the Kafka platform and data pipelines
- Identify and fix issues related to the platform, including data pipelines, network problems, cloud or containerization resources failures, or software bugs
- Perform regular performance tuning of Kafka platform components
- Monitor and optimize the cost and performance of Kafka clusters
- Upgrade the Kafka platform to newer versions, including planning, testing, and implementation
- Manage the security of the Kafka platform, including access control lists, encryption, and regular security reviews
- Perform regular backups and disaster recovery procedures
- Manage the capacity of the Kafka platform, including projecting future growth and scaling needs
- Document procedures, configurations, and issue resolutions, and share knowledge with the team
- Work with Confluent Support for issues that cannot be resolved in-house
- Support existing Infrastructure as Code (IaC) and Configuration Management (CM) automations/pipelines for Kafka platform management and maintenance
- Maintain and enhance onboarding automation scripts
- Support Kafka self-service automations for Topic, RBAC, Schema, Connectors management
- Support application teams in setting up and onboarding Kafka consumers, producers, connectors, and streams
- Provide support for team requests in Slack and convert to Cloud Tickets if complexity is high
- Implement new features released by the vendor as part of their product roadmap, in coordination with the platform owner team
- Implement and enhance automation scripts and processes to reduce the number of tickets via self-service
Want more jobs like this?
Get jobs in Río Grande, Mexico delivered to your inbox every week.
- 3+ years of hands-on experience with Apache Kafka
- Proven experience in the implementation and maintenance of Confluent Platform
- Strong knowledge of Helm and excellent problem-solving skills with the ability to troubleshoot complex issues
- Experience in automating processes and maintaining automated scripts
- Understanding of networking and the capability to coordinate with different teams, including the networking team and CI/CD team, among others
- Proficiency in Python and UNIX shell scripting
- Excellent written and verbal communication skills
- Relevant certifications (such as Confluent Certified Developer or Administrator for Apache Kafka) would be an advantage
- Experience with Amazon Web Services and Google Cloud Platform
- Familiarity with Jenkins, Kubernetes, Linux, PagerDuty, Terraform, and Uptrends
- Career plan and real growth opportunities
- Unlimited access to LinkedIn learning solutions
- International Mobility Plan within 25 countries
- Constant training, mentoring, online corporate courses, eLearning and more
- English classes with a certified teacher
- Support for employee's initiatives (Algorithms club, toastmasters, agile club and more)
- Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
- Flexible work schedule and dress code
- Collaborate in a multicultural environment and share best practices from around the globe
- Hired directly by EPAM & 100% under payroll
- Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
- Major medical expenses insurance: Life, Major medical expenses with dental & visual coverage (for the employee and direct family members)
- 13 % employee savings fund, capped to the law limit
- Grocery coupons
- 30 days December bonus
- Employee Stock Purchase Plan
- 12 vacations days plus 4 floating days
- Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th & 31st)
- Monthly non-taxable amount for the electricity and internet bills
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM's Privacy Notice and Policy.