As a Site Reliability Engineer you'll play a pivotal role in maintaining and enhancing our critical developer-facing tools at one of the biggest companies in the world. We're seeking a candidate with expertise in Kubernetes, Go, and operations/observability technologies.
#LI-DNI#LI-VC5
Responsibilities
- Develop, monitor, and maintain observability tooling on Kubernetes (e.g., Prometheus, Jaeger, Grafana/Plutono)
- Develop (Golang) and collaborate closely with other development team, including onsite engagements
- Provide occasional third-level support for internal tools
- Utilize and create Grafana/Plutono/Prometheus dashboards and queries
- Administer and leverage log aggregation tooling
- Operate and monitor Kubernetes workloads, adhering to best practices
- Implement and manage end-user monitoring tools
- Update workflows on GitHub Actions
- Use and update Terraform modules
- Enhance operational efficiency and productivity for 50k engineers
Want more jobs like this?
Get jobs in Sofia, Bulgaria delivered to your inbox every week.
- 3 years of experience in a similar role and knowledge of Kubernetes and Golang
- Hands-on experience with operations and observability tooling
- Knowledge of creating and managing dashboards and queries in Grafana/Plutono/Prometheus
- Experience with log aggregation tools like Splunk, Open Telemetry, fluentbit, and ELK Stack
- Proficiency in administering and operating Kubernetes workloads
- Experience with end-user monitoring tools (e.g., Dynatrace RUM)
- Familiarity with Sentry (sentry.io) for error management
- Expertise in developing Helm charts and Helm chart libraries
- Experience updating workflows on GitHub Actions
- Experience using and updating Terraform modules
- Very good proficiency in English (written and spoken)
- Willingness to work in a hybrid setup (home office and office in Sofia)
- Opportunity to Engineer your Future and to drive the world's digital transformation with top industry clients
- Personal development program that will allow you to be valued for your strengths
- Wide range of professional trainings and workshops
- Being part of a collaborative, fast-growing, and innovative design team
- Established and accelerated growth toward different career paths, competencies, and roles
- Broad projects variety and possible mobility between projects over the time
- Collaboration in a multicultural environment and exchange of best practices with colleagues around the world
- Varied social benefits, Sports, Transportation and Health programs
- Work-life balance and flexible schedule, team buildings and sport opportunities
- Modern office/collaboration spaces (incl. new Infinity Tower business center, Sofia)
- Hybrid By Design - we provide you with the best productivity options from the 2 worlds. Meet, socialize and enjoy F2F time with your colleagues, while working from the modern EPAM's office for a few days per week and benefit from the EPAM's virtual working environment - making you able to be productive and work from remote for the rest of the week