About Gap Inc.
Our past is full of iconic moments - but our future is going to spark many more. Our brands - Gap, Banana Republic, Old Navy and Athleta - have dressed people from all walks of life and all kinds of families, all over the world, for every occasion for more than 50 years.
But we're more than the clothes that we make. We know that business can and should be a force for good, and it's why we work hard to make product that makes people feel good, inside and out. It's why we're committed to giving back to the communities where we live and work. If you're one of the super-talented who thrive on change, aren't afraid to take risks and love to make a difference, come grow with us.
About the Role
Want more jobs like this?
Get Software Engineering jobs in Hyderabad, India delivered to your inbox every week.
In this role you will work with multiple cross-functional teams to develop, maintain & migrate various observability tools. You will play a significant role in implementing our NextGen Observability platform with logs, metrics & traces. You will be responsible for ensuring the reliability and availability of our tools through monitoring best practices. Your contributions will impact our ability to monitor, analyze, and optimize our systems for peak performance.
What You'll Do
- Configure and maintain Observability solutions like NewRelic, Grafana, Prometheus & GCP Logging.
- Collaborate with multiple product teams and respective owners to design observability solutions as needed.
- Monitor system performance and troubleshoot issues.
- Automate deployment when needed using Chef and GitHub.
- Implement solutions for logging, metrics and traces using Open Telemetry (OTEL).
- Participate in on-call rotations and Incident response for the observability platform.
- Mentor more Jr team members when needed and able to collaborate efficiently.
- 5-8 yrs of relevant experience in Observability space.
- Strong hands-on admin knowledge of Linux, Chef & GitHub.
- Strong understanding of Prometheus, OTEL & Grafana.
- Demonstrated experience with implementing OTEL instrumentation, configuring OTEL collectors & node exporters to scrape telemetry data.
- Experience with Open Telemetry API and SDKs.
- Fair understanding of Application & Infra monitoring.
- Ability to adapt and learn quickly in a fast-paced environment with excellent communication skills to collaborate cross-functionally.
- Exposure to public Cloud solutions (preferably Azure & GCP).
- Good exposure of Kubernetes/Docker/Containerization is preferred.
- One of the most competitive paid time off plans in the industry
- Comprehensive health coverage for employees, same-sex partners and their families
- Health and wellness program: free annual health check-ups, fitness center and Employee Assistance Program
- Comprehensive benefits to support the journey of parenthood
- Retirement planning assistance
- See more of the benefits we offer.