We are looking for a seasoned and visionary Senior Site Reliability Engineer Architect to lead the architecture and implementation of highly reliable and scalable systems. As a Senior SRE Architect, you will play a pivotal role in shaping our technology infrastructure, driving innovation, and ensuring the optimal performance of our critical applications.
Responsibilities:
- Lead efforts to enhance the reliability, availability, and performance of critical systems
- Perform in-depth analysis of system behavior, identifying areas for improvement and implementing solutions
- Design, implement, and maintain automation tools and frameworks to streamline operational processes
- Drive the integration of observability automation into the CI/CD pipeline
Want more jobs like this?
Get Software Engineering jobs in Gurgaon, India delivered to your inbox every week.
- Evaluate and recommend new tools, technologies, and methodologies to enhance SRE capabilities.
- Stay abreast of industry trends and emerging technologies
- Lead incident response and resolution activities, ensuring timely and effective resolution of system issues
- Conduct post-incident reviews and implement preventive measures to mitigate future occurrences
- Collaborate with cross-functional engineering teams to conduct capacity planning and scalability assessments and design solutions for handling current and future growth
- Implement and maintain monitoring solutions to proactively identify and address capacity-related issues
- Implement performance optimization strategies to ensure optimal system response times
- Collaborate with development and operations teams to promote a culture of reliability and operational excellence
- Mentor junior team members and actively contribute to knowledge-sharing initiatives
Requirements:
- 15+ years with minimum of 5 years as SRE engineer or in a similar architectural role
- Proven experience in designing and implementing architectural solutions for highly reliable systems
Must-Have
- Proficiency in scripting and programming languages and Automation - Java / Python / Bash
- Advanced understanding of observability tools and their set up with hands on experience - Preferably New Relic (Others - Dynatrace / Datadog / Prometheus)
- Complete understating of setting up observability for back-end services - Java Springboot microservices
- Good understanding of cloud and container technologies with some hands-on experience - AWS / Docker / ECS / EKS/ Kubernetes
- Good understanding of front-end application frameworks - Preferably Angular / React / JavaScript / AJAX
- Hands on Performance improvement for front-end and back-end applications - preferably Angular and Java related
- Understanding of Distributed Microservices Architecture - Java / Springboot
Nice-To-Have
- Understanding of logging tools - ELK / Splunk
- Understanding of tools like - Gateway / Service Discovery / Circuit breaker etc.
- Knowledge of Automation Tools and CI/CD pipelines - Maven / Git / Ansible / Terraform / Jenkins / Github pipelines
- Understanding of SRE security practices and their implementation in CI/CD pipeline
- Bachelor's or Master's degree in Computer Science, Information Technology, or related field