Responsibilities
Experience & Qualifications (relate to TPxx levels)
- 3+ years of experience in Computer Science
- Application Development using Continuous Delivery for a SaaS or Managed Hosted application with operations experience
- Authoring and consuming Open API, gRPC based APIs
- Instrumenting metrics, logs & traces for applications & infrastructure you have worked on
- Implementing and Delivering robust Infrastructure as code (IaC)
- Designing, deploying and orchestrating microservices using Kubernetes
- Appropriate RHEL, Kubernetes & Cloud Certifications a plus
Responsibilities:
- Proactively identifying & eliminating excess operational work and poorly performing services
- Authoring observability for applications, infrastructure using RED & USE methods
- Defining the required reliability of your service through service-level indicators (SLI) and service-level objectives (SLO) & utilization of an error budget to manage the pace of innovation with reliability
- Implementing Resiliency Tests, Self-Healing & Circuit Breakers to handle chaotic conditions & ensure your service behaves reasonably even in the face of unexpected demand
- Practicing Chaos Engineering helping us implement and mature
- Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable
- Leading Blameless Postmortems analysis for Incidents
Want more jobs like this?
Get Software Engineering jobs in Bangalore, India delivered to your inbox every week.
Technology Stack
- Multi Cloud; Azure, AWS, GCP
- Programming (Python, Golang, Java, JavaScript)
- Kubernetes, Helm, & ArgoCD (OpenShift a plus)
- Terraform, Ansible and/or Puppet
- Prometheus, Grafana & Loki (Open Telemetry a plus)
- Data Services (delta lake, knative, mongodb, postgresql/cockroachdb, kafka, spark, camel)