Our Purpose
We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation and delivers better business results.
Want more jobs like this?
Get Software Engineering jobs in Sydney, Australia delivered to your inbox every week.
Title and Summary
Director, Site Reliability Engineer
Who is Mastercard?
At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Technology at Mastercard
What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable. We need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day.
Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences, and offers you the flexibility to shape a career across disciplines and continents.
About the Role
We are seeking a Director, Site Reliability Engineer (SRE) to join our Business Operations team at Mastercard. As the production readiness steward for Mastercard products, you'll play a vital role in ensuring our platform's stability, scalability, and performance.
In this role, you will empower developers to build resilient, fault-tolerant products by providing support during the application build phase, focusing on operational design, automation, capacity planning, and monitoring. You'll also lead efforts in triage, root cause analysis, and proactive risk management to enhance customer experience and maximize application value.
Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle.
A Biz Ops engineer will spend a bit of time throughout their career with all of these aspects of the role:
• Operational Readiness Architect:
oServe as the primary contact responsible for the overall application health, performance, and capacity
oSupport services before they go live through activities such as system design consulting, capacity planning and launch reviews.
oPartner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment.
• Site Reliability Engineering:
oPerforms operability and resilience design and implements and maintains highly reliable and scalable infrastructure.
oPerform root cause analysis of incidents and collaborate with development teams to resolve issues.
oStay up to date with the latest technologies and trends in SRE and cloud computing.
oParticipate in on-call rotations and be available to respond to critical incidents.
oComplete end-to-end run ownership of the product.
oPractice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.
oAutomate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability.
• DevOps/Automation:
oTackle complex development, automation, and business process problems. Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation, and refinement.
oSupport the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
oPerforms operational and resilience Design and implements solutions for capacity planning and performance optimization.
oIncrease automation and tooling to reduce toil and manual intervention
• ITSM Practices:
oAnalyses ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Role qualifications:
The ideal candidate will have experience in:
• BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
• Ability to read, write, and understand code in one of the programming languages such as Java, Spring Framework, Python, Go.
• Strong understanding of DevOps principles, practices along with configuration management.
• Experience in operational and resilience designing, building, and operating large-scale, distributed systems.
• A passion for observability, automation and continuous improvement.
• Familiarity with cloud platforms like AWS, Azure, or GCP (a plus).
• Experience in observability tools such as Splunk, Dynatrace, Prometheus, Datadog, Grafana, and Monitoring as a Code.
• Appetite for change and pushing the boundaries of what can be done with automation. Be curious about new technology, infrastructure, and practices to scale our architecture and prepare for future growth.
• Experience with algorithms, data structures, scripting, pipeline management, and software design.
• Systematic problem-solving approach, analytical, coupled with strong communication skills and a sense of ownership and drive.
• Interest in designing, analyzing, and troubleshooting large-scale distributed systems.
• Strong leadership and mentoring skills.
• Willingness and ability to learn and take on challenging opportunities and to work as a member of matrix based diverse and geographically distributed project team.
• Ability to balance doing things right with fixing things quickly. Flexible and pragmatic, while working towards improving the long-term health of the system.
• Comfortable collaborating with cross-functional teams to ensure that expected system behaviour is understood, and monitoring exists to detect anomalies.
Ready to Make an Impact?
Apply today to be part of a team that values innovation, resilience, and continuous improvement. Let's build something extraordinary together!
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard's security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.