At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.
Role Overview
We are seeking a dynamic Senior Site Reliability Engineer (SRE) to lead the design, implementation, and operational support of our hybrid environments, spanning on-premises, private cloud, and public cloud platforms. This role will be pivotal in setting the foundation and strategy for our SRE practices while driving their implementation across the organization. The ideal candidate will combine technical expertise with leadership skills to guide our team on the SRE journey and ensure our environments are scalable, reliable, and secure.
Want more jobs like this?
Get Software Engineering jobs in Manila, Philippines delivered to your inbox every week.
Responsibilities
- You will manage applications running on Windows and Unix/Linux servers, perform application installations, modify configurations, and server maintenance.
- Create documentations, diagrams, procedures, turnover document for supporting products
- Ensure the production applications are running healthy and inefficiencies or service availability gaps are addressed
- Architect and participate in Disaster Recovery testing
- Automate processes within the environment to achieve higher efficiencies
- Work directly with the business partners and development teams to provide leadership on project and task statuses
- Perform and participate in annual system readiness, capacity planning, and provide recommendations to ensure the production environments meets SLA's
- You will participate on an On-Call rotation which provide off hour support
- Coordinate, support, and perform weekend changes when required to support project deliverables
- 5+ years' experience in SRE related roles and/or functional leadership role, managing systems and applications running on Windows and Unix
- In-depth knowledge of Windows operating systems and RHEL/Unix operating systems
- Understanding of tier 3 architecture design and concepts
- Experience automating processing using various scripting languages, such as PowerShell and Python
- Knowledge and management of software such as IIS, Apache, WebSphere, Tomcat, and Microsoft clustering technologies
- Experience with change control and incident management processes
- Knowledge of Ansible, Chef, Jenkins, Terraform
- Ability to troubleshoot complex problems, providing root cause analysis and remediation to mitigate future risk with appropriate Technical and Operational staff to resolve issues.
#LI-KA2 #LI-Hybrid
We are dedicated to fostering a collaborative, engaging, and inclusive environment and are committed to providing a workplace that empowers associates to be authentic and bring their best to work. We believe that associates do their best when they feel safe, understood, and valued, and we work diligently and collaboratively to ensure Broadridge is a company-and ultimately a community-that recognizes and celebrates everyone's unique perspective.