Meta is looking for a forward thinking engineer with technical skills in production network operations to join the Edge and Network Services (ENS) Operations organization. In this role, you will lead initiatives to improve efficiency, reliability, and risk management via process, systems, and data in one of the largest scale networks in the world. The right candidate will thrive in a fast moving operations organization and enjoy digging into complex operational and reliability challenges in order to implement process and technical system solutions at a global scale.
Network Engineer, Operations & Support Responsibilities:
- Incident Response: Drive work investigating complex technical and process issues on a global scale spanning multiple reliability, security, and continuity disciplines for infrastructure spanning thousands of locations during major incidents/site events on edge, caching, and network infrastructure. This will require you to work closely and effectively with a variety of cross functional teams, managed service providers, and third-party vendor partners.
- Operational Leadership: As an operations practitioner within the team you will be expected to drive improvement in everything we do. In this role, you will work with a large contingent workforce responsible for delivering road mapped projects and executing on recurring activities. You will encourage standards across the network and full compliance to those standards and policies.
- Escalation Management: Participate in the global team's Tier 3 and 4 on-call rotation with the goal of routing issues as needed and understanding how processes or tooling might be improved, skills can be developed, or automation can be implemented to prevent the need to escalate similar issues in the future.
- Risk Management and Assurance: Work internally and with upstream partner teams to ensure design, build, and operations aligns to applicable reliability, security, privacy, regulatory policy, and business continuity drivers.
- Information and Data Assurance: ensure relevant operational process, procedure, and policy documentation is effectively managed and the data required to support operations is complete and accurate in systems.
- Automation: Be heavily involved in driving the team to analyze operational events in order to identify new automation opportunities and help us achieve our goal of all faults in the network being fully remediated by software. This will include helping others understand our requirements and drive their roadmaps, but may include directly implementing light weight solutions in code.
- Data Measurement: As an operations practitioner supporting our network, you will be expected to drive quality into the metrics we report to assist us in focusing on the areas that give us the best return on investment. This could include measurement and analysis of our escalation issues, fault/event trends, infrastructure capacity, and vendor performance failures.
- Travel: International and Domestic travel may be required up to 15 percent.
Want more jobs like this?
Get jobs in Singapore delivered to your inbox every week.
- Currently has, or is in the process of obtaining, BS or MS in Computer Science, Computer Engineering, or a related technical discipline, or equivalent experience.
- Network Protocol: Knowledge of TCP/IP, IPv4/v6, Border Gateway Protocol, Intermediate System to Intermediate System, Open Shortest Path First (OSPF), and/or Multi-protocol Label Switching (MPLS) in complex troubleshooting scenarios.
- CDN and Edge: Knowledge in content delivery networks (CDN) and peering network strategies, including topology, traffic analysis, server platforms, and architectures in troubleshooting scenarios.
- Repair Function: Experience in logical troubleshooting and physical repair with an understanding of physical infrastructure such as cable types, connector types, racks, patch panels, power/cooling, hardware components, and facility infrastructure.
- Cisco or Juniper Professional level certifications
- Operations Center Experience: Experience within a global Network Operations Center (NOC) or IT Operations Center environment to manage Service Level Agreements and continuous improvement against metrics at scale.
- Network and Infrastructure Design: Experience understanding and influencing network and infrastructure architectures to include constraint and dependency analysis and translating these into supportable solution requirements.
- Automation: knowledge coding and automating in higher-level languages such as Python, Go, or JavaScript.
- Vendor Partnership: Experience partnering with service provider vendors such as network hardware platforms(HPE servers, Cisco, Juniper, Ciena, Infinera, and Arista), ITAD vendors, logistics vendors, and colocation vendors.
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.