Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

SiteOps Global Production Platform Engineering Manager

AT Meta
Meta

SiteOps Global Production Platform Engineering Manager

Hendersonville, TN

Meta is seeking a forward thinking experienced Production Platform Engineering Manager to join the Data Center Site Operations team. The Production Platform Engineering (PPE) team is responsible for the overall performance of Meta's production compute, storage, and accelerator (GPU) platforms through their life-cycles in our data centers. This role will lead a subset of the overall PPE team. The role scope is focused on maintaining and improving the health of platforms from operational testing into mass production through end-of-life. Key responsibilities include identifying systemic hardware, firmware, and tooling issues; engaging in hands-on problem solving; and collaborating effectively with cross-functional engineering and tooling teams to improve performance of the fleet. Our data centers, and the tens of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in an environment where adaptability and flexibility will be key to their success. We seek an individual who can quickly absorb and understand the technical challenges of subject matter experts and local site operations teams, create alignment between these globally distributed teams as well as partner organizations, and can set informed priorities and direction while getting buy-in and commitment from relevant stakeholders.

Want more jobs like this?

Get jobs in Hendersonville, TN delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


SiteOps Global Production Platform Engineering Manager Responsibilities:
  • Manage other Production Platform Engineering (PPE) team members through efforts that provide end-to-end lifecycle leadership (operational test through end of life decommissioning) of platforms and associated new technologies in the data centers
  • Serve as the central point of contact representing hardware platforms and associated new technologies across Site Operations, and be the subject matter experts on hardware platform issues, for data center operations teams
  • Drive complex platform technical investigations globally and spanning multiple disciplines such as Hardware, Software/Firmware, Networking and Power & Cooling
  • Work closely with other PPE team members to share best practices and ensure appropriate feedback is given to cross-functional teams
  • Issue timely alerts and support fixes to operations teams, and assure a robust feedback pipeline to engineering teams
  • Provide serviceability feedback on production hardware to engineering design teams
  • Provide technical mentorship on large scale data center projects and initiatives to global, cross-functional teams
  • Build relationships and collaboration with engineering and cross functional teams across the company. Actively solicit feedback from teams, and use that feedback to improve operational effectiveness as infrastructure scales
  • Own the cross-functional communication with other technical operations groups to help resolve incidents
  • Collaborate with stakeholders, functional owners and subject matter experts to interpret and articulate business and operations needs
  • Travel up to 30% required
Minimum Qualifications:
  • BS or BA in technical field (electrical, computer science, or mechanical engineering) or comparable experience
  • 10+ years experience in NPI (New Product Introduction) hardware development and/or validation, working with cross functional teams to deliver products to production
  • Experience working across a diverse global organization and building partnerships with cross functional teams inside and outside of the organization
  • Experience troubleshooting and debugging hardware platforms
  • Experience in processing and analyzing large sets of data
  • Knowledge of server and storage platforms, principles, technologies, protocols, and standards
  • Experience with GPU and accelerator based platform hardware that operates in computing clusters
  • Experience managing multiple concurrent projects and managing tight timelines
  • Experience working within an interdisciplinary team of hardware and operations engineers
  • Experience working with Linux or UNIX Operating systems
  • Technical skills creating documentation for users of all levels
  • Experience mentoring others and leading technical teams
Preferred Qualifications:
  • Direct experience managing others
  • Large-scale data center environment experience, including hardware deployments, system knowledge of Linux, Server Hardware, networking, network protocols, supply chain and Data Center automation
  • Bash, PHP, Python, or Perl scripting experience
  • Experience in data center system and process automation
  • Leadership presentation skills
About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

$163,000/year to $225,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Client-provided location(s): Gallatin, TN 37066, USA
Job ID: a1KDp00000E2RmdMAF_1012
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA
    • FSA With Employer Contribution
    • HSA
    • HSA With Employer Contribution
    • Fitness Subsidies
    • On-Site Gym
    • Mental Health Benefits
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Commuter Benefits Program
    • Casual Dress
    • Happy Hours
    • Snacks
    • Some Meals Provided
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Unlimited Paid Time Off
    • Paid Holidays
    • Personal/Sick Days
    • Sabbatical
    • Leave of Absence
  • Financial and Retirement

    • 401(K)
    • 401(K) With Company Matching
    • Pension
    • Company Equity
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
  • Professional Development

    • Learning and Development Stipend
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Lunch and Learns
    • Internship Program
  • Diversity and Inclusion

    • Employee Resource Groups (ERG)

Company Videos

Hear directly from employees about what it is like to work at Meta.