Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

National Manager, Site Reliability Engineering - Domains

AT Toyota North America
Toyota North America

National Manager, Site Reliability Engineering - Domains

Plano, TX

Overview

Who we are

Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for diverse, talented team members who want to Dream. Do. Grow. with us.

An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment.

Want more jobs like this?

Get Science and Engineering jobs in Plano, TX delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

To save time applying, Toyota does not offer sponsorship of job applicants for employment-based visas or any other work authorization for this position at this time.

Who We're Looking For

Toyota Financial Services (TFS) is looking for a highly motivated person to fill a role as a National Manager, Site Reliability Engineering (SRE).

The primary responsibility of this role is overseeing the strategy, implementation, and management of the SRE function within the organization. This role will lead a team of highly skilled engineers and collaborate with cross-functional teams to ensure the reliability, availability, and performance of TFS's digital platforms and services.

What you'll be doing

  • Develop and communicate the strategic vision for the SRE function aligned with TFS's business objectives.
  • Collaborate with executive leadership to define SRE goals, KPIs, and performance metrics.
  • Drive continuous improvement initiatives to enhance the reliability and scalability of TFS's systems.
  • Lead, mentor, and guide a team of SRE engineers, fostering a culture of innovation, collaboration, and accountability.
  • Define team structure, roles, responsibilities, and career development paths.
  • Manage resource allocation and capacity planning to ensure adequate coverage for operational tasks and projects.
  • Establish and enforce best practices for SRE involvement in incident management, problem resolution, and post-incident analysis.
  • Collaborate with development and operations teams to implement automation, monitoring, and alerting solutions.
  • Ensure effective incident response processes that minimize downtime and impact to business operations.
  • Collaborate with architecture teams to design and implement scalable, reliable, and resilient systems.
  • Drive the adoption of cloud-native technologies and practices to optimize infrastructure and application deployments.
  • Monitor and analyze system performance to proactively identify and address potential bottlenecks or issues.
  • Implement strategies to optimize application and service availability, including disaster recovery planning.
  • Work closely with development, operations, security, and compliance teams to align SRE efforts with overall IT initiatives.
  • Participate in project planning and provide SRE expertise to ensure reliability is built into new systems and features.
  • Manage vendor relationships and contracts related to SRE tools and services.
  • Communicate SRE initiatives, progress, and challenges to executive leadership and stakeholders.
  • Prepare and present reports on system reliability, incidents, and improvements to relevant stakeholders.

What You Bring

  • Bachelor's or Master's degree in Computer Science, Information Technology, or related discipline or equivalent work experience is required
  • 10+ years of proven experience in site reliability engineering, with 3-5 years in a leadership or management role.
  • Strong leadership and people management skills, with a track record of leading and developing high-performing teams.
  • Strong understanding of cloud computing, infrastructure as code, and microservices architecture.
  • Proficiency in implementing DevOps and SRE best practices, including CI/CD pipelines and automated testing.
  • Hands-on experience with tools like Kubernetes, Docker, Prometheus, Grafana, etc.
  • Experience with applications environments deployed utilizing AWS Cloud Technologies
  • Excellent problem-solving skills and the ability to lead teams in high-pressure situations.
  • Strong interpersonal, communication, and collaboration skills.
  • Experience in the financial services industry is a plus.

What we'll bring

During your interview process, our team can fill you in on all the details of our industry-leading benefits and career development opportunities. A few highlights include:

  • A work environment built on teamwork, flexibility, and respect.
  • Professional growth and development programs to help advance your career, as well as tuition reimbursement.
  • Team Member Vehicle Purchase Discount
  • Toyota Team Member Lease Vehicle Program (if applicable)
  • Comprehensive health care and wellness plans for your entire family.
  • Flextime and virtual work options (if applicable).
  • Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute.
  • Paid holidays and paid time off.
  • Referral services related to prenatal services, adoption, childcare, schools, and more.
  • Flexible spending accounts.
  • Relocation assistance (if applicable).
  • This position is based in Plano. TX with a hybrid mix of some in-office time and some remote work.

Belonging at Toyota

Our success begins and ends with our people. We embrace diverse perspectives and value unique human experiences. Respect for all is our North Star. Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members' efforts to dream, do and grow without questioning that they belong. As a company that has been one of DiversityInc's Top 50 Companies for Diversity and a member of The Billion Dollar Roundtable supporting minority and woman-owned suppliers for over 10 years, we are proud to be an equal opportunity employer that celebrates the diversity of the communities where we live and do business.

Applicants for our positions are considered without regard to race, ethnicity, national origin, sex, sexual orientation, gender identity or expression, age, disability, religion, military or veteran status, or any other characteristics protected by law.

Have a question, need assistance with your application or do you require any special accommodations? Please send an email to talent.acquisition@toyota.com.

Client-provided location(s): Plano, TX, USA
Job ID: Toyota_North_America-1771908752
Employment Type: Full Time

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA
    • HSA
    • On-Site Gym
  • Parental Benefits

    • Adoption Leave
    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
  • Office Life and Perks

    • On-Site Cafeteria
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
  • Financial and Retirement

    • Relocation Assistance
  • Professional Development

    • Internship Program
    • Tuition Reimbursement
    • Promote From Within
    • Mentor Program
    • Access to Online Courses