Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Manager, D&AI AIOps, MLOps Operations

AT PepsiCo
PepsiCo

Senior Manager, D&AI AIOps, MLOps Operations

Plano, TX

Overview

We are seeking a highly skilled Senior Manager - AIOps & MLOps to lead and oversee the automation, scalability, and reliability of AI/ML operations across the enterprise.

Responsibilities

This role requires deep expertise in AI-driven observability, machine learning pipeline automation, cloud-based AI/ML platforms, and operational excellence. The ideal candidate will drive AI/ML model deployment, continuous monitoring, and self-healing automation to optimize system performance, minimize downtime, and enhance decision-making with real-time AI-driven insights.

  • Lead and sustain large-scale AIOps, MLOps programs, ensuring alignment with business objectives, data governance standards, and enterprise data strategy.
  • Oversee the implementation of real-time data observability, monitoring, and automation frameworks to enhance data reliability, quality, and operational efficiency.
  • Develop program governance models and execution roadmaps to drive efficiency across data platforms, including Azure, AWS, GCP, and on-prem environments.
  • Ensure seamless integration of CI/CD, data pipeline automation, and self-healing capabilities across the enterprise. Partner in building the next generation D&A platform(s), and leading a high-performing data operations team.
  • Lead and manage the full people, process and technology driven Data & Analytics platform technology strategy and cultural shift for PepsiCo IT to a world class data first organization working across all Sector S&T.
  • Champion of PepsiCo's Data & Analytics program and platform management supporting large scale global data engineering efforts partnering across S&T organization
  • Support Data & Analytics Technology Transformations to provide full sustainment capabilities across the PepsiCo Data Estate, including data platform management automation of proactive issue identification and self-healing abilities.

Want more jobs like this?

Get jobs in Plano, TX delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

AIOps & Observability Automation:

  • Design and implement AIOps strategies for automating IT operations using Azure Monitor, Azure Log Analytics, Azure Sentinel, and AI-driven alerting.
  • Deploy Azure-based observability solutions (Azure Monitor, Application Insights, Azure Synapse for log analytics, and Azure Data Explorer) to enhance real-time system performance monitoring.
  • Enable AI-driven anomaly detection and root cause analysis (RCA) using Azure Machine Learning (Azure ML) and AI-powered log analytics.
  • Develop self-healing and auto-remediation mechanisms using Azure Logic Apps, Azure Functions, and Power Automate to proactively resolve system issues.

MLOps & Machine Learning Pipeline Management:

  • Lead end-to-end ML lifecycle automation using Azure ML, Azure DevOps, and Azure Pipelines for ML (CI/CD).
  • Deploy scalable ML models with Azure Kubernetes Service (AKS), Azure Machine Learning Compute, and Azure Container Instances.
  • Automate feature engineering, model versioning, hyperparameter tuning, and drift detection using Azure ML Pipelines and MLflow.
  • Optimize ML workflows with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics for data preparation and ETL/ELT automation.
  • Implement monitoring and explainability for ML models using Azure Responsible AI Dashboard, Fairlearn, and InterpretML.

Operational Excellence & Cross-Team Collaboration:

  • Partner with Data Science, DevOps, CloudOps, and SRE teams to align AIOps/MLOps strategies with enterprise IT goals.
  • Collaborate with business stakeholders and IT leadership to implement AI-driven insights and automation for improving operational decision-making.
  • Define and track AI/ML operational KPIs, including model accuracy, latency, infrastructure efficiency, and predictive maintenance metric.

Risk, Compliance & AI Governance:

  • Implement AI ethics, bias mitigation, and responsible AI practices for model governance in Azure Responsible AI Toolkits.
  • Ensure compliance with Azure Information Protection (AIP), Role-Based Access Control (RBAC), and data security policies.
  • Develop robust risk management strategies for AI-driven operational automation in Azure environments.
  • Present program updates, risk assessments, and AIOps, MLOps maturity progress to senior executives and key stakeholders.
  • Work collaboratively with wider PepsiCo colleagues to ensure your customer is delighted with their Azure cloud experience.
  • Attract and build a diverse, high-performing team with capabilities needed to achieve current and future business objectives.
  • Remove barriers to agility and enable the team to shift priorities quickly without losing productivity.
  • Develop the appropriate organizational structure, resource plans and culture to support the business objectives and customer deliverables.
  • Leverage your technical and operations expertise in cloud and high-performance computing to establish a solid understanding of the business, customers need, and ability to earn trust in relationships.

Compensation and Benefits:

  • The expected compensation range for this position is between $118,700 - $198,800.
  • Location, confirmed job-related skills, experience, and education will be considered in setting actual starting salary. Your recruiter can share more about the specific salary range during the hiring process.
  • Bonus based on performance and eligibility target payout is 15% of annual salary paid out annually.
  • Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement.
  • In addition to salary, PepsiCo offers a comprehensive benefits package to support our employees and their families, subject to elections and eligibility: Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts, Employee Assistance Program (EAP), Insurance (Accident, Group Legal, Life), Defined Contribution Retirement Plan.

Qualifications

  • 10+ years of technology work experience in a large-scale Global organization - CPG preferred.
  • 10+ years of experience working in Data& Analytics field.
  • 10+ years of experience working within a cross-functional IT organization.
  • 6+ years of experience in leadership/management experience.
  • Excellent Communication: must have the ability to empathize with customers and convey confidence.
  • Able to explain highly technical issues to varied audiences.
  • Able to prioritize and advocate customer's needs to the proper channels.
  • Take ownership - Make it happen - Delight the customer.
  • Customer Obsession: Passion for customers and focus on delivering the right customer experience.
  • Growth mindset: Openness and ability to learn new skills and technologies in a fast-paced environment.
  • Experience in a leadership role in technical support for mission critical solutions in an Microsoft Azure environment.
  • Site Reliability Engineering experience with modern site reliability practices including automated remediation of issues, or improved scalability, etc.
  • Experience driving Operational Excellence in operating large complex mission critical solutions.
  • Significant experience in delivering large scale operational services in a complex-change environment.
  • Ability to create strategic plans spanning multiple time horizons and across multiple partner Teams.
  • Ability to build cross-functional relationships through trust, respect, and partnership.
  • Ability to discern perceived differing priorities between the business and IT, and identifying a path forward that is mutually beneficial.
  • Experience in driving consensus around and across virtual teams and multiple functions through clear communication of vision and objectives, thorough planning, effective execution, and realization of desired benefits.
  • Track record of consistently delivering excellent results in challenging and/or transformational environments.
  • Experience working across the PepsiCo organization, ideally with multi-country or global implementation experience involving data.
  • Knowledge of some of the key concepts around master data management, data standards, analytics, and digital transformation.
  • Strong knowledge and understanding of data acquisition, data catalogues, data standards, and data management tools.
  • Strong Communication Skills/Able to Persuade/Influence Others at all Organization Levels and the ability foster lasting partnerships.

Client-provided location(s): Plano, TX, USA
Job ID: PepsiCo-367837-en-us
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • HSA
    • Pet Insurance
    • Mental Health Benefits
    • On-Site Gym
  • Parental Benefits

    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
    • On-site/Nearby Childcare
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Commuter Benefits Program
    • Snacks
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
    • Leave of Absence
    • Summer Fridays
  • Financial and Retirement

    • 401(K)
    • 401(K) With Company Matching
    • Stock Purchase Program
    • Financial Counseling
  • Professional Development

    • Tuition Reimbursement
    • Mentor Program
    • Access to Online Courses
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Internship Program

Company Videos

Hear directly from employees about what it is like to work at PepsiCo.