Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Manager, D&AI AIOps, MLOps Operations

AT PepsiCo
PepsiCo

Senior Manager, D&AI AIOps, MLOps Operations

Plano, TX

Overview

We are seeking a highly skilled Senior Manager - AIOps & MLOps to lead and oversee the automation, scalability, and reliability of AI/ML operations across the enterprise.

Responsibilities

This role requires deep expertise in AI-driven observability, machine learning pipeline automation, cloud-based AI/ML platforms, and operational excellence. The ideal candidate will drive AI/ML model deployment, continuous monitoring, and self-healing automation to optimize system performance, minimize downtime, and enhance decision-making with real-time AI-driven insights.

  • Lead and sustain large-scale AIOps, MLOps programs, ensuring alignment with business objectives, data governance standards, and enterprise data strategy.
  • Oversee the implementation of real-time data observability, monitoring, and automation frameworks to enhance data reliability, quality, and operational efficiency.
  • Develop program governance models and execution roadmaps to drive efficiency across data platforms, including Azure, AWS, GCP, and on-prem environments.
  • Ensure seamless integration of CI/CD, data pipeline automation, and self-healing capabilities across the enterprise. Partner in building the next generation D&A platform(s), and leading a high-performing data operations team.
  • Lead and manage the full people, process and technology driven Data & Analytics platform technology strategy and cultural shift for PepsiCo IT to a world class data first organization working across all Sector S&T.
  • Champion of PepsiCo's Data & Analytics program and platform management supporting large scale global data engineering efforts partnering across S&T organization
  • Support Data & Analytics Technology Transformations to provide full sustainment capabilities across the PepsiCo Data Estate, including data platform management automation of proactive issue identification and self-healing abilities.

Want more jobs like this?

Get Data and Analytics jobs in Plano, TX delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

AIOps & Observability Automation:

  • Design and implement AIOps strategies for automating IT operations using Azure Monitor, Azure Log Analytics, Azure Sentinel, and AI-driven alerting.
  • Deploy Azure-based observability solutions (Azure Monitor, Application Insights, Azure Synapse for log analytics, and Azure Data Explorer) to enhance real-time system performance monitoring.
  • Enable AI-driven anomaly detection and root cause analysis (RCA) using Azure Machine Learning (Azure ML) and AI-powered log analytics.
  • Develop self-healing and auto-remediation mechanisms using Azure Logic Apps, Azure Functions, and Power Automate to proactively resolve system issues.

MLOps & Machine Learning Pipeline Management:

  • Lead end-to-end ML lifecycle automation using Azure ML, Azure DevOps, and Azure Pipelines for ML (CI/CD).
  • Deploy scalable ML models with Azure Kubernetes Service (AKS), Azure Machine Learning Compute, and Azure Container Instances.
  • Automate feature engineering, model versioning, hyperparameter tuning, and drift detection using Azure ML Pipelines and MLflow.
  • Optimize ML workflows with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics for data preparation and ETL/ELT automation.
  • Implement monitoring and explainability for ML models using Azure Responsible AI Dashboard, Fairlearn, and InterpretML.

Operational Excellence & Cross-Team Collaboration:

  • Partner with Data Science, DevOps, CloudOps, and SRE teams to align AIOps/MLOps strategies with enterprise IT goals.
  • Collaborate with business stakeholders and IT leadership to implement AI-driven insights and automation for improving operational decision-making.
  • Define and track AI/ML operational KPIs, including model accuracy, latency, infrastructure efficiency, and predictive maintenance metric.

Risk, Compliance & AI Governance:

  • Implement AI ethics, bias mitigation, and responsible AI practices for model governance in Azure Responsible AI Toolkits.
  • Ensure compliance with Azure Information Protection (AIP), Role-Based Access Control (RBAC), and data security policies.
  • Develop robust risk management strategies for AI-driven operational automation in Azure environments.
  • Present program updates, risk assessments, and AIOps, MLOps maturity progress to senior executives and key stakeholders.
  • Work collaboratively with wider PepsiCo colleagues to ensure your customer is delighted with their Azure cloud experience.
  • Attract and build a diverse, high-performing team with capabilities needed to achieve current and future business objectives.
  • Remove barriers to agility and enable the team to shift priorities quickly without losing productivity.
  • Develop the appropriate organizational structure, resource plans and culture to support the business objectives and customer deliverables.
  • Leverage your technical and operations expertise in cloud and high-performance computing to establish a solid understanding of the business, customers need, and ability to earn trust in relationships.

Compensation and Benefits:

  • The expected compensation range for this position is between $118,700 - $198,800.
  • Location, confirmed job-related skills, experience, and education will be considered in setting actual starting salary. Your recruiter can share more about the specific salary range during the hiring process.
  • Bonus based on performance and eligibility target payout is 15% of annual salary paid out annually.
  • Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement.
  • In addition to salary, PepsiCo offers a comprehensive benefits package to support our employees and their families, subject to elections and eligibility: Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts, Employee Assistance Program (EAP), Insurance (Accident, Group Legal, Life), Defined Contribution Retirement Plan.

Qualifications

  • 10+ years of technology work experience in a large-scale Global organization - CPG preferred.
  • 10+ years of experience working in Data& Analytics field.
  • 10+ years of experience working within a cross-functional IT organization.
  • 6+ years of experience in leadership/management experience.
  • Excellent Communication: must have the ability to empathize with customers and convey confidence.
  • Able to explain highly technical issues to varied audiences.
  • Able to prioritize and advocate customer's needs to the proper channels.
  • Take ownership - Make it happen - Delight the customer.
  • Customer Obsession: Passion for customers and focus on delivering the right customer experience.
  • Growth mindset: Openness and ability to learn new skills and technologies in a fast-paced environment.
  • Experience in a leadership role in technical support for mission critical solutions in an Microsoft Azure environment.
  • Site Reliability Engineering experience with modern site reliability practices including automated remediation of issues, or improved scalability, etc.
  • Experience driving Operational Excellence in operating large complex mission critical solutions.
  • Significant experience in delivering large scale operational services in a complex-change environment.
  • Ability to create strategic plans spanning multiple time horizons and across multiple partner Teams.
  • Ability to build cross-functional relationships through trust, respect, and partnership.
  • Ability to discern perceived differing priorities between the business and IT, and identifying a path forward that is mutually beneficial.
  • Experience in driving consensus around and across virtual teams and multiple functions through clear communication of vision and objectives, thorough planning, effective execution, and realization of desired benefits.
  • Track record of consistently delivering excellent results in challenging and/or transformational environments.
  • Experience working across the PepsiCo organization, ideally with multi-country or global implementation experience involving data.
  • Knowledge of some of the key concepts around master data management, data standards, analytics, and digital transformation.
  • Strong knowledge and understanding of data acquisition, data catalogues, data standards, and data management tools.
  • Strong Communication Skills/Able to Persuade/Influence Others at all Organization Levels and the ability foster lasting partnerships.

Client-provided location(s): Plano, TX, USA
Job ID: PepsiCo-367837-en-us
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • HSA
    • Pet Insurance
    • Mental Health Benefits
    • On-Site Gym
  • Parental Benefits

    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
    • On-site/Nearby Childcare
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Commuter Benefits Program
    • Snacks
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
    • Leave of Absence
    • Summer Fridays
  • Financial and Retirement

    • 401(K)
    • 401(K) With Company Matching
    • Stock Purchase Program
    • Financial Counseling
  • Professional Development

    • Tuition Reimbursement
    • Mentor Program
    • Access to Online Courses
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Internship Program

Company Videos

Hear directly from employees about what it is like to work at PepsiCo.