Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

ML Infrastructure Software Engineer

AT Apple
Apple

ML Infrastructure Software Engineer

Austin, TX

Do you love creating elegant solutions to highly complex challenges? Do you intrinsically see the importance in every detail? As part of our Silicon Technologies group, you'll help build AI-driven solutions that solve pressing business challenges. You'll ensure Apple products and services can seamlessly and efficiently handle the tasks that make them beloved by millions. Joining this group means you'll be responsible for crafting and building the technology that fuels Apple's devices. We are looking for an individual who is passionate about joining Apple's engineering team as an ML Infrastructure Software Engineer to enable the deployment and integration of AI models supporting our domains.

Description

In this highly visible role, your primary responsibilities will include: - Deploying, optimizing, and integrating industry-standard AI models within internal infrastructure to support silicon design workflows. - Collaborating with internal teams to evaluate model needs, define selection and benchmarking standards, and ensure our infrastructure remains state-of-the-art by tracking industry advancements. - Managing pipelines for fine-tuning and model conversion, and implementing monitoring to ensure scalable and efficient model deployment. - Contributing to compute planning and hardware decisions, including evaluating third-party silicon and supporting adoption of internal chip solutions.

Want more jobs like this?

Get Data and Analytics jobs in Austin, TX delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Minimum Qualifications

  • Experience in Python
  • Experience with at least one of the following model deployment frameworks: VLLM, Triton, or TensorRT-LLM
  • Experience scaling or optimizing machine learning models in production environments
  • Minimum requirement of BS and 3+ years of relevant industry experience

Preferred Qualifications

  • Understanding of model optimization techniques (e.g., quantization, pruning, or format conversions)
  • Familiarity with containerization and orchestration tools such as Docker or Kubernetes
  • Ability to evaluate model choices based on hardware efficiency and constraints
  • Exposure to performance monitoring and observability systems for ML workloads
  • Designed and optimized RESTful services

Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Submit Resume

Client-provided location(s): Austin, TX, USA
Job ID: apple-200599493
Employment Type: Other

Company Videos

Hear directly from employees about what it is like to work at Apple.