We are seeking a highly skilled and experienced Lead MLOps Engineer to join our team.
The ideal candidate will be responsible for overseeing the entire machine learning operations lifecycle, from design and deployment to maintenance and optimization. This role requires a blend of technical expertise, leadership skills, and a deep understanding of MLOps practices.
If this resonates with you, this could be the perfect opportunity to join the EPAM team!
#LI-DNI
Responsibilities
- Design, create, maintain, troubleshoot, and optimize the performance of complete end-to-end ML deployment lifecycle
- Establish and configure CI/CD/CT processes
- Identify technical risks and gaps, devise mitigation strategies
- Evaluate new tools and techniques
- Promote and support MLOps practices
- Collaborate with cross-functional teams to ensure seamless integration of ML models
- Develop monitoring systems to track the performance of deployed models
- Drive innovation and stay updated with the latest industry trends in MLOps
- Provide technical leadership and mentorship to junior team members
- Lead the development and implementation of best practices for ML model management and deployment
Want more jobs like this?
Get jobs in Sofia, Bulgaria delivered to your inbox every week.
- 5+ years production experience as MLOps Engineer, ML Engineer (MLE) or Data Engineer
- Confirmed team or project leading experience, with proven ability to mentor and coach other team members
- Experience with at least one of the MLOps related platforms or technologies, such as Azure ML (preferred), AWS SageMaker, GCP Vertex AI, Databricks MLFlow, etc
- Proficiency in Databricks
- Experience in Python development and the Python ML ecosystem
- Skills in automated data pipeline and workflow management tools
- Competency in basic software engineering tools, e.g., git, CI/CD environment, PyPi, Docker, Kubernetes
- Agile development experience
- Problem-solving skills and strong decision-making skills
- Clear, concise communication skills and good command of written and spoken English (B2+)
- Good understanding of the ML fundamentals, data preparation and feature engineering
- Experience with one of the infrastructure as a code (IoC) frameworks (e.g.: Terraform / CDK TF, Ansible, AWS CloudFormation / AWS CDK, etc.)
- Understanding of the Apache Spark ecosystem (Spark SQL, MLlib/Spark ML)
- Software architecture and design experience
- Opportunity to Engineer your Future and to drive the world's digital transformation with top industry clients
- Personal development program that will allow you to be valued for your strengths
- Wide range of professional trainings and workshops
- Being part of a collaborative, fast-growing, and innovative design team
- Established and accelerated growth toward different career paths, competencies, and roles
- Broad projects variety and possible mobility between projects over the time
- Collaboration in a multicultural environment and exchange of best practices with colleagues around the world
- Varied social benefits, Sports, Transportation and Health programs
- Work-life balance and flexible schedule, team buildings and sport opportunities
- Modern office/collaboration spaces (incl. new Infinity Tower business center, Sofia)
- Hybrid By Design - we provide you with the best productivity options from the 2 worlds. Meet, socialize and enjoy F2F time with your colleagues, while working from the modern EPAM's office for a few days per week and benefit from the EPAM's virtual working environment - making you able to be productive and work from remote for the rest of the week