Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.
Your Role and Responsibilities
We are currently looking for skilled Platform Engineers to develop, maintain, and support container orchestration (kubernetes), machine learning workloads, network services, and storage layers across cloud and on-premise.
- Develop and maintain scalable distributed systems in IBM Cloud, AWS, and on-premise.
- Develop and maintain high performance k8s clusters across multiple regions.
- Develop and maintain telemetry infrastructure & service instrumentation (python) for metrics, distributed tracing, and logging.
- Support infrastructure for a petabyte scale data platform and stream analysis services.
- Work with Audio and Speech AI Engineers to accelerate development and deployment of heterogeneous analysis and training pipelines
- Participate in the definition and management of SLIs, SLOs and error budgets for infrastructure and production services.
- Design and implement infrastructure-as-code pipelines
Want more jobs like this?
Get jobs in Krakow, Poland delivered to your inbox every week.
PLP&T_24
Required Technical and Professional Expertise
- 4+ Years cloud development (IBM cloud preferred and AWS) experience designing, implementing, and support cloud-based infrastructure
- 3+ Years experience architecting, deploying, and supporting kubernetes in cloud and on-prem environments.
- 2+ years experience designing and supporting distributed systems.
- Experience writing production code in one of more languages such as Python (preferred), Java, Go in a microservices environments.
- 2+ Years Linux experience configuring, supporting, and optimizing. Bonus for Redhat
Preferred Technical and Professional Expertise
- Familiarity running distributed ML workloads in cluster orchestrated environments
- Experience building and supporting telemetry and related infrastructure (Open Telemetry, Jaeger, Grafana, Prometheus)
- Experience with k8s ecosystem tooling like helm, deployment tools such as ArgoCD
- Experience designing and implementing infrastructure as code pipelines
- Experience designing and implementing traffic routing strategies in edge and microservices environments.
- Competitive salary
- Employee Capital Plan
- Private medical care
- Group life insurance
- Contributions to your Kafeteria MyBenefit account
- Home internet allowance
- Free snacks, and drinks in the office
- Gym membership and tuition reimbursement
- Hands-on career development
- Free parking