This is Adyen
Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.
For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.
Site Reliability Engineer - Data Platform
Want more jobs like this?
Get jobs in Amsterdam, Netherlands delivered to your inbox every week.
This role blends software and systems engineering to develop the essential tools and processes that power our on-premise Big Data Platforms. These platforms support dozens of products, hundreds of developers, and thousands of daily jobs, reinforcing Adyen’s industry-leading capabilities.
As a Data Platform SRE, you will be responsible for managing one of the largest data platforms in the world. Your focus will be on ensuring that data, data services, and infrastructure are reliable, fault-tolerant, efficiently scalable, and cost-effective.
Beyond operations, you’ll have the opportunity to design, build, and deliver scalable systems as a Platform engineer. If you thrive on automation and reducing manual toil, you’ll play a strategic role in shaping the future of automation for our Big Data Platforms.
You will collaborate with data and ML scientists and engineers to develop and roll out tools that enhance platform performance while operating and scaling multiple big data platforms. This includes managing a fleet of a few thousand nodes, tens of thousands of cores, hundreds of terabytes of RAM, and tens of petabytes of storage. This is a unique opportunity to work at scale and influence the infrastructure behind Adyen’s cutting-edge data capabilities.
What you’ll do
- Design, develop, operate, and maintain scalable, reliable, fault-tolerant, and high-performance big data platforms.
- Work with distributed systems in all shapes and flavors (databases, file systems, compute, etc.).
- Shape and maintain continuous improvements and deployment of new clusters.
- Implement observability solutions (logging, monitoring, alerting) to ensure system reliability and performance.
- Optimize performance of large-scale distributed data platforms with appropriate cluster configurations, and resource management.
- Practice sustainable incident response and blameless postmortems, with a focus on automation and root cause analysis.
- Improve security and compliance by implementing access controls, encryption, and best practices for data governance.
- Design and build automation tools to reduce toil and enhance the reliability of data platforms.
- Self-service infrastructure to empower data teams while maintaining platform stability.
Who you are
- Familiar with infrastructure automation and configuration management tools (e.g. Ansible, Terraform, Puppet).
- Strong system administrator/platform engineer background with experience in large-scale distributed systems.
- Experienced with infrastructure and private cloud systems (on-premise).
- A team player with strong communication skills, able to work closely with diverse stakeholders (analysts, data scientists, data engineers, infrastructure, and security teams).
Experience developing and maintaining:
- Distributed data and compute systems (Hadoop, Druid, Trino, etc).
- Kubernetes (k8s, Docker)
- Hadoop ecosystems (Hive, YARN, HDFS, Kerberos).
Good to have
- Experience as a Data Platform Engineer or Site Reliability Engineer for Data Platforms
- Fluency in Python. Java, Golang, or Rust are also appreciated.
- Experience in observability & monitoring tools (Prometheus, Grafana, OpenTelemetry).
- Experience with on-call rotations or incident management in high-scale distributed systems
Our Diversity, Equity and Inclusion commitments
Our unique approach is a product of our diverse perspectives. This diversity of backgrounds and cultures is essential in helping us maintain our momentum. Our business and technical challenges are unique, and we need as many different voices as possible to join us in solving them - voices like yours. No matter who you are or where you’re from, we welcome you to be your true self at Adyen.
Studies show that women and members of underrepresented communities apply for jobs only if they meet 100% of the qualifications. Does this sound like you? If so, Adyen encourages you to reconsider and apply. We look forward to your application!
What’s next?
Ensuring a smooth and enjoyable candidate experience is critical for us. We aim to get back to you regarding your application within 5 business days. Our interview process tends to take about 4 weeks to complete, but may fluctuate depending on the role. Learn more about our hiring process here. Don’t be afraid to let us know if you need more flexibility.
This role is based out of our Amsterdam office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.