Company Overview
Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve transformational business outcomes.
Financial technology is a high-growth industry as change and innovation continue to disrupt the status-quo and prompt major transformation. Arcesium is at a particularly interesting time in our own growth as we look to leverage our successfully established market position and expand operations in pursuit of strategic new business opportunities. We value intellectual curiosity, proactive ownership, and collaboration with colleagues, and we empower you to meaningfully contribute from day one and accelerate your professional development.
Want more jobs like this?
Get jobs in Lisbon, Portugal delivered to your inbox every week.
Position Summary
We are looking for an SRE to join our Corporate Technology team. The ideal candidate will be involved in planning, designing, and implementing various applications and infrastructure used by our staff. Strong focus will be on developing and managing applications built with cloud native and serverless technologies leveraging Azure & AWS Services.
The ideal candidate is an excellent Site Reliability Engineer with experience in cloud-based tech and a firm understanding of how to solve business needs using emerging technologies with emphasis on building applications that are cost friendly and support zero-touch operations. You'll also need to analyze various reports and statistical data to measure productivity levels and identify root causes for underperforming areas, develop customized reporting to measure and track operational statistics, data and results, oversee weekend activities across various office spaces such as user migrations to newer platforms, software & hardware upgrades and audits etc.s
Responsibilities
- Build integrations with third party SaaS applications that will include custom user provisioning, SSO, automation for migrating data and custom integrations with other applications
- Use MS Azure for managing operations in Windows Compute and Solutioning domain.
- Write good code, catch bugs, and style issues in code reviews, ship small features independently
- Participate in all aspects of the software development life cycle for AWS/Azure solutions, including planning, requirements, development, testing, and quality assurance
- Ensure the applications have optimal observability, monitoring and alerts that help identity the problems before they affect business productivity.
- You may also be involved in supporting our existing Corporate Tech applications and infrastructure like - Azure, AD, M365, Slack, Outlook/Exchange, AWS Workspaces/desktop infrastructure and other enterprise SaaS products.
- Handle operation issues for both Portugal and London office and act as Escalation Engineer for both the sites.
- 2+ years of solid Site Reliability Engineering skills, with a proven track record in developing quality software solutions and passion for technology.
- Hands on experience in diagnosing and troubleshooting operational issues, including root cause analysis (RCA) documentation.
- Strong programming skills, with proficiency in Python (preferred) or Java.
- Good understanding of the Linux operating system and TCP/IP suite of networking protocols, DNS, DHCP, VLANs, routing and switching.
- Experience managing and scaling distributed systems, including configuration management, in public, private, or hybrid cloud environments.
- Excellent verbal and written communication in English and Portuguese. Flexibility to collaborate across global time zones.
- Strong sense of ownership and integrity, demonstrated through clear communication and effective teamwork.
- Exceptional problem-solving abilities, adaptability, and a proactive approach to learning and development.
- Have a valid work permit to work in the country and travel across Europe.
- Willingness to travel as required to provide on-site support.
- Experience in any other object-oriented languages is a plus.
- Experience in cloud native and/or serverless architecture, Slack apps, and Azure/AWS certifications are a plus.
- Experience with CI/CD pipelines, container orchestration tools (e.g., Kubernetes), and monitoring and logging tools (e.g., Prometheus, Grafana) is highly desirable.