Date Posted:
2024-12-11
Country:
United States of America
Location:
HTX99: Field Office - TX Remote Location, Remote City, TX, 73301 USA
Position Role Type:
UnspecifiedAre you interested in building and maintaining the infrastructure to power the world's largest flight tracking platform?FlightAware, part of the Connected Aviation Solutions (CAS) unit of Collins Aerospace, has built the world's leading aviation software platform, processing over 300+ million incoming messages an hour from almost 40,000 individual data feeds-more than 5 terabytes a day and growing! We provide the best, most complete, and most accurate real-time flight-tracking service and are proud to have built a wide variety of successful products on this foundation that have become central to the aviation industry at large.FlightAware is searching for an enthusiastic and process-driven, Site Reliability Engineer (SRE) to "automate themselves out of a job." The FlightAware Site Reliability Engineering team embraces infrastructure automation, release engineering, and continuous delivery. As part of the Operations team, our SREs work alongside highly effective and talented counterparts, interacting closely with all facets of the Engineering org. This role requires a fusion of skills in development, analytics and hardware to solve problems in an exciting and demanding environment.FlightAware Engineering consists of teams that develop and deliver an array of services powering its commercial products. These teams tackle a diverse set of technologies and solve challenging technical and product problems daily. From collecting and interpreting aviation datasets to enriching them with procedural and AI/ML solutions and delivering them through APIs, web interfaces, and reporting products, our engineers work in a dynamic environment that tests their ability to marshal over 100 resources to achieve the company's vision. Regardless of role, we expect excellent interpersonal and communication skills across all hires at FlightAware. We look for candidates who will thrive here, meaning they demonstrate clear communication, embrace open feedback, trust their colleagues, and are driven to execute, deliver, and complete projects independently and efficiently.Learn more about the history of our reliability team and the FlightAware engineering interview process.What You Will Do:
Want more jobs like this?
Get jobs that are Remote delivered to your inbox every week.
- Spend your days working to automate and improve reliability and continue to push FlightAware's infrastructure forward, ensuring it is resilient and reproducible.
- Be responsible for service availability, performance, monitoring, incident response, and capacity planning.
- Create, improve, and manage environments to ensure decisions on resource allocation, problem identification, and capacity planning are based on accurate data-driven insights.
- Maintain a physical infrastructure using Kubernetes, Linux, & Ceph, and a cloud infrastructure in AWS as part of the Site Reliability Engineering team.
- Impact technology decision and direction to grow and support the FlightAware platform.
- Collaborate closely with fellow SREs on your team and extend your collaboration across other FlightAware teams and disciplines to design dependable and scalable solutions and services.
- Identify, implement, and champion process improvements to enhance productivity, collaboration, and delivery efficiency, while ensuring alignment with company goals and industry best practices.
- Gain a deep understanding of the systems and infrastructure that support FlightAware's applications and services; this includes networking, operating systems, cloud platform, databases, and other relevant technologies.
- Learn the intricacies of handling and processing real-time flight data, including ensuring the reliability of systems dealing with dynamic and time-sensitive information.
- Gain expertise in designing and maintaining high-availability architectures for critical systems, ensuring continuous availability and performance for FlightAware's global user base.
- Further develop skills in automation and scripting to streamline operational tasks and improve efficiency.
- Will participate in shared on-call rotation with SRE team.
- Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and minimum 8 years prior relevant experience or an Advanced Degree in a related field and minimum 5 years of experience or in absence of a degree, 12 years of relevant experience.
- Must be authorized to work in the U.S. without sponsorship now or in the future. RTX will not offer sponsorship for this position.
- Experience as a SRE, Platform Engineer, or related position within a Linux or UNIX environment working on large, complex infrastructures and/or projects using Docker and Kubernetes solutions.
- Experience automating configuration and infrastructure with tools such as Saltstack, Ansible, Terraform or other declarative languages.
- Experience with hardware; including servers, network switches, & cabling.
- Experience managing Kubernetes clusters using GitOps with continuous delivery (CD) pipelines such as Flux or Argo.
- Experience deploying and maintaining large, distributed storage solutions, such as Ceph.
- Established proficiency in at least one (ideally more) of the following: Python, Go, Rust, or Shell (bash, awk, sed).
- Experience with PostgreSQL, or equivalent RDBMS and SQL in general.
- Experience working with Nix or NixOS.
- Familiarity with Cloud infrastructure, ideally AWS.
- Understanding of SRE principles including building observability solutions and exposing metrics to inform SLO's and KPI's.
- Understanding of how IT infrastructure services work, including: DNS, DHCP, PXE, LDAP, NFS.
- Understanding of network segmentation, routing and VPNs.
- You are a private pilot; you are looking to pursue your private pilot license or have a passion for aviation.
- Medical, dental, and vision insurance
- Three weeks of vacation for newly hired employees
- Generous 401(k) plan that includes employer matching funds and separate employer retirement contribution, including a Lifetime Income Strategy option
- Tuition reimbursement program
- Student Loan Repayment Program
- Life insurance and disability coverage
- Optional coverages you can buy: pet insurance, home and auto insurance, additional life and accident insurance, critical illness insurance, group legal, ID theft protection
- Birth, adoption, parental leave benefits
- Ovia Health, fertility, and family planning
- Adoption Assistance
- Autism Benefit
- Employee Assistance Plan, including up to 10 free counseling sessions
- Healthy You Incentives, wellness rewards program
- Doctor on Demand, virtual doctor visits
- Bright Horizons, child and elder care services
- Teladoc Medical Experts, second opinion program
- And more!
This role is also eligible for the Re-Empower Program. The Re-Empower Program helps support talented and committed professionals as they rebuild their capabilities, enhance leadership skills, and continue their professional journey. Over the course of the 14-week program, experienced professionals will gain paid, on-the-job experience, have an opportunity to participate in sessions with leadership, develop personalized plans for success and receive coaching to guide their return-to-work experience. Upon completion of the program, based on performance and contributions participants will be eligible for a career at RTX.Minimum Program Qualifications
- Be on a career break of one or more year at time of application
- Have prior experience in functional area of interest
- Have interest in returning in either a full-time or part-time position
Our Connected Aviation Solutions team provides advanced information management systems, products and services that enable the connected ecosystem by bringing together Collins' unique breadth of aviation products with our smart digital solutions to help us enhance every aspect of the end-to-end travel experience. We help airlines, airports and business aircraft turn data into value to streamline operations, increase efficiency and reduce cost, enhance the passenger experience and contribute to sustainable flight. By combining the best networks, connectivity and data/analytics solutions, we're solving big problems for our customers and the world, while enhancing the security and connectivity of systems both on and off the aircraft, to help operators and passengers stay more connected and informed and create a more sustainable, efficient, reliable and enjoyable travel experience. Aviation connects the world. Our Connected Aviation Solutions team connects aviation. Sustainably. Seamlessly. Securely.Please ensure the role type (defined below) is appropriate for your needs before applying to this role.
Remote: Employees who are working in Remote roles will work primarily offsite (from home). An employee may be expected to travel to the site location as needed.
- Position is remote; however, if you live within a reasonable commute of a Collins site with other colleagues you interact with, your manager will discuss whether there is a degree of onsite presence associated with this role.
Click on this link to read the Policy and Terms