Description
Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and develop reusable solutions that support our customers in any environment. You will have the opportunity to contribute to the design and implementation of Continuous Integration and Continuous Delivery (CI/CD) pipelines that accelerate the secure delivery of software to production. You will automate the buildout of infrastructure in cloud and on-premises environments to operate Kubernetes clusters and microservices deployments. In this role, you will join dynamic Agile software teams that are singularly focused on providing world-class solutions to our customers in an exciting, collaborative, and inclusive atmosphere. You will be intellectually challenged and provided with a tremendous opportunity for growth in a fast-paced, and fun environment.
Want more jobs like this?
Get jobs that are Remote delivered to your inbox every week.
You'll learn, master, and improve the Continuous Integration Continuous Delivery (CI/CD) processes and tools we use to develop, test, integrate, and deploy our Cloud-based and on-premises solutions into multiple hosting environments, such as AWS, Azure, VMWare, and others. You'll learn new technologies and tools and apply what you've learned to overcome technological challenges with innovative solutions. You'll collaborate with other software engineers and SREs to share your knowledge with the team and the organization to make us all better at what we do. You'll perform technical spikes and develop prototypes to help test product concepts and achieve customer validation.
Primary Responsibilities:
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding of an microservice enterprise system (cloud and on-premises)
- Partner with development teams to improve services through rigorous testing and release procedures
- Participate in system design consulting, platform management, and capacity planning
- Create sustainable systems and services through service automation
- Design, develop, troubleshoot, and debug mission critical infrastructure
- Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC).
- Participate in the design of reusable infrastructure components for scalable, highly available, secure architectures for cloud native applications.
- Enable the continuous integration and continuous delivery of our diverse suite of software products by applying best practices for infrastructure provisioning, configuration and automated software deployments.
- Continually evaluate fielded system deployments and apply best practices to facilitate continuous improvement that can be applied across teams.
- Work closely with other engineers to develop the best technical design and approach for new product installation and field service activities (software patches, cyber updates, etc.)
- Develop solutions to complex technical issues and problems that impact multiple area or disciplines.
- Communicate with internal team members across multiple areas and coordinate completion of key deliverables across teams.
- Liaise with external and internal customer stakeholders on technical design decisions and trade-offs and ensure software solution will meet required functional, performance, and SLA thresholds.
- Mentor other SREs in the art of building deploying and maintaining production mission critical microservice enterprise systems.
- Resolve roadblocks for the field service team, working collaboratively with the product engineering, technical leadership, and others.
Basic Qualifications:
- Bachelor's degree in computer science or computer engineering with 4+ years of experience in a relevant field
- Experience delivering entire projects or processes spanning multiple technical areas.
- Experience serving as a technical lead managing large projects or processes.
- Working knowledge of Agile Development and continuous integration and continuous delivery methodologies and tools.
- Expertise with Linux and Windows operating systems, network administration, and networking protocols/functions (e.g., HTTP, HTTPS, SSL/TLS, SMTP, DNS)
- Expertise provisioning and managing resources within IaaS/Cloud infrastructures (e.g., Azure, AWS, Google Cloud Platform, etc.)
- Experience with Terraform, Ansible, Helm, BASH Scripting, CloudFormation, Chef, Puppet, Ansible or similar technologies
- Expertise with container technologies such as Docker and container orchestration tools like Kubernetes
- Expertise with Kubernetes kubectl
- Expertise of a version control system (e.g., Git).
- Strong, self-motivated desire to learn new tools, frameworks, and techniques.
- Ability to complete tasking independently with minimal direct supervision.
- Ability to work and collaborate effectively within a multi-disciplined engineering team.
Preferred Qualifications:
- Experience with Enterprise Event Brokers Technologies (Kafka, NATS)
- Experience with monitoring and alerting tools such as Grafana, Prometheus
- Experience with API Gateways such as ISTIO
- Experience with GitOps tools such as Argo CD, Flux CD, Fleet or similar
- Professional cybersecurity certification such as Security+, or similar.
- Knowledge of Agile Development methodologies.
- Familiarity with at least one Relational Database Management System (Oracle, MySQL, PostgreSQL, SQL Server, etc.).
The Security Enterprise Solutions (SES) Operation is the cornerstone of Leidos' comprehensive suite of fully-integrated security solutions for aviation, ports, borders, and critical infrastructure customers around the world. With our new, combined portfolio, our operation has more than 24,000 products deployed across 120 countries. Leveraging this portfolio, our core technical strengths, and robust R&D initiatives, we are positioned to address emerging and evolving threats through rapid development of innovative solutions for our global customers. Travel to various customer sites domestically and internationally as required.
SES is comprised of three divisions to align with our customers' missions and needs:
- Aviation Solutions
- Ports & Borders
- Global Services
SPECIFIC DUTIES, ACTIVITIES AND RESPONSIBILITIES INCLUDE BUT ARE NOT LIMITED TO:
- Performing customer support activities involving the installation, modification, and repair of complex equipment and systems
- Conducting on-site installation and testing of equipment to ensure proper working order.
- Isolating equipment start-up malfunctions and taking corrective action.
- Performing ad-hoc and predetermined work assignments with instructions.
- Following all established guidelines, procedures and policies.
- Working on assignments which are semi-routine in nature where ability to recognize deviation from accepted practice is required
- Managing resources, maintaining schedules, and coordinating all repair activities in priority sequence to ensure adherence to contractual service level agreements (SLAs); and
- Working in a customer environment with systems operated in a 24/7 environment which require immediate response times (either while on-shift or after hours on-call) and driving long distances to sites.
- Accepting and updating work order in CRM systems such as Salesforce, Maximo, and GURU for our customers.
Basic Qualifications:
- Related Trade experience (electrical, electronic, mechanical)/Military Training (electrical, mechanical, electronics). (Equivalence achieved through comparative work and life experience of 2 years of electrical or mechanical field service role). Computer literacy; competency in use of all programs within MS Office Suite and aptitude for learning specialized software programs.
- Minimum of 2 years' experience directly involved in troubleshooting and field repair of electrical and electronic systems and equipment.
- Individuals shall possess excellent communications skills and have a strong orientation for customer focus and teamwork. Must be responsive to all customer issues at all times. Must be willing & able to travel at short notice.
- Must be able to pass an in-depth background check (TSA eQIP).
Preferred Qualifications:
- Possess excellent organizational, communication, interpersonal skills with the ability to multitask several projects at once.
- Excellent customer service skills and the ability to handle stressful situations.
- Self-motivated, reliable, and accountable individual
- Must be able to lift/carry 50 lbs.
- Must be able to push/pull 200 lbs.
- Must be able to move/ manipulate equipment weighing up to 1000 lbs. with the assistance of carts, hoists, davit cranes, pallet jacks or other devices as defined in the manuals and Technical Advisory documentation.
- Job requires frequent bending, stooping, twisting, turning, and working in unusual positions requiring full body mobility.
- Must be able to work safely and follow safety precautions in extreme environments (temperature, humidity, noise, confined spaces, etc.) around dangerous industrial equipment.
Salary Range for this position: $90,000 to $100,000 **
Original Posting:
March 5, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:
Pay Range $85,150.00 - $153,925.00
The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
#Remote