Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
The Hartford

AVP Reliability Engineer

Hartford, CT

AVP & Reliability Engineering - IE05HE

We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.

The AVP & Reliability Engineer will have end-to-end accountability for the reliability of IT Services within Group Benefits IT Portfolio, The Hartford. This person will lead a team of RE's who will work together to drive reliability, resiliency and productivity across the domain of control. He/She will ensure the implementation of IT Security and service hardening requirements and contribute to the long-term strategic evolution of the portfolio. Working closely with Software Engineering and Enterprise Architecture leadership, he/she will drive the sustained advancement of the RE practice within Group Benefits. Key measures of success will include service reliability (such as availability, latency, quality), feature velocity and deployment quality, as well as technical debt reduction and cost efficiency.

Want more jobs like this?

Get jobs in Hartford, CT delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Job Duties:

- Create and drive the use of best-in-class software engineering standards and design practices for instrumenting code/application technology stack to enable the generation of relevant metrics on overall technology health - availability, performance, quality, currency and resiliency.

- Serve as key partner with the architecture and software engineering leadership to help determine the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization.

- HR Manager responsibilities for T6-T8 Reliability Engineers and other applicable roles within the RE Function

- DevSecOps Solution Responsibilities:

  • Design effective tooling, alerts, and response mechanisms to identify and address reliability risks leveraging automation to support problem prevention, detection, mitigation, and resolution.
  • Drive the enhancement of the delivery flow by engineering the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
  • Progressively design preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines.
  • Promote and implement innovative solutions.

- IT Engineering Responsibilities:

  • Champion the migration of applications to open-source platforms, PaaS and use of containers and other cloud technology standards for cloud-enablement and platform agility.
  • Design and drive simplification across the stack, responsible for ensuring that all technical designs can be effectively operated without adding technical complexity.
  • Design and drive inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities (platform, infrastructure, security, etc.).
  • Critical thinking to evaluate and optimize data systems and processes, identifying areas for improvement and developing solutions to enhance the reliability and scalability of these systems.
  • Ability to evaluate and recommend designs and patterns that can be leveraged to support environments at scale
  • Ability to institute practices that balance Agility & Speed with a FinOps mindset to ensure the organization can move quickly while maintaining data integrity

- IT Ops Responsibilities:

  • Ensure operational excellence. Ensure the SRE teams are able to independently drive the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate end-to-end ownership.
  • Partner with infrastructure product teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities and automated service restoration processes. Take proactive measures to prevent high impactful incidents.
  • Ensure that the continuity of Hartford and third-party assets that support a business function are achieved and maintained. Ensure the RE teams keep the IT application and infrastructure metadata repositories current.

Knowledge/Skills:

  • System Thinking end-to-end - Broad and deep understanding of enterprise architectures and complex (backend) systems
  • Strong solution architecture orientation to enable expedient troubleshooting, issue-resolution and root-cause removal in a hybrid cloud environment.
  • Highly collaborative, partners with peers, stakeholders with a passion about delighting customers.
  • Deep software and systems engineering expertise. Hands-on development experience with Java stack - JSP, JSF, Spring, JMS, JDBC, Web Services, Weblogic and JBOSS, Apache, .NET, SQL, Oracle, Dynamo, NoSQL, Angular.
  • Experience with XML and XSLT. Knowledge of ACORD and WTX
  • Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, Jira, SonarQube, Azure DevOps, AWS Code Pipeline.
  • Expert with Performance and Observability tools such as DynaTrace, SumoLogic, TrueSight, CloudWatch, CloudTrail, AWS X-Ray, and related tools.
  • Expert experience with performance monitoring and exposure to Dynatrace and infrastructure observability tools is required.
  • Expert on new market technologies and adept at learning and adopting new models. Promotes and applies continuous learning.
  • Proven experience working with complex traditional and modern enterprise architectures and systems (understand more than the component itself).
  • Expert hybrid cloud experience (private and public) across various service delivery models - IaaS, PaaS, SaaS.
  • Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units
  • People Leader and Strategic Partner - Develop and maintain a sustainable organization through the hiring, performance management, coaching and development of a multi-discipline organization, including managing strategic business and vendor partner relationships to create value for the organization.
  • Lead transformational change management by championing the adoption of automation capabilities built and foster a culture of continuous learning and improvement mindset for the organization.

Minimum Qualifications: Degree in Computer Science or related discipline with a minimum of 15 years of work experience in IT systems analysis, design, application development, IT Operations, and tech leadership.

Preferred Qualifications: 10+ years experience in an SRE or Multi Stack / Data Engineer Manager role

Certifications/Licenses: AWS Certified Solution Architect, AWS Certified DevOps Engineer, AWS Certified Developer, Microsoft Certified Azure Solution Architect, Microsoft Certified Azure DevOps Engineer, Microsoft Certified Azure Developer, Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), EXIN DevOps Master, Finops.

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:

Equal Opportunity Employer/Females/Minorities/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age

About Us | Culture & Employee Insights | Diversity, Equity and Inclusion | Benefits

Client-provided location(s): Hartford, CT, USA
Job ID: hartford-R2417149
Employment Type: Full Time

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA With Employer Contribution
    • HSA With Employer Contribution
    • On-Site Gym
    • Mental Health Benefits
    • Virtual Fitness Classes
    • Fitness Subsidies
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
    • Adoption Leave
  • Work Flexibility

    • Hybrid Work Opportunities
    • Remote Work Opportunities
    • Flexible Work Hours
  • Office Life and Perks

    • Commuter Benefits Program
    • Casual Dress
    • On-Site Cafeteria
    • Company Outings
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Volunteer Time Off
    • Personal/Sick Days
  • Financial and Retirement

    • 401(K) With Company Matching
    • Stock Purchase Program
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
    • Profit Sharing
  • Professional Development

    • Internship Program
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Tuition Reimbursement
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Lunch and Learns
    • Learning and Development Stipend
  • Diversity and Inclusion

    • Employee Resource Groups (ERG)
    • Diversity, Equity, and Inclusion Program