Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Site Reliability Engineer

AT TikTok
TikTok

Site Reliability Engineer

London, United Kingdom

Responsibilities

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic etc..

Responsibilities
- Build, expand and operate Bytedance's global traffic platform, including large-scale systems in public and private clouds, edge data centers.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.

Want more jobs like this?

Get Software Engineering jobs in London, United Kingdom delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
- Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement

Qualifications

Minimum Qualifications
- Master's degree (or Bachelor's degree with 3+) years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major
- 3+ years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
- 3+ years experience in one or more programming languages such as Go, Python and Shell script.
- Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.
- Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
- Strong in analytical skills and the ability to solve real world problems in a fast moving environment.

Preferred Qualifications
- Experience in designing, analyzing and building automation and tools for large scale systems
- Experience in building solutions with AWS, Google, Azures and other cloud services.
- Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment.
- Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc.

Client-provided location(s): London, UK
Job ID: TikTok-7483551161053251848
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • HSA
    • Life Insurance
    • Fitness Subsidies
    • Short-Term Disability
    • Long-Term Disability
    • On-Site Gym
    • Mental Health Benefits
    • Virtual Fitness Classes
  • Parental Benefits

    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
  • Work Flexibility

    • Flexible Work Hours
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Casual Dress
    • Snacks
    • Pet-friendly Office
    • Happy Hours
    • Some Meals Provided
    • Company Outings
    • On-Site Cafeteria
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
    • Leave of Absence
  • Financial and Retirement

    • 401(K) With Company Matching
    • Performance Bonus
    • Company Equity
  • Professional Development

    • Promote From Within
    • Access to Online Courses
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Mentor Program
  • Diversity and Inclusion

    • Diversity, Equity, and Inclusion Program
    • Employee Resource Groups (ERG)

Company Videos

Hear directly from employees about what it is like to work at TikTok.