Site Reliability Engineer - USDS

at  TikTok

Sydney, New South Wales, Australia -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate14 Aug, 2024Not Specified15 May, 20243 year(s) or aboveDistributed Systems,Computer Science,Database Modeling,Mysql,Openstack,Information Technology,Spark,Strategic Thinking,Programming Languages,Redis,Kubernetes,Network Architecture,Java,Hadoop,Communication Skills,Creativity,Operating Systems,DockerNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Responsibilities
About TikTok U.S. Data Security
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (“USDS”) is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.
Why Join Us
Creation is the core of TikTok’s purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That’s how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you’ll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We encourage close collaboration while promoting self-direction.

Responsibilities

  • Develop and maintain automation procedures to maximize system efficiency and minimize human intervention.
  • Work closely with software engineering teams to design, deploy and operate elements to ensure that systems are functionally robust.
  • Ensure system scalability to handle growth in web traffic and data.
  • Implement monitoring tools and set up metrics to keep track of system health and performance.
  • Participate in on-call rotations, assist with incident management, and diagnose, resolve, and prevent production issues.
  • Conduct performance tests to find and address system bottlenecks.
  • Collaborate with teams across the organization to define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs).
  • Practice sustainable user support, incident response, and blameless postmortems.

Qualifications

MINIMUM QUALIFICATIONS:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field with 3+ years of experience
  • Proven work experience as a Site Reliability Engineer, Systems Engineer, or similar software engineering role.
  • Proficient knowledge of high-level programming languages (e.g. Python, Go, Java, and Shell script).
  • Experience in network architecture, database modeling, cloud systems and large-scale distributed systems.
  • Strong understanding of Linux operating systems and open-source technologies.

PREFERRED QUALIFICATIONS:

  • Experience in MySQL, Redis, Ngnix, Kubernetes, Docker, OpenStack, Hadoop, Spark, etc
  • [Preferred] Knowledge of monitoring tools and methodologies (such as Prometheus, Grafana).
  • Excellent problem-solving skills, strategic thinking, and a strong ability to debug complex systems.
  • Exceptional communication skills and the ability to effectively collaborate with cross-functional teams.
    TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
    In the spirit of reconciliation, TikTok acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their Elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today

Responsibilities:

  • Develop and maintain automation procedures to maximize system efficiency and minimize human intervention.
  • Work closely with software engineering teams to design, deploy and operate elements to ensure that systems are functionally robust.
  • Ensure system scalability to handle growth in web traffic and data.
  • Implement monitoring tools and set up metrics to keep track of system health and performance.
  • Participate in on-call rotations, assist with incident management, and diagnose, resolve, and prevent production issues.
  • Conduct performance tests to find and address system bottlenecks.
  • Collaborate with teams across the organization to define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs).
  • Practice sustainable user support, incident response, and blameless postmortems


REQUIREMENT SUMMARY

Min:3.0Max:8.0 year(s)

Information Technology/IT

IT Software - System Programming

Software Engineering

Graduate

Computer science information technology or a related field with 3 years of experience

Proficient

1

Sydney NSW, Australia