Site Reliability Engineer at Forman Technology Group
Austin, TX 78799, USA -
Full Time


Start Date

Immediate

Expiry Date

13 Nov, 25

Salary

98000.0

Posted On

13 Aug, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Health Insurance, Python, Docker, Azure, Information Technology, Dental Insurance, Aws, Computer Science, Communication Skills, Automation Tools, Bash

Industry

Information Technology/IT

Description

We are looking for a Site Reliability Engineer (SRE) to help build and maintain reliable, scalable, and high-performance systems. This role is perfect for someone passionate about automation, infrastructure, and solving operational challenges with code. As part of our engineering team, you’ll collaborate across departments to ensure our systems are secure, resilient, and highly available.

REQUIREMENTS

  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
  • 1–2 years of relevant experience in systems engineering, DevOps, or SRE roles.
  • Basic knowledge of Linux/Unix systems administration and networking fundamentals.
  • Familiarity with cloud platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
  • Understanding of CI/CD pipelines and automation tools.
  • Proficiency in at least one scripting language (Python, Bash, or similar).
  • Strong problem-solving skills and ability to work collaboratively in cross-functional teams.
  • Excellent communication skills and a proactive learning mindset.
    Job Type: Full-time
    Pay: $82,000.00 - $98,000.00 per year

Benefits:

  • 401(k)
  • Dental insurance
  • Health insurance
  • Paid time off

Work Location: In perso

Responsibilities
  • Monitor, maintain, and improve the availability, scalability, and performance of critical systems and services.
  • Collaborate with software engineering teams to design and implement robust infrastructure solutions.
  • Automate operational processes using scripts, tools, and modern DevOps practices.
  • Identify, troubleshoot, and resolve production issues promptly to minimize downtime.
  • Implement and manage observability tools for system monitoring, logging, and alerting.
  • Participate in incident response, root cause analysis, and post-mortem reviews.
  • Support deployment processes and continuous integration/continuous delivery (CI/CD) pipelines.
  • Contribute to the creation and maintenance of technical documentation and runbooks.
Loading...