Site Reliability Engineer at Pluribus Digital
, , -
Full Time


Start Date

Immediate

Expiry Date

18 Dec, 25

Salary

130000.0

Posted On

19 Sep, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Site Reliability, System Administration, Troubleshooting, Problem Solving, AWS, Java, Python, Ruby, Linux, Windows, Infrastructure Management, Deployment Automation, Configuration Management, System Monitoring, Security Practices, Database Management

Industry

IT Services and IT Consulting

Description
About Pluribus Digital: Join us and do work that matters: use your skills to improve how your government serves the public! Pluribus Digital partners with our government customers to design, develop, and deliver useful and impactful digital products. We are a hands-on digital services consultancy – part technologists, part change agents, and all heart. We employ modern best practices in all that we do as we work to solve problems in public health, financial industry regulation, granting citizenship and asylum, and identity and access management. About the Role: We are looking for a Site Reliability Engineer to support the General Services Administration (GSA) on the ITRegs program. This role is part of the Operations & Maintenance (O&M) team, maintaining mission-critical federal systems, and migration support. The engineer will contribute to infrastructure management, deployment automation and system monitoring to ensure availability, security and performance as the program transitions into a modernized, cloud-hosted environment. What you will do: Support infrastructure management across hybrid environments (Linux/Red Hat, Windows). Assist with deployment automation and configuration management tasks to improve system efficiency and reliability Perform system monitoring, troubleshooting, and log analysis to identify and resolve performance or security issues Contribute to patching, updates, and vulnerability remediation in line with federal security standards Document system changes, deployments, and operational procedures to support compliance and audits Participate in incident response activities, assisting senior engineers in root cause analysis and corrective actions Collaborate with O&M team members to support continuity of operations (COOP/DR) and modernization-related migration activities What you will bring: 4+ years of relevant professional experience in system administration, site reliability, or operations support. Bachelor’s degree or higher in Computer Science, Information Systems, or a related technical field. Ability to obtain and maintain a Public Trust clearance (U.S. citizenship required). Hands-on experience with system monitoring, logging and performance management in enterprise environments Familiarity with federal IT security practices (patching, vulnerability scanning, NIST standards) or a willingness to learn Strong troubleshooting and problem-solving skills with the ability to follow standard operating procedures Experience with AWS and a deep understanding of AWS networking and containerization tools Development experience with Java, Python or Ruby Familiarity with federal IT security practices (patching, vulnerability scanning, NIST standards) or a willingness to learn Strong troubleshooting and problem-solving skills with the ability to follow standard operating procedures Ability to work as part of a team supporting mission-critical systems in a compliance-driven environment Experience with Documentum or Oracle database environments is a plus Why Pluribus May Be a Fit for You We are purpose driven. We support missions and products that serve the public good, and where our focused capabilities positively impact those mission outcomes. We bring a consultative approach to partner with our government customers and help them succeed as change makers. Pluribus is a calm company. We are knowledge workers. People do their best work when they are not rushed by artificial urgency or drained by a culture of facetime and workaholism. By having confidence in our people, we can get more done at better quality. When real crunch time comes, we are not already stretched to the limit. We are stronger because of the variety of skills and personal backgrounds of our team. We hold ourselves accountable and we publish our workforce statistics annually. Compensation and benefits: Pluribus Digital offers a competitive salary that is determined at the time of offer. Compensation will be based on experience and qualifications, with salary ranges aligned accordingly. If a candidate is a strong fit at a more junior or senior level to what is outlined here, we will assess them accordingly and apply the appropriate salary range during the hiring process. The range for this specific role is from $120,000 to $130,000 depending on experience. Salary is augmented with opportunity to earn annual bonus and medical/dental/vision benefits, PTO, company paid life insurance and a generous 401k match program. Details on benefits can be found here: https://pluribusdigital.com/content/join/benefits.
Responsibilities
The Site Reliability Engineer will support infrastructure management across hybrid environments and assist with deployment automation and configuration management tasks. They will also perform system monitoring, troubleshooting, and contribute to patching and updates in line with federal security standards.
Loading...