Site Reliability Engineer at stackArmor Inc

Reston, VA 20190, USA -

Full Time

Start Date

Immediate

Expiry Date

07 Sep, 25

Salary

160000.0

Posted On

08 Jun, 25

Experience

3 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Automation, Computer Science, Logging, Docker, Health Insurance, Vision Insurance, Python, Bash, Jenkins, Flexible Schedule, Life Insurance, Nist, Dental Insurance, Azure, Scripting, Devops, Aws, Kubernetes, Reliability Engineering

Industry

Information Technology/IT

Description

ABOUT US:

stackArmor is a fast-growing cloud services company focused on secure and compliant digital transformation for government, healthcare, and regulated industries. We deliver DevSecOps, cloud engineering, and cybersecurity solutions aligned with frameworks like FedRAMP, CMMC, and NIST.

POSITION OVERVIEW:

We’re looking for a Site Reliability Engineer (SRE) to help build and maintain secure, reliable, and scalable infrastructure that powers mission-critical workloads. You’ll work closely with cloud engineers, security analysts, and developers to drive automation, performance, and resilience across our platforms.

REQUIRED QUALIFICATIONS:

Bachelor’s degree in Computer Science, Engineering, or equivalent experience
3+ years in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
Solid hands-on experience with AWS, Azure, or GCP (AWS preferred)
Experience with Kubernetes, Docker, and container orchestration in production environments
Proficient with scripting and automation (Python, Bash, Go, or similar)
Experience with CI/CD tools (GitHub Actions, Jenkins, GitLab CI, or similar)
Strong understanding of Linux administration, networking, and cloud security principles
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK, Datadog)

PREFERRED QUALIFICATIONS:

Hands-on experience working in compliance-driven environments (FedRAMP, NIST, CMMC, HIPAA)
Familiarity with continuous compliance and cloud security posture management (CSPM) tools
Experience with serverless architecture and service mesh (e.g., AWS Lambda, Istio)
Experience implementing backup, disaster recovery, and business continuity plans
Knowledge of zero-trust architecture, IAM policies, and security hardening
Cloud certifications such as AWS Certified DevOps Engineer, GCP Professional SRE, or Azure DevOps Engineer
Exposure to incident management platforms (e.g., PagerDuty, Opsgenie)
The ideal candidate is a proactive problem-solver with a passion for building reliable and secure cloud infrastructure. They thrive in fast-paced environments, have hands-on experience with modern DevOps and SRE tools, and are comfortable working independently or as part of a cross-functional team. A strong foundation in cloud platforms (especially AWS), container orchestration (Kubernetes), and automation is essential. Candidates who have worked in compliance-driven environments and bring a security-first mindset will stand out.
Job Type: Full-time
Pay: $120,000.00 - $160,000.00 per year

Benefits:

401(k)
Dental insurance
Flexible schedule
Flexible spending account
Health insurance
Life insurance
Paid time off
Vision insurance

Compensation Package:

Performance bonus

Schedule:

8 hour shift

Work Location: Hybrid remote in Reston, VA 2019

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

Design, deploy, and manage highly available, distributed systems in cloud environments (AWS, Azure, or GCP)
Implement and manage Infrastructure as Code (IaC) using tools like Terraform and Ansible
Build, maintain, and optimize CI/CD pipelines to ensure safe and efficient application delivery
Monitor application and system performance, availability, and security using modern observability stacks
Develop automated remediation workflows for common operational issues
Participate in capacity planning, cost optimization, and cloud resource management
Maintain documentation for systems, processes, and best practices
Lead and support incident response efforts, root cause analysis, and postmortems
Champion reliability-focused practices such as chaos engineering and failure injection
Collaborate with compliance and security teams to meet FedRAMP, CMMC, and NIST 800-53 requirements
Provide mentorship and technical guidance to junior engineers