Site Reliability Engineer at stackArmor Inc
Reston, VA 20190, USA -
Full Time


Start Date

Immediate

Expiry Date

07 Sep, 25

Salary

160000.0

Posted On

08 Jun, 25

Experience

3 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Automation, Computer Science, Logging, Docker, Health Insurance, Vision Insurance, Python, Bash, Jenkins, Flexible Schedule, Life Insurance, Nist, Dental Insurance, Azure, Scripting, Devops, Aws, Kubernetes, Reliability Engineering

Industry

Information Technology/IT

Description

ABOUT US:

stackArmor is a fast-growing cloud services company focused on secure and compliant digital transformation for government, healthcare, and regulated industries. We deliver DevSecOps, cloud engineering, and cybersecurity solutions aligned with frameworks like FedRAMP, CMMC, and NIST.

POSITION OVERVIEW:

We’re looking for a Site Reliability Engineer (SRE) to help build and maintain secure, reliable, and scalable infrastructure that powers mission-critical workloads. You’ll work closely with cloud engineers, security analysts, and developers to drive automation, performance, and resilience across our platforms.

REQUIRED QUALIFICATIONS:

  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience
  • 3+ years in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
  • Solid hands-on experience with AWS, Azure, or GCP (AWS preferred)
  • Experience with Kubernetes, Docker, and container orchestration in production environments
  • Proficient with scripting and automation (Python, Bash, Go, or similar)
  • Experience with CI/CD tools (GitHub Actions, Jenkins, GitLab CI, or similar)
  • Strong understanding of Linux administration, networking, and cloud security principles
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK, Datadog)

PREFERRED QUALIFICATIONS:

  • Hands-on experience working in compliance-driven environments (FedRAMP, NIST, CMMC, HIPAA)
  • Familiarity with continuous compliance and cloud security posture management (CSPM) tools
  • Experience with serverless architecture and service mesh (e.g., AWS Lambda, Istio)
  • Experience implementing backup, disaster recovery, and business continuity plans
  • Knowledge of zero-trust architecture, IAM policies, and security hardening
  • Cloud certifications such as AWS Certified DevOps Engineer, GCP Professional SRE, or Azure DevOps Engineer
  • Exposure to incident management platforms (e.g., PagerDuty, Opsgenie)
    The ideal candidate is a proactive problem-solver with a passion for building reliable and secure cloud infrastructure. They thrive in fast-paced environments, have hands-on experience with modern DevOps and SRE tools, and are comfortable working independently or as part of a cross-functional team. A strong foundation in cloud platforms (especially AWS), container orchestration (Kubernetes), and automation is essential. Candidates who have worked in compliance-driven environments and bring a security-first mindset will stand out.
    Job Type: Full-time
    Pay: $120,000.00 - $160,000.00 per year

Benefits:

  • 401(k)
  • Dental insurance
  • Flexible schedule
  • Flexible spending account
  • Health insurance
  • Life insurance
  • Paid time off
  • Vision insurance

Compensation Package:

  • Performance bonus

Schedule:

  • 8 hour shift

Work Location: Hybrid remote in Reston, VA 2019

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
  • Design, deploy, and manage highly available, distributed systems in cloud environments (AWS, Azure, or GCP)
  • Implement and manage Infrastructure as Code (IaC) using tools like Terraform and Ansible
  • Build, maintain, and optimize CI/CD pipelines to ensure safe and efficient application delivery
  • Monitor application and system performance, availability, and security using modern observability stacks
  • Develop automated remediation workflows for common operational issues
  • Participate in capacity planning, cost optimization, and cloud resource management
  • Maintain documentation for systems, processes, and best practices
  • Lead and support incident response efforts, root cause analysis, and postmortems
  • Champion reliability-focused practices such as chaos engineering and failure injection
  • Collaborate with compliance and security teams to meet FedRAMP, CMMC, and NIST 800-53 requirements
  • Provide mentorship and technical guidance to junior engineers
Loading...