Job Title: Senior Systems Engineer - SRE
Compensation: up to $75/hr W2
Work Location: Bethesda, MD
Work Mode: Hybrid (prefer one to three days a week onsite)
POSITION SUMMARY
The Senior Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Infrastructure, and the broader Applications and Infrastructure Delivery teams to develop key metrics and KPIs to improve applications stability, availability and performance. The ideal candidate will bring strong communication skills, collaborating with key stakeholders across the company to optimize cloud infrastructure and uphold the highest standards of operational excellence in a dynamic, fast-paced environment.
SKILLS AND EXPERIENCE
- 8-10 years’ experience in information technology process and/or technical project management including:
- 4+ years of experience as a Site Reliability Engineer (SRE), building and managing highly available and mission critical systems, with 3+ years of experience on public cloud, preferably AWS
- Expertise in enterprise storage platforms (e.g., NetApp, Dell EMC, Isilon, Unity, Pure Storage, PURE Cloud Block Store)
- Expertise in cloud storage platforms (e.g., EBS, S3, Azure Blob, AWS FSx, ONTAP FSx etc)
- Deep knowledge of enterprise backup technologies (e.g., Commvault, Rubrik, Veeam, Veritas)
- Deep knowledge of cloud native backups (e.g., AWS Backup, Azure Backup etc)
- Strong scripting skills (Python, Shell, PowerShell).
- Familiarity with Infrastructure as Code (IaC) tools like Terraform, Cloudformation.
- Monitoring and observability experience using Prometheus, Grafana, ELK Stack, or similar.
- Proven automation and programming experience in one or more of the following languages: Java, Python, Go, Perl, Bash.
- Deep understanding of SRE practices such as Service Level Objectives, Error Budgets, Toil Management, Observability & Monitoring, Blameless Postmortems, Incident Response Process, Capacity Planning.
- Exposure to Cloud Native, Relational and NoSQL databases like RDS, MySQL, PostgreSQL, Cassandra or Couchbase preferable.
- Experience with deploying, monitoring, and troubleshooting large-scale, distributed applications in cloud environments such as AWS.
- Experience in vulnerability assessment, patching, security compliance of infrastructure, storage & backup.
- Experience is setting up DR using approved Storage and Backup technologies.
- Familiarity with security frameworks such as ISO27001, SOCII, PCI-DSS, and/or HIPAA.
- Experience working with SaaS, IaaS, and PaaS offerings.
- Ability to work with global teams located in US and India.
- 6+ years experience in a technical discipline role with experience in planning, implementing and evaluating processes, systems and/or initiatives.
- Broad technical acumen across multiple disciplines applications with a solid understanding of current technologies.
- Experience applying measurement processes/methods for assessing program outputs and outcomes or progress toward goals and objectives.
- Extremely high level of analytical ability with complex problems.
- Ability to work across organizational boundaries, to help lead and influence change.
- Ability to command the process across all levels to ensure customer focus; including being assertive and self-starting.
- Demonstrated leadership experience in influence and garnering alignment from external organizations.
- Ability to align change management strategies with projects.
- Skilled in conceptualizing creative solutions, documenting them, and presenting/selling them to senior management.
- Very high level of interpersonal skills to work effectively with others, motivate employees, and elicit work output in a team environment.
EDUCATION AND CERTIFICATIONS
- Undergraduate degree in Computer Science or related technical field or equivalent experience/certification
Job Types: Full-time, Contract
Pay: $60.78 - $75.00 per hour
Expected hours: 40 per week
Benefits:
- 401(k)
- Dental insurance
- Health insurance
- Vision insurance
Experience:
- Senior Systems Engineer: 8 years (Required)
Work Location: In perso