Site Reliability Engineer II - CTJ - Top Secret at Microsoft
Redmond, Washington, United States -
Full Time


Start Date

Immediate

Expiry Date

19 Feb, 26

Salary

0.0

Posted On

21 Nov, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Site Reliability Engineering, Automation, Deployment, Compliance, Security, Incident Response, Postmortems, Cloud Systems, Distributed Systems, Software Engineering, Network Engineering, Systems Administration, Cross-Team Collaboration, Continuous Learning, Engineering Best Practices, Service Health Monitoring

Industry

Software Development

Description
Live Site Operations: Serve as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health and responding to incidents within SLA timelines. Automation & Deployment: Contribute to automation efforts and validate code functionality in non-production environments to ensure smooth deployments. Compliance & Security: Support compliance processes by verifying security, privacy, and accessibility standards during onboarding of new technologies. Continuous Learning: Stay current with industry trends and internal tools to improve reliability, performance, and observability at scale. Engineering Best Practices: Apply proven development and scaling practices to meet performance and customer requirements. Cross-Team Collaboration: Communicate effectively with engineering partners to align on goals and deliver user-centric solutions. Incident Response & Postmortems: Address complex live site issues, implement mitigations, and document learnings through postmortems. Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience. Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience. 2+ years technical experience working with large-scale cloud or distributed systems.
Responsibilities
Serve as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health and responding to incidents. Contribute to automation efforts and validate code functionality to ensure smooth deployments and compliance with security standards.
Loading...