Principal Software Engineer at Microsoft
Atlanta, Georgia, United States -
Full Time


Start Date

Immediate

Expiry Date

20 Feb, 26

Salary

0.0

Posted On

22 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Distributed Systems, Reliability Engineering, Observability, Telemetry, Incident Management, Change Management, Automation, Cloud Systems, Microservices, Containers, Kubernetes, Linux, Programming, AI Technologies, Data Technologies, Azure

Industry

Software Development

Description
Design and implement world-class distributed systems to support billions of users worldwide. Improve the reliability and resilience of key Azure products. Define and maintain system reliability goals through Service Level Objectives (SLOs). Enhance production systems with improvements in observability, telemetry, alerting, incident and change management, and deployment safety. Build reusable automation and scalable processes to support multiple engineering teams in achieving their reliability goals. Influence product architecture and roadmap to embed reliability as a core design principle. Contribute directly to product code to drive reliability improvements. Leverage AI technologies to detect anomalies, predict incidents, and automate operational workflows at scale. Design and develop secure, modular, reliable, testable, and observable distributed services and solutions. Collaborate with internal and external stakeholders to align efforts and deliver cohesive outcomes. Drive continuous improvement in engineering processes and codebases. Develop automation solutions to prevent or resolve service issues before they impact users. Apply AI tools and techniques to reduce operational toil and scale practices across complex environments. Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience. 5+ years of experience analyzing modern distributed software design patterns and cloud systems architecture, including microservices, containers, load-balancing, queuing, caching. 5+ years of experience in building, shipping and operating reliable solutions. Familiarity with modern distributed software design patterns and cloud systems architecture, including microservices, containers, load balancing, queuing, caching. Experience as a technical lead or engineering manager. Experience working on large and unfamiliar codebases (millions of lines of code). Experience with open-source projects, Kubernetes, Linux and containers is desired. Proven track record in building, shipping, and operating reliable solutions. Proficiency in programming languages like C#/Java/Python. Experience with data technologies (SQL/NoSQL/etc. ). Experience with Azure is a plus. Experience in AI adoption with tools like GitHub Copilot, Azure OpenAI and custom copilots to streamline development and reduce toil.
Responsibilities
Design and implement distributed systems to support billions of users while improving the reliability of Azure products. Collaborate with stakeholders to enhance production systems and drive continuous improvement in engineering processes.
Loading...