Principal Software Engineer - AFD at Microsoft
Sydney, New South Wales, Australia -
Full Time


Start Date

Immediate

Expiry Date

24 Feb, 26

Salary

0.0

Posted On

26 Nov, 25

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Software Engineering, Distributed Systems, Networking, C/C++, Rust, Go, TCP/UDP, DNS, TLS, HTTP, Kubernetes, CI/CD, DDoS Mitigation, Cloud Platforms, Observability, Cryptography

Industry

Software Development

Description
Architect and build internet-scale, low-latency edge services (proxies, load balancers, TLS offload, routing pipelines, caching layers) across hundreds of global sites and thousands of nodes. Design and build services that provide L4/L7 DDoS protection, HTTP-level CDN, global load balancing, and WAF capabilities. Lead reliability by design: champion SLOs, error budgets, and graceful degradation patterns; instrument systems end-to-end (metrics/traces/logs), drive telemetry-driven engineering and automated mitigations. Lead identification of dependencies and development of design documents for products, applications, services, or platforms. Mentor engineers and lead by example to produce extensible and maintainable code used across products. Own live-site for AFD services: participate in DRI/on-call, guide incident response, lead post-incident reviews, and convert findings into systemic fixes and automation. Proactively seek new knowledge and adapt to trends, technical solutions, and patterns that improve availability, reliability, efficiency, observability, and performance at scale. Required Bachelor's degree in computer science, or related technical discipline AND 10+ years building and operating distributed systems or networking platforms in production; track record of delivering high‑throughput, low‑latency services. Strong systems programming proficiency in C/C++ and/or Rust (Go acceptable) with deep understanding of memory, concurrency, async I/O, and performance profiling (perf/eBPF/flamegraphs). Expertise in networking & protocols: TCP/UDP, DNS, TLS, HTTP/1.1-3, QUIC; load balancing algorithms; congestion control; connection pooling; keep‑alive; retry/backoff. Linux fundamentals (kernel & networking stack), containerization/orchestration (Kubernetes), CI/CD, safe releases, and observability (metrics/traces/logs). Proven live‑site leadership: incident command, debugging in production, mitigation design, and preventative engineering at scale. Experience building services on cloud platforms such as Azure or AWS. Master's Degree in Computer Science or related technical field AND 8+ years of technical engineering experience with coding in languages including C, C++, C#, Rust, Java, JavaScript, or Python. OR Bachelor's Degree in Computer Science or related technical field AND 10+ years of technical engineering experience with coding in the above languages. CDN/edge proxy experience (e.g., NGINX/Envoy/ATS) or custom proxy pipelines. DDoS mitigation (L3/4 and L7), rate limiting, abuse detection, traffic shaping, and resiliency patterns (hedged requests, circuit breakers). At‑scale telemetry systems (e.g., Kusto/Geneva), capacity planning, and performance/COGS optimization. Cryptography/TLS internals, certificate lifecycle, HSM/KMS integrations. Experience with Azure networking/compute primitives and large multi‑tenant services.
Responsibilities
Architect and build internet-scale, low-latency edge services across global sites. Own live-site for AFD services, guiding incident response and leading post-incident reviews.
Loading...