Software Architect – AI Datacenter Orchestration Platform at DRIVENETS
Tel Aviv, Tel-Aviv District, Israel -
Full Time


Start Date

Immediate

Expiry Date

23 May, 26

Salary

0.0

Posted On

22 Feb, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Architecture, Orchestration Platform, AI Datacenter, Backend Development, Python, REST APIs, GraphQL APIs, Distributed Systems, Scalability, Resilience, Performance, Observability, Databases, Schema Design, Authentication, Authorization

Industry

Software Development

Description
Software Architect – AI Datacenter Orchestration Platform Location: Tel Aviv Hybrid DriveNets is a leader in high-scale disaggregated networking solutions. Founded in 2015, DriveNets modernizes the way service providers, cloud providers and hyperscalers build networks. Supporting the largest network in the world, more than half of AT&T’s backbone traffic runs on DriveNets’ Network Cloud open disaggregated architecture. Raising $587 million in three funding rounds, DriveNets is disrupting the networking market from high-scale architecture to AI platforms, and is bringing onboard the most talented people. The Role We’re building a new orchestration platform for AI datacenters—software that turns a complex, multi-vendor stack of compute, networking, and accelerators into something customers can deploy, operate, and optimize with confidence. You’ll join a small founding team (growing later) where every engineer ships. This is a hands-on architect role in the truest sense: you’ll own key architectural decisions and write production code 100% of the time, side-by-side with the team. If you like shaping systems at the “whiteboard-to-production” level—and you want your decisions to run at scale in real AI datacenters—this is that kind of role. What You’ll Do Design and build the core of an orchestration platform spanning the full datacenter stack. Turn real customer needs into crisp system architecture: service boundaries, workflows, APIs, data models, and failure modes. Implement critical platform components end-to-end: from prototypes and “spikes” to hardened production services. Define and evolve platform standards through working code: patterns, libraries, templates, test strategy, CI/CD. Make pragmatic tradeoffs (performance vs. simplicity, consistency vs. availability) and document the “why.” Lead design and code reviews, raising quality while keeping velocity high. Requirements 6+ years (or equivalent) building production software, including ownership of architecture/design for non-trivial systems. Strong backend development skills (preferably Python) and experience building REST and/or GraphQL APIs. Distributed systems thinking: scalability, resilience, performance, observability, and operational readiness. Working knowledge of modern web architecture (TypeScript + React preferred) — enough to guide and contribute when needed. Experience with databases (SQL and/or NoSQL), schema design, and performance considerations. Familiarity with authentication/authorization patterns (OAuth/JWT, RBAC). Strong communication, high ownership, and a bias toward shipping. Nice to Have Cloud platforms (AWS/GCP/Azure), containers, and Kubernetes. Observability tooling (OpenTelemetry, Prometheus, Grafana, Datadog, etc.). Security mindset (OWASP Top 10 awareness and secure coding practices). Experience with orchestration domains (cluster managers, provisioning pipelines, schedulers). Familiarity with AI/GPU infrastructure and/or high-performance networking. Why This Role Is Different You won’t be handed a mature system to maintain. You’ll help create the foundation—the architecture, the patterns, and the first production-grade components—alongside a team that moves fast, owns outcomes, and builds for real-world scale.
Responsibilities
The role involves designing and building the core of a new orchestration platform for AI datacenters, spanning the full stack, and turning customer needs into crisp system architecture including service boundaries, workflows, and APIs. Responsibilities also include implementing critical platform components end-to-end, defining platform standards through working code, and leading design and code reviews.
Loading...