Software Development Snr Director at Oracle Risk Management Services
Seattle, Washington, United States -
Full Time


Start Date

Immediate

Expiry Date

11 Jun, 26

Salary

338500.0

Posted On

13 Mar, 26

Experience

10 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Software Development, Systems Specialist, Low Level Drivers, Firmware, Repair Systems, System Architecture, Agile Environment, Provisioning, Monitoring, Maintenance, Configuration, Validation, GPU Clusters, High Availability, Data Plane

Industry

IT Services and IT Consulting

Description
Here at OCI we’re building the world’s largest AI clusters and we’re the fastest at bringing them to market.  The AI Infrastructure organization at OCI is leading this effort.      As part of this focus on AI workloads and customers we’re building provisioning, repair, monitoring, maintenance, configuration and validation systems that enable us to deliver high quality GPU clusters to our customers and operate them with high availability.  These systems are responsible for firmware management of the GPUs, the high speed backend NICs and automated validation and for GPU servers.  Together they build the GPU specialization on top of the general compute Data plane which enables us to deliver high performing and high availability offerings to our customers.    In this role you would lead the software development organization building out and operating these systems and work with some of the largest players in the AI space building systems that operate at unprecedented speed, scale and reliability. You should be a systems specialist with exposure to low level drivers and firmware, knowledge of repair systems, able to architect broad systems interactions, while being very hands-on, able to dive deep into any part of the stack and lower-level system interactions. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn.  Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives. True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com [accommodation-request_mb@oracle.com] or by calling 1-888-404-2494 in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
This role involves leading the software development organization responsible for building and operating systems that manage provisioning, repair, monitoring, maintenance, configuration, and validation for high-quality GPU clusters. The focus is on enabling high-performing and high-availability offerings for AI workloads operating at unprecedented speed and scale.
Loading...