Software Engineer, Staff (Systems)

at  dMatrix

Santa Clara, California, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate20 Jan, 2025USD 230000 Annual21 Oct, 2024N/AEnterprise,Mobile Operators,Google,Intel,Mixed Signal,Facebook,Computing,Cisco,Microsoft,NokiaNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The “holy grail” of AI compute has been to break through the memory wall to minimize data movements. We’ve achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is poised to advance Large Language Models to scale Generative inference acceleration with our chiplets and In-Memory compute approach. We are on track to deliver our first commercial product in 2024. We are poised to meet the energy and performance demands of these Large Language Models. The company has 100+ employees across Silicon Valley, Sydney and Bengaluru.

LOCATION:

Working onsite at our Santa Clara, CA headquarters 3 days per week - Hybrid.
We are seeking a skilled and experienced Staff Software Engineer to contribute to the architecture and development of our next-generation AI inference runtime. The ideal candidate will have a strong background in C++ development, with experience in building distributed systems or high-performance computing (HPC) applications. Familiarity with PyTorch internals or similar machine learning frameworks is a significant advantage.

Responsibilities:

  • Architect and Develop: Lead the design and implementation of a high-performance inference runtime that leverages d-Matrix’s advanced hardware capabilities.
  • Integrate Frameworks: Integrate the inference runtime with PyTorch to enable upstream software capabilities like inference and finetuning.
  • Collaborate: Work closely with cross-functional teams including hardware engineers, data scientists, and product managers to define requirements and deliver integrated solutions.
  • Optimize Performance: Develop and implement optimization techniques to ensure low latency and high throughput in distributed and HPC environments.
  • Code Quality: Ensure the code quality, and performance through rigorous testing and code reviews.
  • Documentation: Create technical documentation to support development, deployment, and maintenance activities.


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Computer Software/Engineering

IT Software - System Programming

Software Engineering

Graduate

Proficient

1

Santa Clara, CA, USA