Software Engineer, Staff (Systems)
at dMatrix
Santa Clara, California, USA -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 20 Jan, 2025 | USD 230000 Annual | 21 Oct, 2024 | N/A | Enterprise,Mobile Operators,Google,Intel,Mixed Signal,Facebook,Computing,Cisco,Microsoft,Nokia | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The “holy grail” of AI compute has been to break through the memory wall to minimize data movements. We’ve achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is poised to advance Large Language Models to scale Generative inference acceleration with our chiplets and In-Memory compute approach. We are on track to deliver our first commercial product in 2024. We are poised to meet the energy and performance demands of these Large Language Models. The company has 100+ employees across Silicon Valley, Sydney and Bengaluru.
LOCATION:
Working onsite at our Santa Clara, CA headquarters 3 days per week - Hybrid.
We are seeking a skilled and experienced Staff Software Engineer to contribute to the architecture and development of our next-generation AI inference runtime. The ideal candidate will have a strong background in C++ development, with experience in building distributed systems or high-performance computing (HPC) applications. Familiarity with PyTorch internals or similar machine learning frameworks is a significant advantage.
Responsibilities:
- Architect and Develop: Lead the design and implementation of a high-performance inference runtime that leverages d-Matrix’s advanced hardware capabilities.
- Integrate Frameworks: Integrate the inference runtime with PyTorch to enable upstream software capabilities like inference and finetuning.
- Collaborate: Work closely with cross-functional teams including hardware engineers, data scientists, and product managers to define requirements and deliver integrated solutions.
- Optimize Performance: Develop and implement optimization techniques to ensure low latency and high throughput in distributed and HPC environments.
- Code Quality: Ensure the code quality, and performance through rigorous testing and code reviews.
- Documentation: Create technical documentation to support development, deployment, and maintenance activities.
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Computer Software/Engineering
IT Software - System Programming
Software Engineering
Graduate
Proficient
1
Santa Clara, CA, USA