Solutions Engineer at NeuReality
San Jose, California, United States -
Full Time


Start Date

Immediate

Expiry Date

01 Aug, 26

Salary

0.0

Posted On

03 May, 26

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Linux, Docker, Kubernetes, Distributed systems, System-level debugging, Networking fundamentals, Cloud infrastructure, On-prem infrastructure, AWS, Azure, GCP, AI/ML infrastructure, Observability tools, Prometheus, Grafana, GPU-based systems

Industry

Semiconductor Manufacturing

Description
At NeuReality, we’re redefining AI deployment with a purpose-built inference platform that unifies silicon, system software, and orchestration enabling unmatched efficiency, performance, and scale for AI inference. We are looking for a hands-on Solutions Engineer / FAE to lead customer deployments and technical engagements, ensuring the successful integration, operation, and performance of a complex distributed platform in real-world environments. Requirements Customer-facing technical experience, including the ability to lead POCs, run demos, deploy systems, and troubleshoot live customer environments end-to-end 3+ years of hands-on experience with Linux, containers (Docker), and system-level debugging Experience with Kubernetes and distributed systems, including deployment and troubleshooting in production environments Ability to analyze system performance and troubleshoot bottlenecks, with a solid understanding of networking fundamentals (latency, throughput, resource utilization) Experience working with cloud or on-prem infrastructure environments (AWS, Azure, GCP, or data center setups) Candidates must be located in California to support collaboration within the Pacific Time Zone. How to stand out: Exposure to AI / ML infrastructure or inference workloads Hands-on experience with observability tools (e.g., Prometheus, Grafana) Familiarity with GPU-based systems or high-performance environments
Responsibilities
Lead customer deployments and technical engagements for a distributed AI inference platform. Ensure successful integration, operation, and performance of the platform in real-world environments.
Loading...