Start Date
Immediate
Expiry Date
25 Nov, 24
Salary
0.0
Posted On
29 Aug, 24
Experience
0 year(s) or above
Remote Job
No
Telecommute
No
Sponsor Visa
No
Skills
Tuning, Rhel, Troubleshooting, Rdma, Visio, Network Technologies, Security, Cluster Management, Design, Powerpoint, Infiniband, Centos, Network Architecture, Ethernet, Excel, Cisco, Interpersonal Skills, Linux, Network Performance, Working Experience, Technical Services
Industry
Information Technology/IT
ADDITIONAL LOCATIONS :
WHAT YOU’LL BRING/POSITION REQUIREMENTS:
The candidate is expected to work effectively in providing onsite technical services in the areas of HPC/AI cluster solutions and Network architecture. This person will be responsible for designing and implementing the HPC solution at customer sites with a strong focus on Network technologies but additionally Storage, Power and Cooling, OS, and cluster management.
Knowledge and working experience in the following technologies/areas:
Strong focus on Network technology – design, deployment, and configuration
Demonstrated ability to optimize network performance including tuning and troubleshooting.
Strong understanding of networking concepts and technologies relevant to HPC including InfiniBand, Ethernet, and RDMA.
Familiarity with security best practices to protect the environments
In-depth knowledge of Network and HPC Hardware
The job responsibilities also involve providing knowledge transfer, troubleshooting, resolving, and advising on the mentioned infrastructure and technologies.
Applicant must possess strong customer interaction skills and ability to make technical decisions. Collaborate during projects with several organization verticals, partners, and customers. Develop training and knowledge base documentation for technical skills throughout career.
SKILLS REQUIRED:
Network benchmarking experience, both Ethernet and Infiniband (TCP/IP and verbs)
Strong knowledge of Cisco and Mellanox ethernet switches.
Understanding of routing protocols such as BGP or OSPF.
Deep understanding of InfiBand including different InfiBand topologies, partitioning, and QoS.
Ability to automate and script deployment processes for repeatability and standardization.
Strong knowledge of Linux (i.e. Suse, RHEL & CentOS).
Knowledge of HPC cluster manager & Job scheduler.
Experience in HPC system troubleshooting and support.
Good knowledge of Visio, Excel and PowerPoint.
Very good hands-on technical skills and problem-solving skills.
Excellent communication and interpersonal skills.
English language, both spoken and written.
Please refer the Job description for details