Senior Engineer - Cloud Development

at  graphcore

Bristol, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate24 Dec, 2024Not Specified27 Sep, 2024N/AResource Management,Kubernetes,Docker,System Administration,Presentation Skills,Critical Infrastructure,Virtual Networks,Cloud ServicesNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

ABOUT GRAPHCORE

How often do you get the chance to build a technology that transforms the future of humanity? Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more sophisticated models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business. We recently joined SoftBank Group, bringing large and ongoing investment from one of the world’s leading backers of innovative AI companies.

SKILLS AND EXPERIENCE

The ideal candidate will bring extensive software engineering experience with a proven track record of delivering technical output as an individual contributor. Proven Linux scripting ability and system administration, as well as a hands-on understanding of the technologies underpinning cloud services, virtual networks, resource management and monitoring are essential. They will have experience with OpenStack deployments or the technologies they rely on, as well as container deployment and management using Docker, Podman or Kubernetes. We are looking to find a candidate who is a confident user of version control system, who brings experience with Continuous Integration or testing pipelines and solutions. They should also have some knowledge of continuous system management concepts and tools.
On top of these technical skills, we would like to identify candidates who are able to work independently on critical infrastructure who maintain a focus on end-user availability. They should understand how to prioritise, as well as assess risk, issues, impacts and constraints, and have strong communication and presentation skills.

Responsibilities:

THE ROLE

We are looking for a Senior Engineer to join our Cloud Development Team. Working closely with our colleagues in Platform Engineering, Datacentre Operations and Product Development, you will optimise our fleet of groundbreaking AI systems. As part of our Platform Engineering organisation, you will be involved in the cloud integration, validation, performance benchmarking, optimisation, and development of our high-performance AI solutions. These include in-house AI systems alongside off-the-shelf high-performance servers, switches and storage solutions. This is a hand-on role requiring a proven background in infrastructure, cloud deployment using Infrastructure-as-Code, OpenStack and high-performance networking and storage systems. The successful candidate may have been working in an IT organisation, a datacentre, a cloud provider or as a developer of orchestration or cloud components.
The Platform Engineering team at Graphcore builds Graphcore products into large-scale AI solutions for our customers and within that, this team is responsible for providing such systems to our internal users via private clouds. Often these internal systems will be using and developing pre-release hardware and software.

RESPONSIBILITIES

  • Partner with the system architecture and engineering teams to develop complete cloud-ready AI solutions based on Graphcore’s next-generation AI products.
  • Work with our Datacentre Operations Engineers to maintain the fleet of AI systems at peak performance in our private clouds.
  • Operate and extend existing OpenStack cloud services and contribute to the deployment and development of new ones, support internal end-users with application services on our private clouds.
  • Configure and test new Graphcore AI hardware and systems using Infrastructure-as-code as they are deployed in internal and external datacentres.
  • Drive corrective actions for systems that are not operating accurately, working with DC operations, Engineering and datacentres as required.
  • Develop tested and optimised configurations for our AI Cloud Reference Design.


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Computer Software/Engineering

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

Bristol, United Kingdom