Senior Engineer - Cloud Development at graphcore
Bristol, England, United Kingdom -
Full Time


Start Date

Immediate

Expiry Date

07 Dec, 25

Salary

0.0

Posted On

08 Sep, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

System Administration, Docker, Resource Management, Code, It, Virtual Networks, Cloud Services, Kubernetes

Industry

Computer Software/Engineering

Description

ABOUT GRAPHCORE

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.

JOB SUMMARY

We are looking for a Senior Engineer to join our Cloud Development Team. Working closely with our colleagues in Platform Engineering, Datacentre Operations and Product Development teams, you will help us provide services on our fleet of cutting-edge AI systems. As part of our Platform Engineering organisation, you will be involved in the cloud integration, validation, performance benchmarking, optimisation, and development of our high-performance AI solutions. These include in-house AI systems alongside off-the-shelf high-performance servers, switches and storage solutions. This is a hands-on role requiring a solid background in infrastructure, cloud deployment using Infrastructure-as-Code, and high-performance networking and storage systems. You may have been working in an IT organisation, a datacentre, a cloud provider or as a developer of orchestration or cloud components.

SKILLS AND EXPERIENCE

The ideal candidate will be experienced within software engineering or IT, with a proven track record of delivering technical output as an individual contributor. Proven Linux scripting ability and system administration, as well as a hands-on understanding of the technologies underpinning cloud services, virtual networks, resource management and monitoring are essential. They will have experience or working knowledge of Infrastructure-as-Code automation and deployments, as well as container deployment and management using Docker, Podman or Kubernetes.
We are looking to find a candidate who brings experience with version control systems, monitoring and observability and Continuous Integration or testing pipelines and solutions. They should also have some knowledge of continuous system management concepts and tools.

Responsibilities
  • Work with the system architecture and engineering teams to develop complete cloud-ready AI solutions based on Graphcore’s next-generation AI products.
  • Work with our Datacentre Operations Engineers to maintain the fleet of AI systems at peak performance in our private clouds.
  • Operate and extend existing OpenStack cloud services and contribute to the deployment and development of new ones.
  • Support internal end-users with application services on our private clouds.
  • Configure and test new Graphcore AI hardware and systems using Infrastructure-as-Code as they are deployed in internal and external datacentres.
  • Provide statistics for performance of internal systems and clear reporting of any issues. Work with users to provide clear information of any issues to Engineering and QA departments.
  • Drive corrective actions for systems that are not operating correctly, working with DC operations, Engineering and datacentres as required.
Loading...