Site Reliability Engineer

at  Capital on Tap

London, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate11 Aug, 2024GBP 250 Annual12 May, 2024N/AGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Running a business is hard, we make it easier. We provide an all-in-one business credit card & spend management platform, built for SMEs. Over 200,000 small businesses have spent more than £8 billion on their Capital on Tap Business Credit Cards.

Responsibilities:

THE ROLE

Our Site Reliability Engineers work closely with our Platform and Engineering teams to ensure our applications are designed and built with reliability and speed in mind as well as ensuring our application infrastructure is robust and scalable.
As a Site Reliability Engineer at Capital on Tap you will be responsible for designing, building, and monitoring systems to maximise platforms uptime and efficiency for the best possible end-user experience. You are also tasked with identifying and resolving potential outages and performance issues before they become a problem.

RESPONSIBILITIES

  • Manage Azure services and resources, Cloudflare edge security, traffic management in code.
  • Create, manage and monitor development resources within Kubernetes clusters and Serverless (i.e. Function Apps, Automation Accounts) for Product Engineering Teams.
  • Own Terraform / Ansible / Pulumi Infrastructure as Code for each Product Engineering team.
  • Continuously identify opportunities for improvement in systems, processes, and technologies, and implement changes to improve the overall reliability and performance of the platforms.
  • Improve monitoring to provide insights to uptime and availability, and work towards the agreed SLO.
  • Work with the Product team to identify the company SLA and objectives for all core services/applications.
  • Work with Platform Engineers to deliver end-to-end automated solutions and pipelines.
  • Work with software developers and stakeholders to improve the user experience through pipeline management and infrastructure improvements.
  • Proactively support Platform services and tooling (TeamCity, Octopus, Azure DevOps & more to come)
  • Improve reliability, quality, and time-to-market of our suite of software solutions. Through solutions such as load testing, chaos engineering and improved deployment strategies.
  • Own and lead the troubleshooting of incidents that impact the customer experience.


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

London, United Kingdom