Site Reliability Engineer
at Capital on Tap
London, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 11 Aug, 2024 | GBP 250 Annual | 12 May, 2024 | N/A | Good communication skills | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
Running a business is hard, we make it easier. We provide an all-in-one business credit card & spend management platform, built for SMEs. Over 200,000 small businesses have spent more than £8 billion on their Capital on Tap Business Credit Cards.
Responsibilities:
THE ROLE
Our Site Reliability Engineers work closely with our Platform and Engineering teams to ensure our applications are designed and built with reliability and speed in mind as well as ensuring our application infrastructure is robust and scalable.
As a Site Reliability Engineer at Capital on Tap you will be responsible for designing, building, and monitoring systems to maximise platforms uptime and efficiency for the best possible end-user experience. You are also tasked with identifying and resolving potential outages and performance issues before they become a problem.
RESPONSIBILITIES
- Manage Azure services and resources, Cloudflare edge security, traffic management in code.
- Create, manage and monitor development resources within Kubernetes clusters and Serverless (i.e. Function Apps, Automation Accounts) for Product Engineering Teams.
- Own Terraform / Ansible / Pulumi Infrastructure as Code for each Product Engineering team.
- Continuously identify opportunities for improvement in systems, processes, and technologies, and implement changes to improve the overall reliability and performance of the platforms.
- Improve monitoring to provide insights to uptime and availability, and work towards the agreed SLO.
- Work with the Product team to identify the company SLA and objectives for all core services/applications.
- Work with Platform Engineers to deliver end-to-end automated solutions and pipelines.
- Work with software developers and stakeholders to improve the user experience through pipeline management and infrastructure improvements.
- Proactively support Platform services and tooling (TeamCity, Octopus, Azure DevOps & more to come)
- Improve reliability, quality, and time-to-market of our suite of software solutions. Through solutions such as load testing, chaos engineering and improved deployment strategies.
- Own and lead the troubleshooting of incidents that impact the customer experience.
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Application Programming / Maintenance
Software Engineering
Graduate
Proficient
1
London, United Kingdom