Software Engineer - SRE Focused at Alpheya

Bengaluru, karnataka, India -

Full Time

Start Date

Immediate

Expiry Date

22 Mar, 26

Salary

0.0

Posted On

22 Dec, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Node.js, Go, Python, Site Reliability Engineering, Debugging, APIs, Microservices, Kubernetes, Docker, CI/CD, GitOps, PostgreSQL, MySQL, Problem-Solving, Observability, OpenTelemetry

Industry

Financial Services

Description

We are a B2B WealthTech startup based in Abu Dhabi and backed by BNY Mellon (America’s oldest bank and first company to list on NYSE) and Lunate (a new $50B AUM alternative asset management firm based in Abu Dhabi, UAE). The company has raised $300M to build a state of the art wealth technology platform. Our mission is to power and grow our clients’ Wealth franchises through differentiated experiences, financial solutions, and insights. Our digital wealth management platform- will enable banks and other financial institutions in the Middle East to grow and further penetrate affluent, HNW and UHNW investor segments. While still leveraging the capabilities and knowledge of large organizations, our fintech is a startup with truly cross-functional and agile teams. For more information, please visit www.alpheya.com Role We are looking for Software Engineers with a strong foundation in backend development (Node.js, Go, or Python) and an interest in Site Reliability Engineering. You will work on debugging production application issues, adding observability, and collaborating with Engineering to ensure system reliability. Debug and resolve production issues in APIs, workers, and data processors. Read and understand existing Node.js / Go codebases to trace errors. Contribute small features, bug fixes, and configs to improve application reliability. Add observability hooks (metrics, logging, tracing) with OpenTelemetry. Work closely with engineers to deploy fixes and enhancements via GitOps. Participate in on-call rotation for production support (Alinma, Pershing clients). Document and build runbooks for recurring production issues. Proficiency in Node.js, Go, or Python (at least one strong). Understanding of REST/GraphQL APIs, microservices architecture. Ability to debug stack traces, logs, runtime errors, SQL queries. Familiarity with Kubernetes, Docker. Exposure to CI/CD (GitHub Actions, Jenkins) and GitOps (FluxCD, ArgoCD). Knowledge of PostgreSQL / MySQL. Strong debugging and problem-solving skills. Good to Have Experience with Temporal workflows or distributed systems. Prior exposure to observability stacks (Prometheus, Grafana, Loki, Tempo). Interest in transitioning towards SRE/Platform engineering.

Responsibilities

The role involves debugging production application issues, adding observability, and collaborating with engineering teams to ensure system reliability. You will also contribute to small features and bug fixes to improve application reliability.