Senior DevOps Engineer at Credit Genie

New York, New York, USA -

Full Time

Start Date

Immediate

Expiry Date

05 Jul, 25

Salary

225000.0

Posted On

05 Apr, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Azure, Devops, Docker, Compliance Regulations, Cloud Computing, Bash, Automation, Containerization, Python, Root, Incident Response, Logging, Aws

Industry

Information Technology/IT

Description

COMPANY

Credit Genie is a mobile-first financial wellness platform designed to help individuals take control of their financial future. We leverage artificial intelligence to provide personalized insights and are building a financial ecosystem by offering tools and services that provide instant access to cash, and building credit. Our goal is to empower every customer to achieve long-term financial stability.
Founded in 2019 by Ed Harycki, former Swift Capital Founder (acquired by PayPal 2017). Backed by Khosla Ventures and led by industry pioneers from companies such as; PayPal, Square, and Cash App, we are well positioned to build the future of inclusive finance through cutting-edge technology and customer-centric solutions.
We are seeking a DevOps / Site Reliability Engineer (SRE) to design, build, and maintain scalable, reliable, and secure cloud infrastructure. This role will be instrumental in automating deployments, optimizing performance, and improving system resilience, working closely with engineering, security, data and AI teams to enable seamless operations.

What you’ll do

Design, implement, and manage cloud-based infrastructure (AWS) to ensure scalability and resilience.
Implement robust incident response strategies.
Define and monitor SLOs, and SLAs to ensure alignment with business goals and user expectations; leverage insights from these metrics to improve reliability and inform strategic decisions.
Monitor and improve system reliability, availability, and performance, implementing best practices for high uptime.
Build and maintain CI/CD pipelines to enable fast and secure deployments for our engineering, data and AI/ML Teams.
Implement observability tools, including monitoring, logging, and alerting solutions, to proactively identify and resolve issues.
Automate infrastructure provisioning, configuration management, and deployments using AWS CDK and Terraform, or similar tools.
Collaborate with our security team to enforce best practices across infrastructure, including IAM, encryption, vulnerability scanning, and incident response planning.
Work with security and compliance teams to ensure adherence to regulatory requirements.
Conduct disaster recovery and business continuity planning, ensuring rapid response and system recovery.

REQUIREMENTS

5+ years of experience in DevOps, SRE, or cloud infrastructure roles.
Strong expertise in cloud computing (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and automation (Terraform, AWS CDK or equivalent).
Strong knowledge of Linux systems, networking, and security best practices.
Proficiency in monitoring, logging, and alerting tools such as Prometheus, Grafana, ELK, or Datadog.
Experience with incident response, root cause analysis, and performance optimization.
Familiarity with fintech security and compliance regulations is a plus.
Strong scripting or programming skills (Python, Go, or Bash) for automation.

Responsibilities

Design, implement, and manage cloud-based infrastructure (AWS) to ensure scalability and resilience.
Implement robust incident response strategies.
Define and monitor SLOs, and SLAs to ensure alignment with business goals and user expectations; leverage insights from these metrics to improve reliability and inform strategic decisions.
Monitor and improve system reliability, availability, and performance, implementing best practices for high uptime.
Build and maintain CI/CD pipelines to enable fast and secure deployments for our engineering, data and AI/ML Teams.
Implement observability tools, including monitoring, logging, and alerting solutions, to proactively identify and resolve issues.
Automate infrastructure provisioning, configuration management, and deployments using AWS CDK and Terraform, or similar tools.
Collaborate with our security team to enforce best practices across infrastructure, including IAM, encryption, vulnerability scanning, and incident response planning.
Work with security and compliance teams to ensure adherence to regulatory requirements.
Conduct disaster recovery and business continuity planning, ensuring rapid response and system recovery