Site Reliability Engineer at Smiley Technologies

Little Rock, Arkansas, United States -

Full Time

Start Date

Immediate

Expiry Date

11 May, 26

Salary

0.0

Posted On

10 Feb, 26

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Reliability, Observability, Incident Response, CI/CD, Dynatrace, Azure, AWS, Containers, Terraform, Helm, Bash, PowerShell, REST APIs, Python, Incident Command, SLIs/SLOs/SLAs

Industry

Financial Services

Description

Description Site Reliability Engineer (SRE) Reliability, observability, and calm leadership when it matters most. Smiley Technologies powers core banking platforms used by banks and credit unions across the United States. When something goes wrong, it’s not just an alert - it’s real customers, real transactions, and real impact. We’re hiring a Site Reliability Engineer to replace a former senior SRE who recently moved into a leadership role. This is a mid-level SRE position, ideal for someone who already has hands-on SRE or Platform experience and is ready to grow into broader ownership over time. This role sits within our Platform / DevOps Engineering team and works across nearly every technical group in the organization. You’ll be hands-on with observability, incident response, CI/CD, and reliability practices - and you’ll help make life easier for engineers across Smiley. What Makes This Role Different This SRE is a key Incident Command contributor. Critical incidents are rare (only a few per year), but when they happen: You’ll run the call with clients Make sure the right people are present Keep communication clear, calm, and structured Capture notes and drive strong post-incident learning If you’re someone who stays composed under pressure and communicates clearly - this role will suit you well. What You’ll Be Doing Work cross-functionally with Network, SecOps, DevSecOps, Platform, Developers, and Support Own and evolve observability and monitoring, primarily using Dynatrace Dashboards, alerts, reporting, and adoption across teams Help teams improve root cause analysis, retrospectives, and reliability practices Support and improve CI/CD pipelines (GitHub Actions / Azure DevOps) Maintain standards and documentation across the SDLC Monitor and optimize cloud costs, infrastructure tiers, and capacity Participate in Incident Command on-call rotation Define and document SLIs, SLOs, SLAs, KPIs, and OKRs Promote both Shift Left and Shift Right reliability thinking Help Smiley continue its DevOps and Platform transformation The Technical Environment Hybrid infrastructure (on-prem + cloud) Azure-first (AWS experience welcomed) Containers: Docker, ACR, AKS Infrastructure as Code and automation-driven workflows Regulated, high-availability financial systems What We’re Looking For We care more about real experience + communication + teachability than checking every box. Core Experience 2+ years in an SRE, Platform Engineer, or DevOps role Hands-on experience with APM / observability tools Dynatrace strongly preferred Datadog, New Relic, Prometheus/Grafana also relevant Experience with Azure or AWS Experience supporting CI/CD pipelines Experience with containers (Docker, AKS) Working knowledge of Git, Terraform, Helm, Bash, PowerShell Experience supporting REST APIs Experience with .NET or Python (or similar) Additional Strengths (Nice to Have) Linux/Unix administration fundamentals Performance troubleshooting across: Applications Databases (SQL Server, DB2, PostgreSQL) Familiarity with WAFs, networking, and OWASP concepts Experience with developer portals (Backstage.io a plus) Financial services or regulated environments (helpful, not required) Why would you like to work at Smiley Technologies? Work on core banking systems where reliability truly matters Real ownership without being thrown in the deep end Strong culture of mentorship and growing people A team that values communication, learning, and integrity Opportunity to shape what SRE looks like at Smiley If you’re an SRE who wants meaningful responsibility, strong mentorship, and the chance to grow - we’d love to talk. Apply today and our recruiter will reach out with more details

Responsibilities

The Site Reliability Engineer will own and evolve observability and monitoring, primarily using Dynatrace, and will help teams improve root cause analysis, retrospectives, and overall reliability practices. This role also involves supporting and improving CI/CD pipelines and participating in the Incident Command on-call rotation.