Software Engineer (SRE) at Jobgether
, , United States -
Full Time


Start Date

Immediate

Expiry Date

22 Feb, 26

Salary

0.0

Posted On

24 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Software Engineering, Site Reliability Engineering, Cloud Platforms, Kubernetes, Automation, AI, Incident Response, Root Cause Analysis, Microservices, Security, Compliance, Mentorship, Collaboration, Observability, Developer Experience, Healthcare Technology

Industry

Internet Marketplace Platforms

Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Software Engineer (SRE) in United States. This role offers the opportunity to combine software engineering expertise with site reliability principles to build highly resilient, scalable, and secure systems. You will play a central role in shaping the infrastructure that powers critical AI-driven applications, ensuring optimal performance, availability, and reliability. The position involves driving automation, leveraging AI for proactive monitoring, and managing cloud-native microservices platforms at scale. You will collaborate closely with engineering, product, and operations teams, mentoring peers and contributing to best practices in SRE. This is a high-impact, hands-on role that allows you to innovate while ensuring that systems operate smoothly and efficiently. You will have the chance to directly influence the reliability of platforms that improve workflows for end-users. \n Accountabilities: Lead the design, implementation, and operation of scalable, cloud-native infrastructure and microservices platforms. Develop automation, tooling, and services to enhance operational efficiency, system observability, and developer experience. Drive AI-powered SRE initiatives for anomaly detection, predictive capacity planning, incident response, and automated remediation. Take ownership of production incidents, perform root cause analysis, and implement preventative measures to improve uptime and performance. Manage Kubernetes-based deployments, ensuring reliable resource utilization, seamless scaling, and robust system resilience. Embed security and compliance best practices into infrastructure, contributing to HIPAA, SOC2, and other regulatory requirements. Collaborate with cross-functional teams, mentor junior engineers, and promote a culture of reliability, automation, and shared ownership. Requirements: Strong backend software engineering experience in Python, Go, Java, or similar languages. Hands-on experience with cloud platforms, particularly GCP, and production Kubernetes environments. Demonstrated passion for Site Reliability Engineering principles and automation-first mindset. Proven ability to troubleshoot complex distributed systems and drive incident resolution effectively. Knowledge of AI applications in operational workflows is a plus. Excellent communication, collaboration, and mentorship skills. Familiarity with security, compliance, and monitoring standards in enterprise environments. Preferred: experience in healthcare technology, microservices architecture, and AI-driven operational tooling. Benefits: Competitive base salary with additional variable targets and equity opportunities. Fully remote work flexibility with potential for hybrid arrangements. Opportunities to lead high-impact infrastructure and reliability projects. Professional growth, mentorship, and knowledge-sharing within a collaborative team. Comprehensive benefits package including medical, dental, vision, and retirement plans. Culture of innovation, inclusion, and mission-driven impact. \n Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching. When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly. 🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements. 📊 It compares your profile to the job’s core requirements and past success factors to determine your match score. 🎯 Based on this analysis, we automatically shortlist the three candidates with the highest match to the role. 🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed. The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team. Thank you for your interest! #LI-CL1
Responsibilities
Lead the design, implementation, and operation of scalable, cloud-native infrastructure and microservices platforms. Drive AI-powered SRE initiatives for anomaly detection, predictive capacity planning, and incident response.
Loading...