Principal Systems Engineer at Harris Computer
Singapore, , Singapore -
Full Time


Start Date

Immediate

Expiry Date

08 Sep, 26

Salary

0.0

Posted On

10 Jun, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Site Reliability Engineering, Azure, SQL Server, Windows Server, Infrastructure as Code, Monitoring & Observability, PowerShell, Python, Kubernetes, CI/CD, ITIL, Incident Management

Industry

Software Development

Description
Site Reliability Engineer (SRE) - Remote Overview As a Site Reliability Engineer (SRE) at Altera, you will be responsible for ensuring the reliability, scalability, and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability, automate operations, and improve the customer experience. You will act as a technical leader in monitoring, troubleshooting, incident response, and continuous improvement across our cloud and hybrid environments. Key Responsibilities Maintain and improve the reliability, availability, and performance of our production environments. Lead the investigation and resolution of complex application, database, and infrastructure issues. Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences. Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments. Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers. Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency. Partner with engineering and cloud teams to refine deployment, monitoring, and support processes. Provide technical leadership during major incidents and act as a key escalation point for critical issues. Qualifications Experience: 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments. Monitoring & Observability: Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic. Microsoft Stack: Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon. Database Skills: Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups. Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls). ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles. Preferred Skills: Scripting with PowerShell, Python, or similar languages. Infrastructure as Code (Terraform, ARM Templates, Bicep). CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions). Experience with Kubernetes and containerized workloads. Experience implementing SLOs, SLIs, and Error Budgets. Experience in a healthcare technology or patient care environment. Education: Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered. Working Arrangements This is a remote position open to candidates within the United States. You will participate in an on-call rotation to support our 24x7 healthcare environment. Occasional after-hours work is required for activations, upgrades, and major incidents. Travel Travel is not a requirement for this role. Why Altera? At Altera Digital Health, you will have the opportunity to profoundly impact the lives of patients by empowering healthcare providers to deliver superior care. You will join a passionate and gifted team committed to innovation and excellence. We offer a competitive compensation and benefits package and the opportunity to work in a fast-paced and dynamic environment. At Harris, we believe great people build great software. We offer an environment where employees are empowered to make a real impact, grow their skills, and shape their careers. Our teams enjoy a supportive, award‑winning culture, a casual and collaborative workplace, and opportunities to learn from a diverse group of businesses and industries. We are financially strong and proudly part of Constellation Software Inc. (CSI), the largest software company in Canada, providing long‑term stability alongside entrepreneurial autonomy. In addition to a competitive compensation and benefits package, we offer meaningful perks, flexibility, and—most importantly—a culture that values people, curiosity, and having fun while doing great work. Follow us on LinkedIn to learn more about our culture, values, and career opportunities. Harris is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability. Applicants who require a reasonable accommodation due to a disability may contact us by email at HarrisTalentAcquisition@harriscomputer.com. Accommodation requests may be made at any time. This email address is dedicated solely to accommodation requests and cannot be used to inquire about application status. Know Your Rights Poster EO 13496: Notification of Employee Rights under Federal Labor Laws Our commitment to fair and equitable hiring. As part of our recruitment process, we use artificial intelligence (AI) tools during the initial screening phase to help identify candidates whose qualifications most closely align with the requirements of the role. This technology supports efficiency and consistency in the early stages, but it never replaces human judgement. All subsequent evaluations and final hiring decisions are made by our recruitment professionals. AI does not make final hiring decisions.
Responsibilities
Ensure the reliability, scalability, and performance of hosted healthcare platforms through automation and proactive monitoring. Lead incident response, root cause analysis, and the definition of SLIs and SLOs to maintain service availability.
Loading...