Site Reliability Engineer
at LoadSpring Solutions
Massachusetts, Massachusetts, USA -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 17 Dec, 2024 | USD 90000 Annual | 18 Sep, 2024 | 4 year(s) or above | Good communication skills | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
Description:
LoadSpring is expanding beyond hosting into the world of predictive transformation. At LoadSpring, we bridge innovation and transformation with our LoadSpring Cloud Platform and the integrated data capabilities we provide through LoadSpring INSIGHTS. Our technology solutions provide a secure hosting platform to run the project and capital-intensive industries’ most crucial project applications, delivering a reporting and analytical database of clean, accurate, relevant, and structured data.
LoadSpring’s innovative, tenacious, and driven professionals benefit from a unique working environment where our teams blend varying perspectives, experiences, and technologies to solve complex problems. In our value-filled environment, you’ll feel supported with workplace flexibility, commitment to health and wellness, and varied professional growth opportunities. We are excited to invite you to apply for our Site Reliability Engineer (SRE) position and see how you can help top companies around the globe unlock the power of their data and position them to make the best strategic business decisions!
ABOUT THE SITE RELIABILITY ENGINEER POSITION:
We are seeking a highly skilled Site Reliability Engineer (SRE) who will be responsible for designing, optimizing, implementing, and maintaining our cloud environment for customers. This includes both front-end (applications), back-end infrastructure, and delivery systems.
Responsibilities:
Site Reliability
- Help design, deploy, and maintain application monitoring to ensure all applications and sites are monitored.
- Respond to application, CPU, and memory alerts, find root causes of the alerts, and if possible, provide permanent solutions.
- Educate users on application usage that causes performance degradation.
- Help create and maintain a culture of continuous improvement within the SRE and broader organization.
- QA newly deployed applications to ensure consistency and a great customer experience.
- Leverage the Dev-Ops model to manage the deployment of new platform releases.
- Analyze and recommend solutions for production performance and availability issues.
Documentation
- Create knowledgebase articles to record and document permanent fixes to customer challenges.
- Collaborate with Software Implementation team to update deployment documentation with any new best practices.
- Demonstrates excellent verbal and written communication skills with customers and internal teams.
Professional Development
- Effectively complete training within the timeframe required by the business.
- Maintains current knowledge of technological innovations and trends.
Process
- Take a lead role in any site outages, lead the Incident response and the postmortem process.
- Develop automated recovery plans for sites and applications.
- Develop automated quality assurance processes to provide a consistent environment.
- Follow Change Management processes to implement configuration changes.
- Follow Problem Management processes to troubleshoot and resolve recurring issues.
- Participates in the on-call rotation to ensure 24 x 7 support of IT operations.
Mentoring:
- Act as a mentor within the SRE team and broader organization, providing guidance, training, and knowledge sharing.
Requirements:
REQUIREMENT SUMMARY
Min:4.0Max:6.0 year(s)
Information Technology/IT
IT Software - Other
Software Engineering
Graduate
Proficient
1
Massachusetts, USA