Site Reliability Engineer

at  ISG Search Inc

Toronto, ON, Canada -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate23 Nov, 2024Not Specified29 Aug, 2024N/AData Structures,Functionality,Technical Documentation,Software Design,Data Architects,Daily Operations,Service Orientation,C++,Technical Leadership,Software Engineering Practices,Automation Tools,Workstation,Automation,Engineers,Ansible,Career DevelopmentNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and are responsible for mentoring and leading less experienced SRE’s.

KEY ACCOUNTABILITIES:

  1. Interface with systems analysts, solution designers, data architects, and business stakeholders, to understand the business-site reliability requirements.
  • Assist with analysis and profiling of existing site reliability issues and resolve in a timely, competent manner.
  • Document project requirements and articulate necessary technical specifications to align technical and business goals.
  • Ensure all forms of communication (verbal, email, business documents, and technical documents) are clear, concise, and professional.
  • Drive partnership with security engineers to develop plans and automation strategies too aggressively and safely respond to new risks and vulnerabilities.2. Design, develop, test, and document advanced site reliability solutions within a complex site reliability environment to align technical and business goals.
  • Analyze systems problems/issues for projects and/or business areas and develop/support systems solutions.
  • Lead the design, and architect operational solutions for managing applications with the specific goal of increasing the automation, repeatability, and consistency of operational tasks.
  • Drive defining system objectives based on user needs, industry requirements, and an understanding of business rules.
  • Sponsor healthy software development practices.
  • Use strong technical skills, knowledge, and experience to implement appropriate site reliability solutions to resolve a variety of simple or complex issues and problems within established guidelines and drive business strategy.
  • Provide architectural and practical leadership to software development to improve resiliency, efficiency, performance, and costs.
  • Build deployment and site provisioning automation and move the organizations toward total infrastructure as code paradigm by utilizing more and more scripting and automation from environment provisioning to failure detection and recovery.
  • Drive the complete automation of the pipeline.3. Work with a team of analysts, engineers, management, and business stakeholders to identify site reliability solutions while assessing technical and business suitability, and its impact to the enterprise’s systems.
  • Develop and implement solutions according to site reliability project needs, while assessing current systems and solutions to establish best practices for project.
  • Provide technical leadership to a growing team focused on applying software engineering practices to operations at scale.
  • Drive dynamic, cross-functional and multi-stakeholder projects.
  • Resolve or escalate issues and/or project changes to the Team Lead, as well as provide status updates to the Team Lead.
  • Provide input on best site reliability applications and processes.5. Manage daily operations and functionality of site reliability solutions and applications.
  • Collaborate with analysts, support staff, to determine site reliability issues and resolve in a timely, competent manner while escalating if necessary.
  • Collaborate with internal teams to ensure that operational development solutions meet business requirements.
  • Develop, communicate, and monitor standard processes to promote the long-term health of sustainability and health of operational development tasks.
  • Persistent testing of application and infrastructure resiliency over a variety of error conditions.
  • Support, monitor site reliability environment for issues, inefficiencies, and potential areas for improvement.
  • Monitor and report on service level objectives for a given applications services. Work with business and product owners to establish key performance indicators.
  • Create and maintain monitoring technologies and processes that improve the visibility of the enterprise’s applications’ performance and business metrics, as well as helping keep operational workload reasonable.6. Research, evaluate, and recommend operational procedures and best practices to optimize processes and workflow efficiencies.
  • Manage new and existing account underwriting workflows.
  • Assist in the development, production, and maintenance of procedures, reports, and documentation as required.
  • Conduct knowledge transfer sessions with less experienced staff members; ensure that all technical documentation is produced for ease of transferring skills and knowledge on projects.7. Assist in the training and development of staff, and provide technical expertise and advice regarding products, systems, and processes.
  • Ongoing training, career development, and mentorship of team members.
  • Act as a resource on risks outside of the authority level for less experienced staff and authorize risks within own established authority levels.
  • Participate in conducting technical training events, game day scenarios, and focused engineering spikes.
  • Provide training, development, and technical guidance to colleagues regarding products, systems, and processes as required.8. Develop and maintain a knowledge and understanding of company operations/procedures, and insurance industry trends/developments, to enhance the security process.
  • Collaborate, participate in, “feature/service” oriented development teams.9. Other duties as assigned.SKILL/BEHAVIOUR REQUIREMENTS:
  • Exceptional professional communication skills.
  • Exceptional interpersonal skills with the ability to work as a team member.
  • Exceptional customer service orientation.
  • Exceptional ability to debug and optimize code and automate routine tasks.
  • Systematic problem-solving approach.
  • Exceptional ability to manage multiple, complex desktop projects.
  • Documented experience in developing process improvement initiatives.
  • An intimate understanding of industry practices and how to best leverage them.
  • Proficiency in one or more of the following: C, C++, Java, Python, Go, Perl or Ruby.
  • Documented experience with algorithms, data structures, complexity analysis and software design.
  • Documented experience working with configuration management and deployment automation tools like Chef, Terraform, Puppet or Ansible.
  • Documented experience within operations, administering both Window (2003/2008/2012) and Linux machines.
  • Exceptional knowledge of the Software Development Life Cycle with experience working in QA and beta environments.
  • Proficiency in cloud computing concepts, both remotely and on-site.
  • Exceptional knowledge and experience working with Azure cloud infrastructure and services is preferred.
  • Exceptional knowledge and experience with virtualization and container solutions (ie., Docker, Kubernetes, ECS).
  • Documented experience and knowledge of scripting tools such as Terraform and Ansible.
  • Expert-level knowledge and understanding of diverse infrastructure platforms and infrastructure concepts
  • Extensive understanding and experience of the Agile software development methodology.
  • Expert-level ability to debug and optimize code and automate routine tasks.QUALIFICATIONS:
  • Bachelor’s Degree in Computer Science, Computer Engineering, Software Engineering, MIS, or other related discipline required.
  • 7+ years’ software development / architecture experience.
  • Strong, documented hands-on experience in a DevOps or System Administration role.
  • Prior experience building and supporting cloud-based solutions.WORKING CONDITIONS:
  • Indoors in normal office environment with little exposure to excessive noise, dust, fumes, vibrations, and temperature.
  • Frequent computer use at workstation.

Responsibilities:

Please refer the Job description for details


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer science computer engineering software engineering mis or other related discipline required

Proficient

1

Toronto, ON, Canada