Site Reliability / Systems Engineer

at  Dakota Software Corporation

Remote, Oregon, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate08 Jul, 2024Not Specified09 Apr, 20244 year(s) or abovePython,Linux,Flux,Computer Science,Infrastructure,Octopus,Ruby,Javascript,Redis,Powershell,Operating Systems,Scripting Languages,KubernetesNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

REQUIRED SKILLS AND QUALIFICATIONS

  • Bachelor’s degree (or equivalent) in computer science or related discipline from an accredited U.S. University or the foreign equivalent
  • 4 years industry experience building, maintaining and troubleshooting cloud computing systems and infrastructure for SaaS systems
  • Proficiency with automation scripting languages, such as Terraform, Flux, PowerShell and use of Azure ARM Templates
  • Experience with DevOps and deployment systems, such as Azure DevOps and Octopus
  • Experience with Azure cloud computing infrastructure such as Azure storage, Kubernetes
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement

PREFERRED SKILLS AND QUALIFICATIONS

  • Experience with Azure Monitor, Application Insights and Prometheus/Grafana
  • Experience with Containerized Apps, Message Queueing and Linux and Windows Operating Systems
  • Coding experience beyond simple scripts
  • Ability to program (structured and OOP) using one or more high-level languages, such as C#, Python, Ruby and JavaScript
  • Understanding of Redis (Cache)
  • Professional certification

Responsibilities:

OBJECTIVES OF THIS ROLE

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Provide primary operational support and engineering for multiple large-scale distributed software applications

KEY RESPONSIBILITIES

  • Create sustainable systems and services through automation (IaC) and uplifts
  • For the dev, test, staging and production systems: automate their setup, maintain the systems, and debug and resolve issues
  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Ensure production systems adhere to industry best practices related to security and privacy
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Provide primary operational support and engineering for multiple large-scale distributed software applications
  • Manage and monitor all infrastructure and systems installations, including configuration, testing, and maintenance
  • Implement and maintain strategies for backup, security, and redundancy
  • Craft custom scripts that reduce the need for human interaction
  • Working on-call shift to prevent incidents from ever happening
  • Building monitoring that alerts on symptoms rather than on outages
  • Work with team to fine tune automated deployments and develop progressive roll-out and roll-back strategies
  • Use the monitoring systems (for alerting and dashboards)
  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures


REQUIREMENT SUMMARY

Min:4.0Max:9.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Information Technology

Graduate

Computer science or related discipline from an accredited u.s university or the foreign equivalent

Proficient

1

Remote, USA