Site Reliability Engineer (Temp to Perm) at Leidos
Vista, CA 92081, USA -
Full Time


Start Date

Immediate

Expiry Date

16 Sep, 25

Salary

85150.0

Posted On

17 Jun, 25

Experience

4 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Oracle, Remote Locations, Aws, Sql Server, Operating Systems, Continuous Integration, Sustainment, Kubernetes, Google Cloud Platform, Network Administration, Security+, Fleet, Management System, Mysql, Computer Science, Computer Engineering, Docker, Https, Large Projects

Industry

Information Technology/IT

Description

BASIC QUALIFICATIONS:

  • Typically requires a Bachelor’s degree in computer science or computer engineering with 4+ years of experience in a relevant field.
  • Must be able to pass an in-depth background check (CBP Public Trust BI).
  • Experience delivering entire projects or processes spanning multiple technical areas.
  • Experience serving as a technical lead managing large projects or processes.
  • Working knowledge of Agile Development and continuous integration and continuous delivery methodologies and tools.
  • Expertise with Linux and Windows operating systems, network administration, and networking protocols/functions (e.g., HTTP, HTTPS, SSL/TLS, SMTP, DNS).
  • Expertise provisioning and managing resources within IaaS/Cloud infrastructures (e.g., Azure, AWS, Google Cloud Platform, etc).
  • Experience with Terraform, Ansible, Helm, BASH Scripting, CloudFormation, Chef, Puppet, Ansible or similar technologies.
  • Troubleshooting PLC end-device software and service-level computers on the edge feeding our on-prem and cloud-based server systems.
  • Expertise with container technologies such as Docker and container orchestration tools like Kubernetes.
  • Expertise with Kubernetes kubectl
  • Expertise of a version control system (e.g., Git).
  • Strong, self-motivated desire to learn new tools, frameworks, and techniques.
  • Ability to complete tasking independently with minimal direct supervision.
  • Ability to work and collaborate effectively within a multi-disciplined engineering team.
  • Ability to travel up to 70% of times to remote locations, mostly in the US along the border to troubleshoot network and software bugs during initial deployment and sustainment.
  • U.S. Citizenship is required.

PREFERRED QUALIFICATIONS:

  • Experience with Enterprise Event Brokers Technologies (Kafka, NATS)
  • Experience with monitoring and alerting tools such as Grafana, Prometheus
  • Experience with API Gateways such as ISTIO
  • Experience with GitOps tools such as Argo CD, Flux CD, Fleet or similar
  • Professional cybersecurity certification such as Security+, or similar.
  • Knowledge of Agile Development methodologies.
  • Familiarity with at least one Relational Database Management System (Oracle, MySQL, PostgreSQL, SQL Server, etc.).
    *Salary Range for this position: $90,000 - $105,000 *
Responsibilities
  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding of an microservice enterprise system (cloud and on-premises)
  • Partner with development teams to improve services, diagnostics, and deployment tools through gap identification, concept development, and rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through service automation
  • Design, develop, troubleshoot, and debug mission critical infrastructure on-prem and remote
  • Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site activites.
  • Participate in the concept design of reusable infrastructure components for scalable, highly available, secure architectures for cloud native applications.
  • Enable the continuous integration and continuous delivery of our diverse suite of software products by applying best practices for infrastructure provisioning, configuration and automated software deployments.
  • Continually evaluate fielded system deployments and apply best practices to facilitate continuous improvement that can be applied across teams.
  • Work closely with other engineers to develop the best technical design and approach for new product installation and field service activities (software patches, cyber updates, etc.)
  • Develop solutions to complex technical issues and problems that impact multiple area or disciplines.
  • Communicate with internal team members across multiple areas and coordinate completion of key deliverables across teams.
  • Liaise with external and internal customer stakeholders on technical design decisions and trade-offs and ensure software solution will meet required functional, performance, and SLA thresholds, especially with customer network interfaces.
  • Mentor other SREs in the art of building deploying and maintaining production mission critical microservice enterprise systems.
  • Resolve roadblocks for the field service team, working collaboratively with the product engineering, technical leadership, and others.
Loading...