Site Reliability Engineer (Temp to Perm) at Leidos

Vista, CA 92081, USA -

Full Time

Start Date

Immediate

Expiry Date

16 Sep, 25

Salary

85150.0

Posted On

17 Jun, 25

Experience

4 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Oracle, Remote Locations, Aws, Sql Server, Operating Systems, Continuous Integration, Sustainment, Kubernetes, Google Cloud Platform, Network Administration, Security+, Fleet, Management System, Mysql, Computer Science, Computer Engineering, Docker, Https, Large Projects

Industry

Information Technology/IT

Description

BASIC QUALIFICATIONS:

Typically requires a Bachelor’s degree in computer science or computer engineering with 4+ years of experience in a relevant field.
Must be able to pass an in-depth background check (CBP Public Trust BI).
Experience delivering entire projects or processes spanning multiple technical areas.
Experience serving as a technical lead managing large projects or processes.
Working knowledge of Agile Development and continuous integration and continuous delivery methodologies and tools.
Expertise with Linux and Windows operating systems, network administration, and networking protocols/functions (e.g., HTTP, HTTPS, SSL/TLS, SMTP, DNS).
Expertise provisioning and managing resources within IaaS/Cloud infrastructures (e.g., Azure, AWS, Google Cloud Platform, etc).
Experience with Terraform, Ansible, Helm, BASH Scripting, CloudFormation, Chef, Puppet, Ansible or similar technologies.
Troubleshooting PLC end-device software and service-level computers on the edge feeding our on-prem and cloud-based server systems.
Expertise with container technologies such as Docker and container orchestration tools like Kubernetes.
Expertise with Kubernetes kubectl
Expertise of a version control system (e.g., Git).
Strong, self-motivated desire to learn new tools, frameworks, and techniques.
Ability to complete tasking independently with minimal direct supervision.
Ability to work and collaborate effectively within a multi-disciplined engineering team.
Ability to travel up to 70% of times to remote locations, mostly in the US along the border to troubleshoot network and software bugs during initial deployment and sustainment.
U.S. Citizenship is required.

PREFERRED QUALIFICATIONS:

Experience with Enterprise Event Brokers Technologies (Kafka, NATS)
Experience with monitoring and alerting tools such as Grafana, Prometheus
Experience with API Gateways such as ISTIO
Experience with GitOps tools such as Argo CD, Flux CD, Fleet or similar
Professional cybersecurity certification such as Security+, or similar.
Knowledge of Agile Development methodologies.
Familiarity with at least one Relational Database Management System (Oracle, MySQL, PostgreSQL, SQL Server, etc.).
*Salary Range for this position: $90,000 - $105,000 *

Responsibilities

Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding of an microservice enterprise system (cloud and on-premises)
Partner with development teams to improve services, diagnostics, and deployment tools through gap identification, concept development, and rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Create sustainable systems and services through service automation
Design, develop, troubleshoot, and debug mission critical infrastructure on-prem and remote
Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site activites.
Participate in the concept design of reusable infrastructure components for scalable, highly available, secure architectures for cloud native applications.
Enable the continuous integration and continuous delivery of our diverse suite of software products by applying best practices for infrastructure provisioning, configuration and automated software deployments.
Continually evaluate fielded system deployments and apply best practices to facilitate continuous improvement that can be applied across teams.
Work closely with other engineers to develop the best technical design and approach for new product installation and field service activities (software patches, cyber updates, etc.)
Develop solutions to complex technical issues and problems that impact multiple area or disciplines.
Communicate with internal team members across multiple areas and coordinate completion of key deliverables across teams.
Liaise with external and internal customer stakeholders on technical design decisions and trade-offs and ensure software solution will meet required functional, performance, and SLA thresholds, especially with customer network interfaces.
Mentor other SREs in the art of building deploying and maintaining production mission critical microservice enterprise systems.
Resolve roadblocks for the field service team, working collaboratively with the product engineering, technical leadership, and others.