Lead Site Reliability Engineer (Cloud Engineering)

at  Tyson Foods

Torreón, Coah., Mexico -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate09 Oct, 2024Not Specified10 Jul, 20245 year(s) or aboveAws,Reliability Engineering,Code,Docker,Teams,Scripting,Computer Science,Azure,Communication Skills,Palantir,Operations,InfrastructureNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Job Details:
Job Description – Lead Site Reliability Engineer (Cloud Engineering)
The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based analytics systems in GCP. The role will play a crucial part in designing and implementing robust, scalable solutions while providing technical leadership to our Cloud/Data Engineering team. We are seeking a seasoned Lead Site Reliability Engineer with over 6 years of technical expertise in cloud infrastructure and cloud operations space.
At Tyson Foods, the Lead Site Reliability Engineer will have the opportunity to work on cutting-edge technologies and collaborate with talented professionals in a dynamic environment. We offer a culture that values innovation, growth, and work-life balance, along with opportunities for career advancement and professional development.

Primary Job Responsibilities:

  • Design and implement highly available and scalable cloud infrastructure solutions in GCP/AWS.
  • Collaborate with cross-functional teams to optimize system performance, capacity, and cost-efficiency.
  • Architect and implement and manage monitoring, logging, and observability solutions to ensure proactive issue identification and resolution.
  • Lead incident response efforts, conduct root cause analysis, and implement corrective actions to prevent recurrence.
  • Develop automation scripts and tools to streamline operational processes and improve efficiency.
  • Implement and enforce security best practices and compliance requirements in cloud environments.
  • Provide technical leadership, mentorship, and guidance to a team of engineers to foster a culture of innovation and excellence.
  • Work closely with development teams to ensure alignment of infrastructure with application requirements and deployment pipelines.
  • Drive continuous improvement initiatives to enhance reliability, scalability, and performance of our cloud infrastructure.
  • Troubleshoot and resolve issues related to data migration, system updates etc
  • This role requires strong technical expertise as well as excellent problem-solving and communication skills.

Qualifications:

  • Minimum 10 years of experience in a Site Reliability Engineering or similar role, with a strong focus on cloud infrastructure, and operations.
  • 8+ years and deep expertise in cloud platforms such as GCP, AWS or Azure.
  • 5+ years and proficiency in infrastructure as code (IaC) tools such as Terraform, CloudFormation, or Ansible.
  • Strong knowledge of containerization and orchestration technologies (e.g., Kubernetes, Docker).
  • Experience with CI/CD pipelines and DevOps practices.
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack).
  • 5+ years in scripting and programming skills (e.g., Python, Bash).
  • Experience in mentoring technical teams and driving projects to successful completion.
  • Excellent communication skills with the ability to collaborate effectively across teams and influence stakeholders.
  • Strong analytical and troubleshooting skills, with the ability to analyze complex issues and drive solutions.
  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • Certification in relevant cloud platforms (GCP, AWS, AZURE or Palantir).
  • Familiar with Agile Methodology concepts.

Good to have:

  • Master’s degree in Computer science, engineering field.
  • Familiarity with ITIL or other service management frameworks.

TysonMXT

Relocation Assistance Eligible:
No
Work Shift:
Hourly Applicants ONLY -You must complete the task after submitting your application to provide additional information to be considered for employment.
Tyson is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will be considered without regard to race, national origin, color, religion, age, genetics, sex, sexual orientation, gender identity, disability or veteran status.
We provide our team members and their families with paid time off; 401(k) plans; affordable health, life, dental, vision and prescription drug benefits; and more.
CCPA Notice. If you are a California resident, and would like to learn more about what categories of personal information we collect when you apply for this job, and how we may use that information, please read our CCPA Job Applicant Notice at Collection

Responsibilities:

  • Design and implement highly available and scalable cloud infrastructure solutions in GCP/AWS.
  • Collaborate with cross-functional teams to optimize system performance, capacity, and cost-efficiency.
  • Architect and implement and manage monitoring, logging, and observability solutions to ensure proactive issue identification and resolution.
  • Lead incident response efforts, conduct root cause analysis, and implement corrective actions to prevent recurrence.
  • Develop automation scripts and tools to streamline operational processes and improve efficiency.
  • Implement and enforce security best practices and compliance requirements in cloud environments.
  • Provide technical leadership, mentorship, and guidance to a team of engineers to foster a culture of innovation and excellence.
  • Work closely with development teams to ensure alignment of infrastructure with application requirements and deployment pipelines.
  • Drive continuous improvement initiatives to enhance reliability, scalability, and performance of our cloud infrastructure.
  • Troubleshoot and resolve issues related to data migration, system updates etc
  • This role requires strong technical expertise as well as excellent problem-solving and communication skills


REQUIREMENT SUMMARY

Min:5.0Max:10.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer science engineering or a related field

Proficient

1

Torreón, Coah., Mexico