Distributed Cloud | Azure Site Reliability Engineer, Hybrid

at  Devoteam

Lisboa, Área Metropolitana de Lisboa, Portugal -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate13 Oct, 2024Not Specified15 Jul, 20245 year(s) or aboveReliability,Powershell,Bash,Performance Metrics,Discrimination,Microsoft Azure,Maintenance,System Architecture,Root,Virtual Networks,Disabilities,Security,Operations,English,Analytical Skills,Scalability,Integration,Automation,Computer ScienceNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

  • At Devoteam, we believe that technology with strong human values can actively drive change for the better. Discover how Tech for People unlocks the future, creating a positive impact on the people and the world around us. We are a global leading player in Digital Transformation for leading organisations across EMEA, with a revenue of €1B. We believe in transforming technology to create value for our clients, partners and employees in a world where technology is developed for people. We are proud of the culture we have built together. We are proud of our people at the service of technology. We are proud of our diverse environment. Because we are #TechforPeople. Join our multidisciplinary team of Cloud experts, Designers, Business consultants, Security experts, Engineers, Developers and other extraordinary talents, spread across more than 20 EMEA countries. Become one of our +10.000 tech and business leaders on cloud, data and cyber security. Let’s fuse creativity with technology together and build innovative solutions that actively change things for the better.
    Our Devoteam Distributed Cloud Unit is looking for Azure Site Reliability Engineers to join our Infrastructure Microsoft team and work inside several projects within the banking sector.

We are seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) with Azure and Kubernetes certifications and extensive experience working with microservices. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based microservices architecture. You will collaborate closely with development and operations teams to design, implement, and maintain systems that are robust, resilient, and highly available.

  • Infrastructure Management: Design, implement, and manage scalable, secure, and reliable cloud infrastructure on Microsoft Azure and Google Cloud Platform.
  • Microservices Management: Design, implement, and manage a scalable and resilient microservices architecture on Microsoft Azure.
  • Kubernetes Administration: Deploy, manage, and optimize Kubernetes clusters, ensuring smooth operation and integration with existing systems.
  • Automation and Monitoring: Develop and maintain automation scripts and tools to enhance system efficiency and reduce manual intervention. Implement comprehensive monitoring and alerting systems to proactively identify and resolve issues.
  • Performance Optimization: Analyze system performance metrics and make recommendations for improvements. Implement performance tuning and optimization strategies.
  • Incident Response: Lead incident response efforts, including troubleshooting, root cause analysis, and post-incident reviews. Develop and implement strategies to minimize downtime and prevent future occurrences.
  • Collaboration: Work closely with development, operations, and security teams to ensure seamless integration of new features and technologies. Advocate for best practices in reliability, security, and performance.
  • Documentation: Create and maintain detailed documentation of system architecture, processes, and procedures. Ensure knowledge sharing across the team.
  • Mentorship: Provide guidance and mentorship to junior SREs and other team members. Foster a culture of continuous learning and improvement.
  • Bachelor’s degree in Computer Science, Information Technology, or a related field. Master’s degree is a plus.
  • Minimum of 5 years of experience in a Site Reliability Engineer or similar role, with a strong background in cloud infrastructure and container orchestration.
  • Proficiency in Microsoft Azure, including Azure Resource Manager (ARM), Virtual Networks, Azure Kubernetes Service (AKS), Azure DevOps, and Azure Monitor.
  • Extensive experience with Kubernetes, including deployment, scaling, and maintenance of clusters.
  • Strong understanding of microservices architecture and best practices.
  • Strong programming and scripting skills (e.g., Python, PowerShell, Go, Bash, Azure CLI).
  • Strong experience with IAC and configuration management tools (e.g., Ansible, Terraform).
  • Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
  • Knowledge of networking, security, and best practices in cloud environments.
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration abilities.
  • Ability to work effectively in a fast-paced, dynamic environment.
  • Leadership skills with the ability to mentor and guide team members.
  • Good level of knowledge in English (mandatory).
  • Certifications: Microsoft Certified: Azure Solutions Architect Expert, Microsoft Certified: Azure DevOps Engineer Expert, Certified Kubernetes Administrator (CKA).
  • The Devoteam Group works for equal opportunities, promoting its employees based on merit and actively fights against all forms of discrimination. We are convinced that diversity contributes to the creativity, dynamism and excellence of our organization. All of our vacancies are open to people with disabilities.

Responsibilities:

  • Infrastructure Management: Design, implement, and manage scalable, secure, and reliable cloud infrastructure on Microsoft Azure and Google Cloud Platform.
  • Microservices Management: Design, implement, and manage a scalable and resilient microservices architecture on Microsoft Azure.
  • Kubernetes Administration: Deploy, manage, and optimize Kubernetes clusters, ensuring smooth operation and integration with existing systems.
  • Automation and Monitoring: Develop and maintain automation scripts and tools to enhance system efficiency and reduce manual intervention. Implement comprehensive monitoring and alerting systems to proactively identify and resolve issues.
  • Performance Optimization: Analyze system performance metrics and make recommendations for improvements. Implement performance tuning and optimization strategies.
  • Incident Response: Lead incident response efforts, including troubleshooting, root cause analysis, and post-incident reviews. Develop and implement strategies to minimize downtime and prevent future occurrences.
  • Collaboration: Work closely with development, operations, and security teams to ensure seamless integration of new features and technologies. Advocate for best practices in reliability, security, and performance.
  • Documentation: Create and maintain detailed documentation of system architecture, processes, and procedures. Ensure knowledge sharing across the team.
  • Mentorship: Provide guidance and mentorship to junior SREs and other team members. Foster a culture of continuous learning and improvement.
  • Bachelor’s degree in Computer Science, Information Technology, or a related field. Master’s degree is a plus.
  • Minimum of 5 years of experience in a Site Reliability Engineer or similar role, with a strong background in cloud infrastructure and container orchestration.
  • Proficiency in Microsoft Azure, including Azure Resource Manager (ARM), Virtual Networks, Azure Kubernetes Service (AKS), Azure DevOps, and Azure Monitor.
  • Extensive experience with Kubernetes, including deployment, scaling, and maintenance of clusters.
  • Strong understanding of microservices architecture and best practices.
  • Strong programming and scripting skills (e.g., Python, PowerShell, Go, Bash, Azure CLI).
  • Strong experience with IAC and configuration management tools (e.g., Ansible, Terraform).
  • Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
  • Knowledge of networking, security, and best practices in cloud environments.
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration abilities.
  • Ability to work effectively in a fast-paced, dynamic environment.
  • Leadership skills with the ability to mentor and guide team members.
  • Good level of knowledge in English (mandatory).
  • Certifications: Microsoft Certified: Azure Solutions Architect Expert, Microsoft Certified: Azure DevOps Engineer Expert, Certified Kubernetes Administrator (CKA).
  • The Devoteam Group works for equal opportunities, promoting its employees based on merit and actively fights against all forms of discrimination. We are convinced that diversity contributes to the creativity, dynamism and excellence of our organization. All of our vacancies are open to people with disabilities


REQUIREMENT SUMMARY

Min:5.0Max:10.0 year(s)

Information Technology/IT

IT Software - Network Administration / Security

Software Engineering

Graduate

Computer science information technology or a related field

Proficient

1

Lisboa, Portugal