Distributed Cloud | Azure Site Reliability Engineer, Hybrid
at Devoteam

Lisboa, Área Metropolitana de Lisboa, Portugal -

Start Date	Expiry Date	Salary	Posted On	Experience	Skills	Telecommute	Sponsor Visa
Immediate	13 Oct, 2024	Not Specified	15 Jul, 2024	5 year(s) or above	Reliability,Powershell,Bash,Performance Metrics,Discrimination,Microsoft Azure,Maintenance,System Architecture,Root,Virtual Networks,Disabilities,Security,Operations,English,Analytical Skills,Scalability,Integration,Automation,Computer Science	No	No

Add to Wishlist Apply All Jobs

Required Visa Status:

Citizen	GC
US Citizen	Student Visa
H1B	CPT
OPT	H4 Spouse of H1B
GC Green Card

Employment Type:

Full Time	Part Time
Permanent	Independent - 1099
Contract – W2	C2H Independent
C2H W2	Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

At Devoteam, we believe that technology with strong human values can actively drive change for the better. Discover how Tech for People unlocks the future, creating a positive impact on the people and the world around us. We are a global leading player in Digital Transformation for leading organisations across EMEA, with a revenue of €1B. We believe in transforming technology to create value for our clients, partners and employees in a world where technology is developed for people. We are proud of the culture we have built together. We are proud of our people at the service of technology. We are proud of our diverse environment. Because we are #TechforPeople. Join our multidisciplinary team of Cloud experts, Designers, Business consultants, Security experts, Engineers, Developers and other extraordinary talents, spread across more than 20 EMEA countries. Become one of our +10.000 tech and business leaders on cloud, data and cyber security. Let’s fuse creativity with technology together and build innovative solutions that actively change things for the better.
Our Devoteam Distributed Cloud Unit is looking for Azure Site Reliability Engineers to join our Infrastructure Microsoft team and work inside several projects within the banking sector.

We are seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) with Azure and Kubernetes certifications and extensive experience working with microservices. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our cloud-based microservices architecture. You will collaborate closely with development and operations teams to design, implement, and maintain systems that are robust, resilient, and highly available.

Infrastructure Management: Design, implement, and manage scalable, secure, and reliable cloud infrastructure on Microsoft Azure and Google Cloud Platform.
Microservices Management: Design, implement, and manage a scalable and resilient microservices architecture on Microsoft Azure.
Kubernetes Administration: Deploy, manage, and optimize Kubernetes clusters, ensuring smooth operation and integration with existing systems.
Automation and Monitoring: Develop and maintain automation scripts and tools to enhance system efficiency and reduce manual intervention. Implement comprehensive monitoring and alerting systems to proactively identify and resolve issues.
Performance Optimization: Analyze system performance metrics and make recommendations for improvements. Implement performance tuning and optimization strategies.
Incident Response: Lead incident response efforts, including troubleshooting, root cause analysis, and post-incident reviews. Develop and implement strategies to minimize downtime and prevent future occurrences.
Collaboration: Work closely with development, operations, and security teams to ensure seamless integration of new features and technologies. Advocate for best practices in reliability, security, and performance.
Documentation: Create and maintain detailed documentation of system architecture, processes, and procedures. Ensure knowledge sharing across the team.
Mentorship: Provide guidance and mentorship to junior SREs and other team members. Foster a culture of continuous learning and improvement.
Bachelor’s degree in Computer Science, Information Technology, or a related field. Master’s degree is a plus.
Minimum of 5 years of experience in a Site Reliability Engineer or similar role, with a strong background in cloud infrastructure and container orchestration.
Proficiency in Microsoft Azure, including Azure Resource Manager (ARM), Virtual Networks, Azure Kubernetes Service (AKS), Azure DevOps, and Azure Monitor.
Extensive experience with Kubernetes, including deployment, scaling, and maintenance of clusters.
Strong understanding of microservices architecture and best practices.
Strong programming and scripting skills (e.g., Python, PowerShell, Go, Bash, Azure CLI).
Strong experience with IAC and configuration management tools (e.g., Ansible, Terraform).
Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
Knowledge of networking, security, and best practices in cloud environments.
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work effectively in a fast-paced, dynamic environment.
Leadership skills with the ability to mentor and guide team members.
Good level of knowledge in English (mandatory).
Certifications: Microsoft Certified: Azure Solutions Architect Expert, Microsoft Certified: Azure DevOps Engineer Expert, Certified Kubernetes Administrator (CKA).
The Devoteam Group works for equal opportunities, promoting its employees based on merit and actively fights against all forms of discrimination. We are convinced that diversity contributes to the creativity, dynamism and excellence of our organization. All of our vacancies are open to people with disabilities.

Responsibilities:

Infrastructure Management: Design, implement, and manage scalable, secure, and reliable cloud infrastructure on Microsoft Azure and Google Cloud Platform.
Microservices Management: Design, implement, and manage a scalable and resilient microservices architecture on Microsoft Azure.
Kubernetes Administration: Deploy, manage, and optimize Kubernetes clusters, ensuring smooth operation and integration with existing systems.
Automation and Monitoring: Develop and maintain automation scripts and tools to enhance system efficiency and reduce manual intervention. Implement comprehensive monitoring and alerting systems to proactively identify and resolve issues.
Performance Optimization: Analyze system performance metrics and make recommendations for improvements. Implement performance tuning and optimization strategies.
Incident Response: Lead incident response efforts, including troubleshooting, root cause analysis, and post-incident reviews. Develop and implement strategies to minimize downtime and prevent future occurrences.
Collaboration: Work closely with development, operations, and security teams to ensure seamless integration of new features and technologies. Advocate for best practices in reliability, security, and performance.
Documentation: Create and maintain detailed documentation of system architecture, processes, and procedures. Ensure knowledge sharing across the team.
Mentorship: Provide guidance and mentorship to junior SREs and other team members. Foster a culture of continuous learning and improvement.
Bachelor’s degree in Computer Science, Information Technology, or a related field. Master’s degree is a plus.
Minimum of 5 years of experience in a Site Reliability Engineer or similar role, with a strong background in cloud infrastructure and container orchestration.
Proficiency in Microsoft Azure, including Azure Resource Manager (ARM), Virtual Networks, Azure Kubernetes Service (AKS), Azure DevOps, and Azure Monitor.
Extensive experience with Kubernetes, including deployment, scaling, and maintenance of clusters.
Strong understanding of microservices architecture and best practices.
Strong programming and scripting skills (e.g., Python, PowerShell, Go, Bash, Azure CLI).
Strong experience with IAC and configuration management tools (e.g., Ansible, Terraform).
Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
Knowledge of networking, security, and best practices in cloud environments.
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work effectively in a fast-paced, dynamic environment.
Leadership skills with the ability to mentor and guide team members.
Good level of knowledge in English (mandatory).
Certifications: Microsoft Certified: Azure Solutions Architect Expert, Microsoft Certified: Azure DevOps Engineer Expert, Certified Kubernetes Administrator (CKA).
The Devoteam Group works for equal opportunities, promoting its employees based on merit and actively fights against all forms of discrimination. We are convinced that diversity contributes to the creativity, dynamism and excellence of our organization. All of our vacancies are open to people with disabilities

REQUIREMENT SUMMARY

Experience:Min:5.0Max:10.0 year(s)

Industry:Information Technology/IT

Functional area of job:IT Software - Network Administration / Security

Domain:Software Engineering

Qualifications:Graduate

Specialization:Computer science information technology or a related field

English Proficiency:Proficient

Number of posts:1

Address of job:Lisboa, Portugal

Distributed Cloud | Azure Site Reliability Engineer, Hybrid
at Devoteam

Required Visa Status:

Employment Type:

REQUIREMENT SUMMARY

INDIA

AUSTRALIA

UNITED ARAB EMIRATES

Distributed Cloud | Azure Site Reliability Engineer, Hybridat Devoteam

Required Visa Status:

Employment Type:

REQUIREMENT SUMMARY

Distributed Cloud | Azure Site Reliability Engineer, Hybrid
at Devoteam