Job Title: Systems Applications Administrator IV
Number of Positions: 1
Assignment Type: ~10-month contract to hire
Location: Frisco, TX or Omaha, NE or Alpharetta, GA
Summary of Position:
What does Senior Application Monitoring Engineer do?
You will be a senior member and integral part of the Digital Solutions’ Application Monitoring and Automation team, which is responsible for monitoring the availability and performance of the online banking service platforms we host in Client’s data centers/cloud and the clients on those platforms. In this capacity, your mission is to adeptly harness our arsenal of monitoring and automation tools, thereby enabling the expeditious dispatch of meticulous alerts. These alerts, finely honed through your vigilance, empower our Support teams to execute timely, effective responses. Whenever feasible, the team endeavors to introduce automated solutions to rectify anomalies, obviating the need for manual intervention.
Your role assumes paramount significance in guaranteeing the efficacy, dependability, and security of our applications. Your proficiency in the deployment of monitoring tools, scripting, and cloud-based integrations is indispensable in upholding the vitality of our digital ecosystem. The scope of your involvement extends across a diverse array of systems and technologies, necessitating collaboration with multiple teams across Client to ensure comprehensive and robust monitoring coverage.
The Application Monitoring and Automation team does not perform traditional incident response but assistance with high priority incidents is sometimes necessary and after-hours work is occasionally needed to support certain client go-lives and system changes. You will also be part of a On-call support rotation.
The key factors to success in this role are:
- A passion for learning and technology. This role works with various technology platforms and scripting/programming languages. Successful candidates are excited about opportunities to learn about and work with new technologies.
- Enjoys working with people and relationship building. This role must coordinate with many different teams to achieve our goals. Having strong and productive relationships with these teams is critical for driving projects across the finish line.
- Has strong organizational skills. This team typically has several efforts going at any time such as large strategic efforts, shorter term tactical efforts, and efforts to address immediate issues/fires. A successful candidate will be comfortable switching from one effort to another when they are blocked, or when priorities change, and then return to that effort when appropriate.
What will you do:
- Tools Proficiency: Leverage your deep expertise in tools like Splunk, ExtraHop, and Dynatrace to monitor and analyze application performance, troubleshoot issues, and provide actionable insights.
- Scripting Mastery: Utilize your scripting background, including Python, Selenium, Webdriver, and JMeter, to automate monitoring processes and enhance efficiency.
- Cloud Expertise: Apply your working knowledge of cloud-based solutions to effectively monitor applications hosted in cloud environments, optimizing performance and resource utilization.
- API Integration: Understand and implement APIs in a distributed environment, ensuring seamless integration with monitoring tools and systems.
- Proactive Alerting: Collaborate with the team to reduce the volume of alerts by conducting in-depth analysis and fine-tuning alerting mechanisms, eliminating false positives, and ensuring alerts are highly relevant.
- Incident Response: Assist in high-severity outage investigations, providing critical insights and expertise to minimize downtime and maintain service availability.
- Continuous Learning: Maintain a never-say-no attitude, staying up-to-date with the latest industry trends, tools, and best practices. Share knowledge and mentor team members to foster a culture of learning and growth.
What you will need to have:
- Bachelor’s degree required. Major in Computer Science, Engineering preferred.
- 5+ years of experience in Windows Systems Engineering and/or Application Support
- 5+ years of experience in Application Monitoring and Observability for multiple products at large organizations
- Solid experience with and knowledge of Splunk for monitoring and troubleshooting
- Experience in supporting IIS and Windows environments
- Expert level in PowerShell scripts and/or Python. Any similar scripting language will also be considered.
- Experience in supporting multi-tiered applications
- Experience in working with web logs, HTTP headers, and web services
- Experience with API integration in distributed environments
- Excellent verbal and written communication abilities and strong interpersonal skills
- Strong analytical and problem-solving skills
- Detail oriented, organized, thorough, and self-driven
- Exceptional problem-solving skills and the ability to work independently
- Eagerness to continuously learn and adapt to emerging technologies
- A proactive mindset with a “can-do " attitude
What would be great to have:
- Experience with monitoring tools such as Extrahop, DynaTrace, Moogsoft, etc.
- Experience/certifications with Azure cloud platform
- Proficiency with Selenium, Webdriver, and JMeter
- Experience integrating cloud technologies with SIEM and monitoring tools
TB_E