Remote - Senior Manager, Software Engineering

at  Green Dot Corporation

Remote, Oregon, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate24 Jun, 2024USD 206000 Annual25 Mar, 2024N/AUx,Aws,Docker,Interpersonal Skills,Change Management,Teams,Computer Science,Automation,Azure,Microservices,Java,Sql Server,Programming Languages,Root Cause,Software Development,Test Methodologies,Python,Customer Experience,Transactional SystemsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

We’re looking for talented professionals, anywhere in the United States, to join us in bringing smart money management and payment solutions to everyone’s fingertips.
At Green Dot, we are evolving to a new and permanent “Work from Anywhere” model designed to maximize the benefits of remote work, promote and enable a strong culture of performance and connectedness, and attract the best and brightest talent who align with our entrepreneurial spirit and mission.
<<>><<>><<>><<>><<>><<>><<>><<>><<>><<>>

JOB DESCRIPTION

The Sr. Manager of Site Reliability Engineering will oversee the reliability functions under engineering organization including but not limited to Core Platform, Risk, Customer Care, IVR, Customer Notifications stacks for Green Dot. Reporting to the Sr. Director of Engineering, the Sr. Manager of SRE will build/lead and retain a team of geographically diverse team of engineers encompassing the above functions.
The Engineering leader will be experienced in managing multiple engineering platforms simultaneously, dealing with multiple incidents of varied priorities. The leader should also be good at resource planning; building cross functional business partnerships and driving strategic planning and execution. This leader is expected to be highly technical to drive operational aspects in a proactive and reactive model, understanding business functions or should be able to prototype a POC. The leader is also expected to build a world class Engineering team in SRE domain. The leader is expected to be a fast learner who grasps the Business/Product/Customer demographics and touchpoints quickly and can partner with the Product, Operation and Engineering teams to not only optimize the customer experience but also strive for proactively reducing customer issues through other self-service means such as alerts and dashboards. The leader is also expected to be a good communicator who can interface easily with internal and external stakeholders/Customers. Finally, the leader is expected to have an operational mindset to be able to maintain/monitor the system post its production roll out.

Job Responsibilities

  • Success in this role will be determined via the leader’s ability to deliver upon the following:
  • Lead SRE teams responsible for reliability and performance of on-prem and cloud services.
  • Lead and drive troubleshooting of complex technical issues and come up with effective solutions quickly.
  • Build operational dashboards by gathering and analyzing metrics from operating systems, database systems, Network, Storage, and applications to assist in performance tuning and root cause findings.
  • Partner with development teams to improve services through rigorous testing and release procedures.
  • Participate in systems design consulting, platform management and capacity planning.
  • Create sustainable systems and services through automation using various programming languages.
  • Lead the discussion between balance feature development speed and reliability with well-defined service level objectives.
  • Help drive the SLAs for system uptime and stability.
  • Partner with engineering team to drive the on prem services into cloud of choice such as Azure and AWS.
  • Introduce new tools for monitoring the system effectively and ensure the retention of data in Datadog and other monitoring tools.
  • Should display strong communication skill at engineering and business level.
  • Proactively define shelf life for the systems based on the growth of the system in terms of Accounts and transactions and based on system load.

Qualifications:

  • Degree in computer science or relevant experience.
  • Should have 7-10 years of experience leading a team of SRE engineers.
  • Experience in modern infrastructure services, large scale distributed systems, microservices, and software design at a high level.
  • Experience with providing service with very high-volume transactional systems.
  • Strong emphasis on SRE as an engineering subject matter expert with proficiency in at least one of the programming languages such as Python, GoLang, Java.
  • Solid foundation of SRE principals, including monitoring, alerting, error handling, fault/incident analysis and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements.
  • Experience in software development lifecycle, test methodologies and tools.
  • Experience building and leading engineering teams, ideally SRE or Production Engineering.
  • Passion for eliminating repetitive manual processes using automation to improve through repeated iterations.
  • Experience in monitoring tools and practices for ensuring system health and performance.
  • Experience with public cloud platforms such as AWS and Azure.
  • Experience with various SQL (preferably SQL Server) and other Azure based No-SQL solutions.
  • Experience with cloud native stacks and tools such as Jenkins, Docker, Containers and contained management systems.
  • Superb interpersonal skills, capable of working with multi-functional technical and business teams and varying levels of management, influencing decision making.
  • Experience building strong partnerships with other job functions, like Product, Customer Experience, Marketing, and UX, to keep teams collaborating smoothly and working together to improve the product.
  • Strong desire to understand the root cause of issues and details of systems, get hands-on with data and analysis to evaluate how the team and the product are growing.
  • Experience with service-oriented and event-driven system architectures, building high-performance highly transactional distributed systems to the order of 5000 tps.
  • Should be adaptable to a flexible work schedule.
  • Should be an effective communicator, driven, passionate and assertive leader.
  • Must be knowledgeable about industry trends in SRE, best practices, and change management.

Responsibilities:

  • Success in this role will be determined via the leader’s ability to deliver upon the following:
  • Lead SRE teams responsible for reliability and performance of on-prem and cloud services.
  • Lead and drive troubleshooting of complex technical issues and come up with effective solutions quickly.
  • Build operational dashboards by gathering and analyzing metrics from operating systems, database systems, Network, Storage, and applications to assist in performance tuning and root cause findings.
  • Partner with development teams to improve services through rigorous testing and release procedures.
  • Participate in systems design consulting, platform management and capacity planning.
  • Create sustainable systems and services through automation using various programming languages.
  • Lead the discussion between balance feature development speed and reliability with well-defined service level objectives.
  • Help drive the SLAs for system uptime and stability.
  • Partner with engineering team to drive the on prem services into cloud of choice such as Azure and AWS.
  • Introduce new tools for monitoring the system effectively and ensure the retention of data in Datadog and other monitoring tools.
  • Should display strong communication skill at engineering and business level.
  • Proactively define shelf life for the systems based on the growth of the system in terms of Accounts and transactions and based on system load


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer Science

Proficient

1

Remote, USA