Site Reliability Senior Engineer at City National Bank
San Francisco, CA 94104, USA -
Full Time


Start Date

Immediate

Expiry Date

01 Aug, 25

Salary

172355.0

Posted On

02 May, 25

Experience

8 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Technology, Automation, Dynatrace, Visualization, Creativity, Software Systems, Kibana, Metrics Collection, Appdynamics, Aggregation

Industry

Computer Software/Engineering

Description

SKILLS AND KNOWLEDGE

  • Minimum 2+ years of Experience with log analytical and management solutions such as Splunk / Elasticsearch and Kibana
  • Minimum 2+ years of experience in Monitoring tools such as Datadog, AppDynamics, Dynatrace etc
  • Creativity, energy, and passion for leveraging technology to transform our industry; the belief that automation is the only way
  • A good understanding of modern, cloud centric architectures and DevOps principles
  • Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting
  • Providing standardized offerings to facilitate and ensure operational health of stacks throughout their lifecycle including metrics collection, aggregation, and visualization, inventory, capacity, and billing/tag management
  • You are competitive and passionate. You thrive on challenge and have a proven ability to set ambitious but achievable goals and surpass them
  • Demonstrate a team player attitude with a growth mindset to be open to learn and adapt the changing landscape of the industry

COMPENSATION

Starting base salary: $101,231 - $172,355 per year. Exact compensation may vary based on skills, experience, and location. This job is eligible for bonus and/or commissions.

  • To be considered for this position you must meet at least these basic qualifications

The preceding job description has been designed to indicate the general nature and level of work performed by employees within this classification. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities, and qualifications required of employees assigned to this job.

Responsibilities
  • Design, develop and implement solutions that improve stability, security, scalability and availability of CNB’s software platforms -
  • Design mechanisms for proactive alerts and responses to identify and address reliability risks
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
  • Design, build and manage SLIs, SLOs and Error budgets for Availability , Performane/Latency and Throughput for critical services running in production. Be a proponent of using the SRE core principles in driving product velocity
  • Create educational documentation on how-to’s and and blog about use-cases and architectures that relate to cloud platforms and Observability. Co-ordinate hackathons and code reviews with goals of continuos improvement in design , build and architectural practices
  • Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting
  • Design and create solutions to test application resiliency using chaos engineering , fail over scenarios and capacity analysis to reduce MTTR (Mean Time to resolve) and MTBF (Mean Time between Failures) to minimize client impact
  • Coach and mentor the junior team members to nurture team productivity and professional development
  • All other appropriate duties as required.
Loading...