Site Reliability Engineer, Senior

20 Feb 2025

Vacancy expired!

JOB DESCRIPTION

Your Role The Site Reliability Engineering team provides support to design, develop, and improve services, platforms and processes that result in improved end-to-end reliability and maintainability to our mission critical application services The Site Reliability Engineer Senior will report to the Senior Manager, Technical Engineering. As stewards of the four golden signals, you will proactively seek out system weaknesses and remediate discovered issues before production issues occur using observability principles, trend analysis, and test resiliency using Chaos Engineering.

Your Work In this role, you will:

  • Design, build and support the application stack in an operationally reliable and cost-effective manner
  • Maintain and measure reliability, latency, and scalability for complex systems
  • Automate and orchestrate workflows through tools, scripting, and programming
  • Troubleshoot, manage, and resolve issues in production environments and collaborate with IT and business teams to implement strategies to eliminate them
  • Deliver and support monitoring of business service health including Service Level Agreements
  • Address production issues both during and outside of working hours in an on-call capacity
  • Support senior staff in the effort to drive architectural consolidation and simplification
  • Perform proactive daily system monitoring including reviewing system and application logs as well as responding to, triaging, troubleshooting and remediating incidents
  • Participate in post incident-reviews to find out what's working and what's not and improving them by filling the gaps in the process

QUALIFICATIONS

Your Knowledge and Experience

  • Requires a bachelor's degree in computer science or equivalent field
  • Requires at least five years of prior relevant experience
  • Coding/Scripting experience with Java, JavaScript, Python, SQL, PL/SQL, MySQL, Shell Script, or Powershell. Experience using APIs - Rest, SOAP
  • Required skills leveraging self-healing for monitoring abrasion - use of desired state within tools such as Ansible Automation Platform
  • Demonstrates a high level of enthusiasm and curiosity to learn new technologies and keep abreast of latest technologies
  • Strong grasp of Windows/Unix/Linux systems & networking
  • Strong ability to measure and meet SLA/SLOs focusing on availability, performance, incidents, and chronic quality issues. Support the effort to arm developers with deeper insights into application performance and service health issues towards reducing MTTA & MTTR
Pay Range:

The pay range for this role is: 96800.00 to 145200.00 for California.

Note:

Please note that this range represents the pay range for this and many other positions at Blue Shield that fall into this pay grade. Blue Shield salaries are based on a variety of factors, including the candidate's experience, location (California, Bay area, or outside California), and current employee salaries for similar roles.

  • ID: #49320587
  • State: California Oakland 94601 Oakland USA
  • City: Oakland
  • Salary: USD TBD TBD
  • Job type: Permanent
  • Showed: 2023-02-20
  • Deadline: 2023-04-20
  • Category: Et cetera