Site Reliability Engineer, Senior job vacancy

Vacancy expired!

JOB DESCRIPTION

Your Role The Site Reliability Engineering team provides support to design, develop, and improve services, platforms and processes that result in improved end-to-end reliability and maintainability to our mission critical application services The Site Reliability Engineer Senior will report to the Senior Manager, Technical Engineering. As stewards of the four golden signals, you will proactively seek out system weaknesses and remediate discovered issues before production issues occur using observability principles, trend analysis, and test resiliency using Chaos Engineering.

Your Work In this role, you will:

Design, build and support the application stack in an operationally reliable and cost-effective manner
Maintain and measure reliability, latency, and scalability for complex systems
Automate and orchestrate workflows through tools, scripting, and programming
Troubleshoot, manage, and resolve issues in production environments and collaborate with IT and business teams to implement strategies to eliminate them
Deliver and support monitoring of business service health including Service Level Agreements
Address production issues both during and outside of working hours in an on-call capacity
Support senior staff in the effort to drive architectural consolidation and simplification
Perform proactive daily system monitoring including reviewing system and application logs as well as responding to, triaging, troubleshooting and remediating incidents
Participate in post incident-reviews to find out what's working and what's not and improving them by filling the gaps in the process

QUALIFICATIONS

Your Knowledge and Experience

Requires a bachelor's degree in computer science or equivalent field
Requires at least five years of prior relevant experience
Coding/Scripting experience with Java, JavaScript, Python, SQL, PL/SQL, MySQL, Shell Script, or Powershell. Experience using APIs - Rest, SOAP
Required skills leveraging self-healing for monitoring abrasion - use of desired state within tools such as Ansible Automation Platform
Demonstrates a high level of enthusiasm and curiosity to learn new technologies and keep abreast of latest technologies
Strong grasp of Windows/Unix/Linux systems & networking
Strong ability to measure and meet SLA/SLOs focusing on availability, performance, incidents, and chronic quality issues. Support the effort to arm developers with deeper insights into application performance and service health issues towards reducing MTTA & MTTR

Pay Range:

The pay range for this role is: 96800.00 to 145200.00 for California.

Note:

Please note that this range represents the pay range for this and many other positions at Blue Shield that fall into this pay grade. Blue Shield salaries are based on a variety of factors, including the candidate's experience, location (California, Bay area, or outside California), and current employee salaries for similar roles.

ID: #49320587
State: California Oakland 94601 Oakland USA
City: Oakland
Salary: USD TBD TBD
Job type: Permanent
Showed: 2023-02-20
Deadline: 2023-04-20
Category: Et cetera