Senior Site Reliability Engineer

13 Apr 2024

Vacancy expired!

Company Federal Reserve Bank of Richmond

The Richmond Fed is the proud home of the Federal Reserve's National IT organization-a nationwide team delivering technology solutions and support across the Federal Reserve System. Many National IT employees are located in Richmond, while others are based across the U.S. at other Federal locations.

When you join our team, you'll become part of a culture that welcomes differences, cares about our communities, and empowers each other to lead from where we are to make things better. Bring your passion and we'll provide challenging and purposeful careers in a variety of fields, opportunities to grow and a wide range of benefits and perks that support your health and wealth. It's all part of what makes #MyRichmondFed a great place to work!

About the Opportunity:

As a Site Reliability Specialist, you will be part of the Technical Operations (TechOps) department that has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the infrastructure for the FedNow Program, as well as the transition to production support and operations. This team interfaces with internal stakeholders, customers for planning, delivery, and service management. It owns ongoing ITIL processes and the implementation and driving of continuous improvement initiatives. You will architect, implement, and leverage monitoring and tooling to be used for capacity planning, utilization reporting, and scaling. The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI/CD tooling, and automating cloud-based highly available high performing applications.

What You Will Do:
  • Provide technical functional expertise to the Architecture, Engineering, DevOps, and QA teams
  • Leverage SRE best practices - own responsibility for the availability and performance of the cloud infrastructure/platform
  • Focus on solving problems through software
  • Define SLIs/SLOs
  • Work with CI and CD tools, and source control such as GIT and SVN
  • Implement Performance monitoring and capacity management - detect and automatically resolve
  • Lead the team through continuous improvement of production operations
  • Offer technical support where needed and developing automation software to speed incident resolution
  • Building and maintaining tools, services, and automations associated with deployment and operations platform
  • Actively troubleshoot any issues that arise in production with the the goal of providing permanent fixes - conduct root cause analysis of problems to prevent future occurrence
  • Maintain effective knowledge base and documentation
  • Drive innovation and platform evolution - identify potential breakdowns and drive improvements
  • Automate our operational processes as needed, with accuracy, and compliant with security standards.
  • Be a champion of operational excellence
  • Develop and maintain health dashboards
  • Provide rotational on-call support

Qualifications:
  • Extensive knowledge and understanding of working in AWS environments & services
  • Familiarity with networking, security and cloud engineering concepts
  • Experience supporting infrastructure for large multi-services applications.
  • Proficiency in scripting languages.
  • Experience with Performance tools
  • Experience working with configuration management tools
  • Working knowledge of databases
  • Ability to develop and maintain environment documentation and support procedures
  • Knowledge of technology project and secure coding standards
  • SRE experience on the on-premise and cloud technologies
  • Experience with Terraform

Education And Experience Requirements
  • Bachelor's degree in computer science or computer engineering
  • Minimum 10 years of hands-on experience in application and technical support role
  • Minimum 5 years of SRE experience
  • Minimum of 5 years hybrid cloud infrastructure experience

Other Requirements and Considerations:
  • A requirement of this position is that the employee must be fully vaccinated against COVID-19; individuals who are unable to be vaccinated due to a medical condition or sincerely held religious belief may request an accommodation from the Bank.
  • Candidates should review the Bank's Employee Code of Conduct to ensure compliance with conflict of interest rules and personal investment restrictions.
  • If you need assistance or an accommodation due to a disability, please notify rich.recruitment@rich.frb.org .
  • Sponsorship is not available for this role. The selected candidate will be subject to a government security investigation and must meet eligibility requirements for access to classified information. Eligibility for this specific position requires U.S. Citizenship.
  • Salaries for this position range from $131,200 - 164,000. For candidates outside Richmond, VA, listed hiring and salary ranges may be adjusted upward based on your geographic location.
  • Salary offered will be based on the job responsibilities and the individual's knowledge, skills, and experience as defined in the job qualifications.

Full Time / Part Time Full time

Regular / Temporary Regular

Job Exempt (Yes / No) Yes

Job Category Information Technology

Work Shift First (United States of America)

The Federal Reserve Banks believe that diversity and inclusion among our employees is critical to our success as an organization, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

Privacy Notice