Vacancy expired!
Company Federal Reserve Bank of Richmond
The Richmond Fed is the proud home of the Federal Reserve's National IT organization-a nationwide team delivering technology solutions and support across the Federal Reserve System. Many National IT employees are located in Richmond, while others are based across the U.S. at other Federal locations.When you join our team, you'll become part of a culture that welcomes differences, cares about our communities, and empowers each other to lead from where we are to make things better. Bring your passion and we'll provide challenging and purposeful careers in a variety of fields, opportunities to grow and a wide range of benefits and perks that support your health and wealth. It's all part of what makes #MyRichmondFed a great place to work!About the Opportunity This position requires U.S. Citizenship.As a Site Reliability Engineering Leader, you will have the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the infrastructure for the FedNow Program as well as the transition to production support and operations. The team is accountable for ensuring that we have the right processes, people, and tools in place to keep our environment running 24x7. You will lead a team of Site Reliability, Infra-Ops, & Sec-Ops engineers who are focused on SLI/SLO development, Automation, Toil Elimination, Incident Response/Resolution, Monitoring Implementation, Addressing Vulnerabilities, and Resiliency and Chaos Testing. The ideal candidate is someone who has led and is very passionate about building and maintaining reliable and scalable systems, CI/CD tooling, and automating cloud-based highly available platforms and applications. You will own the responsibility for the availability and performance of the cloud infrastructure/platformWhat You Will Do:- Create SRE strategy and jointly develop roadmaps with the Product Owners.
- Collaborate with Architects and Engineers on work related to the operational health, security, performance, and observability.
- Educate and mature the team on SRE best practices and leverage the same.
- Drive the operational and observability capabilities
- Drive innovation and platform evolution - identify potential breakdowns and drive improvements
- Automate our operational processes as needed, with accuracy, and compliant with security standards
- Provide thought leadership and be a champion of operational excellence
- Bachelor's degree in computer science or computer engineering
- Minimum 10 years of hands-on experience in application and technical support role
- Minimum of 5 years hybrid cloud infrastructure experience
- Minimum 3 years of SRE leader experience including establishment and maturing of the practice.
- Extensive knowledge and understanding of working in AWS environments & services
- Familiarity with basic networking, security and cloud engineering concepts
- Experience supporting infrastructure for large applications.
- Experience with SAFe Agile
- Experience with Performance tools, Jira, Remedy, & ServiceNow
- Infrastructure as Code experience (i.e., Terraform/CloudFormation)
- Code versioning experience (i.e., GitLab)
- Experience with Live Production Deployment
- Great medical benefits
- Pension and 401(k) with employer match
- Paid time off
- Tuition reimbursement
- Employee resource networks
- Paid volunteer leave
- Flexible work options
- Onsite amenities that make working here fun
- Candidates should review the Bank's Employee Code of Conduct to ensure compliance with conflict of interest rules and personal investment restrictions.
- Sponsorship is not available for this role. Selected candidate is subject to special background check procedures.
- The hiring range for this position in the Richmond market is $166,500 - 200,000. For locations outside of Richmond, adjustments may be made to account for market costs.