Site Reliability Engineer or Sr. Site Reliability Engineer - T2C

01 Jul 2024

Vacancy expired!

Company Federal Reserve Bank of Cleveland

All employees must be fully vaccinated against COVID-19 which includes receiving a COVID-19 vaccine booster or qualifying for an accommodation from the Bank's COVID-19 Vaccination Policy; individuals who are unable to be vaccinated due to a medical condition or sincerely held religious belief may request an accommodation from the Bank.

To be considered for this position, candidates must be a U.S. citizen

Essential Accountabilities
  • Ensures that Treasury application services are highly available, reliable, and performant through monitoring and alerting.
  • Serves as the primary subject matter expert for Treasury application services towards preventing (pro-active) as well as troubleshooting and mitigating (re-active) service availability/performance issues.
  • Develops tools to improve our ability to effectively monitor application services in a large-scale and complex environment. Evaluates and suggests improvement of existing tools and monitoring thresholds.
  • Provides technical assistance and operational guidelines for business operations and application development to ensure applications are running optimally in production, test, and development environments.
  • Designs, implements, and maintains SRE dashboard, bots and other automation based on the current operational needs and current release changes. Evaluate and suggest improvement of the dashboard, bots, and other automation.
  • Identifies repetitive, manual, and scalable tasks and potentially automates them using scripting/programming languages or tools.
  • Identifies key operational metrics, follows through by defining and designing methods to programmatically capture the data necessary to create them.
  • Functions as the subject matter expert for coordinating and managing the deployment process and support of the full lifecycle of applications in the data center and Amazon Web Services.
  • Understands and evaluates current application release changes to identify any potential addition or modification needs to current SRE program.
  • Serves as a technical resource to internal and external IT groups. May provide subject matter expertise for third party products and utilities used to support enterprise-wide applications.
  • Consults with developers on issues related to the impact of development on the infrastructure, works with system engineers and developers to define server configuration settings, leads the migration of code through staging environments to production, and provides assistance to software quality assurance technicians during system acceptance testing.
  • Works directly with users such as technicians with federal agencies to resolve third party interfacing system technical issues.
  • Influences new application and infrastructure designs and architectures, as well as create standards and guidelines for large-scale distributed systems with a focus on operability.
  • Required to provide rotational on-call support.
  • Perform other duties as assigned or requested.
  • Adhere to the Bank's attendance policies through regular and prompt attendance.

Problem Solving Skills
  • Logical analysis: Requires thinking through and solving problems step by step, completing root cause analysis, often looking beyond the obvious solution to problems and digging deeper for the best solutions.
  • Requires following vaguely defined procedures. Decisions are consistently made within reason and affect the work group or department.
  • Working in a group environment: Requires working as part of a group to solve issues and problems.
  • Technical and Business expertise: Ability to develop the expertise necessary to work with business stakeholders and customers to provide 24/7 functional and technical application support of multiple complex financial business application systems, processes, and system to system interfaces in crisis situations.

Qualifications
  • Bachelor's degree and 3+ years of experience OR Associate degree and/or Technical Bootcamp Certificate with 5+ years of experience for Site Reliability Engineer
  • Bachelor's degree and 5+ years of experience OR Associate degree and/or Technical Bootcamp Certificate with 7+ years of experience for Sr. Site Reliability Engineer
  • Ability to read, comprehend, and create complex technical documentation.
  • Ability to comprehend business operational requirements.
  • Demonstrated ability to analyze complex and communicate complex technical analysis to technical and non-technical audiences.
  • Strong communication skills; verbal & written. Ability to articulate clear and concise instructions and resolutions.
  • Excellent problem solving, organizational and analytical skills

Knowledge Areas Preferred
  • AWS, Azure, or Google Cloud Platform technologies, infrastructure, and practices, including Production environments (Production Required)
  • Traditional and Cloud infrastructure components and techniques in Production and Lower environments, including virtualization, elasticity, networking, and load balancing
  • Development, QA, and Production Deployment patterns and version control (e.g., zero downtime, blue/green deployments, canary releases, etc.)
  • Cloud Operating Console commands, administration, and configuration
  • Experience in coding languages, such as Python, .Net, Java, Terraform, Typescript
  • GitHub, Docker, Splunk, Grafana, AWS CloudWatch, AWS Lambda, AWS X-Ray
  • CI/CD software deployment & configuration using Ansible/Nexus/Bamboo/Gitlab
  • Understanding of Agile and DevOps practices

Full Time / Part Time Full time

Regular / Temporary Regular

Job Exempt (Yes / No) Yes

Job Category Information Technology

Work Shift First (United States of America)

The Federal Reserve Banks believe that diversity and inclusion among our employees is critical to our success as an organization, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

Privacy Notice

  • ID: #43748477
  • State: Ohio Cleveland 44102 Cleveland USA
  • City: Cleveland
  • Salary: USD TBD TBD
  • Job type: Permanent
  • Showed: 2022-07-01
  • Deadline: 2022-08-29
  • Category: Et cetera