Site Reliability Engineer / SRE

02 Jul 2024

Vacancy expired!

Site Reliability Engineer / SRE

Contract (Hybrid 2 days a week onsite)Locations: NJ / New York, NY / Wilmington, DE / Columbus, OH

Responsibilities:
  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Coach or manage teams as applicable
  • Participate in the 24x7 support coverage as needed

Qualifications:
  • Bachelor’s degree or equivalent experience in an software engineering discipline
  • Expertise in at least one technology stack designing, coding, testing, and delivering software – Java Stack, Kubernetes, GAIA, Microservices, Oracle, Kafka
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
  • Excellent debugging and trouble shooting skills
  • Software Engineering background with a focus on Systems Engineering.
  • Some of the SRE tools we need are Splunk, Dynatrace, Prometheus, Grafana.
  • Scripting - Ansible, Puppet, Chef

  • ID: #43773058
  • State: Texas Plano 75023 Plano USA
  • City: Plano
  • Salary: Depends on Experience
  • Job type: Contract
  • Showed: 2022-07-02
  • Deadline: 2022-08-30
  • Category: Software/QA/DBA/etc