Site Reliability Engineer (SRE): Hiring at all levels!

04 Jul 2024

Vacancy expired!

"We make magic." That's our motto at Walt Disney Parks and Resorts. And it permeates everything we do. At Disney, you will help inspire that magic by allowing our teams to push the limits of entertainment and create the never-before-seen!

Do you want to be part of a team that creates magic for millions of guests? Behind the scenes, the Retail Technology Operations team helps provide magical digital and physical experiences applying the latest technology; and our Site Reliability Engineers provide expert engineering services in the cloud, automation, and reliability to support the innovation and operation of The Walt Disney Company. We are passionate about ensuring our systems provide the best guest experience! You will protect and improve the automation and systems that run Disney's experiences and services with a focus on availability, latency, and automation while embracing a DevOps culture.

Objectives of this Role
  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large distributed software applications

Responsibilities : Daily and Monthly Responsibilities
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives

Basic Qualifications : Required Skills and Qualifications
  • UNIX/Linux administration, troubleshooting, performance tuning, & security
  • Strong technical knowledge of digital environment full stack including Mobile, Web, APIs, Messaging, Databases, Networks and their interaction
  • Understanding of observability principles (monitoring, logging, tracing, alerting), tools and practices that promote observability
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Perl, Ruby, Java, Go, Rust, C/C
  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
  • Experience with continuous integration tools (e.g.Gitlab, AWS CodeBuild, CodeDeploy, CodePipeline, Azure DevOps) (3 years)
  • Trouble-shooting skills that span systems, network, and code
  • Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible, Chef)
  • Experience with container technologies (i.e. Docker, Kubernetes)
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

Required Education :
  • Bachelor's degree in computer science (or other highly technical, scientific discipline) or related work experience.

Additional Information : #DISNEYTECH #LI-AF2

  • ID: #43817846
  • State: Florida Orlando 32801 Orlando USA
  • City: Orlando
  • Salary: USD TBD TBD
  • Job type: Permanent
  • Showed: 2022-07-04
  • Deadline: 2022-09-01
  • Category: Et cetera