Site Reliability Engineer (SRE) Hiring at all levels job vacancy

Vacancy expired!

"We make magic." That's our motto at Walt Disney Parks and Resorts. And it permeates everything we do. At Disney, you will help inspire that magic by allowing our teams to push the limits of entertainment and create the never-before-seen!

Do you want to be part of a team that creates magic for millions of guests? Behind the scenes, the Retail Technology Operations team helps provide magical digital and physical experiences applying the latest technology; and our Site Reliability Engineers provide expert engineering services in the cloud, automation, and reliability to support the innovation and operation of The Walt Disney Company. We are passionate about ensuring our systems provide the best guest experience! You will protect and improve the automation and systems that run Disney's experiences and services with a focus on availability, latency, and automation while embracing a DevOps culture.

Objectives of this Role

Run the production environment by monitoring availability and taking a holistic view of system health
Build software and systems to manage platform infrastructure and applications
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational support and engineering for multiple large distributed software applications

Responsibilities : Daily and Monthly Responsibilities

Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
Partner with development teams to improve services through rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Create sustainable systems and services through automation and uplifts
Balance feature development speed and reliability with well-defined service level objectives

Basic Qualifications : Required Skills and Qualifications

UNIX/Linux administration, troubleshooting, performance tuning, & security
Strong technical knowledge of digital environment full stack including Mobile, Web, APIs, Messaging, Databases, Networks and their interaction
Understanding of observability principles (monitoring, logging, tracing, alerting), tools and practices that promote observability
Ability to program (structured and OO) with one or more high level languages, such as Python, Perl, Ruby, Java, Go, Rust, C/C
Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
Experience with continuous integration tools (e.g.Gitlab, AWS CodeBuild, CodeDeploy, CodePipeline, Azure DevOps) (3 years)
Trouble-shooting skills that span systems, network, and code
Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible, Chef)
Experience with container technologies (i.e. Docker, Kubernetes)
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

Required Education :

Bachelor's degree in computer science (or other highly technical, scientific discipline) or related work experience.

Additional Information : #DISNEYTECH #LI-AF2

ID: #43817846
State: Florida Orlando 32801 Orlando USA
City: Orlando
Salary: USD TBD TBD
Job type: Permanent
Showed: 2022-07-04
Deadline: 2022-09-01
Category: Et cetera

Site Reliability Engineer (SRE): Hiring at all levels!