Vacancy expired!
Description Primary Technology : Python, PHP, AWS, Gremlin, Dynatrace, DataDog, Ansible, Puppet, Shell ScriptingSecondary Technology (Nice to Have) : resilience4j, Jaeger, Nagios, Prometheus, NewRelic, Google Cloud Platform, AzureMandatory Skillset : Resiliency, Incident, Change and Observability Best Practices, TroubleshootingNice to Have Skillset : Telemetry Best PraticesIndividuals would be responsible for participating in the early phases of an applications lifecyle from architecture and design with a focus on resiliency, fault tolerance, failure scenarios, dependencies, observability and scalability. They would also be responsible for setup, configuration and management of cloud resources and operational processes such as vulnerability remediation, change management, repave activities, etc. They would work to identify toil and find ways through automation to eliminate. They would also participate in the incident management process as an escalation point for major incidents. They would proactively work to identify improvements to the applications they support around observabilty, resiliency, fault tolerance and availability. They will focus on things like self-healing, auto-failover, elasticity and AIOps.
- ID: #49511889
- State: New Jersey Jerseycity 07097 Jerseycity USA
- City: Jerseycity
- Salary: $60,000 - $120,000
- Job type: Permanent
- Showed: 2023-03-21
- Deadline: 2023-05-13
- Category: Et cetera