Site Reliability Engineer

14 Oct 2024

Vacancy expired!

This is a hybrid work model with 3 days onsite and 2 days work from homeOur Client is seeking multiple Site Reliability Engineers with 2 to 8 years on solid experience in Java and Production Support.

JOB SUMMARYAt COMPANY the SRE RunOps and Monitoring team is looking for an individual(s) that can join the team and have an immediate impact. COMPANY is using many cutting-edge processes and we need someone to join the team that is willing and eager to learn and reach for knowledge. As an SRE team member you will be front line for all issues related to the platform and its connected devices.This role will afford you the ability to learn about IoT and serverless technology stacks and really broaden your expertise into a new fledgling but growing area. As an SRE you will interface with all the other technical teams and works closely to troubleshoot issues that arise. Clear, concise verbal and written communication is absolutely key to the success of this role.

KEY RESPONSIBILITES AND DUTIES· Handle incoming calls and tickets to support COMPANY stores· Work in a fast past 24x7 role that requires rotating shifts (every 1-2 months a night shift and weekend)· Monitor our dashboards, reports and alerts to ensure the highest availability.· Work closely with other SRE members to improve our alerting and dashboards

“MUST HAVE” SPECIFIC KNOWLEDGE AND SKILLS· Exceptional verbal and written communication skills· Strong technical troubleshooting skills· Bachelor's degree or equivalent work experience 5+ years of experience building complex distributed systems· 2+ years of experience in managing public cloud-based infrastructure (AWS or Azure)· 3+ years of experience with running and/or managing large infrastructure services with multiple availability regions Public Cloud (AWS, Google Cloud Platform, Azure)· Work cross-functionally with the various teams in the organization and help establishing the SLOs and then help teams consistently achieve those SLOs.· Working experience with IoT devices.· Experience with RCA’s, Monitoring and Alarming in all environments and familiar with tools like Datadog, New Relic, Dynatrace, Splunk, ELK stack.· Experience with leading and participating in Scrum/Kanban, AGILE workflow technologies and using JIRA, Confluence and OneDrive.· Develop validations for end-to-end system verification and configuration management.· 2+ years of experience in managing public cloud-based infrastructure (AWS or Azure)· 3+ years of experience with running and/or managing large infrastructure services with multiple availability regions Public Cloud (AWS, Google Cloud Platform, Azure)· Experience managing IoT devices.· Experience using Postman

ADDITIONAL SKILLS AND OTHER REQUIREMENTS· Experience managing IOT devices in Microsoft Intunes· DevOps experience· Knowledge of the Google SRE handbook· Experience working with Kibana, New Relic, Cloud Front and other monitoring tools.

  • ID: #46449370
  • State: Texas Dallas / fort worth 75201 Dallas / fort worth USA
  • City: Dallas / fort worth
  • Salary: $50 - $75
  • Job type: Contract
  • Showed: 2022-10-14
  • Deadline: 2022-12-11
  • Category: Et cetera