Senior Site Reliability Engineer

10 May 2024

Vacancy expired!

Job Description

The specific

responsibilities of an SRE managing a large, distributed application built on microservices, spring boot, and Google Cloud may include:
  • Strong background in software development and systems administration, as well as excellent problem-solving and communication skills.
  • Run the production environment by monitoring availability and taking a holistic view of system health.
  • Developing, improving, and operating the deployment and orchestration of a complex distributed system
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Collaborating with development teams to design, build, and operate scalable and resilient software systems
  • Automating deployment, monitoring, and incident response processes
  • Performing root cause analysis of production incidents and implementing preventive measures
  • Conducting performance analysis and optimization of the system
  • Implementing and maintaining disaster recovery processes
  • Participating in an on-call rotation for incident response and support.

Qualifications
  • Four-year college degree in Computer Science or Equivalent.
  • 4+ years' experience with JAVA, J2EE, NoSQL/SQL Datastore, Spring Boot, Google Cloud Platform/AWS/Azure & Docker/K8 in developing multi-tier applications.
  • Programming skills (Perl, Python, Ruby, Java/Scala or C).
  • Experience with RESTful APIs and microservices platform is a must
  • Working knowledge of the TCP/IP stack, internet routing and load balancing
  • 2-3 Years of experience with any of APM and other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, PagerDuty.
  • Experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime.
  • Debug production issues across services and levels of the stack.
  • Thorough understanding of software development cycle and agile programming environment.
  • Architect, design & develop automation to reduce toil, improve recoverability, availability, latency & scalability of supported applications.
  • Triage, analyze and provide solution to critical & high priority technical issues occurring in the ecosystem, optimize incident management processes.
  • Respond, react & communicate as per the ITSM incident management process. This process involves detection of the incident, timely communication to leadership during the life of the incident, service restoration, followed by root cause analysis to prevent the incident from occurring in the future.
  • Practice destructive testing for discovering vulnerabilities in environments powered by Distributed software systems.

What you'll receive in return: As part of the Ford family, you'll enjoy excellent compensation and a comprehensive benefits package that includes generous PTO, retirement, savings and stock investment plans, incentive compensation and much more. You'll also experience exciting opportunities for professional and personal growth and recognition. Candidates for positions with Ford Motor Company must be legally authorized to work in the United States permanently. Verification of employment eligibility will be required at the time of hire. Visa sponsorship is available for this position. We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. For information on Ford's salary and benefits, please visit: At Ford, the health and safety of our employees is our top priority. Vaccination has been proven to play a critical role in combating COVID-19. As a result, Ford has made the decision to require U.S. salaried employees to be fully vaccinated against COVID-19, unless employees require an accommodation for religious or medical reasons. Being fully vaccinated means that an individual is at least two weeks past their final dose of an authorized COVID-19 vaccine regimen. As a condition of employment, newly hired employees will be required to provide proof of their COVID-19 vaccination or an approved medical or religious exemption. We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status.

$desc3

  • ID: #49902470
  • State: Michigan Dearborn 48126 Dearborn USA
  • City: Dearborn
  • Salary: USD TBD TBD
  • Job type: Permanent
  • Showed: 2023-05-10
  • Deadline: 2023-07-09
  • Category: Et cetera