Reliability Engineer

13 Jan 2025

Vacancy expired!

  • Must Have Skills:
  • Software Development (understand how to read and write code)
  • Troubleshooting Skills
  • Scripting Knowledge

Nice to Have Skills:
  • Previous experience of Splunk or Dynatrace or Network tools
  • Job Description: The Enterprise Monitoring team offers robust event management, correlation, automation and runbook solutions to help MasterCard deliver continuous uptime of business services and applications and, optimize operations costs and efficiency.• This individual will work closely with technology support teams and application teams to build monitoring and automation solutions to improve the application and infrastructure availability.• Key questions for viable candidates: 1. Have you resolved a complex application availability issue with monitoring and automation? 2. Are you a self-started with minimal direction and guidance? 3. Have you ever been part of a team with diverse skills and experience located in different geographical locations/time zones?Role• Represent Enterprise Monitoring team in project meetings and provide advice, status, training and technical support• Work with our customers to understand the monitoring and automation requirements and implement solution using available tool sets and scripting languages• Administer, support and maintain enterprise Monitoring tools in a multi-tier environment• Build Automation solutions using available tool sets and scripting languages.• Maintain design and support documents for all built solutions and processes• Troubleshoots networking, Unix/Linux systems, and applications to identify and correct malfunctions and other operational problems utilizing associated Linux and UNIX command line and management tools.• On-call administration and tools support.• As new technologies emerge and impact our environment, learn these technologies very quickly and resolve any problems involved in integrating new technologies.• Maintains a broad knowledge of state-of-the-art technology, equipment, and/or systems.• Self-Driven and flexible willing to learn in adjacent areas with the initiative to learn more.• Thorough, adhering to critical processes even under stress• Support business disaster recovery procedures for assigned areas of responsibility.• Accurately document duties and procedures to aid the department in cross-training and absentee coverage• Work with technical engineering teams to manage and improve processes• Ability to solve problems quickly and completely.• Ability to identify tasks that should be automated and then develop and implement automation

    All About You• Advanced user level expertise in UNIX and/or Red Hat Linux• Networking experience from basics to advanced, along with security knowledge.• Proficient with scripting or programming languages such as SQL, Perl and shell scripting• Experience with enterprise systems management/monitoring tools such as IBM Tivoli products, Microsoft System Center Operations Manager, Zabbix, Nagios, etc. is highly desirable.• Experience developing web applications on a Linux/Apache/MySQL/PHP stack is a strong plus.• Strong analytical, troubleshooting and problem solving skills• Ability to manage multiple projects simultaneously under pressure without direct supervision• Ability to manage multiple activities and work with a strong sense of urgency• Ability and motivation to learn new technologies quickly and with minimal support and guidance.• Evening, weekend and shift on-call required to meet deadlines and correct system failures or for patch upgrades.• Strong people skills and the ability to understand business needs and translate them into technical solutions• Excellent verbal and written skills, organization, project prioritization, and time management skills.

  • ID: #48531223
  • State: Missouri St louis 63011 St louis USA
  • City: St louis
  • Salary: Depends on Experience
  • Job type: Contract
  • Showed: 2023-01-13
  • Deadline: 2023-03-05
  • Category: Et cetera