Performance Monitoring Support Engineer, St. Louis, MO

19 Nov 2024

Vacancy expired!

Performance Monitoring Support Engineer, St. Louis, MO 6+ Months Phone + Skype Senior Application Support Specialist / Performance Monitoring Specialist Manager Notes App Support experience MUST include experience supporting Java internal business applications

  • "What did the app they support do for the company they were at?
  • How did it effect the business if the app they supported went down"
  • "How visible was the app to the sr. level management".
App Dynamics and Splunk are required They must be an extra strong communicator. They must provide constant/consistent feedback to our development and test teams who depend on these environments to do their work / Able to convey information to leadership and technical audiences. Providing technical mentoring and day-to-day direction of technical staff responding to alarms, triaging issues, and escalating appropriately. Constantly striving to improve team response. Preferred Skills & Experience:
  • 4+ minimum years' experience in a support related role.
  • With some significant Linux support (minimum 1 year)
  • and application support both technically but also with knowledge of how the application supported works within the business.
  • Strong communication skills with the ability to capture and articulate technical and non-technical details.
  • Expertise using logging, ITOA, NPM, and APM products such as SolarWinds, Splunk, or AppDynamics.
  • Proficient with a wide array of data mining and messaging tools and techniques.
  • Experience using source control and deployment tools, such as Automic, Jenkins, Git, and/or Puppet is a plus.
Required Skills Advanced Java application troubleshooting including thread and heap analysis. Understanding complex multi-application environments, which interact in a firewalled environment. Splunk log analysis for application troubleshooting. Must be able to develop adhoc queries on the fly to pinpoint application issues. Needs to be able to articulate search parameters and commands. Simple usage of prewritten queries and dashboards is not sufficient. This is not Splunk administration like rolling out agents or administering Splunk, there is more to it. Usage of AppDynamics (or equivalent application performance monitoring (APM) tool, such as DynaTrace or New Relic) to troubleshoot and monitor application health and performance. This would include investigation of business transactions, information points, and health rules to pinpoint application problems. Comprehensive understanding of Application to Database interaction and troubleshooting. This would include Oracle and Mongo databases. Areas of understanding like JDBC connection utilization and troubleshooting, connection pool/query/cache optimization, ability to analyze DB reports like AWR and make recommendations. Supporting applications running in a Linux environment. Shell scripting for automation of administration tasks. Understanding of OS setting for app performance optimization, administration of Microservices, troubleshooting logging/forwarding issues. Plus Skills E-Commerce or Oracle Webcommerce or Akamai or HaProxy (working with heavy payment systems/transactions) is a HUGE plus! RESPONSIBILITIES Create and maintain data flow diagrams, access policies, and application catalog documentation. Maintain various system health monitoring and reporting tools. Write and maintain scripts for system maintenance and administration. Create reports, dashboards, and processes to monitor and analyze availability and performance metrics. Work with architects, technologists, and others for future system enhancements and scalability. Actively participate in decisions to develop and improve security-based standards and practices. Perform application changes, deployments, and DR tasks on a predefined schedule. Leverage CI/CD tools and DevOps principles. Perform advanced troubleshooting techniques in a high-pressure environment. Provide after-hours support on an "as needed" or "on call" basis. Serve as a technical SME for other internal and external groups. Document systems and procedures. Train others in the use of tools, functions, and techniques. Role: Quickly responds to incident alerts and support calls. Identifies, and coordinates with, teams critical to the event's resolution, sets up bridge calls, and escalates when necessary. Proactively monitors highly available applications/systems. Interprets and disseminates the information to a variety of audiences, including support teams and internal leadership. Performs key tasks to hot-fix and enhancement deployments to production and non-production environments. Executes automated and manual smoke tests following maintenance and outage recovery periods. Self-educates on new systems, applications, and tools, and requests guidance and training proactively, as needed, to ensure optimum job performance. Assists in elevating the performance of other support team members. More on the position / challenges Two teams 24-7 staffed. Handle 50% of all systems. Half of system exposed. Work with the day team, monitors and alerts who to go to. This team works remote. All with IT SOX group. Expire certificates is one of the biggest problems they deal with. One side is not updated in a timely manner. Network outages, network problem, cabling. When it does need to how to engage the right people. IT SOX are the ones who are expected to facilitate the bridges, they are also 24/7 staff, Day shift stronger than night shift. If someone in this is stronger than someone on the SOX team then they would step up and handle to problem themselves

  • ID: #23011188
  • State: Missouri Saintlouis 63104 Saintlouis USA
  • City: Saintlouis
  • Salary: USD TBD TBD
  • Job type: Contract
  • Showed: 2021-11-19
  • Deadline: 2022-01-18
  • Category: Et cetera