SIte Reliability Engineer - Raleigh, NC

02 Jul 2024

Vacancy expired!

Title: SIte Reliability Engineer

Length of the project: 10 Months (Can be converted in full-time)

Location: Raleigh, NC (Remote)

Job Description:
  • Skills equiv. to 6-10 years in comparable position.
  • We are looking to hire a Senior Site Reliability Engineer who is responsible for supporting production.
  • Other responsibilities include developing automated solutions to address common challenges and reduce manual churn, supporting provisioning and infrastructure, technology and business partners’ needs.
  • Our ideal candidate is a confident self-starter, loves identifying and solving problems, communicates well, and is passionate about cloud services, Web services, and remote administration.
  • We are improving availability and performance of our client’s experience with a variety of solutions using.NET (Core, C#, PowerShell), Splunk, Microservice Architecture, Pivotal Cloud Foundry (aka Tanzu), Aerospike , MongoDB and Google Cloud Platform – it is an exciting time to join our team and be part of this new system development opportunity.

What you’ll do:
  • Evangelize SRE mindset and solve problems through systematization.
  • Support production environment and keep our shared environments available for customers.
  • Triage alerts & diagnosing/resolving critical issues, handling implementation of changes
  • Real-Time troubleshooting of critical application workflows and incorporating feedback to product development
  • Hands-on enterprise systems administration, monitoring, and deployment activities
  • Identify opportunities to build innovative tools and solve unique operations problems on a large enterprise and mission critical applications
  • Building scripts to automate operational tasks & incorporating the solutions into infrastructure
  • Coordinate and Collaborate with release management relating to infrastructure other critical changes.
  • Develop and support automation and processes to enable teams to deploy, manage, configure, test, and monitor their applications
  • Create and review documentation and process regarding recurring issues, new standard operating procedures, knowledge transfer material, etc.
  • Collaborate with Engineering, Scrum and Ops resources to provide technical expertise and support on key initiatives for system availability and reliability.
  • Bring a passion to stay on top of tech trends, experiment with and learn new technologies, participate in internal technology communities, and mentor other members of the team
  • Review programming and environment changes and raise awareness for potential impacts

REQUIRED
  • 6 - 10 years of experience in troubleshooting and providing support to .NET/.NET Core Production applications
  • Experience in Cloud application configuration, deployment, support and migration – Google Cloud Platform/PCF (Tanzu) is a plus
  • Familiarity with large scale distributed systems and high-availability architectures

What you have:

Required Skills:
  • 6 - 10 years of experience with enterprise level administration and support
  • Solid Experience in Powershell scripting language and Windows administration
  • Experience with Windows Server and IIS webserver administration
  • Experience and knowledge of noSQL database systems – Aerospike is a plus
  • Familiarity with logging/application monitoring tools (AppDynamics, Splunk, Zabbix, Nagios, etc.)
  • Knowledge of one or more of Message Brokers such as Kafka, RabbitMQ
  • SaltStack experience or similar experience (Ansible for example) a plus
  • Flexibility to operate in an environment with changing demands and priorities
  • Ability to effectively engage subject matter experts and understand technical topics

  • ID: #43765085
  • State: North Carolina Raleigh / durham / CH 27601 Raleigh / durham / CH USA
  • City: Raleigh / durham / CH
  • Salary: Depends on Experience
  • Job type: Contract
  • Showed: 2022-07-02
  • Deadline: 2022-08-28
  • Category: Et cetera