Description We are seeking a Site Reliability Engineer to join our team. In this role, you will be a key contributor in enhancing system reliability and automation processes to deliver operational insights via analytics. You will work closely with DevOps and application development teams to ensure the highest level of availability, reliability, and security of our products and services.Responsibilities: Design and implement highly available, scalable, and fault-tolerant infrastructure. Collaborate with engineering teams to define and implement reliability standards and best practices. Automate infrastructure provisioning, configuration, and deployment processes to streamline operations. Work with software engineers to design and implement deployment strategies using automated continuous integration and delivery pipelines. Monitor system performance and proactively identify potential issues to ensure uptime and optimal performance. Collaborate with software engineering teams to improve system reliability through automated testing, fault tolerance, and disaster recovery planning. Lead incident management efforts, overseeing response processes and coordinating with cross-functional teams. Design and implement incident response playbooks and escalation procedures for timely and effective resolution. Conduct post-incident reviews to identify root causes and implement preventative measures. Develop and implement robust observability solutions to gain deeper insights into system performance.Requirements Proficient in Continuous Integration / Continuous Delivery (CICD) Strong knowledge of Python programming language Experience with Infrastructure as Code Familiarity with Computer Security Incident Response Team operations Solid understanding of Disaster Recovery strategies Proficiency in using Ansible for configuration management Experience with Splunk for log management and analysis Ability to use Grafana for data visualization Practical knowledge of Terraform for infrastructure management Understanding of DevOps methodologies Experience in DevOps Engineering and using DevOps Tools Ability to work collaboratively in a team and independently Excellent problem-solving skills and attention to detail Strong verbal and written communication skills Bachelor’s degree in Computer Science or a related field, or equivalent work experience Relevant industry certifications would be a plus. Technology Doesn't Change the World, People Do.® Robert Half is the world’s first and largest specialized talent solutions firm that connects highly qualified job seekers to opportunities at great companies. We offer contract, temporary and permanent placement solutions for finance and accounting, technology, marketing and creative, legal, and administrative and customer support roles.Robert Half works to put you in the best position to succeed. We provide access to top jobs, competitive compensation and benefits, and free online training. Stay on top of every opportunity - whenever you choose - even on the go. Download the Robert Half app (https://www.roberthalf.com/us/en/mobile-app) and get 1-tap apply, notifications of AI-matched jobs, and much more.All applicants applying for U.S. job openings must be legally authorized to work in the United States. Benefits are available to contract/temporary professionals, including medical, vision, dental, and life and disability insurance. Hired contract/temporary professionals are also eligible to enroll in our company 401(k) plan. Visit roberthalf.gobenefits.net for more information.© 2024 Robert Half. An Equal Opportunity Employer. M/F/Disability/Veterans. By clicking “Apply Now,” you’re agreeing to Robert Half’s Terms of Use (https://www.roberthalf.com/us/en/terms) .
Full-time