Vacancy expired!
Are you an engineer interested in new media, social activities and television? Paramount+ is developing the next generation of multimedia and cross-platform entertainment! The team at Paramount+ values software which exemplifies simplicity of design, maintainability and foundational robustness.
Role Details: Paramount Streaming seeks a Senior Site Reliability Engineer for our online television and media-focused web properties. In this role, you will support our Kubernetes platform that serves our streaming products in the cloud. Our team seeks to produce Observability infrastructure that's fast, self-healing, and operates at a global scale. We aim to produce a platform that is both opinionated to reliability best practices, while also providing best-in-class tooling for our engineering organization. This a great opportunity for a seasoned Site Reliability Engineer to build systems that have that global reach, and which impact millions of users. About You: You have a passion for data, and seek to quantify all things! You thrive on designing systems with an eye towards scale, self-healing, and automation as your guiding principles. You love the challenges of monitoring at large scales, and are compelled by problems of analysis, and large-scale data collection. You are at home with system-engineering challenges and service-based architecture. You have experience with being on-call, and seek-out ways to improve the on-call rotation for the team. You can plan project lifecycles and can evangelize best practices across teams. You are passionate about mentorship and seek to promote a culture of collaboration. Required Qualifications:- Experience with Thanos, ArgoCD, Kafka, and/or Kibana
- Provide support and guidance of the Observability platform, integrations, and best practices across multiple engineering teams.
- Build and manage Observability infrastructure for a global scale.
- Build self-healing and automated systems on Kubernetes.
- Design and build systems to collect, visualize, and store service health indicators.
- Design Observability tooling utilizing a hybrid of open-source and enterprise solutions.
- Additional other duties and responsibilities, as assigned.
- Implementing log collection and storage via Elasticsearch.
- Building visualizations for multiple services, utilizing different types of data sources.
- Working with Prometheus time-series data, producing metrics and integrations.
- Building and supporting robust event queues via Kafka.
- Work with our development teams to instrument their applications and capture events that support our global product deployment.
- Bachelor's degree or equivalent experience
- 5+ years managing and monitoring Linux systems
- 2+ years leading the design and implementation of Cloud systems in AWS/Google Cloud Platform using tools like Terraform, and Kubernetes.
- CI/CD tooling such as Jenkins.
- 4+ years? experience working with monitoring, logging, and visualization tooling, such as Prometheus, Elasticsearch, and Grafana.
- 2+ years? experience programming in a programming language such as Java, Python, Go
- On call experience
- Ability to manage the lifecycle of multiple projects
- Ability to collaborate across teams
- Experience mentoring junior engineers and writing onboarding documentation Pay: $82.00 - $95.00 per hour No Corp to Corp contracts.
- Apply Now
- ID: #49219124
- State: California Sanfrancisco 94105 Sanfrancisco USA
- City: Sanfrancisco
- Salary: USD TBD TBD
- Job type: Contract
- Showed: 2023-02-15
- Deadline: 2023-04-15
- Category: Et cetera