Site Reliability Engineering (AWS)

24 Nov 2024

Vacancy expired!

Position: Site Reliability Engineering (AWS)

Location: San Jose, CA // Costa Mesa, CA// Allen, TX

Duration: 12+ Months

About the Role: We are seeking a Software Engineer for our SRE team to help us continuously improve how we build, monitor, secure and run our rapidly growing cloud platform. Much of our software development focuses on building infrastructure and eliminating work through automation. On SRE team, you will have opportunity to use your expertise in coding, system design thinking and analytical skills to provide reliable cloud infrastructure and observability tools for the rest of the product development teams.

What You’ll Do Here
  • Build our platforms, systems and infrastructure using your solid expertise in coding.
  • Work closely with product development teams, provide hands-on engagement to develop, direct and implement reliable, secured and cost-effective cloud solutions.
  • Participate with a cross department-functional team to establish a cloud operational governance framework.
  • And often times involves routine grunt work on service requests to assist other teams with platform services.
Below is mandatory for Experian SRE role.The following experiences
  • Strong Terraform experience, building modules in AWS
  • Automation ( bash/python)
  • Python coding in scale of 1-10 minimum 6. If the team, ask him/her to write code they should be able to do it without hesitation
  • System admin experience and shell scripting
  • Experience 3+ years
  • What You'll Need To Succeed

    Must Have Skills:
    • Deep understanding of Linux, networking, cloud design patterns, API's, and security.
    • Solid professional coding experience with at least one scripting language - Shell, Python etc.
    • At least 3+ years of experience working with AWS Infrastructure services with emphasis on IAM, Network, EC2, Lambda, S3, CloudWatch, CloudTrail and in general overall Security.
    • Strong knowledge and implementation history of Terraform, Packer, Ansible, Chef, Jenkins or any other similar tooling.
    • Excellent knowledge and working experience in implementing one or more Observability platforms like Prometheus, InfluxDB, Dynatrace, Grafana, Splunk etc. to measure telemetry data like logs, metrics and traces.

    Nice to have skills:
    • Previous experience with running containers (Docker/LXC) in a production environment using one of the container orchestration services (Kubernetes, Docker Swarm, AWS ECS, AWS EKS).
    • Experience with other public cloud platforms like Azure and Google Cloud Platform is a bonus.
    • Solid professional coding experience in at least one programming language, preferably Java.
    • Experience with BigData platforms, like AWS EMR, Databricks, Cloudera, Hortonworks etc.
    • Experience with open source technologies like Hadoop, Hive, Presto, Spark, Airflow etc.

    • ID: #23384840
    • State: California Costamesa 92626 Costamesa USA
    • City: Costamesa
    • Salary: Depends on Experience
    • Job type: Contract
    • Showed: 2021-11-24
    • Deadline: 2022-01-21
    • Category: Et cetera