ETL data engineer opportunity to work with a Digital healthcare company(100% remote considered)

25 May 2024

Vacancy expired!

ETL Data EngineerFull-time (100% remote considered)Office located in San Francisco, CA Our client is a growing healthcare technology company with a mission to ensure that patients are on the safest and most effective medication regimens. The RoleThe ETL Data Engineer will be responsible for various data pipelines and data ingesting and integration platform. The role will work in close collaboration with technical stakeholders to optimize processes to ingest large enterprise healthcare datasets and build novel healthcare data pipelines. This position will support, maintain, and develop software using a variety of different tools, including AWS stacks, Python, Pandas and Pyspark. ResponsibilitiesParticipate in all aspects of company data platform, which include:Writing production level Python and Pyspark for ETL pipelinesData processing, validation, cleaning, and debuggingUsing AWS services and technologies for application deployments to data APIs Qualifications 5-7 years of experience in building end-to-end data pipelinesProven on the job experience with building data pipelines using Python, PySparkExperience with optimizing Spark clusters and Spark jobsExperience with AWS or other cloud providers (Azure, Google Cloud Platform)Experience in developing and maintaining data pipelines in production environmentsExperience with writing native Python scripts Nice to HaveExperience processing EHR and healthcare claims dataExperience in AWS EMR, Glue, Athena

  • ID: #41540746
  • State: California Sanfrancisco 94101 Sanfrancisco USA
  • City: Sanfrancisco
  • Salary: Depends on Experience
  • Job type: Permanent
  • Showed: 2022-05-25
  • Deadline: 2022-07-12
  • Category: Et cetera