Cloud Data Platform Solution Architect - 2402

02 Dec 2024

Vacancy expired!

Cloud Data Platform Solution Architect- 100% REMOTE

Job Profile Summary The ideal candidate will have industry-leading programming skills and established knowledge of implementing, designing, deploying, and maintaining big data analytics platforms in a cloud environment. The solution architect will use knowledge of healthcare data to influence the implementation and governance of our data architecture. Your designs will account for data movement, storage, compute and BI consumption, and will comply with security and data governance standards. You will work closely with a variety of partners within the Data and Analytics organization.

Essential Job Functions
  • Provide thought leadership and technical direction to the data engineering team in building analytic data products
  • Understand and translate business requirements to data strategies that align with overall technology vision
  • Design, develop and enforce standards for the data storage, processing and governance across all environments
  • Work closely with enterprise and application architects to align the data engineering team to the overall company SDLC standards, practices, and data access patterns
  • Develop, document and maintain overall view of data platform architecture, data acquisition, data quality, and data retention
  • Provide formal and informal training for data engineers, platform engineers and ETL developers
  • Maintain knowledge of emerging technologies and architectures
  • Document and publish technical principles and standards, and mentor the data engineering team to incorporate them into their daily practices
  • Champion and present the technical vision to the executive team and business stakeholders

Basic Qualifications
  • Bachelor' s degree with a preferred area of study in information technology, computer science, computer engineering or related fields
  • 8+ years of overall experience in big data, database and enterprise data architecture and delivery
  • 8+ years of programming proficiency in a subset of Python, Java, and Scala
  • 5+ years of hands-on experience building solutions on distributed processing frameworks such as Spark, Hadoop or Databricks
  • 5+ years of experience architecting, developing, releasing, and maintaining enterprise data lake platforms
  • 3+ years of experience implementing cloud-based systems. AWS and/or Databricks preferred
  • Strong SQL skills to create/maintain DB objects, query/load required data using data governance (e.g. business glossary, data dictionary, data catalog, data quality, master data management, etc.) and visualization tools to bring data literacy to the organization.
  • Practical experience on workload management, monitoring, and performance tuning Apache Spark jobs
  • Broad knowledge of data technologies, tools, and disciplines including data modeling, dimensional models, third-normal-form structures, ETL/ELT, change data capture and slowly changing dimensions
  • Experience with healthcare data a big plus
  • Experience with Machine Learning & MLOPs is a big plus

Other Qualifications
  • Strong presentation and written communication skills
  • Ability to develop professional relationships
  • Ability to be a high-impact player on multiple simultaneous engagements

  • ID: #23741801
  • State: Texas Irving 75014 Irving USA
  • City: Irving
  • Salary: USD TBD TBD
  • Job type: Permanent
  • Showed: 2021-12-02
  • Deadline: 2022-01-30
  • Category: Architect/engineer/CAD