Data Engineer, python, data lake

18 Feb 2025

Vacancy expired!

We have been retained by our client in Houston, Texas to deliver a

Data Engineer on a long term

contract basis. This team is experiencing growth and this big data practice is evolving/improving quickly. We are seeking data engineering candidates who are in search for Python data framework design and development opportunities.

Low turnover at this company. Nice culture. This role is part of a multi-faceted, multi-disciplined 7-person team, where you are the data engineer talking with data scientists with models needing the right data streams. Your team is made part of a much larger big data practice. Your contributions, accomplishments are tied to specific, and measurable goals and results. The problems are real world problems, and the results are real world results. We seek a

data engineer to create python data frameworks.

The candidates we seek will have

3+ years of focused work experience programming data frameworks or full stack applications development with

Python and will have experience with

some of these:

Parquet, Delta Lake, Apache Iceberg, data lake, data lakehouse, datalake formats, Dremio

Python Pandas, Numpy, Pytest, Scikit-Learn, Tensorflow, Keras, Matplotlib, SpaCy, NLTK, Theano, Pytorch, Caffe, Caffe2

Apache Airflow, Kubernetes, Distributed File Systems, and Massively Parallel Processing (MPP)

PySpark, Apache Spark

big data analytics, machine learning (ML), Artificial Intelligence (AI)

You will be treated as a first class citizen/data engineer and as a valuable data engineering team player, as you provide valuable analytical and technical work including, delivering data streams needed by data scientists; a huge amount of sensor data to work with on this big data practice.

  • Design and implement reliable python data pipelines to integrate disparate data sources into a single Data Lakehouse or data lake
  • Design and implement data quality pipelines to ensure data correctness and building trusted data sets

  • Design and implement a Data Lakehouse solution to accurately reflect business operations

  • Assist with data platform performance tuning and physical data model design and support including partitioning and compaction or compression of data
  • Provide guidance in data visualizations and reporting efforts to ensure solutions are aligned to business objectives

We seek a

data engineer to create python data frameworks. This role will join our client s team to work on python data framework software design and development. This person will apply their analytical and technical strengths to this very collaborative python data engineering practice, and engineering teams. You will work with the engineering domain experts and data scientists needing your python data framework designs to deliver on the right data streams at the right time.

Requirements:

An articulate and collaborative candidate who seeks collaboration, data lake design and brings:

  • 3+ years of experience as a Data Engineer designing data pipeline architectures with Python, not necessarily only Python, but other languages, albeit 3 years of Python is required, heavier Python preferred.
  • A vast experience in SQL, any SQL, but any of ANSI SQL, PL/SQL, or TSQL, Transact-SQL, stored procedures (Oracle, or SQL Server)
  • Experience in various data integration patterns including ETL, ELT, Pub/Sub(publish/subscribe), and Change Data Capture
  • Experience in data management practices including data catalog, data lineage, and master data management
  • Experience in business analysis and defining business performance metrics
  • Experience in software development practices such as Software Design Principles and Software Design Patterns, Testing, CI/CD, and version control
  • Knowledgeable of common data visualization tools such as Power BI and Tibco Spotfire or Tableau or other
  • Experience in implementing any data lakes is a big plus; or any Data Lake design, Data Lakehouse, Data Lake Use Cases, Data Lake Formats, Dremio or other Apache Iceberg, or Delta Lake

There may be ways for you to contribute in a technical leadership fashion and there is definitely room for technological advancement within this big data engineering practice / department. Grow with the big data engineering practice and the entire organization as a whole, as s data engineer delivering on the right data streams at the right time. Long term contract.

Employment Type: Contract

hourly, Full-Time M-F, 40 hours per week, flex schedule, onsite Houston.

Hourly Rate:
$65 110 /hr

w2 employment

Location: Houston, Texas

(central Houston area)

Immigration: US citizens and those authorized to work in the US are encouraged to apply. We are unable to sponsor H1b candidates at this time. No third parties. No consulting firms. Principals only.

Please apply with resume.

Houston contract opportunity:

Data Engineer, python, data lake

;/a>

Call or text to inquire:

Please take a look at the interview scheduling link below and select a day and time on the calendar to discuss via Microsoft Teams web cam call or a telephone call.

This interview scheduling link can streamline finding a time to meet and discuss, and place a meeting on both our calendars:

;/a>

  • ID: #49276341
  • State: Texas Houston 77007 Houston USA
  • City: Houston
  • Salary: USD 65 - 110 /hour 65 - 110 /hour
  • Job type: Contract
  • Showed: 2023-02-18
  • Deadline: 2023-04-17
  • Category: Et cetera