E
ETL

Projects with this topic

View DDRflow project

HeyTUL / DDRflow

Configuration and data workflows for an instance of Apache Airflow for the DDRplatform

apache-airflow Python ETL

1

Updated Jan 19, 2026

1 0

Updated Jan 19, 2026
View DataRider ETL with Spark project

DataRider v2 / DataRider ETL with Spark

DataRider bloc for ETL Stream with Spark+Scala

ETL spark

0

Updated Jan 15, 2026

0 0 1 13

Updated Jan 15, 2026
View FastAPI Data Normalization Microservice project

Education Projects / FastAPI Data Normalization Microservice

This is a study project built using FastAPI to practice microservice architecture, data normalization techniques, and clean API design.

The service receives raw payloads from different simulated sources and transforms them into a standardized and validated structure.

It centralizes normalization logic and demonstrates how to build a scalable, maintainable, and test-friendly data processing layer.

fastapi Python microservice study-project learning data normalization API validation clean archit... ETL uvicorn

0

Updated Dec 09, 2025

0 0 0 0

Updated Dec 09, 2025
View flight radar project

Exalt IT Dojo / katas / flight radar

Python pyspark ETL

0

Updated Nov 27, 2025

0 177 5 0

Updated Nov 27, 2025
View data-migration-jikkosoft project

jaquimbayoc7-group / data-migration-jikkosoft

Solución end-to-end para la migración y análisis de datos utilizando Python, FastAPI, Kafka y PostgreSQL. Implementa un pipeline de datos asíncrono y una API RESTful para analíticas, todo completamente containerizado con Docker Compose para un despliegue fácil y reproducible.

Python fastapi kafka PostgreSQL Docker docker-compose data-migration data-pipeline data-enginee... rest-api analytics ETL sqlalchemy pandas asynchronous

0

Updated Nov 05, 2025

0 0 0 0

Updated Nov 05, 2025
View core-etl project

ByteCode Solutions / Core / core-etl

This project/library contains common elements related to ETL processes...

Python ETL

0

Updated Oct 29, 2025

0 0 0 0

Updated Oct 29, 2025
View Spark for Batch + Streaming - Market Analysis Kafka Pipeline project

Cristian Vasu Data Portfolio / Spark for Batch + Streaming - Market Analysis Kafka Pipeline

Unified project demonstrating both batch analytics and real-time streaming pipelines with Apache Spark:

Batch (PySpark/Jupyter): Processed S&P 500 stock data, applied transformations, and ran distributed computations.

Streaming (Spark + Kafka): Built a streaming pipeline to consume Kafka topics, process messages in real-time, and visualize outputs.

Deployed using Docker and Jupyter for reproducibility.

spark pyspark kafka streaming ETL real-time batch-proces... Docker data-pipeline

1

Updated Sep 04, 2025

1 0 0 0

Updated Sep 04, 2025
View NOAA Weather Data Analysis with MapReduce project

Cristian Vasu Data Portfolio / NOAA Weather Data Analysis with MapReduce

Analyzed decades of historical weather station data (1920–1940) using Hadoop MapReduce. Filtered operable stations, computed descriptive statistics (min, max, mean, median), and produced reports/graphs. Designed modular MRJobs to chain tasks together for scalable processing.

MapReduce hadoop mrjob big-data data-analysis ETL Python weather-data

1

Updated Sep 04, 2025

1 0 0 0

Updated Sep 04, 2025
View Spring Ai project

Strandber9 / Spring Ai

Spring spring-ai vector-database RAG ETL

0

Updated Apr 03, 2025

0 0 0 0

Updated Apr 03, 2025
View scd-etl-pipeline project

gideon tinega / scd-etl-pipeline

Python ETL PostgreSQL Cassandra

0

Updated Dec 03, 2024

0 0 0 0

Updated Dec 03, 2024
View pyDataBridge project

J. Alexánder Alzate Olaya / pyDataBridge

Advanced data synchronization framework.

Python synchronization ETL

0

Updated Sep 20, 2024

0 0 0 2

Updated Sep 20, 2024
View Reporting project

MIT Club of Northern California / Reporting

Reporting for MIT Club of Northern California

ETL google cloud

1

Updated Sep 13, 2024

1 0 1 3

Updated Sep 13, 2024
View Airflow DAGs project

Data Science / Airflow DAGs

В данном проекте находятся два задания, написанные на Python и реализующие выполнение цепочек задач (DAG) в среде Airflow

airflow dag ETL

1

Updated Jul 12, 2024

1 0 0 0

Updated Jul 12, 2024
View MLOps Project for a Hospital Application project

Portfolio / MLOps Project for a Hospital Application

Simulation of a real hospital scenario with a ML model in production

mlops mlflow airflow s3 fastapi ETL machine lear... Docker Python

1

Updated May 03, 2024

1 0 0 0

Updated May 03, 2024
View home-depot-product-extractor project

zaira-ardor-llc / products / home-depot-product-extractor

Crawl and extract Home Depot's schema.org/Products.

data-extraction schema.org ETL

0

Updated Mar 31, 2024

0 0 0 1

Updated Mar 31, 2024
View ETL Project for Wind Energy Viability Analysis in El Calafate project

Portfolio / ETL Project for Wind Energy Viability Analysis in El Calafate

Official weather data ETL for wind energy project evaluation in El Calafate, Argentina.

ETL apache-airflow docker-compose Docker Cassandra nosql Bash shellscript Python

0

Updated Feb 25, 2024

0 0 0 0

Updated Feb 25, 2024
View openfoodfacts-etl project

EPSI - Projects Group / I1 - Master EISI / 2. Data / openfoodfacts-etl

ETL Apache Spark Java

0

Updated Feb 17, 2024

0 0 0 0

Updated Feb 17, 2024
View Target Core project

Singer Core / Target Core

target-core is a Singer Target which intend to work with regular Singer Tap. The Goal is to use this package as a foundation to build other targets focusing on the core features, reducing the energy spent on maintaining the common parts.

singer tap Target target-core ETL datapipeline target-s3-jsonl tap-core Python

1

Updated Jul 21, 2023

1 1 1 1

Updated Jul 21, 2023
View dataport-pecanstreet-etl project

research / sioe-clustering / dataport-pecanstreet-etl

ETL to get energy load shapes for dataport pecanstreet.

Scala ETL

0

Updated May 25, 2023

0 0 0 0

Updated May 25, 2023
View Chronicle project

Lennart Skogmo / Chronicle

ETL pyspark delta-lake databricks

0

Updated Apr 18, 2023

0 0 0 0

Updated Apr 18, 2023