data-pipeline

Here are 14 public repositories matching this topic...

dataflint / spark

Performance Observability for Apache Spark

emr big-data apache-spark etl optimization data-pipelines databricks observability data-pipeline dataproc spark-operator

Updated Apr 6, 2025
TypeScript

jvalue / jayvee

Star

Jayvee is a domain-specific language and runtime for automated processing of data pipelines

data-science typescript data-engineering domain-specific-language data-pipeline etl-pipeline

Updated Apr 25, 2025
TypeScript

aeksco / aws-pdf-textract-pipeline

Sponsor

Star

🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript

pdf aws lambda cloudformation typescript serverless jest dynamodb s3 sns webscraping textract data-pipeline cdk puppeteer aws-cdk aws-textract

Updated Jun 5, 2024
TypeScript

scopashq / typestream

Star

⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first

typescript data-transformation data-extraction developer-experience auto-reload data-pipeline

Updated Apr 12, 2022
TypeScript

instill-ai / console

Star

📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core

console ui computer-vision deep-learning frontend image-classification object-detection structured-data data-pipeline no-code model-serving vdp unstructured-data data-connector vision-ai versatile-data-pipeline

Updated Apr 23, 2025
TypeScript

montara-io / dbt-command-center

Star

Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

python bigquery etl data-validation orchestration data-warehouse data-engineering dataops data-catalog data-analysis redshift dbt elt data-pipelines data-pipeline data-lineage analytics-engineering dbt-packages data-observability

Updated Apr 20, 2025
TypeScript

splicing-ai / splicing

Star

Splicing: Gen-AI Copilot for Data Engineering

agent ai data-engineering developer-tools gpt data-pipeline llm generative-ai anthropic

Updated Nov 9, 2024
TypeScript

AqueductHub / aqueductcore

Star

Aqueduct Core is responsible for the core functionality of Aqueduct, an experiment management system.

quantum-computing software data-pipeline experiment-control

Updated Feb 17, 2025
TypeScript

gradientsandgrit / langsync

Star

Sync your team's data to your LLM applications in real-time

etl haystack data-pipeline llm langchain llamaindex

Updated Sep 14, 2023
TypeScript

Indexical-Metrics-Measure-Advisory / watchmen

Star

Watchmen Platform is a low code data platform for data pipeline, meta data management , analysis, indicator objective analysis and quality management

visualization charts pipeline metrics data-visualization indicator data-pipeline data-quality low-code data-quality-monitoring

Updated Apr 28, 2025
TypeScript

nickw444 / budget-bot

Star

An extensible pipelining tool to build data pipelines from your bank account to any destination.

banking budget hacktoberfest data-pipeline banking-api aspire-budget

Updated Jul 7, 2022
TypeScript

funinkina / whatsappchatanalyzer

Star

A next JS app that analysis your whatsapp chats and gives useful quirky insights

analysis nextjs webapp data-pipeline fastapi

Updated Apr 22, 2025
TypeScript

BeameryEdge / querycraft-pipelines

Star

Create Database agnostic aggregations base on data pipelines

data-pipeline querycraft querycraft-filter-builder querycraft-pipelines

Updated Apr 19, 2023
TypeScript

jorgermduarte / real-time-data-architecture-kafka-flink-dw-k8s

Star

Real-time data processing architecture using Apache Kafka, Flink, and Kubernetes. This project demonstrates how to build a scalable and resilient pipeline for streaming data, performing ETL with Flink, and storing the processed data in a Data Warehouse for analysis.

kubernetes distributed-systems streaming node real-time kafka big-data etl grafana apache prometheus data-warehouse data-dictionary flink data-pipeline data-warehousing bussiness-intelligence

Updated Jan 10, 2025
TypeScript

Improve this page

Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-pipeline

Here are 14 public repositories matching this topic...

dataflint / spark

jvalue / jayvee

aeksco / aws-pdf-textract-pipeline

scopashq / typestream

instill-ai / console

montara-io / dbt-command-center

splicing-ai / splicing

AqueductHub / aqueductcore

gradientsandgrit / langsync

Indexical-Metrics-Measure-Advisory / watchmen

nickw444 / budget-bot

funinkina / whatsappchatanalyzer

BeameryEdge / querycraft-pipelines

jorgermduarte / real-time-data-architecture-kafka-flink-dw-k8s

Improve this page

Add this topic to your repo