ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)...
SmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!
Set of .Net Libraries written in C# to create Listeners, Extractors, Writers and possibly more. These libraries allow you to (a) listen for events, (b) load data i...
ETL & Data Enrichment with Spark.NET and ML.NET Automated (Auto) ML
Laughing Waffle is a helper library for doing bulk insert and upate (read upsert) work with SQL Server. Specifically providing help and code generation around the...
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted...
Apache Doris is an easy-to-use, high performance and unified analytics database.
An orchestration platform for the development, production, and observation of data assets.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
The event stream processing platform for developers. Unified experience for real-time data ingestion, stream processing, and low-latency serving. Best-in-class per...
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
Privacy and Security focused Segment-alternative, in Golang and React
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, Pos...
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments....
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents