How to design data processing with Confluent for replacing ETL?

rununrath · 8 May 2023 09:33

Hello team,

I want to consult with you about how to design Confluent platform to do data processing, maybe with ksqlDB or KStream, to replace existing ETL tools.

Details of logic on existing ETL tools:

Get data from on-prem database systems then enrich them together before updating to another database system on cloud.
Logics for data enrichment as a huge point to joining data around 20-30 tables together and that mixed conditional for joining like

SELECT * FROM TABLE_A A
LEFT JOIN TABLE_B B WITH (NOLOCK) on A.column1 = B.column1 and A.column2 = B.column2
INNER JOIN TABLE_C C WITH (NOLOCK) on A.column3 = C.column3
...
LEFT JOIN TABLE_Z Z WITH (NOLOCK) on A.column20 = Z.column20 and A.column21 = Z.column21 and A.column22 = Z.column22

.

My Questions
1 Are able to move existing data enrichment on ETL that joins data around 20-30 tables to processing on ksqlDB or KStream?
2 If the answer 1 is yes, which is more suitable between ksqlDB and KStream?
3 If the answer 1 is no, how do you design a solution for supporting data enrichment with multiple tables with Confluent platform? Or needs 3rd tools to help for joining 20-30 tables.

Thanks
May

Topic	Replies	Views
Online Talk: Develop a Streaming ETL pipeline from MongoDB to Snowflake with Apache Kafka Events	3011	7 May 2021
✍️ Announcing ksqlDB 0.18.0 News and Blogs	3155	26 May 2021
✍️ Serverless Stream Processing with Apache Kafka, Azure Functions, and ksqlDB News and Blogs	2712	10 August 2022
✍️ How ksqlDB Works: Internal Architecture & Advanced Features News and Blogs	3165	25 August 2021
Building and Deploying a Real-Time Stream Processing ETL Engine with Kafka and ksqlDB ksqlDB	3364	15 December 2020

How to design data processing with Confluent for replacing ETL?

Related topics