How to remove deduplication upon struct in ksqldb

Anusha · 19 January 2022 08:49

Hi ,
Step 1 i am creating stream with create Type with struct format as json

CREATE TYPE contact AS STRUCT<"otherPhone" VARCHAR, "workPhone" VARCHAR, "address1" VARCHAR, "city" VARCHAR, "country" VARCHAR, "zipCode" VARCHAR>;

CREATE STREAM event_load(KEY VARCHAR KEY,"eventType" VARCHAR, "platform" VARCHAR, "billingAccount" BILLINGACCOUNT, "soldToContact" CONTACT, "billToContact" CONTACT, "subscription" "subscription") WITH (KAFKA_TOPIC = 'topic1.in', VALUE_FORMAT = 'JSON',PARTITIONS=1);
upon this i am applying  select * 

CREATE STREAM dsc_events_supply WITH (KAFKA_TOPIC = 'subscription.in, VALUE_FORMAT = 'JSON',PARTITIONS=1) AS select * from event_load where "eventType" in ('SubscriptionCreated') ;

so here is the problem: when I alter the stream event_load with add column and dropping stream dsc_events_supply and recreating it for addition of new column from event_load in my subscription.in topic data is loaded twice: 1st set of data which was already existing and second set of data with new added column, so duplication is happening. I am not able use this How to find distinct values in a stream of events using ksqlDB because my data is struct making group by columns into primary key. So how to fix this issue

system · 18 February 2022 08:49

This topic was automatically closed after 30 days. New replies are no longer allowed.

Topic		Replies	Views
How to deduplicate records in a kstream or ktable Kafka Streams	5	7735	13 September 2021
Duplicate key in KSQL Table ksqlDB	5	5531	12 May 2021
ksqlDB Table - Append new values to existing column as output of query ksqlDB	3	3933	29 September 2021
KsqlDB Table-Table join on STRUCT Type Primary Key ksqlDB	1	1090	29 February 2024
Can't access the data from the left join table ksqlDB	2	1996	26 April 2023

How to remove deduplication upon struct in ksqldb

Related topics