Cache/Query latest value of each key in a topic?

DrCouchi · 2 June 2021 00:45

Hi all,

I’m very new to Kafka and am looking at it as a potential data pipeline for a system. I’ve been very interested in what ksqldb can do but I haven’t managed to figure this out:

Can you create a table based on a particular topic and use it to cache the latest value of each key, such that you can perform ad hoc (pull) queries on it to get the value of a key. A simple example would be to listen to a stock quote stream and store the most recent price for a ticker symbol and then look that up by symbol with a ksql query.

From what I’ve tried so far, it seems like you can only query a materialized view, which needs to be based on an aggregate function. Am I misunderstanding this, as that seems a bit limited. How should I go about doing this?

Thanks in advance!

mjsax · 3 June 2021 16:55

Your observation is correct, and we are working on improvements already.

The “old” workaround is to issue an aggregation query using LATEST_BY_OFFSET aggregation function, so you can query the result table of the aggregation.

In 0.17 (and 0.18) release, it is actually simpler now, and instead of doing an aggregation query, you can do a CREATE TABLE materialized AS SELECT * FROM input_table; and you should be able to issue pull queries against materialized (cf Announcing ksqlDB 0.17.0 - New Features and Updates and ksqlDB 0.18: Newest Features and Updates).

We are actually also working on a direct materialization of source tables to avoid the need to write a query to make the data available for pull queries: https://github.com/confluentinc/ksql/pull/7474

system · 2 July 2021 00:45

This topic was automatically closed after 30 days. New replies are no longer allowed.

Topic		Replies	Views
Query with rolling values from last record ksqlDB	3	3585	15 March 2022
NULL values are retuned after JOIN in ksqlDB ksqlDB	1	1771	20 August 2023
Select events from a topic between two dates without emitting changes ksqlDB	2	3449	9 March 2021
Duplicate key in KSQL Table ksqlDB	5	5606	12 May 2021
Ksql stream from a Kafka topic ksqlDB	4	3379	13 March 2022

Cache/Query latest value of each key in a topic?

Related topics