Compaction of low-traffic topics

kristoffer · 17 September 2024 14:34

We have a low-traffic topic with around 500k events that has been configured with a compaction strategy. In practice this topic never actually compacts. This makes rerunning from earliest rerun all “versions” rather than just once for latest. I realize that it’s not “wrong” but is there really no setting with confluent cloud that will cause a low-traffic topic to periodically run compaction?

dtroiano · 17 September 2024 17:39

You can configure max.compaction.lag.ms to control this but keep in mind the minimum allowed values:

Minimum value: 21,600,000 ms (six hours) for Dedicated clusters and 604800000 ms (7 days) for Basic, Standard, and Enterprise clusters.

If these time periods are too long, you may contact support with your requirements to see if this can be changed for your account. From here:

All Confluent Cloud resources have hard thresholds that cannot be exceeded, but many of the default quotas can be increased based on your changing requirements. To request an increase for a quota, contact Confluent Support.

kristoffer · 20 September 2024 08:46

We did already adjust this. But still see multiple events for the same key when consuming topic from earliest even before the 7 day limit. Is that expected?

dtroiano · 20 September 2024 15:01

I forgot to mention segment.ms will also matter for this because active segments don’t get cleaned:

This configuration controls the period of time after which Kafka will force the log to roll even if the segment file is not full to ensure that retention can delete or compact old data

What is this set to? You can edit this setting after topic creation in the Confluent Cloud Console by going to the topic, then Configuration, then Edit settings, then Switch to expert mode. From there change the value for segment_ms. Be aware of this:

You can set segment.ms as low as 600000 (10 minutes), but the minimum of 14400000 (4 hours) is still enforced.

kristoffer · 26 September 2024 10:48

It’s been 600000 for some time without any effect on the compaction.

dtroiano · 26 September 2024 21:28

How about after 4 hours? Remember this caveat that 4 hours is the minimum despite the fact that you can enter lower values:

kristoffer · 27 September 2024 08:04

It’s been like that for some weeks without compaction running

kristoffer · 27 September 2024 14:16

I got a bit tricked by the offset/lag calculation so there is some compaction going on, but not every 4th hour. Looks more like a weekly schedule?

dtroiano · 30 September 2024 13:55

@kristoffer I tested this out on Confluent Cloud with max.compaction.lag.ms = 21600000 (6 hours) and segment.ms = 14400000 (4 hours) and I’m observing older messages do get deleted by the compaction process as expected.

There’s an edge case to keep in mind that you might be observing. Because the compaction process only considers inactive segments, you can wind up with multiple events of the same key even after compaction (because the latest event per key in the inactive segments won’t be deleted by the compaction process, and those same keys may also reside in active segments). Example with 6 hour max compaction lag and segment ms:

produce k:1, k:2, k:3 (k is the key, number is the value)
wait a day
produce k:4

The k:4 event will trigger the segment with the first three messages to roll if segment rolling wasn’t already triggered. The compaction process will delete k:1 and k:2 but leave k:3 alone since it’s the latest event with key k. You wind up with k:3 and k:4 post compaction even though k:3 is older than 6 hours.

florianmutter · 20 November 2024 12:55

Follow up question: If nothing else happens on this topic, will k:3 be deleted at some point? Or will the segment with k:4 stay active and is not considered when compacting forever?

dtroiano · 20 November 2024 21:38

In this case, the segment with k:4 wouldn’t roll so it would remain this way if the cleanup policy were strictly compact. If the cleanup policy were compact,delete and retention.ms wasn’t set to -1 (no time based retention), then segment cleanup would kick in to delete the segment with k:3.

Topic		Replies	Views
Limits of topic log compaction Architecture and Design	5	4302	23 July 2021
Cleanup Policy of compact AND delete Ops	13	17477	5 December 2022
Changing `retention.ms` during uptime Ops	3	3143	6 March 2022
Using CDC fed Kafka topics for replay with new consumers Architecture and Design	4	4256	25 January 2022
__consumer_offsets topic with very big partitions] Confluent Cloud	1	4106	11 June 2021

Compaction of low-traffic topics

Related topics