Connect thread limit

barakme · 14 July 2021 06:40

I am working on a system that will load files from various sources (S3, Azure Blob, SFTP, etc) into Kafka topics, and reviewing Kafka Connect as the underlying framework.
The number of sources and their configurations is dynamic and will change at runtime - new configuration will be added, old ones removed.
Each configuration will have separate polling intervals, number of tasks, target topic, etc.
The number of active configurations at any one times is at-least 10,000.

Here is my question: assuming I have 10,000 connectors, each running 1 task, will there be a total of 10,000 threads running across all JVMs in my distributed connect worker pool? Does connect create a dedicated thread per task, or does each worker have a shared thread pool?

My concern here is that of resource usage - a static assignment of a thread per task is wasteful if the polling interval is large (which is likely).

system · 13 August 2021 06:40

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parallelism and Load Balancing in Distributed Kafka Connect Deployment Kafka Connect	1	197	19 July 2024
Kafka connector, workers Kafka Connect	2	2091	11 June 2023
Different kafka cluster and different connect cluster Self-Managed Connectors	12	4608	11 June 2021
Incresing the number of tasks for an S3 sink connector Kafka Connect	6	4779	30 April 2021
How to scale using Kafka Connect Kafka Connect	6	3487	8 February 2021

Connect thread limit

Related topics