Hey everyone,
What is the best configuration of apache kafka when preparing to build pipeline?
In my previous question I have like response 3 nodes clusters,
So what is best optimum configuration in apache kafka?
Going by your question, my assumption is you are currently exploring Kafka and don’t have a production system ready., is that right? Also since you mentioned Kafka Connect, it means you have some existing data source which you want to ingest into Kafka. There is no thing like best configuration, but probably most suitable configuration. To start with, you can consider a single cluster of n number of Kafka brokers, the “n” will be decided by the amount of data you are going to ingest and the server storage capacity. Number of Topics also depends what data you are ingesting, as topics will roughly correspond to Relational database tables or MongoDB collections. So it depends on your data source primarily for now.