Greetings all,
I have deployed KSQL using the official helm chart (upgraded to version 7.0.0), and selected 3 replicas. Next I have created a stream which references a Kafka topic as well as a table using the “CREATE TABLE AS SELECT” which groups by the timestamp within the topic and count, i.e. giving the rate of messages in per second over time.
Now, When i try to to a select on this table, i receive the following error:
Exception in thread “main” java.lang.IllegalStateException: KSQL error: {“@type”:“statement_error”,“error_code”:40001,“message”:“Unable to execute pull query. [Partition 1 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 2 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 4 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 5 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 7 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 8 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 10 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 11 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 13 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 14 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 16 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 17 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 19 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 20 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646], Partition 22 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-swmwg:8088 was not selected because Host is not alive as of time 1637527823646], Partition 23 failed to find valid host. Hosts scanned: [ksql-server-84cf6b7477-df7f2:8088 was not selected because Host is not alive as of time 1637527823646]]”,“statementText”:“SELECT ts, count FROM workload_input WHERE ts>=1637527864000;”,“entities”:}
When i check the cluster status i see that 2 of the servers are showing “aliveHost” as false. However, I cannot seem to find the reason for this in the logs or any further information online, how can I troubleshoot this? Are their any common reasons for this?