redpanda icon indicating copy to clipboard operation
redpanda copied to clipboard

CI Failure (DNS failure: communications error to 127.0.0.11#53) in `AlterTopicConfiguration.test_shadow_indexing_config`

Open vbotbuildovich opened this issue 11 months ago • 2 comments

https://buildkite.com/redpanda/redpanda/builds/45930#018e251b-8649-4be6-96d3-6c47a0c923bb

Module: rptest.tests.alter_topic_configuration_test
Class: AlterTopicConfiguration
Method: test_shadow_indexing_config
test_id:    AlterTopicConfiguration.test_shadow_indexing_config
status:     FAIL
run time:   31.851 seconds

RemoteCommandError({'ssh_config': {'host': 'docker-rp-1', 'hostname': 'docker-rp-1', 'user': 'root', 'port': 22, 'password': 'UNUSED', 'identityfile': '/root/.ssh/id_rsa'}, 'hostname': 'docker-rp-1', 'ssh_hostname': 'docker-rp-1', 'user': 'root', 'externally_routable_ip': 'docker-rp-1', '_logger': <Logger rptest.tests.alter_topic_configuration_test.AlterTopicConfiguration.test_shadow_indexing_config-184 (DEBUG)>, 'os': 'linux', '_ssh_client': <paramiko.client.SSHClient object at 0x7f36301bd300>, '_sftp_client': <paramiko.sftp_client.SFTPClient object at 0x7f3630694b80>, '_custom_ssh_exception_checks': None}, 'host ;; communications error to 127.0.0.11#53: connection refused', 2, b'')
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 182, in _do_run
    self.setup_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 260, in setup_test
    self.test.setup()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/test.py", line 91, in setup
    self.setUp()
  File "/root/tests/rptest/tests/redpanda_test.py", line 38, in setUp
    self.__redpanda.start()
  File "/root/tests/rptest/services/redpanda.py", line 2528, in start
    self.for_nodes(to_start, start_one)
  File "/root/tests/rptest/services/redpanda.py", line 1407, in for_nodes
    return list(executor.map(cb, nodes))
  File "/usr/lib/python3.10/concurrent/futures/_base.py", line 621, in result_iterator
    yield _result_or_cancel(fs.pop())
  File "/usr/lib/python3.10/concurrent/futures/_base.py", line 319, in _result_or_cancel
    return fut.result(timeout)
  File "/usr/lib/python3.10/concurrent/futures/_base.py", line 458, in result
    return self.__get_result()
  File "/usr/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result
    raise self._exception
  File "/usr/lib/python3.10/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
  File "/root/tests/rptest/services/redpanda.py", line 2520, in start_one
    self.start_node(node,
  File "/root/tests/rptest/services/redpanda.py", line 2813, in start_node
    self.write_node_conf_file(
  File "/root/tests/rptest/services/redpanda.py", line 3639, in write_node_conf_file
    fqdn = self.get_node_fqdn(node)
  File "/root/tests/rptest/services/redpanda.py", line 3607, in get_node_fqdn
    fqdn = node.account.ssh_output(
  File "/usr/local/lib/python3.10/dist-packages/ducktape/cluster/remoteaccount.py", line 41, in wrapper
    return method(self, *args, **kwargs)
ducktape.cluster.remoteaccount.RemoteCommandError: root@docker-rp-1: Command 'host ;; communications error to 127.0.0.11#53: connection refused' returned non-zero exit status 2.

JIRA Link: CORE-1873

vbotbuildovich avatar Mar 14 '24 00:03 vbotbuildovich

Kind of weird, looks like the earlier dig command failed (or at least the first line out ouptut was a failure) and we pass that on to the next command. Maybe a temporary blip in DNS resolution?

I guess DNS is implemented by docker daemon here and maybe this is very early in the container lifetime? There seem to be some handwavy explanations that it may fail at this point.

Probably we can add some logging of the full output and maybe a retry.

travisdowns avatar Mar 14 '24 03:03 travisdowns

Using "kafka" as I guess this was added for kerberos support.

travisdowns avatar Mar 14 '24 03:03 travisdowns

Closing as stale.

piyushredpanda avatar Jul 10 '24 04:07 piyushredpanda

*https://buildkite.com/redpanda/vtools/builds/15928

vbotbuildovich avatar Jul 25 '24 00:07 vbotbuildovich

Automatically closing issue to match current state of CORE-1873

michael-redpanda avatar Jul 26 '24 03:07 michael-redpanda

*https://buildkite.com/redpanda/vtools/builds/15944 *https://buildkite.com/redpanda/vtools/builds/15952

vbotbuildovich avatar Jul 26 '24 04:07 vbotbuildovich

*https://buildkite.com/redpanda/vtools/builds/15980 *https://buildkite.com/redpanda/vtools/builds/16003 *https://buildkite.com/redpanda/vtools/builds/16016 *https://buildkite.com/redpanda/vtools/builds/16030

vbotbuildovich avatar Jul 30 '24 21:07 vbotbuildovich

Closing older-bot-filed CI issues as we transition to a more reliable system.

piyushredpanda avatar Sep 24 '24 04:09 piyushredpanda