DIRAC icon indicating copy to clipboard operation
DIRAC copied to clipboard

CS Slave refreshing from itselfs

Open hanx-hep opened this issue 3 months ago • 3 comments

We have one master server prod-dirac.ihep.ac.cn and only one slave server dirac05.ihep.ac.cn

and log in dirac05 shows:

2025-09-10 23:47:18 UTC Framework [140521675170880] DEBUG: Refreshing configuration...
2025-09-10 23:47:18 UTC Framework [140521675170880] DEBUG: Refreshing from list ['dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server', 'dips://dirac05.ihep.ac.cn:9135/Configuration/Server']
2025-09-10 23:47:18 UTC Framework [140521675170880] DEBUG: Randomized server list is dips://dirac05.ihep.ac.cn:9135/Configuration/Server, dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:18 UTC Framework [140521675170880] DEBUG:  Trying to refresh from dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:18 UTC Framework/DIRAC.Core.Tornado.Client.ClientSelector [140521675170880] DEBUG: Trying to autodetect client for dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:18 UTC Framework [140521675170880] DEBUG: Already given a valid url dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:19 UTC Framework [140521675170880] DEBUG: Trying to connect to: dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:19 UTC Framework [140521675170880] WARN: Issue getting socket: <DIRAC.Core.DISET.private.Transports.M2SSLTransport.SSLTransport object at 0x7fcdc0485f10> : ('dips', 'dirac05.ihep.ac.cn', 9135, 'Configuration/Server') : [Errno 111] Connection refused:ConnectionRefusedError(111, 'Connection refused')
2025-09-10 23:47:19 UTC Framework [140521675170880] WARN: Non-responding URL temporarily banned dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:19 UTC Framework [140521675170880] INFO: Retry connection : 1 to dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:19 UTC Framework [140521675170880] INFO: Waiting 2.000000 seconds before retry all service(s)
2025-09-10 23:47:21 UTC Framework [140521675170880] DEBUG: Already given a valid url dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:21 UTC Framework [140521675170880] DEBUG: Trying to connect to: dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:21 UTC Framework [140521675170880] WARN: Issue getting socket: <DIRAC.Core.DISET.private.Transports.M2SSLTransport.SSLTransport object at 0x7fcdbdb62b10> : ('dips', 'dirac05.ihep.ac.cn', 9135, 'Configuration/Server') : [Errno 111] Connection refused:ConnectionRefusedError(111, 'Connection refused')
2025-09-10 23:47:21 UTC Framework [140521675170880] INFO: Retry connection : 2 to dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:21 UTC Framework [140521675170880] INFO: Waiting 2.000000 seconds before retry all service(s)
2025-09-10 23:47:23 UTC Framework [140521675170880] DEBUG: Already given a valid url dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:23 UTC Framework [140521675170880] DEBUG: Trying to connect to: dips://dirac05.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:23 UTC Framework [140521675170880] WARN: Issue getting socket: <DIRAC.Core.DISET.private.Transports.M2SSLTransport.SSLTransport object at 0x7fcdbdb8ab90> : ('dips', 'dirac05.ihep.ac.cn', 9135, 'Configuration/Server') : [Errno 111] Connection refused:ConnectionRefusedError(111, 'Connection refused')
2025-09-10 23:47:23 UTC Framework [140521675170880] WARN: Can't update from server Error while updating from dips://dirac05.ihep.ac.cn:9135/Configuration/Server: [Errno 111] Connection refused:ConnectionRefusedError(111, 'Connection refused')
2025-09-10 23:47:23 UTC Framework [140521675170880] DEBUG:  Trying to refresh from dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:23 UTC Framework/DIRAC.Core.Tornado.Client.ClientSelector [140521675170880] DEBUG: Trying to autodetect client for dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server
2025-09-10 23:47:23 UTC Framework [140521675170880] DEBUG: Already given a valid url dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server

hanx-hep avatar Sep 17 '25 05:09 hanx-hep

There has been quite some changes lately in URL resolution. Which version are you running ?

Also, what do you have locally in your slave server /opt/dirac/etc/dirac.cfg in /DIRAC/Configuration/Servers ?

chaen avatar Sep 17 '25 08:09 chaen

Hi, I apologize for the late reply. We are using v8.0.58.

And slave server /DIRAC/ in /opt/dirac/etc/dirac.cfg are:

DIRAC
{
  Setup = CAS_Production
  Hostname = dirac05.ihep.ac.cn
  Configuration
  {
    Master = no
    Servers = dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server # the Master server
  }
  Security
  {
  }
  Setups
  {
    CAS_Production
    {
      Configuration = Production
      Framework = Production
      Transformation = Production
      Monitoring = Production
      DataManagement = Production
      WorkloadManagement = Production
      Accounting = Production
    }
  }
}

hanx-hep avatar Nov 11 '25 06:11 hanx-hep

Hi, v8.0.58 is a quite old release, and there have been fixes for your issue. Can you update to the latest v8 and retry? It's at the moment v8.0.76.

At the same time, looking at your Configuration, you should have:

DIRAC
{
  Configuration
  {
    MasterServer = dips://prod-dirac.ihep.ac.cn:9135/Configuration/Server
    Servers = dips://dirac05.ihep.ac.cn:9135/Configuration/Server
  }

fstagni avatar Nov 11 '25 11:11 fstagni