skywire icon indicating copy to clipboard operation
skywire copied to clipboard

Visor is non-transportable after reinitiating stcpr - `STCPR rebinded in heartbeating process`

Open 0pcom opened this issue 1 month ago • 5 comments

due to recent deployment issues - it seems like the visor got stuck here in rebinded heartbeat process... here is the logging from my local visor

[2025-11-01T16:40:06.024571161-05:00] DEBUG [visor]: Saving transport to 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845 via stcpr
[2025-11-01T16:40:06.024654545-05:00] DEBUG [transport_manager]: Initializing TP with ID f1af0a38-cffd-0552-b52a-1106a3c227e4
[2025-11-01T16:40:06.024746571-05:00] DEBUG [transport_manager]: Dialing transport to 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845 via stcpr
[2025-11-01T16:40:06.024781257-05:00] DEBUG [stcpr]: Dialing PK 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845
[2025-11-01T16:40:06.86150533-05:00] DEBUG [stcpr]: Resolved PK 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845 to visor data {143.42.59.213 false {42693 [127.0.0.1 143.42.59.213 ::1 2a01:7e01::f03c:93ff:fe99:8bd6]}}
[2025-11-01T16:40:06.999880913-05:00] DEBUG [stcpr]: Performing handshake with 143.42.59.213:42693
[2025-11-01T16:40:17.001104991-05:00] DEBUG [transport_manager]: Error dialing transport to 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845 via stcpr: mt.client.Dial: handshake failed: read tcp 192.168.1.118:46806->143.42.59.213:42693: i/o timeout

[2025-11-01T16:40:17.397482207-05:00] DEBUG [tp:02c1e0]: Error deleting transport error="404 Not Found: {"error":"transport not found"}" tp-id=f1af0a38-cffd-0552-b52a-1106a3c227e4

and from the remote / public visor

[2025-11-01T21:38:40.929992939Z] INFO [stcpr]: STCPR rebinded in heartbeating process
[2025-11-01T21:38:50.36479234Z] DEBUG [dmsgC]: Updating entry.
 entry=  version: 0.0.1
  sequence: 3945
  registered at: 1762033070350109274
  static public key: 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845
  signature: afa506ec9720f172599ab2cef5a803383f2a37a154d98c0ead1846413f1f386b6df65d6cab9f0118cc2f9c4f8f3685595cc815bd93e93d9fc4cf796ab2f1c41501
  entry is registered as client. Related info: 
    delegated servers: 
      0326978f5a53aff537dbb47fed58b1f123af3b00132d365f1309a14db4168dcff7
      0281a102c82820e811368c8d028cf11b1a985043b726b1bcdb8fce89b27384b2cb
      03717576ada5b1744e395c66c2bb11cea73b0e23d0dcd54422139b1a7f12e962c4
      02a49bc0aa1b5b78f638e9189be4ed095bac5d6839c828465a8350f80ac07629c0
      02a2d4c346dabd165fd555dfdba4a7f4d18786fe7e055e562397cd5102bdd7f8dd


[2025-11-01T21:39:50.563655893Z] DEBUG [dmsgC]: Updating entry.
 entry=  version: 0.0.1
  sequence: 3946
  registered at: 1762033130364957152
  static public key: 02c1e0bf0defb8320bdbaeb262102f36819842b4faaf5e3a318ec45fd97c694845
  signature: 9d0726211dc51ed0ba02eebc1f532349ef1f53f0d21a81f955f8d4ba3c1c254c24f14096b03ff4966f02c392e5a20c060ac427dc93b109e929e36df4b2d5756b00
  entry is registered as client. Related info: 
    delegated servers: 
      0326978f5a53aff537dbb47fed58b1f123af3b00132d365f1309a14db4168dcff7
      0281a102c82820e811368c8d028cf11b1a985043b726b1bcdb8fce89b27384b2cb
      03717576ada5b1744e395c66c2bb11cea73b0e23d0dcd54422139b1a7f12e962c4
      02a49bc0aa1b5b78f638e9189be4ed095bac5d6839c828465a8350f80ac07629c0
      02a2d4c346dabd165fd555dfdba4a7f4d18786fe7e055e562397cd5102bdd7f8dd


[2025-11-01T21:40:09.279914628Z] DEBUG httpClient.BindSTCPR [address_resolver]: Address resolver binding with: [127.0.0.1 143.42.59.213 ::1 2a01:7e01::f03c:93ff:fe99:8bd6]
[2025-11-01T21:40:09.28785088Z] INFO [stcpr]: STCPR rebinded in heartbeating process
[2025-11-01T21:40:10.281185447Z] DEBUG httpClient.BindSTCPR [address_resolver]: Address resolver binding with: [127.0.0.1 143.42.59.213 ::1 2a01:7e01::f03c:93ff:fe99:8bd6]
[2025-11-01T21:40:10.286279006Z] INFO [stcpr]: STCPR rebinded in heartbeating process

I can't make stcpr transport to the public visor.

after restarting it was working. this may be an edge case but it's one i've seen more than once. We just need a better handling of basically any non-transportable state and shutdown / restart - if nothing else.

0pcom avatar Nov 01 '25 22:11 0pcom