Erratic Signal Disconnects and Errors Prevent Communication Between Peers #1265

Closed
opened 2025-11-20 05:27:08 -05:00 by saavagebueno · 6 comments
Owner

Originally created by @trbutler on GitHub (Sep 20, 2024).

Describe the problem

I frequently receive Signal: Disconnected, reason: rpc error: code = DeadlineExceeded desc = context deadline exceeded. on all of my Netbird clients. The issue appears to be degrading from something that caused intermittent communications problems to a situation where Netbird is almost completely non-functional to most of my clients. Inexplicably a few continue to work.

I've tried adapting my Netbird "quick start" self-hosted configuration to alleviate the issue. I moved from using Caddy to NGINX for reverse proxy. This sped things up a fair amount and reduced resource usage, but didn't fix the issue. I also tried directly exposing Signal (which I had Docker translate from 443 to port 30006) while giving it access to NGINX's SSL certificate, so that a reverse proxy was not involved at all. None of these three different arrangements resolved the issue.

When proxied through NGINX, the NGINX error log is filled with entries like this:

2024/09/20 09:02:12 [error] 616106#616106: *120768 upstream rejected request with error 0 while reading response header from upstream, client: [client IP address], server: anon1.anon-r6ORu.domain, request: "POST /signalexchange.SignalExchange/Send HTTP/2.0", upstream: "grpcs://127.0.0.1:30006", host: "cyprus.serverforest.com:443"

The Signal docker container doesn't show anything unusual, even when set to debug mode on the logs; it simply shows many messages being conveyed between peers.

To Reproduce

Steps to reproduce the behavior:

  1. Run netbird up
  2. Wait a moment and netbird status will report the issue.

Expected behavior

I'd expect Netbird to be able to connect to the Signal server without issue.

Are you using NetBird Cloud?

I'm using self-hosted netbird.

NetBird version

netbird version

NetBird status -dA output:

Peers detail:
 washington.anon-DK9Lf.domain:
  NetBird IP: 100.91.0.186
  Public key: KQIjQLtUaZM9J30rBp2AxHC4nrvn8neHA7Vg1DURkFg=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 jacques.anon-DK9Lf.domain:
  NetBird IP: 100.91.2.189
  Public key: zAEXmNz59Fr5IF+bLp+z5rc0THc/bhJJ+o4U8jHl+3U=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 juniper.anon-DK9Lf.domain:
  NetBird IP: 100.91.12.9
  Public key: 0BwIRdWYsyZxQJHdy/GxODrwRzfesIOI0t5JtvDoWRg=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/prflx
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.0:51820
  Relay server address: 
  Last connection update: 4 minutes, 26 seconds ago
  Last WireGuard handshake: 13 seconds ago
  Transfer status (received/sent) 95.6 KiB/17.4 KiB
  Quantum resistance: true
  Routes: -
  Latency: 226.080667ms

 iphone-admin.anon-DK9Lf.domain:
  NetBird IP: 100.91.41.130
  Public key: 8y2qoR39K7K5Vv6hABNKqRVEdZ/FkHBZKRhWqssNGS0=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 little-hills-live-stream.anon-DK9Lf.domain:
  NetBird IP: 100.91.41.180
  Public key: yBhdVf0uxhuvaAr4tVbFDLUWnTWg/JCOviH5T3KmphM=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/prflx
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.1:51820
  Relay server address: 
  Last connection update: 4 minutes, 44 seconds ago
  Last WireGuard handshake: 30 seconds ago
  Transfer status (received/sent) 72.1 KiB/11.0 KiB
  Quantum resistance: true
  Routes: -
  Latency: 206.283666ms

 cyprus.anon-DK9Lf.domain:
  NetBird IP: 100.91.63.165
  Public key: gqSAS+yo0Qp3RhqWaWlY0qyhLYugQ0+6HFIJAJNQZ24=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/host
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.2:51820
  Relay server address: rels://cyprus.serverforest.com:443
  Last connection update: 4 minutes, 45 seconds ago
  Last WireGuard handshake: 31 seconds ago
  Transfer status (received/sent) 456 B/1.1 KiB
  Quantum resistance: true
  Routes: -
  Latency: 202.555375ms

 mastodon1.anon-DK9Lf.domain:
  NetBird IP: 100.91.83.133
  Public key: gc7H34F3uuqW1oodfgHy5VyOU80AWPyiMZKPTWAoeV0=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 rosalind.anon-DK9Lf.domain:
  NetBird IP: 100.91.87.250
  Public key: 8ulzaG4yTm9RqIYMwRQXkw4LB7LDdhXy1ocdNCuEqBA=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): srflx/prflx
  ICE candidate endpoints (Local/Remote): 198.51.100.3:51820/198.51.100.4:48521
  Relay server address: 
  Last connection update: 4 minutes, 17 seconds ago
  Last WireGuard handshake: Now
  Transfer status (received/sent) 1008 B/804 B
  Quantum resistance: true
  Routes: -
  Latency: 202.577958ms

 little-hills-slides.anon-DK9Lf.domain:
  NetBird IP: 100.91.99.242
  Public key: PZEt9DoVoL3qataY9Oc0uyBtbbmk0Z7KgjslUGDoslk=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 touchstone.anon-DK9Lf.domain:
  NetBird IP: 100.91.112.131
  Public key: PAzQjGnO5xftL4rgeX9SdkajCjEJA3A+iViMbXoPgXE=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/srflx
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.5:1053
  Relay server address: rels://cyprus.serverforest.com:443
  Last connection update: 5 minutes, 23 seconds ago
  Last WireGuard handshake: 1 minute, 14 seconds ago
  Transfer status (received/sent) 276 B/924 B
  Quantum resistance: true
  Routes: -
  Latency: 26.766667ms

 independence.anon-DK9Lf.domain:
  NetBird IP: 100.91.122.117
  Public key: wwROJuAi9t5d7W8DnF78sdMTm13iDZ9YcrtjjHtIYDM=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): srflx/prflx
  ICE candidate endpoints (Local/Remote): 198.51.100.3:51820/198.51.100.5:51820
  Relay server address: rels://cyprus.serverforest.com:443
  Last connection update: 4 minutes, 45 seconds ago
  Last WireGuard handshake: 1 minute, 51 seconds ago
  Transfer status (received/sent) 360 B/716 B
  Quantum resistance: true
  Routes: -
  Latency: 26.469916ms

 spruce.anon-DK9Lf.domain:
  NetBird IP: 100.91.147.59
  Public key: 0sA1GjrlFs+yPKlh7CARYIoFA/Ydsa4Tq/jnpLw1axk=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/prflx
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.6:51820
  Relay server address: 
  Last connection update: 4 minutes, 26 seconds ago
  Last WireGuard handshake: 20 seconds ago
  Transfer status (received/sent) 97.9 KiB/12.7 KiB
  Quantum resistance: true
  Routes: -
  Latency: 812.671042ms

 franklin.anon-DK9Lf.domain:
  NetBird IP: 100.91.150.140
  Public key: v9F8qsB+L4fpvuTv9B8NiD27cx6h6dzVMC0XBwtw4WA=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 mesquite.anon-DK9Lf.domain:
  NetBird IP: 100.91.155.86
  Public key: vp6GLJc22GQXj2Ht5deowZp0OA8kG7XJS1kYl3zc6lI=
  Status: Connected
  -- detail --
  Connection type: P2P
  ICE candidate (Local/Remote): host/prflx
  ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.7:51820
  Relay server address: rels://cyprus.serverforest.com:443
  Last connection update: 4 minutes, 45 seconds ago
  Last WireGuard handshake: 1 minute, 50 seconds ago
  Transfer status (received/sent) 392 B/716 B
  Quantum resistance: true
  Routes: -
  Latency: 347.723708ms

 miranda.anon-DK9Lf.domain:
  NetBird IP: 100.91.170.233
  Public key: D2k3MtkmfFLj9ZuJC/3KWEW1XhMesLNpHHz8P/86q2Q=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 ipad-admin.anon-DK9Lf.domain:
  NetBird IP: 100.91.178.26
  Public key: a+sg6th5zv4wl9zCN5/q5C3O8sZQh2SwgC/8gJZuyjQ=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 boaz.anon-DK9Lf.domain:
  NetBird IP: 100.91.182.98
  Public key: ydFumIBVUwCGBjx5Xh0pZPW1G6kFq2v+8DPNz1XYkRE=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 rahab.anon-DK9Lf.domain:
  NetBird IP: 100.91.203.23
  Public key: hxczQ9TIXjpDAFHDVzwjH6aDPlC5l5GcTj0LEmhgfRQ=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

windowspc.anon-DK9Lf.domain:
  NetBird IP: 100.91.212.35
  Public key: eDP33MB5NvltMsSq9XEoxYQXoBfJjLgX9BkA3/FjKnY=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 phebe.anon-DK9Lf.domain:
  NetBird IP: 100.91.224.117
  Public key: 0Bi8tUwaKffJVD69HXxQ6RbG+wdI1npXViS4Crw+yls=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

 ipad-admin-1.anon-DK9Lf.domain:
  NetBird IP: 100.91.251.74
  Public key: nheiiB0C3H5uYy+cDvWj34o9nKotHZwNTZ1lHjCB4UQ=
  Status: Disconnected
  -- detail --
  Connection type: 
  ICE candidate (Local/Remote): -/-
  ICE candidate endpoints (Local/Remote): -/-
  Relay server address: 
  Last connection update: -
  Last WireGuard handshake: -
  Transfer status (received/sent) 0 B/0 B
  Quantum resistance: false (remote didn't enable quantum resistance)
  Routes: -
  Latency: 0s

OS: darwin/arm64
Daemon version: 0.29.2
CLI version: 0.29.2
Management: Connected to https://anon1.anon-r6ORu.domain:443
Signal: Connected to https://anon1.anon-r6ORu.domain:30006
Relays: 
  [stun:anon1.anon-r6ORu.domain:3478] is Available
  [turn:anon1.anon-r6ORu.domain:3478?transport=udp] is Available
  [rels://anon1.anon-r6ORu.domain:443] is Available
Nameservers: 
FQDN: falstaff.anon-DK9Lf.domain
NetBird IP: 100.91.122.186/16
Interface type: Userspace
Quantum resistance: true (permissive)
Routes: -
Peers count: 8/21 Connected

Do you face any (non-mobile) client issues?

Yes, the issue prevents clients from functioning. Presently most clients cannot connect, although a few consistently do connect. There is no rhyme or reason I've been able to discern: with two clients in the same location, one consistently connects and one does not; the variation does not appear to relate to platform (some of what works is MacOS, some are running Debian Linux). Reauthorizing the clients with a new setup key doesn't seem to change things for the worse or better -- it is like they are "stuck" either working or not.

(Although all clients show the Signal error given above at least part of the time.)

Additional context

I'm using a modified version of the docker-compose.yml that was available back in December 2023. It's been upgraded to add the new relay container, remove Caddy (as noted above as part of troubleshooting), expose the NGINX SSL cert to Signal, etc. Because it is from last year, it uses CockroachDB instead of PostgreSQL. I've wondered about finding a way to migrate cleanly to PostgreSQL, though I don't know if that'd materially affect this problem or not.

My docker-compose.yml:

version: "3.4"
services:
  # Caddy reverse proxy
#  caddy:
#    image: caddy
#    restart: unless-stopped
#    networks: [ netbird ]
#    #ports:
#    #  - '443:443'
#    #  - '80:80'
#    #  - '8080:8080'
#    volumes:
#      - netbird_caddy_data:/data
#      - ./Caddyfile:/etc/caddy/Caddyfile
  relay:
    image: netbirdio/relay:latest
    restart: unless-stopped
    networks: [netbird]
    ports:
      - '30005:80'
    env_file:
      - ./relay.env
    logging:
      driver: "json-file"
      options:
        max-size: "500m"
        max-file: "2"

  #UI dashboard
  dashboard:
    image: netbirdio/dashboard:latest
    restart: unless-stopped
    networks: [netbird]
    ports: 
      - '30001:80'
    env_file:
      - ./dashboard.env
  # Signal
  signal:
    image: netbirdio/signal:latest
    restart: unless-stopped
    networks: [netbird]
    ports:
      - '30002:80'
      - '30006:443'
    command: [ "--log-file", "console","--log-level","debug","--cert-file","/ssl/fullchain.pem","--cert-key","/ssl/privkey.pem" ]
    volumes:
      - /etc/letsencrypt/live/anon1.anon-r6ORu.domain/fullchain.pem:/ssl/fullchain.pem:ro
      - /etc/letsencrypt/live/anon1.anon-r6ORu.domain/privkey.pem:/ssl/privkey.pem:ro
  # Management
  management:
    image: netbirdio/management:latest
    restart: unless-stopped
    networks: [netbird]
    ports:
      - '30003:80'
    volumes:
      - netbird_management:/var/lib/netbird
      - ./management.json:/etc/netbird/management.json
    command: [
      "--port", "80",
      "--log-file", "console",
      "--log-level", "info",
      "--disable-anonymous-metrics=false",
      "--single-account-mode-domain=anon2.domain",
      "--dns-domain=anon2.domain",
      "--idp-sign-key-refresh-enabled",
    ]
  # Coturn, AKA relay server
  coturn:
    image: coturn/coturn
    restart: unless-stopped
    domainname: netbird.relay.selfhosted
    volumes:
      - ./turnserver.conf:/etc/turnserver.conf:ro
    network_mode: host
    command:
      - -c /etc/turnserver.conf
  # Zitadel - identity provider
  zitadel:
    restart: 'always'
    networks: [netbird]
    image: 'ghcr.io/zitadel/zitadel:v2.31.3'
    command: 'start-from-init --masterkeyFromEnv --tlsMode external'
    ports:
      - '30004:8080'
    env_file:
      - ./zitadel.env
    depends_on:
      crdb:
        condition: 'service_healthy'
    volumes:
      - ./machinekey:/machinekey
      - netbird_zitadel_certs:/crdb-certs:ro
  # CockroachDB for zitadel
  crdb:
    restart: 'always'
    networks: [netbird]
    image: 'cockroachdb/cockroach:v22.2.2'
    command: 'start-single-node --advertise-addr crdb'
    volumes:
      - netbird_crdb_data:/cockroach/cockroach-data
      - netbird_crdb_certs:/cockroach/certs
      - netbird_zitadel_certs:/zitadel-certs
    healthcheck:
      test: [ "CMD", "curl", "-f", "http://localhost:8080/health?ready=1" ]
      interval: '10s'
      timeout: '30s'
      retries: 5
      start_period: '20s'



volumes:
  netbird_management:
  netbird_caddy_data:
  netbird_crdb_data:
  netbird_crdb_certs:
  netbird_zitadel_certs:

networks:
  netbird:
Originally created by @trbutler on GitHub (Sep 20, 2024). **Describe the problem** I frequently receive `Signal: Disconnected, reason: rpc error: code = DeadlineExceeded desc = context deadline exceeded.` on all of my Netbird clients. The issue appears to be degrading from something that caused intermittent communications problems to a situation where Netbird is almost completely non-functional to most of my clients. Inexplicably a few continue to work. I've tried adapting my Netbird "quick start" self-hosted configuration to alleviate the issue. I moved from using Caddy to NGINX for reverse proxy. This sped things up a fair amount and reduced resource usage, but didn't fix the issue. I also tried directly exposing Signal (which I had Docker translate from 443 to port 30006) while giving it access to NGINX's SSL certificate, so that a reverse proxy was not involved at all. None of these three different arrangements resolved the issue. When proxied through NGINX, the NGINX error log is filled with entries like this: ``` 2024/09/20 09:02:12 [error] 616106#616106: *120768 upstream rejected request with error 0 while reading response header from upstream, client: [client IP address], server: anon1.anon-r6ORu.domain, request: "POST /signalexchange.SignalExchange/Send HTTP/2.0", upstream: "grpcs://127.0.0.1:30006", host: "cyprus.serverforest.com:443" ``` The Signal docker container doesn't show anything unusual, even when set to debug mode on the logs; it simply shows many messages being conveyed between peers. **To Reproduce** Steps to reproduce the behavior: 1. Run `netbird up` 2. Wait a moment and `netbird status` will report the issue. **Expected behavior** I'd expect Netbird to be able to connect to the Signal server without issue. **Are you using NetBird Cloud?** I'm using self-hosted netbird. **NetBird version** `netbird version` **NetBird status -dA output:** ``` Peers detail: washington.anon-DK9Lf.domain: NetBird IP: 100.91.0.186 Public key: KQIjQLtUaZM9J30rBp2AxHC4nrvn8neHA7Vg1DURkFg= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s jacques.anon-DK9Lf.domain: NetBird IP: 100.91.2.189 Public key: zAEXmNz59Fr5IF+bLp+z5rc0THc/bhJJ+o4U8jHl+3U= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s juniper.anon-DK9Lf.domain: NetBird IP: 100.91.12.9 Public key: 0BwIRdWYsyZxQJHdy/GxODrwRzfesIOI0t5JtvDoWRg= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/prflx ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.0:51820 Relay server address: Last connection update: 4 minutes, 26 seconds ago Last WireGuard handshake: 13 seconds ago Transfer status (received/sent) 95.6 KiB/17.4 KiB Quantum resistance: true Routes: - Latency: 226.080667ms iphone-admin.anon-DK9Lf.domain: NetBird IP: 100.91.41.130 Public key: 8y2qoR39K7K5Vv6hABNKqRVEdZ/FkHBZKRhWqssNGS0= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s little-hills-live-stream.anon-DK9Lf.domain: NetBird IP: 100.91.41.180 Public key: yBhdVf0uxhuvaAr4tVbFDLUWnTWg/JCOviH5T3KmphM= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/prflx ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.1:51820 Relay server address: Last connection update: 4 minutes, 44 seconds ago Last WireGuard handshake: 30 seconds ago Transfer status (received/sent) 72.1 KiB/11.0 KiB Quantum resistance: true Routes: - Latency: 206.283666ms cyprus.anon-DK9Lf.domain: NetBird IP: 100.91.63.165 Public key: gqSAS+yo0Qp3RhqWaWlY0qyhLYugQ0+6HFIJAJNQZ24= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/host ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.2:51820 Relay server address: rels://cyprus.serverforest.com:443 Last connection update: 4 minutes, 45 seconds ago Last WireGuard handshake: 31 seconds ago Transfer status (received/sent) 456 B/1.1 KiB Quantum resistance: true Routes: - Latency: 202.555375ms mastodon1.anon-DK9Lf.domain: NetBird IP: 100.91.83.133 Public key: gc7H34F3uuqW1oodfgHy5VyOU80AWPyiMZKPTWAoeV0= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s rosalind.anon-DK9Lf.domain: NetBird IP: 100.91.87.250 Public key: 8ulzaG4yTm9RqIYMwRQXkw4LB7LDdhXy1ocdNCuEqBA= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): srflx/prflx ICE candidate endpoints (Local/Remote): 198.51.100.3:51820/198.51.100.4:48521 Relay server address: Last connection update: 4 minutes, 17 seconds ago Last WireGuard handshake: Now Transfer status (received/sent) 1008 B/804 B Quantum resistance: true Routes: - Latency: 202.577958ms little-hills-slides.anon-DK9Lf.domain: NetBird IP: 100.91.99.242 Public key: PZEt9DoVoL3qataY9Oc0uyBtbbmk0Z7KgjslUGDoslk= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s touchstone.anon-DK9Lf.domain: NetBird IP: 100.91.112.131 Public key: PAzQjGnO5xftL4rgeX9SdkajCjEJA3A+iViMbXoPgXE= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/srflx ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.5:1053 Relay server address: rels://cyprus.serverforest.com:443 Last connection update: 5 minutes, 23 seconds ago Last WireGuard handshake: 1 minute, 14 seconds ago Transfer status (received/sent) 276 B/924 B Quantum resistance: true Routes: - Latency: 26.766667ms independence.anon-DK9Lf.domain: NetBird IP: 100.91.122.117 Public key: wwROJuAi9t5d7W8DnF78sdMTm13iDZ9YcrtjjHtIYDM= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): srflx/prflx ICE candidate endpoints (Local/Remote): 198.51.100.3:51820/198.51.100.5:51820 Relay server address: rels://cyprus.serverforest.com:443 Last connection update: 4 minutes, 45 seconds ago Last WireGuard handshake: 1 minute, 51 seconds ago Transfer status (received/sent) 360 B/716 B Quantum resistance: true Routes: - Latency: 26.469916ms spruce.anon-DK9Lf.domain: NetBird IP: 100.91.147.59 Public key: 0sA1GjrlFs+yPKlh7CARYIoFA/Ydsa4Tq/jnpLw1axk= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/prflx ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.6:51820 Relay server address: Last connection update: 4 minutes, 26 seconds ago Last WireGuard handshake: 20 seconds ago Transfer status (received/sent) 97.9 KiB/12.7 KiB Quantum resistance: true Routes: - Latency: 812.671042ms franklin.anon-DK9Lf.domain: NetBird IP: 100.91.150.140 Public key: v9F8qsB+L4fpvuTv9B8NiD27cx6h6dzVMC0XBwtw4WA= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s mesquite.anon-DK9Lf.domain: NetBird IP: 100.91.155.86 Public key: vp6GLJc22GQXj2Ht5deowZp0OA8kG7XJS1kYl3zc6lI= Status: Connected -- detail -- Connection type: P2P ICE candidate (Local/Remote): host/prflx ICE candidate endpoints (Local/Remote): 192.168.0.48:51820/198.51.100.7:51820 Relay server address: rels://cyprus.serverforest.com:443 Last connection update: 4 minutes, 45 seconds ago Last WireGuard handshake: 1 minute, 50 seconds ago Transfer status (received/sent) 392 B/716 B Quantum resistance: true Routes: - Latency: 347.723708ms miranda.anon-DK9Lf.domain: NetBird IP: 100.91.170.233 Public key: D2k3MtkmfFLj9ZuJC/3KWEW1XhMesLNpHHz8P/86q2Q= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s ipad-admin.anon-DK9Lf.domain: NetBird IP: 100.91.178.26 Public key: a+sg6th5zv4wl9zCN5/q5C3O8sZQh2SwgC/8gJZuyjQ= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s boaz.anon-DK9Lf.domain: NetBird IP: 100.91.182.98 Public key: ydFumIBVUwCGBjx5Xh0pZPW1G6kFq2v+8DPNz1XYkRE= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s rahab.anon-DK9Lf.domain: NetBird IP: 100.91.203.23 Public key: hxczQ9TIXjpDAFHDVzwjH6aDPlC5l5GcTj0LEmhgfRQ= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s windowspc.anon-DK9Lf.domain: NetBird IP: 100.91.212.35 Public key: eDP33MB5NvltMsSq9XEoxYQXoBfJjLgX9BkA3/FjKnY= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s phebe.anon-DK9Lf.domain: NetBird IP: 100.91.224.117 Public key: 0Bi8tUwaKffJVD69HXxQ6RbG+wdI1npXViS4Crw+yls= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s ipad-admin-1.anon-DK9Lf.domain: NetBird IP: 100.91.251.74 Public key: nheiiB0C3H5uYy+cDvWj34o9nKotHZwNTZ1lHjCB4UQ= Status: Disconnected -- detail -- Connection type: ICE candidate (Local/Remote): -/- ICE candidate endpoints (Local/Remote): -/- Relay server address: Last connection update: - Last WireGuard handshake: - Transfer status (received/sent) 0 B/0 B Quantum resistance: false (remote didn't enable quantum resistance) Routes: - Latency: 0s OS: darwin/arm64 Daemon version: 0.29.2 CLI version: 0.29.2 Management: Connected to https://anon1.anon-r6ORu.domain:443 Signal: Connected to https://anon1.anon-r6ORu.domain:30006 Relays: [stun:anon1.anon-r6ORu.domain:3478] is Available [turn:anon1.anon-r6ORu.domain:3478?transport=udp] is Available [rels://anon1.anon-r6ORu.domain:443] is Available Nameservers: FQDN: falstaff.anon-DK9Lf.domain NetBird IP: 100.91.122.186/16 Interface type: Userspace Quantum resistance: true (permissive) Routes: - Peers count: 8/21 Connected ``` **Do you face any (non-mobile) client issues?** Yes, the issue prevents clients from functioning. Presently most clients cannot connect, although a few consistently _do_ connect. There is no rhyme or reason I've been able to discern: with two clients in the same location, one consistently connects and one does not; the variation does not appear to relate to platform (some of what works is MacOS, some are running Debian Linux). Reauthorizing the clients with a new setup key doesn't seem to change things for the worse or better -- it is like they are "stuck" either working or not. (Although all clients show the Signal error given above at least part of the time.) **Additional context** I'm using a modified version of the docker-compose.yml that was available back in December 2023. It's been upgraded to add the new relay container, remove Caddy (as noted above as part of troubleshooting), expose the NGINX SSL cert to Signal, etc. Because it is from last year, it uses CockroachDB instead of PostgreSQL. I've wondered about finding a way to migrate cleanly to PostgreSQL, though I don't know if that'd materially affect this problem or not. My docker-compose.yml: ``` version: "3.4" services: # Caddy reverse proxy # caddy: # image: caddy # restart: unless-stopped # networks: [ netbird ] # #ports: # # - '443:443' # # - '80:80' # # - '8080:8080' # volumes: # - netbird_caddy_data:/data # - ./Caddyfile:/etc/caddy/Caddyfile relay: image: netbirdio/relay:latest restart: unless-stopped networks: [netbird] ports: - '30005:80' env_file: - ./relay.env logging: driver: "json-file" options: max-size: "500m" max-file: "2" #UI dashboard dashboard: image: netbirdio/dashboard:latest restart: unless-stopped networks: [netbird] ports: - '30001:80' env_file: - ./dashboard.env # Signal signal: image: netbirdio/signal:latest restart: unless-stopped networks: [netbird] ports: - '30002:80' - '30006:443' command: [ "--log-file", "console","--log-level","debug","--cert-file","/ssl/fullchain.pem","--cert-key","/ssl/privkey.pem" ] volumes: - /etc/letsencrypt/live/anon1.anon-r6ORu.domain/fullchain.pem:/ssl/fullchain.pem:ro - /etc/letsencrypt/live/anon1.anon-r6ORu.domain/privkey.pem:/ssl/privkey.pem:ro # Management management: image: netbirdio/management:latest restart: unless-stopped networks: [netbird] ports: - '30003:80' volumes: - netbird_management:/var/lib/netbird - ./management.json:/etc/netbird/management.json command: [ "--port", "80", "--log-file", "console", "--log-level", "info", "--disable-anonymous-metrics=false", "--single-account-mode-domain=anon2.domain", "--dns-domain=anon2.domain", "--idp-sign-key-refresh-enabled", ] # Coturn, AKA relay server coturn: image: coturn/coturn restart: unless-stopped domainname: netbird.relay.selfhosted volumes: - ./turnserver.conf:/etc/turnserver.conf:ro network_mode: host command: - -c /etc/turnserver.conf # Zitadel - identity provider zitadel: restart: 'always' networks: [netbird] image: 'ghcr.io/zitadel/zitadel:v2.31.3' command: 'start-from-init --masterkeyFromEnv --tlsMode external' ports: - '30004:8080' env_file: - ./zitadel.env depends_on: crdb: condition: 'service_healthy' volumes: - ./machinekey:/machinekey - netbird_zitadel_certs:/crdb-certs:ro # CockroachDB for zitadel crdb: restart: 'always' networks: [netbird] image: 'cockroachdb/cockroach:v22.2.2' command: 'start-single-node --advertise-addr crdb' volumes: - netbird_crdb_data:/cockroach/cockroach-data - netbird_crdb_certs:/cockroach/certs - netbird_zitadel_certs:/zitadel-certs healthcheck: test: [ "CMD", "curl", "-f", "http://localhost:8080/health?ready=1" ] interval: '10s' timeout: '30s' retries: 5 start_period: '20s' volumes: netbird_management: netbird_caddy_data: netbird_crdb_data: netbird_crdb_certs: netbird_zitadel_certs: networks: netbird: ```
saavagebueno added the waiting-feedbacktriage-needed labels 2025-11-20 05:27:08 -05:00
Author
Owner

@trbutler commented on GitHub (Sep 22, 2024):

I still haven't been able to solve this, but I did setup a second Netbird server and moved the peers over to it. So far, I've not been seeing the same issue. So it makes me think perhaps it is something to do with the upgrade path to the latest containers? It still worries me, though, since it took a clean slate with all the peers being manually reconnected to a new installation of the server to get things up and running again. I'm going to wipe the old server, but have left it up for the moment if you have any debug data about it you'd like before I wipe it.

@trbutler commented on GitHub (Sep 22, 2024): I still haven't been able to solve this, but I did setup a second Netbird server and moved the peers over to it. So far, I've not been seeing the same issue. So it makes me think perhaps it is something to do with the upgrade path to the latest containers? It still worries me, though, since it took a clean slate with all the peers being manually reconnected to a new installation of the server to get things up and running again. I'm going to wipe the old server, but have left it up for the moment if you have any debug data about it you'd like before I wipe it.
Author
Owner

@ser commented on GitHub (Nov 3, 2024):

I also started to observe recently random disconnections and it's getting extremely annoying, as the only help is to 'netbird down && netbird up' manually or deploying a bot which does it for me.

@ser commented on GitHub (Nov 3, 2024): I also started to observe recently random disconnections and it's getting extremely annoying, as the only help is to 'netbird down && netbird up' manually or deploying a bot which does it for me.
Author
Owner

@mathiash98 commented on GitHub (Nov 6, 2024):

I was about to write an issue on this, but found this one.

My setup is quite similar, Nginx reverse proxy using the new relay server hosted in Docker.

  • See https://github.com/netbirdio/netbird/issues/536#issuecomment-2409099492 for my config files

  • 69 peers connected

  • Works fine for some days with around 700-1000 web connections to my Virtual Machine (I am also running openvpn as a backup on this server)

  • CPU and RAM usage on the virtual machine is very low

  • After some time I see my mac is unable to communicate with peers. Solved by netbird down && netbird up

  • After some days I see mac mac is unable to connect to the server at all

    • Other Linux clients are also losing connections to around 50% of the peers or more.
  • Web connections on virtual machine spikes to around 2500 sudo netstat -tupn | grep ESTABLISHED | wc -l

  • var/log/nginx/error.log is filled with 2024/11/06 09:19:37 [error] 6082#6082: *9625812 connect() failed (111: Connection refused) while connecting to upstream, client: 118.127.90.38, server: vpn.redacted.com, request: "GET /relay HTTP/1.1", upstream: "http://127.0.0.1:33080/relay", host: "vpn.redacted.com:443"

  • Netbird relay docker container has not logged anything the last 12 hours

netbird-relay-1  | 2024-11-05T20:03:03Z ERRO [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF
netbird-relay-1  | 2024-11-05T20:03:03Z DEBG [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/relay.go:137: relay connection closed
netbird-relay-1  | 2024-11-05T20:03:03Z ERRO [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF
netbird-relay-1  | 2024-11-05T20:03:03Z DEBG [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/relay.go:137: relay connection closed

All netbird docker containers are running 0.31.0

  • Most of the clients run either 0.30.1 or 0.31.0

This is fixed temporarily by restarting the relay docker container and after some minutes all connections are back to normal.

Relay server logs after restart:

netbird-relay-1  | 2024-11-05T20:03:03Z ERRO [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF
netbird-relay-1  | 2024-11-05T20:03:03Z DEBG [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/relay.go:137: relay connection closed
netbird-relay-1  | 2024-11-06T09:22:00Z INFO relay/server/listener/ws/listener.go:57: stop WS listener
netbird-relay-1  | 2024-11-06T09:22:00Z INFO relay/server/listener/ws/listener.go:61: WS listener stopped
netbird-relay-1  | 2024-11-06T09:22:00Z INFO relay/server/relay.go:151: close connection with all peers
netbird-relay-1  | 2024-11-06T09:22:00Z INFO relay/cmd/root.go:167: shutting down metrics server
netbird-relay-1  | 2024-11-06T09:22:02Z INFO relay/cmd/root.go:124: running metrics server: :9090/metrics
netbird-relay-1  | 2024-11-06T09:22:02Z INFO relay/cmd/root.go:149: server will be available on: rels://vpn.redacted.com:443
netbird-relay-1  | 2024-11-06T09:22:02Z INFO relay/server/listener/ws/listener.go:39: WS server listening address: :33080
netbird-relay-1  | 2024-11-06T09:22:02Z INFO [peer_id: sha-LCgx/QPtPB8h7mVLG/pf7yHbnQLcQY5rovFiwGh+uL0=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42026
netbird-relay-1  | 2024-11-06T09:22:03Z INFO [peer_id: sha-qKRnLyszhUTWfhKUe+tGoU8ihsjDUQMtr3Qrs3ZHyUk=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42022
netbird-relay-1  | 2024-11-06T09:22:03Z INFO [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42062

Edit:

I have discussed this topic with collse which had some unrelated issues: https://github.com/netbirdio/netbird/issues/2043#issuecomment-2384470230 and have now removed my timeout settings in my nginx config. I will revert back to see if the default 60s nginx timeout is better than my 3600s timeout

Edit2:

My issue was fixed in issue https://github.com/netbirdio/netbird/issues/2880
Now everything is working smoothly for us with 90 peers

@mathiash98 commented on GitHub (Nov 6, 2024): I was about to write an issue on this, but found this one. My setup is quite similar, Nginx reverse proxy using the new relay server hosted in Docker. - See https://github.com/netbirdio/netbird/issues/536#issuecomment-2409099492 for my config files - 69 peers connected - Works fine for some days with around 700-1000 web connections to my Virtual Machine (I am also running openvpn as a backup on this server) - CPU and RAM usage on the virtual machine is very low - After some time I see my mac is unable to communicate with peers. Solved by `netbird down && netbird up` - After some days I see mac mac is unable to connect to the server at all - Other Linux clients are also losing connections to around 50% of the peers or more. - Web connections on virtual machine spikes to around 2500 `sudo netstat -tupn | grep ESTABLISHED | wc -l` - `var/log/nginx/error.log` is filled with `2024/11/06 09:19:37 [error] 6082#6082: *9625812 connect() failed (111: Connection refused) while connecting to upstream, client: 118.127.90.38, server: vpn.redacted.com, request: "GET /relay HTTP/1.1", upstream: "http://127.0.0.1:33080/relay", host: "vpn.redacted.com:443"` - Netbird relay docker container has not logged anything the last 12 hours ``` netbird-relay-1 | 2024-11-05T20:03:03Z ERRO [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF netbird-relay-1 | 2024-11-05T20:03:03Z DEBG [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/relay.go:137: relay connection closed netbird-relay-1 | 2024-11-05T20:03:03Z ERRO [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF netbird-relay-1 | 2024-11-05T20:03:03Z DEBG [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/relay.go:137: relay connection closed ``` All netbird docker containers are running 0.31.0 - Most of the clients run either 0.30.1 or 0.31.0 This is fixed temporarily by restarting the relay docker container and after some minutes all connections are back to normal. Relay server logs after restart: ``` netbird-relay-1 | 2024-11-05T20:03:03Z ERRO [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/peer.go:61: failed to read message: failed to get reader: failed to read frame header: EOF netbird-relay-1 | 2024-11-05T20:03:03Z DEBG [peer_id: sha-hlFl1WW6IRwCpdKih6R6v1d3NdUfZihNfaIMTPrZK9Q=] relay/server/relay.go:137: relay connection closed netbird-relay-1 | 2024-11-06T09:22:00Z INFO relay/server/listener/ws/listener.go:57: stop WS listener netbird-relay-1 | 2024-11-06T09:22:00Z INFO relay/server/listener/ws/listener.go:61: WS listener stopped netbird-relay-1 | 2024-11-06T09:22:00Z INFO relay/server/relay.go:151: close connection with all peers netbird-relay-1 | 2024-11-06T09:22:00Z INFO relay/cmd/root.go:167: shutting down metrics server netbird-relay-1 | 2024-11-06T09:22:02Z INFO relay/cmd/root.go:124: running metrics server: :9090/metrics netbird-relay-1 | 2024-11-06T09:22:02Z INFO relay/cmd/root.go:149: server will be available on: rels://vpn.redacted.com:443 netbird-relay-1 | 2024-11-06T09:22:02Z INFO relay/server/listener/ws/listener.go:39: WS server listening address: :33080 netbird-relay-1 | 2024-11-06T09:22:02Z INFO [peer_id: sha-LCgx/QPtPB8h7mVLG/pf7yHbnQLcQY5rovFiwGh+uL0=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42026 netbird-relay-1 | 2024-11-06T09:22:03Z INFO [peer_id: sha-qKRnLyszhUTWfhKUe+tGoU8ihsjDUQMtr3Qrs3ZHyUk=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42022 netbird-relay-1 | 2024-11-06T09:22:03Z INFO [peer_id: sha-/tJDJpROT5wcmiuAn75Fd48IKWfsw28/56S7xsujmr8=] relay/server/relay.go:129: peer connected from: 172.23.0.1:42062 ``` ## Edit: I have discussed this topic with collse which had some unrelated issues: https://github.com/netbirdio/netbird/issues/2043#issuecomment-2384470230 and have now removed my timeout settings in my nginx config. I will revert back to see if the default 60s nginx timeout is better than my 3600s timeout ## Edit2: My issue was fixed in issue https://github.com/netbirdio/netbird/issues/2880 Now everything is working smoothly for us with 90 peers
Author
Owner

@mgarces commented on GitHub (Nov 12, 2024):

hello, are you able to validate if release 0.31.1 fixes your issue?

@mgarces commented on GitHub (Nov 12, 2024): hello, are you able to validate if release [0.31.1](https://github.com/netbirdio/netbird/releases/tag/v0.31.1) fixes your issue?
Author
Owner

@nazarewk commented on GitHub (Apr 28, 2025):

Hello @trbutler,

We're currently reviewing our open issues and would like to verify if this problem still exists in the latest NetBird version.

Could you please confirm if the issue is still there?

We may close this issue temporarily if we don't hear back from you within 2 weeks, but feel free to reopen it with updated information.

Thanks for your contribution to improving the project!

@nazarewk commented on GitHub (Apr 28, 2025): Hello @trbutler, We're currently reviewing our open issues and would like to verify if this problem still exists in the [latest NetBird version](https://github.com/netbirdio/netbird/releases). Could you please confirm if the issue is still there? We may close this issue temporarily if we don't hear back from you within **2 weeks**, but feel free to reopen it with updated information. Thanks for your contribution to improving the project!
Author
Owner

@mlsmaycon commented on GitHub (Jun 1, 2025):

closing issue due to no recent feedback. Feel free to open a new one if the issue persist or reopen if this was a feature request.

@mlsmaycon commented on GitHub (Jun 1, 2025): closing issue due to no recent feedback. Feel free to open a new one if the issue persist or reopen if this was a feature request.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: SVI/netbird#1265