Per-route latency measurement for client route pick #1464

New Issue

saavagebueno · 2025-11-20T05:31:02-05:00

saavagebueno commented

2025-11-20 05:31:02 -05:00

Originally created by @mohamed-essam on GitHub (Nov 30, 2024).

Is your feature request related to a problem? Please describe.
Currently client connects routes to peers based only on the latency between the client and the peer, this doesn't take into account that some routes could be geographically much further from the peer.

For example if I have a route pointing to a resource in western Europe, and I have two peers that can handle this route, one in west Europe and one in west US, and the client is in east US, the fastest route would be to choose the peer in Europe.

Describe the solution you'd like
Calculate latency for each route from each peer, this information could either be communicated peer-to-peer or through management service keeping a cache of peer-route-latency as reported by each routing peer.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
I understand that this feature could be difficult to implement given the peers know nothing about which ports are open and how to check for latency for a given route, but this possibly could be configured by the user per-route.

For example, when creating the route, the user can choose to use TCP 443 to check latency.

Originally created by @mohamed-essam on GitHub (Nov 30, 2024). **Is your feature request related to a problem? Please describe.** Currently client connects routes to peers based only on the latency between the client and the peer, this doesn't take into account that some routes could be geographically much further from the peer. For example if I have a route pointing to a resource in western Europe, and I have two peers that can handle this route, one in west Europe and one in west US, and the client is in east US, the fastest route would be to choose the peer in Europe. **Describe the solution you'd like** Calculate latency for each route from each peer, this information could either be communicated peer-to-peer or through management service keeping a cache of peer-route-latency as reported by each routing peer. **Describe alternatives you've considered** A clear and concise description of any alternative solutions or features you've considered. **Additional context** I understand that this feature could be difficult to implement given the peers know nothing about which ports are open and how to check for latency for a given route, but this possibly could be configured by the user per-route. For example, when creating the route, the user can choose to use TCP 443 to check latency.

saavagebueno added the feature-request client routes labels 2025-11-20 05:31:02 -05:00

saavagebueno commented

2025-11-20 05:31:03 -05:00

@mohamed-essam commented on GitHub (Dec 4, 2024):

I suggest the following approach:

[Management] Add new field to Route to specify method of checking latency (Protocol, IP/Domain, Port).
1. [Dashboard] Add necessary UI.
2. [Management] Add necessary APIs.
[Client] ServerRouter to calculate latency based on given latency check settings per route.
[Client] Store and send latency reports with sync requests.
[Management] Receive and store latency reports from Peers.
[Management] Send available latency reports with Route objects in pb.
[Client] clientNetwork to include latency reported by peer in route + p2p latency in route selection.

Notes:

Should the latencies be stored in-memory cache on management-side, or stored in Store?
If I understand correctly, sync requests are only sent in the very initial connection to management or when connection is interrupted and restored, would there be a way for client to send updates to management periodically? (in this case for example, it could be when route latency changes significantly?)

Draft data structure diff:

diff --git a/management/proto/management.proto b/management/proto/management.proto
index fe6a828b..5a0dd74c 100644
--- a/management/proto/management.proto
+++ b/management/proto/management.proto
@@ -59,6 +59,12 @@ message EncryptedMessage {
 message SyncRequest {
   // Meta data of the peer
   PeerSystemMeta meta = 1;
+  repeated LatencyReport latencyReport = 2;
+}
+
+message LatencyReport {
+  string RouteID = 1;
+  float Latency = 2;
 }
 
 // SyncResponse represents a state that should be applied to the local peer (e.g. Wiretrustee servers config as well as local peer and remote peers configs)
@@ -351,6 +357,16 @@ message Route {
   string NetID = 7;
   repeated string Domains = 8;
   bool keepRoute = 9;
+  LatencyCheck latencyCheck = 10;
+}
+
+message LatencyCheck {
+  bool Enabled = 1;
+  string Protocol = 2;
+  string Domain = 3;
+  string IP = 4;
+  uint16 Port = 5;
+  float Latency = 6;
 }
 
 // DNSConfig represents a dns.Update
diff --git a/route/route.go b/route/route.go
index e23801e6..71bcbe72 100644
--- a/route/route.go
+++ b/route/route.go
@@ -45,10 +45,18 @@ const (
        DomainNetwork
 )
 
+const (
+       LatencyICMP LatencyProtocol = "ICMP"
+       LatencyTCP  LatencyProtocol = "TCP"
+       LatencyUDP  LatencyProtocol = "UDP"
+)
+
 type ID string
 
 type NetID string
 
+type LatencyProtocol string
+
 type HAMap map[HAUniqueID][]*Route
 
 // NetworkType route network type
@@ -101,6 +109,15 @@ type Route struct {
        Enabled             bool
        Groups              []string `gorm:"serializer:json"`
        AccessControlGroups []string `gorm:"serializer:json"`
+       LatencyCheck        LatencyCheck
+}
+
+type LatencyCheck struct {
+       Enabled  bool
+       Protocol LatencyProtocol
+       Domain   string
+       IP       netip.Addr
+       Port     uint16
 }
 
 // EventMeta returns activity event meta related to the route
@@ -125,6 +142,7 @@ func (r *Route) Copy() *Route {
                Enabled:             r.Enabled,
                Groups:              slices.Clone(r.Groups),
                AccessControlGroups: slices.Clone(r.AccessControlGroups),
+               LatencyCheck:        r.LatencyCheck,
        }
        return route
 }
@@ -150,7 +168,8 @@ func (r *Route) IsEqual(other *Route) bool {
                other.Enabled == r.Enabled &&
                slices.Equal(r.Groups, other.Groups) &&
                slices.Equal(r.PeerGroups, other.PeerGroups) &&
-               slices.Equal(r.AccessControlGroups, other.AccessControlGroups)
+               slices.Equal(r.AccessControlGroups, other.AccessControlGroups) &&
+               r.LatencyCheck == other.LatencyCheck
 }

@mohamed-essam commented on GitHub (Dec 4, 2024): I suggest the following approach: 1. [Management] Add new field to Route to specify method of checking latency (Protocol, IP/Domain, Port). 1. [Dashboard] Add necessary UI. 2. [Management] Add necessary APIs. 3. [Client] ServerRouter to calculate latency based on given latency check settings per route. 4. [Client] Store and send latency reports with sync requests. 5. [Management] Receive and store latency reports from Peers. 6. [Management] Send available latency reports with Route objects in pb. 7. [Client] clientNetwork to include latency reported by peer in route + p2p latency in route selection. Notes: 1. Should the latencies be stored in-memory cache on management-side, or stored in Store? 2. If I understand correctly, sync requests are only sent in the very initial connection to management or when connection is interrupted and restored, would there be a way for client to send updates to management periodically? (in this case for example, it could be when route latency changes significantly?) Draft data structure diff: ```diff diff --git a/management/proto/management.proto b/management/proto/management.proto index fe6a828b..5a0dd74c 100644 --- a/management/proto/management.proto +++ b/management/proto/management.proto @@ -59,6 +59,12 @@ message EncryptedMessage { message SyncRequest { // Meta data of the peer PeerSystemMeta meta = 1; + repeated LatencyReport latencyReport = 2; +} + +message LatencyReport { + string RouteID = 1; + float Latency = 2; } // SyncResponse represents a state that should be applied to the local peer (e.g. Wiretrustee servers config as well as local peer and remote peers configs) @@ -351,6 +357,16 @@ message Route { string NetID = 7; repeated string Domains = 8; bool keepRoute = 9; + LatencyCheck latencyCheck = 10; +} + +message LatencyCheck { + bool Enabled = 1; + string Protocol = 2; + string Domain = 3; + string IP = 4; + uint16 Port = 5; + float Latency = 6; } // DNSConfig represents a dns.Update diff --git a/route/route.go b/route/route.go index e23801e6..71bcbe72 100644 --- a/route/route.go +++ b/route/route.go @@ -45,10 +45,18 @@ const ( DomainNetwork ) +const ( + LatencyICMP LatencyProtocol = "ICMP" + LatencyTCP LatencyProtocol = "TCP" + LatencyUDP LatencyProtocol = "UDP" +) + type ID string type NetID string +type LatencyProtocol string + type HAMap map[HAUniqueID][]*Route // NetworkType route network type @@ -101,6 +109,15 @@ type Route struct { Enabled bool Groups []string `gorm:"serializer:json"` AccessControlGroups []string `gorm:"serializer:json"` + LatencyCheck LatencyCheck +} + +type LatencyCheck struct { + Enabled bool + Protocol LatencyProtocol + Domain string + IP netip.Addr + Port uint16 } // EventMeta returns activity event meta related to the route @@ -125,6 +142,7 @@ func (r *Route) Copy() *Route { Enabled: r.Enabled, Groups: slices.Clone(r.Groups), AccessControlGroups: slices.Clone(r.AccessControlGroups), + LatencyCheck: r.LatencyCheck, } return route } @@ -150,7 +168,8 @@ func (r *Route) IsEqual(other *Route) bool { other.Enabled == r.Enabled && slices.Equal(r.Groups, other.Groups) && slices.Equal(r.PeerGroups, other.PeerGroups) && - slices.Equal(r.AccessControlGroups, other.AccessControlGroups) + slices.Equal(r.AccessControlGroups, other.AccessControlGroups) && + r.LatencyCheck == other.LatencyCheck } ```

saavagebueno referenced this issue

2025-11-20 08:05:01 -05:00

[PR #1464] feat(Self-hosting NetBird script):support custom service port and DNS… #3122

Sign in to join this conversation.

Branches Tags

main

embedded-vnc

readme-cleanup

client/capture-dns-forwarder-port

fix-ssh-authorized-users-multi-rule

fix/wireguard-port-zero

windows-dns-firewall

ui-refactor

fix/wgport-config

feature/refactor-clusters

fix/rosenpass

drop-candidateviaroutes-filter

e2e-windows-dns-combined

refactor-combined

wasm-websocket-dial

feature/affected-peers

dependabot/go_modules/github.com/Azure/go-ntlmssp-0.1.1

debug-logs

reduce-embed-wg-pool

dependabot/go_modules/github.com/jackc/pgx/v5-5.9.2

fix/login-cmd-root-flags

feat/reseller-openapi-spec

github-issue-resolver

add-steamos-support

fix-darwin-uninstaller

flutter-test

dependabot/npm_and_yarn/proxy/web/postcss-8.5.12

ci/freebsd-pkg-bootstrap

cached-serial-check-on-sync

fix-mgmt-cache-bypass-overlay

revert-easyjson-5938

revert-ice-5820

revert-firewalld-5928

refactor/permissions-manager

wasm-js-func-release

revert-dns-5935-systemd-resolved

revert-dns-5935-5945

revert-dns-5945-mgmt-cache

feature/log-most-busy-peers

prototype/ui-wails

coderabbitai/utg/8ae8f20

feature/use-peer-fqdn-on-https

dependabot/go_modules/golang.org/x/image-0.38.0

feature/metrics-push-management-control

release/0.68.3

dependabot/go_modules/github.com/aws/aws-sdk-go-v2/aws/protocol/eventstream-1.7.8

dependabot/go_modules/github.com/aws/aws-sdk-go-v2/service/s3-1.97.3

add-slack-channel

claude/rdp-token-passthrough-eNcqW

transparent-proxy

fix/macos-stale-route-eexist

crowdsec-selfhosted

fix/remove-otel-units

entire/checkpoints/v1

dependabot/go_modules/github.com/go-jose/go-jose/v4-4.1.4

fix/getting-started

feat/static-connectors-combined-server

feature/use-local-keys-embedded

feature/fleetdm

set-env-only-if-not-fork

feature/expose-has-channel

fix/connection-status-race

fix/filter-cgnat-cni-ice-candidates

feature/check-cert-locker-before-acme

test/proxy-fixes

test/proxy-mtu

prototype/ui-tauri

test/proxy-speed

fix-reused-ports

feat/migrate-to-embedded-idp

feature/add-serial-to-proxy-merged

deploy/proxy-serial

test/connection

feature/disable-legacy-port

feature/flag-to-disable-legacy-port

test/perftest

dependabot/go_modules/github.com/pion/dtls/v3-3.0.11

fix/http-redirect

poc-token-command

dn-reverse-proxy

prototype/reverse-proxy-rename

prototype/reverse-proxy-logs-pagination

feature/client-metrics

prototype/reverse-proxy-clusters

debug-dns-route

fix/win-dns-batch

add-extra-route-logs

job-stream-notify-disconnection-eof

deploy/secrets-manager

trigger-proxy-update

bug/update-ios-client-code-build-tags

sync-client-netmap-serial

log/conn-disconn

nmap/compaction-deploy

ci-win-test

feature/disk-encryption-check

wasm-debug

swap-dns-prio

fix/dex-config

feature/migrate-auto-groups-to-table

dependabot/go_modules/github.com/quic-go/quic-go-0.57.0

nmap/compaction

dex-nocgo-stub

feature/exclude-terraform-from-rate-limiting

test-freebsd

retries-refactor

coderabbitai/docstrings/b7e98ac

feat/integrate-zitadel

bug/ios-hanging-reconection

zitadel-idp

feat/network-map-serial

refactor/get-account-no-users

feat/auto-upgrade

feature/report-high-pat-id

feature/temporary-access-for-resource

fix/nmap-fwrules

dont-restart-dns

prototype/ui

update-gomobile

go-dns-for-ice

wasm-ldflags

test-ldflags

wasmbuild-test

feature/networks-s2s

vk/compare-nmaps

dbg/bothmaps

feature/changeset

reorder-dns-shutdown

fix/relay-reconnection-race

fix/nmap-exitnodes

vk/debug/nmap-both

move-licensed-code

feat/better-daemon-connection-lost-message

feat/auto-update-2

test/timings

refactor/getaccount-raw

tests/nmap-getaccount

refactor/nmap

refactor/nmap-limit-buffer

feature/detect-mac-wakeup

feature/extract-modules

quick-setings

feat/sync-limiter

feature/store-cache-impl

fix-install-version

feature/store-metrics

feature/metrics-on-store

feature/use-gorm-cache

loadtest-signal

unsymmetrical-squash

refactor/reducate-signaling

test/update-reduce

feature/store-cache

feature/remote-debug

cli-ws-proxy-backend-addr

feat/mgmt-map-serial

snyk-fix-d9d0081a4c7f9137bdb59d0d50a141a2

snyk-fix-7415cea5a11acd66753540ca2c598c63

job-yml-update

feature/android-allow-selecting-routes

fix/up-sequence

fix/dns-hash-update

snyk-fix-967adae9863f17f108ce8948d9117b8d

log/getaccount-by-peer

signal-suppressor

dns-exit-node

feature/auto-updates

feature/cache-srv-key

merged-fixes

fix/missed-offers-and-debug

debug-and-fixes

poc-wasm-clean-backend-s2s

test/remote-debug

debug-api

dependabot/go_modules/github.com/docker/docker-28.0.0incompatible

fix/remove-gpo-if-empty

fix/test-freebsd

fix/mysql-setup

fix/remove-logout-btn

handle-existing-domain-user

chore/unify-domain-validation

snyk-fix-c5fafc8a50ce1f29046e25a1fc346185

feat/profile-edit-btn

snyk-fix-a54966211e18d4cf67e5a2757cc006d1

log-short-id

feat/logout-ephemeral

log-checks

batch-wg-ops

nb-interface-default

feat/aws-integration

add/race-test

feature/relay-feature-versioning

fix/systemd-service-logs

poc/preprocessed-map

add-account-onboarding

bind-ipv6

fix/merge-main

logs/peerlogs-addpeer

feature/net-297-network-migration

feature/support-skip-auto-apply-exit-node-routes

set-cmd

set-command-with-cursor

feature/limit-update-channel

stop-using-locking-share

feature/poc-lazy-detection

feature/net-248-removal-of-sync-mutex-locks

test/multiple-peer-logging

preresolve

add-ns-punnycode-support

apply-routes-early

windows-search-domains

fix/connecting-route-filter

feature/management/rest-client/impersonate

debug-local-records

resource-fields-snake-case

test/grpc-rate-limit

traffic-correlation-policy

feature/rest-client-options

feat/events-metrics

feature/buf-cli

test/add-ratelimiter

test/remove-write-lock-on-add-peer

fix/add-peer-semaphore

feature/users-roles-endpoint

mlsmaycon-patch-1

debug-user-role

chore/primary-key-on-networks

feature/update-account-peers-buffer-startup

remove-ubuntu2004-runners

refactor/permissions-no-pat-allowed

ref/logrus-factory

use-conntrack-zone

deploy/permissions-account

feature/lazy-connection-idle

ref/improve-test-cov

restore-pr-3440

test/increase-grpc-timeouts

feat/buffer-account-peers-update

test/networkmapgeneration-changes

feature/base-manager

feature/flow-receiver

chore/benchmark-with-large-runner

refactor/handshake-initiator

client/ui-update-systray-icons

userspace-router

wgwatcher-test

output-if-key-already-exists

fix/relay-reconnection

feature/port-forwarding-client-codecleaning

detached2

test/callbacks-nil-iceconninfo

refactor/optimize-peer-expiration

enable-udp-port-for-docker-template

fix/relay-update

feature/apply-posture-netmap

fix/group-update-existing-resource

conntrack-stats

upgrade-okta-sdk

multi-price

test/conn-stat

set-min-parallel-tests-for-management

dns-interceptor

debug-dns

router-dns

add-static-system-info

debug-0.29.4

debug-0.33.0

account-refactoring

relay/2800_quic

route-get-account-refactoring

test/seed-random-routes

feature/get-account-refactoring

test/reconnect-race-condition

refactor/get-account-usage

feature/add-session-id-to-update-channel

improve-ipv4conn

fix/async-pion-event-handling

debug

add-offload

feature/validate-group-association-debug

fix/limit-conn-for-sqlite

test/engine-iface

test/transaction-for-jwt-sync

fix/engine-stop-in-foreground

feature/add-mysql-support

test-migration

refactor/header-size-values

relay/eliminate-gob

test/signal-dispatcher-with-relay

relay/debug

validate-icon

feature/ipv6-support

use-pre-expanded-peers-map

feature/use-signal-dispatcher

validate/peer-status

add-read-write-times

fix/sync-peer-race

feature/relay-status

netmap

evaluate/network-map-hash

fix/lower-dns-resolve-interval-on-fail

feature/relay

fix/go-mod-version

upgrade-nftables

synology-userspace-mode

fix/use-ip-for-default-routes-on-darwin

fix/proxy_close

enable-release-workflow-on-pr

deploy/peer-performance

feature/permanent-turn

feature/permanent-turn-proxy

deploy/posture-check-sqlite

feature/optimize_sqlite_save

debug-ios-behavior

fix/delete-route-only-after-adding

tshoot/windows-logger

remove-new-routing

refactor/eliminate-repo-dependency

add-arm-to-ci

refactor-demo-account-object

test/abc2

test/abc

send-ssh-rosenpass-config-meta

refactor-demo

ensure-schedule-never-runs-non-positive

feature/peer-validator-groupmgm

feature/peer-validator-fix

fix/include-active-dashboard-users

fix/handle-canceling-schedule

fix/geo-download

debug-google-workspace

yury/resolve-ip-to-location

feature/extend-sysinfo

sqlite-async-peer-status

yury/add-postgresql-store

fix/route

test-build

posture-checks-poc

debug-keycloak-idp

poc/netstack

for-pascal-tmp

peer-logout-management

manual-peer-logout

detached

chore/refactor-management

test/dns-bind

fix/enforce-acl-for-containers

yury/use-sync-map-in-updatechannel

fix/events-key-handling

filter-cache-on-load-account

fix/user-expiration

handle-user-context-cancellation

nb-client-k8s-statefulset

fake-addr

fix/iptables_in_docker

ebpf-debug

update-getting-started-flow-use-postgres

fix/peer_list_notification

feature/device-authentication-with-client-secret

feature/keep_alive

feat-groups-from-jwt

separate_proxy_from_wgconfig

fix/wg_conn

wg_conn_fix

wg_bind_parallel_processing

fix-rollback-get-acls

proxy_cfg_cleanup

performance-improvement-rego

update-lock-log-level

feat-client-side-acl

refactor/move_grpcserver_logic_to_account_manager

feature/event-storage

feature/update-idp-redeeming-invite

feature/api-peer-info

return-groupminimum-setupkey

feature/interface-bind

documentation_enhancement

fix-peer-registration

ssh

users_cache

pass-client-caller

client_caller_type

revert-283-feat-fix-windows-installer

periodic-peer-updates

ebpf

braginini/wasm

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: SVI/netbird#1464