Add Maintenace Mode for the Router Peers #1795

Open
opened 2025-11-20 06:06:56 -05:00 by saavagebueno · 4 comments
Owner

Originally created by @danieleblanc87 on GitHub (Apr 7, 2025).

Is your feature request related to a problem? Please describe.
I often face situations where a router peer needs to undergo maintenance, but there is no easy way to automatically reroute traffic to a backup peer. This results in service interruptions or requires manual intervention, leading to potential delays and disruptions.

Describe the solution you'd like
I would like the ability to place a router peer into 'maintenance mode' and automatically notify clients to reconnect to a backup router peer that is not in maintenance mode. This would ensure high availability and provide a seamless user experience during maintenance, minimizing any potential impact.

Describe alternatives you've considered
Accept the impacts.

Additional context
This feature would enhance the flexibility and reliability of the network infrastructure, making maintenance operations smoother and reducing the risk of service disruptions.

Originally created by @danieleblanc87 on GitHub (Apr 7, 2025). **Is your feature request related to a problem? Please describe.** I often face situations where a router peer needs to undergo maintenance, but there is no easy way to automatically reroute traffic to a backup peer. This results in service interruptions or requires manual intervention, leading to potential delays and disruptions. **Describe the solution you'd like** I would like the ability to place a router peer into 'maintenance mode' and automatically notify clients to reconnect to a backup router peer that is not in maintenance mode. This would ensure high availability and provide a seamless user experience during maintenance, minimizing any potential impact. **Describe alternatives you've considered** Accept the impacts. **Additional context** This feature would enhance the flexibility and reliability of the network infrastructure, making maintenance operations smoother and reducing the risk of service disruptions.
saavagebueno added the feature-requestclientclient-ui labels 2025-11-20 06:06:56 -05:00
Author
Owner

@jordantrujillo-hl commented on GitHub (Apr 7, 2025):

Can't you just change the metric of your router peers? Lower number is the priority.

@jordantrujillo-hl commented on GitHub (Apr 7, 2025): Can't you just change the metric of your router peers? Lower number is the priority.
Author
Owner

@mlsmaycon commented on GitHub (Apr 8, 2025):

@danieleblanc87 In HA mode, the route should change to another routing peer once the peer goes offline. Is this not the case for your peers?

@mlsmaycon commented on GitHub (Apr 8, 2025): @danieleblanc87 In HA mode, the route should change to another routing peer once the peer goes offline. Is this not the case for your peers?
Author
Owner

@1nerdyguy commented on GitHub (Apr 8, 2025):

I can confirm the route changes. I believe what they are asking for is the ability to change it PRIOR to dropping a peer?

@1nerdyguy commented on GitHub (Apr 8, 2025): I can confirm the route changes. I believe what they are asking for is the ability to change it PRIOR to dropping a peer?
Author
Owner

@danieleblanc87 commented on GitHub (Apr 9, 2025):

@jordantrujillo-hl My configuration is self-hosted, and for routing, I use a group of peers with at least two routing peers. The metric is applied to the group, at least as I understand it.

@mlsmaycon That’s correct, but there is no any control over failover process. The feature request would enable better control of routing peers during maintenance activities, reducing the impact on users.
Specifically, I would like to prevent clients from connecting to a routing peer that is about to be restarted for maintenance, thus avoiding disruptions to user operations.

@danieleblanc87 commented on GitHub (Apr 9, 2025): @jordantrujillo-hl My configuration is self-hosted, and for routing, I use a group of peers with at least two routing peers. The metric is applied to the group, at least as I understand it. @mlsmaycon That’s correct, but there is no any control over failover process. The feature request would enable better control of routing peers during maintenance activities, reducing the impact on users. Specifically, I would like to prevent clients from connecting to a routing peer that is about to be restarted for maintenance, thus avoiding disruptions to user operations.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: SVI/netbird#1795