Bug #39669 for dhcp-public: [ISC-support #8226] Failover load balancing delay looks for secs

Mon, 01 Jun 2015 09:43:30 -0400 Brian Conry <bconry@isc.org> - Ticket created

Subject:

Failover load balancing delay looks for secs > configured value...

... while the non-failover load balancing delay looks for secs >= the configured value. This may be a bug, may have been intentional, may be too late to change, or any combination of the three. As it is right now, if someone needed to disable the failover load balance delay completely they have no way to do so. On Wed Mar 18 13:11:46 2015, bconry wrote: > On Wed Mar 18 07:11:37 2015, robert.willmann@commerzbank.com wrote: > > Do > > I understand it correctly that when " load balance max seconds" is > > set to zero (default), then it that case the problem wouldn't > > occour and both dhcp-servers would answer? > > Hi Robert, > > From looking at the code and consulting with our primary DHCP engineer, > a value of '0' for 'load balance max seconds' does *not* disable this > behavior. > > This is because the code looks for the value in the 'secs' field to be > greater than the configured value. So for a configured value of '0' > both peers will answer any time the 'secs' field is non-zero, but only > the peer responsible for the client MAC will respond when the 'secs' > field is exactly zero. > > Thanks, > Brian On Wed Mar 18 15:15:07 2015, bconry wrote: > On Wed Mar 18 14:02:44 2015, robert.willmann@commerzbank.com wrote: > > If so, then how do I get them to answer both even if the secs-field is > > zero? > > > > Mit freundlichen Grüßen > > Robert Willmann > > As the code currently stands, you can't. > > This has been evaluated as possibly being a bug, e.g. using '>' instead > of '>=' in the test. > > I've reviewed the draft failover specification and it refers to RFC 3074 > for the load balancing logic. That RFC says: > > " > If the parameter is configured, the server that is not supposed to > serve a specific request (based on the HBA and the STID hash), is > allowed to respond, after S seconds have elapsed since the client > first attempted to get service. A server MAY use the secs field in > the BOOTP header for determining the time since the client has been > trying to get service, or it MAY track repeated requests some other > way. > " > > Further, looking at how we handle load balancing outside of failover, > which is also based on RFC 3074, we are using '>=' for the comparison > there. > > So I would say that there's a very strong case for this behavior being > a bug, and thus something that we will address in our next set of > releases. > > Assuming I haven't overlooked something. :) > > > Now, with regard to considering using an actual working '0' for this > value (if you could), understand that means that both DHCP servers will > respond to every request they see, every time. Aside from increasing > network traffic this has some other implications that are more subtle. > > First, it seems that currently you're relying on the client experience > to validate that you have your relays configured properly. If you > continue with this as your only validation method then you won't notice > an issue until the one server handling the load is down (whether planned > or unplanned) and no clients get addresses. > > Second, when a DHCP server makes an OFFER to a client, unless that > client already has an active lease one must be allocated to that client > temporarily. This allocation is for a duration that does not exceed > the MCLT (and is often less, depending on the failover configuration), > and so isnot communicated to the failover peer. With the load > balancing delay the normal case is that the lease association then gets > confirmed, committed, and then communicated. Without a load balancing > delay both peers will allocate a lease for the client. If the network > segment is busy and near capacity this could cause 'soft' outages where > each peer has leases tied up temporarily but appear free to the partner. > > Third, this decreases the effective number of clients that your DHCP > server pair can handle. Even though the lease associations are > temporary, they still need to be committed to the leases file before > they are sent to the client. In a properly configured network this > doubles the work done for a DISCOVER, albeit split over both peers. > Since the client can only accept one of the OFFERs, half of this work > will always be wasted. The magnitude of the impact of this will depend > on how long clients typically stay around on the network. In an > environment with mostly long-term clients it would be minimal, while in > an environment with mostly transient clients it could be significant. > > You probably ought to work on that first potential problem anyway, even > if you choose to retain a failover load balancing delay. > > Thanks, > Brian

Mon, 01 Jun 2015 09:47:44 -0400 Brian Conry <bconry@isc.org> - Reference to https://support.isc.org/Ticket/Display.html?id=8226 added

Fri, 26 Jun 2015 17:04:37 -0400 Brian Conry <bconry@isc.org> - Subject changed from 'Failover load balancing delay looks for secs > configured value...' to '[ISC-support #8226] Failover load balancing delay looks for secs > configured value...'

Fri, 26 Jun 2015 17:05:13 -0400 Brian Conry <bconry@isc.org> - AdminCc Not for use <support-comment@support.isc.org> added

Fri, 14 Aug 2015 00:17:28 -0400 Shawn Routhier <sar@isc.org> - Priority P2 Normal added

Fri, 14 Aug 2015 00:17:29 -0400 Shawn Routhier <sar@isc.org> - Versions Planned 4.3.4 added

Fri, 14 Aug 2015 00:17:31 -0400 Shawn Routhier <sar@isc.org> - Status changed from 'new' to 'open'

Thu, 08 Oct 2015 12:55:22 -0400 Shawn Routhier <sar@isc.org> - Versions Planned 4.3.4 changed to 4.4.0

Thu, 08 Oct 2015 12:55:23 -0400 Shawn Routhier <sar@isc.org> - FinalPriority changed from 'Medium' to 'Low'

Thu, 08 Oct 2015 12:55:23 -0400 Shawn Routhier <sar@isc.org> - Priority changed from 'Medium' to 'Low'

Fri, 11 Mar 2016 09:07:39 -0500 Brian Conry <bconry@isc.org> - AdminCc support-comment <support-comment@isc.org> added

Fri, 11 Mar 2016 09:07:40 -0500 Brian Conry <bconry@isc.org> - AdminCc Not for use <support-comment@support.isc.org> deleted

Fri, 06 May 2016 17:48:37 -0400 Shawn Routhier <sar@isc.org> - FinalPriority changed from 'Low' to 'Low'

Fri, 06 May 2016 17:48:38 -0400 Shawn Routhier <sar@isc.org> - Priority changed from 'Low' to 'Low'

Mon, 09 May 2016 17:37:21 -0400 Shawn Routhier <sar@isc.org> - TimeEstimated changed from (no value) to '240'

Mon, 09 May 2016 17:37:21 -0400 Shawn Routhier <sar@isc.org> - TimeLeft changed from (no value) to '240'

Thu, 16 Nov 2017 09:40:30 -0500 Thomas Markwalder <tmark@isc.org> - Taken

Thu, 16 Nov 2017 12:00:30 -0500 Thomas Markwalder <tmark@isc.org> - Area feature added

Thu, 16 Nov 2017 12:00:31 -0500 Thomas Markwalder <tmark@isc.org> - Worked 2 hours (120 minutes) 120 minutes

Thu, 16 Nov 2017 12:00:32 -0500 Thomas Markwalder <tmark@isc.org> - Status changed from 'open' to 'review'

Thu, 16 Nov 2017 12:00:32 -0500 Thomas Markwalder <tmark@isc.org> - Queue changed from #8 to dhcp-public

Thu, 16 Nov 2017 12:54:23 -0500 Cathy Almond <cathya@isc.org> - AdminCc support-comment <support-comment@isc.org> deleted

Fri, 17 Nov 2017 08:57:50 -0500 Thomas Markwalder <tmark@isc.org> - Untaken

Thu, 23 Nov 2017 11:09:48 -0500 Francis Dupont <Francis_Dupont@isc.org> - Taken

Thu, 23 Nov 2017 11:12:20 -0500 Francis Dupont <Francis_Dupont@isc.org> - Correspondence added

On Thu Nov 16 17:00:30 2017, tmark wrote: > Ticket is ready for review. => changed secs into seconds in RELNOTES so please pull and review this change. Code is OK.

Thu, 23 Nov 2017 11:12:21 -0500 Francis Dupont <Francis_Dupont@isc.org> - Given to Thomas Markwalder <tmark@isc.org>

Mon, 27 Nov 2017 07:20:48 -0500 Thomas Markwalder <tmark@isc.org> - Version Fixed 4.4.0 added

Mon, 27 Nov 2017 07:20:49 -0500 Thomas Markwalder <tmark@isc.org> - Status changed from 'review' to 'resolved'

Mon, 27 Nov 2017 07:21:43 -0500 Thomas Markwalder <tmark@isc.org> - Dependency by #24720: added

Bug #39669 for dhcp-public: [ISC-support #8226] Failover load balancing delay looks for secs > configured value...

This bug tracker is no longer active.

Created:	Mon, 01 Jun 2015 09:43:30 -0400
Updated:	Mon, 27 Nov 2017 07:21:43 -0500
Closed:	Mon, 27 Nov 2017 07:20:49 -0500