kb1_kanobe
02-04-2005, 03:21 PM
I've been trying to get some simple OSPF route passing working and involving my Wrap-based StarOS 2.01.0 boards. Everything works fine until the link goes quiet for an extended period, then it seems to be hit and miss if the Hellos make it over the air, generally resulting in a link-down propagating across the area.
The general arrangement is:
EndA(Quagga)--[100Mbps Ethernet]--RtrA(Zebra)--[100Mbps Ethernet]--WifiA(Zebra)--[802.11g]--WifiInterchange(Zebra)--[802.11g]--WifiB(Zebra)--[100Mbps Ethernet]--RtrB(Zebra)--[100Mbps Ethernet]--EndB(Quagga)
If I observe the local routing table on EndA with nothing else running on the network the routes associated with WifiInterchange and beyond flap quite regularly (I've lowered my hello interval and dead timers for testing). If I open an SSH session to WifiInterchange and leave it sitting there then the routes associated with WifiB and beyond continue flapping, but everything else stablilises. If I SSH to WifiB then everything stays up solid. I get the same effect if I push streams of pings across the wire to the various hosts.
My two 802.11g legs are on seperate CM-9 radios and use seperate channels (albeit hosted on the same Wrap board) and operate without problems under all other circumstances. Eg. losses seem minimal on the links: >3million pings at 10ms intervals lost 38 packets from end to end, with an average rtt of 5ms.
What I was wondering was if the radios are going into some sort of 'power save' or 'frame consolidation' mode that might cause excessive delays when transfering small infrequent traffic, such as Multicast OSPF Hello frames.
For those of you who might be more familiar with OSPF internals, consider this debug information from around link-down event (notice the apparently delayed hello at 01:28:48):
...
2000/03/04 01:28:41 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:41 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:42 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:42 OSPF: src [10.1.59.3],
2000/03/04 01:28:42 OSPF: dst [224.0.0.5]
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:42 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:42 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:43 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:43 OSPF: src [10.1.59.3],
2000/03/04 01:28:43 OSPF: dst [224.0.0.5]
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:43 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:43 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:44 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:44 OSPF: src [10.1.59.3],
2000/03/04 01:28:44 OSPF: dst [224.0.0.5]
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:44 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:44 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:45 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:45 OSPF: src [10.1.59.3],
2000/03/04 01:28:45 OSPF: dst [224.0.0.5]
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:45 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:45 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:46 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:46 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:47 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:47 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Timer (Inactivity timer expire)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Full -> Down
2000/03/04 01:28:48 OSPF: nsm_change_state(): scheduling new router-LSA origination
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: Backup 0.0.0.0
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: neighbor deleted
2000/03/04 01:28:48 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:48 OSPF: src [10.1.59.3],
2000/03/04 01:28:48 OSPF: dst [224.0.0.5]
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Down (HelloReceived)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Down -> Init
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Init (2-WayReceived)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Init -> ExStart
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: ExStart (AdjOK?)
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 0.0.0.0
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: ExStart (AdjOK?)
....
Any thoughts?
The general arrangement is:
EndA(Quagga)--[100Mbps Ethernet]--RtrA(Zebra)--[100Mbps Ethernet]--WifiA(Zebra)--[802.11g]--WifiInterchange(Zebra)--[802.11g]--WifiB(Zebra)--[100Mbps Ethernet]--RtrB(Zebra)--[100Mbps Ethernet]--EndB(Quagga)
If I observe the local routing table on EndA with nothing else running on the network the routes associated with WifiInterchange and beyond flap quite regularly (I've lowered my hello interval and dead timers for testing). If I open an SSH session to WifiInterchange and leave it sitting there then the routes associated with WifiB and beyond continue flapping, but everything else stablilises. If I SSH to WifiB then everything stays up solid. I get the same effect if I push streams of pings across the wire to the various hosts.
My two 802.11g legs are on seperate CM-9 radios and use seperate channels (albeit hosted on the same Wrap board) and operate without problems under all other circumstances. Eg. losses seem minimal on the links: >3million pings at 10ms intervals lost 38 packets from end to end, with an average rtt of 5ms.
What I was wondering was if the radios are going into some sort of 'power save' or 'frame consolidation' mode that might cause excessive delays when transfering small infrequent traffic, such as Multicast OSPF Hello frames.
For those of you who might be more familiar with OSPF internals, consider this debug information from around link-down event (notice the apparently delayed hello at 01:28:48):
...
2000/03/04 01:28:41 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:41 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:42 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:42 OSPF: src [10.1.59.3],
2000/03/04 01:28:42 OSPF: dst [224.0.0.5]
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:42 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:42 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:42 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:43 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:43 OSPF: src [10.1.59.3],
2000/03/04 01:28:43 OSPF: dst [224.0.0.5]
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:43 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:43 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:43 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:44 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:44 OSPF: src [10.1.59.3],
2000/03/04 01:28:44 OSPF: dst [224.0.0.5]
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:44 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:44 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:44 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:45 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:45 OSPF: src [10.1.59.3],
2000/03/04 01:28:45 OSPF: dst [224.0.0.5]
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (HelloReceived)
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: nsm_ignore called
2000/03/04 01:28:45 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Full (2-WayReceived)
2000/03/04 01:28:45 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:45 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:46 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:46 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:47 OSPF: Hello sent to [224.0.0.5] via [eth0:10.255.255.6].
2000/03/04 01:28:47 OSPF: Hello sent to [224.0.0.5] via [wpci0:10.1.59.200].
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Timer (Inactivity timer expire)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Full -> Down
2000/03/04 01:28:48 OSPF: nsm_change_state(): scheduling new router-LSA origination
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: Backup 0.0.0.0
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: neighbor deleted
2000/03/04 01:28:48 OSPF: Hello received from [135.132.23.1] via [wpci0:10.1.59.200]
2000/03/04 01:28:48 OSPF: src [10.1.59.3],
2000/03/04 01:28:48 OSPF: dst [224.0.0.5]
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Down (HelloReceived)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Down -> Init
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: Init (2-WayReceived)
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: State change Init -> ExStart
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.200
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: ExStart (AdjOK?)
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 0.0.0.0
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[2nd]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: DR-Election[1st]: Backup 10.1.59.200
2000/03/04 01:28:48 OSPF: DR-Election[1st]: DR 10.1.59.3
2000/03/04 01:28:48 OSPF: NSM[wpci0:10.1.59.200:135.132.23.1]: ExStart (AdjOK?)
....
Any thoughts?