PDA

View Full Version : OSPF not (re)starting properly after "apply changes&quo


bradg
01-14-2005, 07:55 PM
OK, the first time it happened, I thought it was a status screen refresh "glitch", but I've seen it three times tonite on two different boxes.

We've been having a ton of trouble with the cold tonite, trying to isolate the issues and get OSPF to behave, but that doesn't seem to be the big issue here.

Running Star-OS latest, WRAP boards, CM9 cards. 22dB grids, shots of 100 yards (yes - yards) and about 1/3 mile. "Uplink" units are indoors (above 50F), "relay" unit is outside in the cold. Signal quality on both links in excess of 25 at all times - most of the time well above 30.

Three times tonite I've seen this - twice on the same unit. Routes are a mess because the ring gets broken (associations or connectivity seem to drop for no good reason other than I assume the cold). But I can see the remote unit associated (and sometimes even ping it from the reachable unit, which is odd).

So, in the case of no wireless connectivity, I get into it remotely from the ethernet port and "apply changes" to force a re-association (it seems to work). After connectivity is re-established, no routes come up. I wait several minutes, and finally chain-hop SSH into the unit and go to advanced routing - OSPF shows up as STOPPED, and a second later, shows up as STARTED. Lo-and-behold, about 30 seconds after I touch that menu, routes propagate, and things start to work.

No additional "activate changes", no "save changes", no reboot, nothing. Just touch the advanced routing menu, and OSPF goes from stopped to started, and volia - it's working.

I also notice that if I enter the OSPF configuration terminal and exit, CPU load shoots up dramatically, and then settles down, and sometimes routes then propagate.

Does simply touching that menu magically kick or start what should be an already started routing process (especially after an apply changes)?

Anyone have similar experiences? Lonnie, Tony - thoughts?

I'm *REALLY* trying to figure out where the demons are in the OSPF parts of the Star-OS world (if you haven't been able to tell), and this is yet another one - amongst NMBA setings, neighbors, encryption, and other variables I have yet to make sense of.

Brad

bradg
01-14-2005, 09:10 PM
Other observations:

On one end (main uplink), I've seen "bmiss" log entries occasionally today.

On the same end, when routing breaks completely, OSPF no longer sees the subnet on the wireless card as being directly connected - or at all for that matter. Activate changes kicks it back into working (temporairly).

Does the wireless "media status" (connected/disconnected = associated/scanning) cause OSPF (Zebra) to drop that interface?


Brad

lonnie
01-14-2005, 10:07 PM
I am remaining quiet since I do not use OSPF. Don't get me wrong - I recommend OSPF since everything I have been told makes it seem to be the best choice, but we have not had the time to actually learn it and use it.

bminish
01-19-2005, 02:01 PM
OSPF shows up as STOPPED, and a second later, shows up as STARTED. Lo-and-behold, about 30 seconds after I touch that menu, routes propagate, and things start to work.

No additional "activate changes", no "save changes", no reboot, nothing. Just touch the advanced routing menu, and OSPF goes from stopped to started, and volia - it's working.

Brad

Paul (OscarBravo) & I discovered one possible cause for this tonight whilst having another go at breaking my network

basically if you have more than one ssh session open to a node you may see this problem.

open an ssh session to the staros router
open a second ssh session to the router

in one of the sessions enable ospf & quit out of advanced routing dialogue
box
in other session open advanced routing dialogue box. OSPF will go down,
close dialogue box, go back to the other session & select advanced routing
dialogue box and OSPF will come back up again.

Does that fit what was going on?
.Brendan

mmc1800
03-30-2005, 08:49 AM
I can verify that very strange things happen to the advanced routing screen when you have 2 ssh conenctions to the router.

This is inconvenient because I like to look at the static routes while configuring the OSPF but it seems to not cause too many problems if I get out of the second terminal before I go back to the staros screen from the OSPF configuration terminal.

I have not played with it very much because as far as I can tell this work around works all the time and for what I am doing I have just avoided this situation as much as possible.

I currently have OSPF routing but have not deleted all of my default routes. Earlier today one of my routers failed to bring OSPF back online after a restart and deleted the configuration, but I am guessing I just forgot to write the configuration.

OSPF sure looks like the promised land for me, hope I can get it stable as it should make use of many redundant links I have built into my network.