Predictable
Stephen Hahn's blog at Sun Microsystems
All | Pastime | Person | Peruse | Position | Process | Product

« smf(5): sun.com, new... | Main | smf(5): the system... »
20040917 Friday September 17, 2004

smf(5): a view from the moon

One interesting aspect of smf(5) is that we have pulled apart many of the assumed interrelationships between system services, and made them explicit. Doing this makes building availability and failure models much easier, but it also lets us see one projection of Solaris's shambling shape. (There's another interesting technique for dynamic discovery of relationships via DTrace, but I'll let Bryan show the image from those experiments when he's ready.) Everyone wanted to visualize the service graph that results, so Dan Price and David Bustos came up with a way to generate one. Here's the result, generated on my two-way Opteron system earlier today:

Because we'll be tweaking the graph a bit more, I'm only showing this scaled down version, but we can take a bit of a tour just from the gross features:

As you might guess, we've had to write numerous graph-aware diagnosis algorithms to make a large structure like this one navigable. We're looking forward to further enhancing our reporting and visualization tools to make troubleshooting easier still.

(Of course, knowing all the dependencies in Solaris doesn't protect you from the occupational hazard of overdiagnosis, as John and I spent an hour poking at every possible aspect of his system, which ultimately required a new network cable. I figure once a year I still end up following an "all possible software causes" algorithm, ending up in mdb(1) poking around the kernel, rather than checking cable connections or a bad software install.)

(2004-09-17 00:02:00.0) Permalink Comments [4]
Comments:

[Trackback] Das Bild zeigt recht gut die vielen Abhaengigkeiten beim Booten eines Solaris 10 Systems:Predictable

Posted by c0t0d0s0.org on September 17, 2004 at 03:06 AM PDT #

It's so beautiful I think I'm gonna cry. No more init mess? No more wondering? No more hacking shell scripts when an init script isn't happy? And, even stranger yet, actually knowing wtf is running with confidence? I think I'm in love. We've needed this for so so so long. benr.

Posted by benr on September 17, 2004 at 11:59 AM PDT #

@benr: Thanks. I'll try and post a few more interesting examples to illustrate some of the other benefits to having explicit relationships. (But I have to confess: some of the services do use shell scripts to launch�they check whether they are correctly configured and exit with precise status values for various kinds of failure.) - Stephen

Posted by Stephen Hahn on September 17, 2004 at 02:38 PM PDT #

Whoa! Finally! This kinda stuff should get more exposure than it deserves.
great innovation indeed!!
BTW, is Sun Java Ent System R3 going to leverage SMF ?
thanks again.

Posted by e1_ang on December 18, 2004 at 04:15 AM PST #

Post a Comment:

Comments are closed for this entry.
Stephen Hahn
Sun Microsystems
sch@sun.com
17 Network Circle
MS MPK17-301
Menlo Park CA 94025 USA