Today's Page Hits: 108
I have more hair and it isn't so grey. :->
This page validates as XHTML 1.0, and will look much better in a browser that supports web standards, but it is accessible to any browser or Internet device. It was created using techniques detailed at glish.com/css/.
I just revamped connectathon.org. I added a style sheet - which let me get rid of the all encompassing table to do layouts. I've got a left, content, and right style defined.
I'm still using the grand table approach over at Serialized Science Fiction. But I had just created Blitz United '97 Red for my son's soccer team and I wanted to do away with the table. So I took the Greenline style from GPS Gazette" for WordPress.org and modified it to fit my needs. (I had been using it for Behind The Scenes and like the layouts.)
When I needed to update connectathon.org for 2009, I decided to further modify the style sheet for that site.
I know I can also probably use php or something to consistently wrap the header, left, right, and footers around the content, but I thought that was too heavy handed. So I feel back to pulling those parts out and using some simple scripts to rebuild the final html files. Basically I use cat and sed to modify the files. I didn't worry about changing the titles, which is a bit more complex, but rather just to change the year in the different talksXX/index.html files.
Like everyone else at Sun, we've cut over to Mercurial to manage our source code. We have our own gate and we want to be able to view the set of applied changes. hg serve will do that for you. And immediately, I asked the group if someone had a smf service for it.
The answer was no. So Robert Gordon kindly supplied one.
You place hg_serve.xml in /var/svc/manifest/network and hg-serve in /usr/local/smf_method (you can place it elsewhere, just modify the path in hg_serve.xml).
And then comes the fun part of activating the service. I had to read the man page. :)
# svccfg import hg_serve.xml # svcadm enable svc:/network/hg_serve
And now the hg serve will survive a reboot!
I'm trying to prove that I've fixed a bug and I happen to know that if a certain function returns an error code, then I might have hit the code in question. How do I quickly determine whether or not I have hit such a case?
[root@pnfs-3-11 ~/dtrace]> more mms.d
#! /usr/sbin/dtrace -Fs
:nfs:nfs4_trigger_mount:return
/args[1] != 0/
{
printf("rc1 = %d\n", args[1]);
}
:nfs:nfs4_ephemeral_umount:return
/args[1] != 0/
{
printf("rc1 = %d\n", args[1]);
}
[root@pnfs-3-11 ~/dtrace]> ./mms.d
dtrace: script './mms.d' matched 2 probes
CPU FUNCTION
0 <- nfs4_ephemeral_umount rc1 = 16
0 <- nfs4_ephemeral_umount rc1 = 16
So if the function name is nfs4_trigger_mount or nfs4_ephemeral_umount, then print the return code if it is not 0. Hmm, I should probably remove the '\n's.
[root@pnfs-3-11 ~/dtrace]> ./mms.d dtrace: script './mms.d' matched 2 probes CPU FUNCTION 1 <- nfs4_ephemeral_umount rc1 = 16 1 <- nfs4_ephemeral_umount rc1 = 16
I'm actually looking for an 11 (EAGAIN) coming back from nfs4_trigger_mount, but the abundance of 16 (EBUSY) from nfs4_ephemeral_umount tells me I've probably fixed another bug I was hitting.
Trying to run a test program which needs rsh, and I'm getting:
# rsh pnfs-3-11.central w pnfs-3-11.central.sun.com: Connection refused
Evidently rshd is not running. There is plenty of info on how to get it running pre-Solaris 10. Hmm, how do I get started with smf?
# svcadm enable svc:/network/shell:default # svcadm enable svc:/network/login:rlogin
I threw in rlogin for free!
Make sure to fill in your /etc/hosts and .rhosts files as needed.
# rsh pnfs-3-11.central w 1:58pm up 9 min(s), 1 user, load average: 0.00, 0.05, 0.05 User tty login@ idle JCPU PCPU what root pts/1 1:50pm 6 -sh
I have two slugs. One is working and the other is not. I can't discover it either on my home subnet or 192.168.1.0/24. nmap can't find it. I don't have a serial port on it.
upslug2 can find it in upgrade mode:
tdh@sneaker:~/sluggo/slugos-4.8-beta/nslu2be.tmp$ upslug2 [no NSLU2 machines found in upgrade mode] tdh@sneaker:~/sluggo/slugos-4.8-beta/nslu2be.tmp$ upslug2 LKG881183 00:0f:66:88:11:83 Product ID: 1 Protocol ID:0 Firmware Version: R23V63 [0x2363]
So I unloaded OpenSlug and put back the latest Linksys firmware. I still couldn't find it. I did the 10 second push of the reset button and I still couldn't find it.
I found a website which had the manual online. I did a 2 second push of the reset button and I could then find it at 192.168.1.77. I couldn't change the IP of the box though. So I've reflashed OpenSlug onto it and I'm going to see if I can get access at the 192.168.2.77 address.
Okay, I am seeing this problem: Error: Fail to get samba information. But that didn't work, so I'm off to try this one: NSLU2, power problem, samba not found.
Okay, why didn't I try this: FailSambaInformation?
Some notes:
There is no way of telling what firmware I used to have on that NSLU2. But now my system is working!
I'm trying to build SlugOS/BE on a Linux box (Fedora Core something or another). I kept on hitting a problem with fakeroot_1.8.3.tar.gz not being found on the Debian mirrors. Well, it is obsolete.
To fix the problem, I ftp'ed to one of the sites and found what the new version was and then I added some BitBake recipes:
[tdh@adept fakeroot]> mv fakeroot-1.8.3 fakeroot-1.9.5 [tdh@adept fakeroot]> mv fakeroot_1.8.3.bb fakeroot_1.9.5.bb [tdh@adept fakeroot]> mv fakeroot-native-1.8.3 fakeroot-native-1.9.5 [tdh@adept fakeroot]> mv fakeroot-native_1.8.3.bb fakeroot-native_1.9.5.bb
And then I let the make run along...
Down for routine maintenance - you can at least enjoy the following until then:
Cthon '08 went off without a hitch. It started out uneventfully as Kerberos worked right out of the box. Evidently Sun's Kerberos team have been working on making initial configuration being painless. And they succeeded.
The public talks were well received and we've started posting the slides as they are sent in. You can check them out on Talks 08.
I'll post more as they arrive.
Also, we videoed most of the talks this year. As that content becomes available, we will post it up as well.
Be sure to visit www.connectathon.org and see when the talks are scheduled. These are open to the public.
Sun Microsystems, Inc. is involved with 6 presentations and then NetApp has 5 of them. I'll be giving two of them, but I'm actually more excited about the one on nfsreplay by Shehjar Tikoo and the Linux development git one by Bruce Fields and Benny Halevy.
Normally we can't share images of the event, but here is one from before the other vendors setting up their gear:
Each of the Sun workstations is probably a node in a pNFS community.
I'm going to narrate a massive debug session I just had, in which I used my XDR dump routine to flush out bugs. I'm not going to give all of the code, and my approach is very free form. But that is very much like a massive debug session. Enjoy or not!
Some my upcall from the kernel to the sped daemon is working fine, but the return path is not. My 64 bit gpool-id is off:
# Apr 3 16:13:29 pnfs-3-15 ./sped[100781]: spedit: |35865356092523570| gives |35865356092523570| and |35865356092523570| 10, 16, 64000, 35865356092523570 uid == 200096 20, 32, 2000, 35865356092523571 uid == 1066 30, 64, 1000, 35865356092523572 uid == 0 40, 2, 2000, 35865356092523573 subnet == 10.1.233.0/24 Apr 3 16:13:29 pnfs-3-15 ./sped[100781]: spedit: |35865356092523571| gives |35865356092523571| and |35865356092523571| Apr 3 16:13:29 pnfs-3-15 ./sped[100781]: spedit: |35865356092523572| gives |35865356092523572| and |35865356092523572| Apr 3 16:13:29 pnfs-3-15 ./sped[100781]: spedit: |35865356092523573| gives |35865356092523573| and |35865356092523573| We got a policy 64, -17890057, 18444853557553841283 for /nippy/bogey4 Apr 3 16:13:34 pnfs-3-15 nfs: WARNING: sped: 2 0 2 200096 10 33554432 /nippy/bogey4 0 0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 7: 0 0 0 2 0 0 0 0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 15: 0 0 0 2 0 3 D A0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 23: 0 0 0 A 2 0 0 0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 31: 0 0 0 D 2F 6E 69 70 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 39: 70 79 2F 62 6F 67 65 79 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 47: 34 0 0 0 0 0 0 0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 52: 0 0 0 0 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: We got a policy 64, -17890057, 18444853557553841283 for /nippy/bogey4 Apr 3 16:13:34 pnfs-3-15 ./sped[100781]: 4: 0 0 0 5
We can see it change from 35865356092523570 to 18444853557553841283. Hmm, it used to have one more change. Anyway, the value of -17890057 looks off. It should be one of 1000, 2000, or 64000. And perhaps the 64 is off as well.
Anyway, we can see that the input XDR looks okay. I've even added a byte counter for it.
And that lets us see that the last line must be the reply. The counter has started again and 4 bytes as a reply looks too small. Oh crud, what does a 5 mean as a status code?
/* * Return status codes */ #define SPED_OK 0 /* * Caller provided invalid arguments */ #define SPED_INVALID 1 /* * Internal error in daemon e.g. out of memory, can't return result */ #define SPED_INTERNAL 2 /* * Were layouts revoked? */ #define SPED_REVOKED 3 /* * Was there a matching rule, but not enough available gpools? */ #define SPED_MATCH_NOT_ENOUGH 4 /* * Was there no matching rules? */ #define SPED_NO_MATCH 5
Grin, so I thought we were going to find that the buffer was too small and something in the encoding was way off. Looks like the buffer is just right and now I need to debug why my input is not matching a policy. But, my debug code is correct here! What is being printed is the garbage in the local variable.
I added some code to zap the local variable and to also dump the input parameters:
Apr 3 16:37:20 pnfs-3-15 ./sped[100790]: 52: 0 0 0 0 Apr 3 16:37:20 pnfs-3-15 ./sped[100790]: Matching against: 200096 10 0 0 0 0 (N/A) (N/A) (N/A) (N/A) (N/A) (N/A) /nippy/bogey5 (N/A)
Hmm, no XDR dump of the output. That means we are erroring out earlier, probably due to zapping the local variable with a memset!
It is coring now somewhere in here:
if (sar.gpoolids)
u = sar.gpoolids[0];
else
u = (uint64_t)0xe4e5c7c8;
printf("We got a policy %d, %d, %llu for %s\n",
sar.lh_unit_size, sar.lh_stripe_count,
sar.gpoolids[0], saa.path);
syslog(LOG_ERR, "We got a policy %d, %d, %llu for %s",
sar.lh_unit_size, sar.lh_stripe_count,
sar.gpoolids[0], saa.path);
We should be concerned that we generate u but never use it. My guess is that sar.gpoolids is now NULL instead of random junk. If we fix that, we get back to:
Apr 3 21:38:41 pnfs-3-15 ./sped[100841]: Matching against: 200096 10 0 0 33554432 0 (N/A) (N/A) (N/A) (N/A) (N/A) (N/A) /nippy/bogey8 (N/A) Apr 3 21:38:41 pnfs-3-15 ./sped[100841]: We got a policy 0, 0, 3840264136 for /nippy/bogey8
Ignoring the need to fix the receiving code to detect a NULL array, what can we do to get a match?
Another unitialized variable in going from the stand alone policy debugger to the sped daemon. How did I find it?
(gdb) n
spe_eval_thunk (st={st_is_interior = B_FALSE, st_node = 0x8067af0},
pat=0x8047144, sa=0x8047078, prc=0x80470d8, bServer=B_TRUE)
at spedaemon.c:827
827 return (b);
(gdb) p b
$19 = B_TRUE
(gdb) n
spe_eval_attribute (si=0x80674e8, pat=0x8047144, prc=0x80470d8, bServer=B_TRUE)
at spedaemon.c:864
864 if (*prc != 0) {
(gdb) p *prc
$21 = -16785880
Why was *prc such an odd number, no error cases were hit.
int
spe_global_eval(policy_attributes_t *pat, uint64_t **gooies,
uint32_t *n_gooies, uint32_t *unit_size, uint32_t *stripes)
{
spe_policy_t *sp;
boolean_t b = FALSE;
int rc;
If we set rc = 0, it works.
Apr 3 22:54:24 pnfs-3-15 ./sped[100938]: Matching against: 200096 10 0 0 33554432 0 (N/A) (N/A) (N/A) (N/A) (N/A) (N/A) /nippy/bogeya (N/A) Apr 3 22:54:24 pnfs-3-15 ./sped[100938]: Considering rule 0 Apr 3 22:54:24 pnfs-3-15 ./sped[100938]: Considering 200096 vs 200096 Apr 3 22:54:24 pnfs-3-15 ./sped[100938]: 7: 0 0 0 0 8 6 F0 38
An 8 byte reply looks wrong. And we have another braindead problem. (Please realize this is the first time I am testing code I wrote over a month ago. I had to task switch to a P1 bug.):
result = spe_global_eval(&pat, &sar.gpoolids,
&sar.n_gooies, &sar.lh_unit_size, &sar.lh_stripe_count);
if (!result) {
sped_door_return(result, &sar, proc);
}
0 is a valid match, everything else is an error. Either I planned to make it a boolean (in which case spe_global_eval is wrong) or I messed up the check here. I'll assume the check:
Hey, the policy match info is correct, but why no payload?
sped_door_return(result, NULL, NULL); /* NOTREACHED */The two sped_door_return instances are mixed up. Flip them. Okay, back to a result and a 4 byte payoff.
And proc has a bad initialization:<
switch (command) { case (SPED_DS_ZPOOL): proc = xdr_spe_alloc_caller_t;Should be: xdr_spe_alloc_res_t
I think we finally have good results:
Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: Matching against: 200096 10 0 0 33554432 0 (N/A) (N/A) (N/A) (N/A) (N/A) (N/A) /nippy/bogeyd (N/A) Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: We got a policy 64000, 16, 35865356092523570 for /nippy/bogeyd Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: 7: 0 0 0 0 0 0 0 1 Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: 15: 0 7F 6B 59 F1 A3 44 32 Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: 23: 0 0 0 0 0 0 0 0 Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: 31: 0 0 0 0 0 0 FA 0 Apr 3 23:14:02 pnfs-3-15 ./sped[100957]: 39: 0 0 0 10 0 0 0 0 Apr 3 23:14:02 pnfs-3-15 nfssrv: WARNING: sped: 64000 16 0 35865356092523570There are some take home points here:
- Try to unit test/debug your code when you write it.
- You'll remember shortcuts you took.
- You'll remember the code.
- Write debugging tools if necessary.
- If I had to step through a debugger every time, I would have gone crazy.
- The XDR dump routine allowed me to focus on problems.
- Remember, sometimes the debugging tools introduce other bugs (see the stuff with u above.
- You didn't see it, but I created small focused test cases when necessary.
- Small one-off files allow you to eliminate large chunks of code.
- Allowed me to use gdb, in which case I fought the code and not the debugger.
I've been working on a kernel to user space door call which encodes the data in an XDR stream. Normally, I view XDR output as it goes across the wire with snoop. As this stuff is not exposed, I ended up writing a little piece of code to dump the bits on the console. I know I had some problems when I wrote it, but I'm just going to give you the current code:
void
sped_xdr_dump(XDR *x)
{
int i;
int j;
char buf[100];
char str[10];
char cbuf[4];
j = 0;
memset(&cbuf, '\0', 4);
buf[0] = '\0';
for (i = 0; i < x->x_handy; i++) {
cbuf[0] = x->x_private[i];
sprintf(str, " %X", cbuf[0]);
strcat(buf, str);
j++;
if (j == 8) {
syslog(LOG_ERR, "%s", buf);
j = 0;
buf[0] = '\0';
}
}
if (j != 0) {
syslog(LOG_ERR, "%s", buf);
}
}
The crap with cbuf is where I consider this code to be fat. It is a lazy hack because I felt the bytes were running together. It is lazy because I don't need it. And it is where I should be looking for bugs. Anyway, with the following input:
> fec3fce8::print XDR
{
x_op = 1 (XDR_DECODE)
x_ops = 0xfee45bd8
x_public = 0
x_private = 0xfec3fdd0
x_base = 0xfec3fdd0
x_handy = 0x30
}
> 0xfec3fdd0/48B
0xfec3fdd0: 0 0 0 2 0 0 0 0
0 0 0 2 0 3 d a0
0 0 0 a 2 0 0 0
0 0 0 a 2f 6e 69 70
70 79 2f 68 32 62 0 0
0 0 0 0 0 0 0 0
As shown by mdb, which I don't like, I get the following output:
Apr 3 12:04:06 pnfs-3-15 last message repeated 1 time Apr 3 12:04:22 pnfs-3-15 sped[100920]: 0 0 0 2 0 0 0 0 Apr 3 12:04:22 pnfs-3-15 sped[100920]: 0 0 0 2 0 3 D FFFFFFA0 Apr 3 12:04:22 pnfs-3-15 sped[100920]: 0 0 0 A 2 0 0 0 Apr 3 12:04:22 pnfs-3-15 sped[100920]: 0 0 0 A 2F 6E 69 70 Apr 3 12:04:22 pnfs-3-15 sped[100920]: 70 79 2F 68 32 62 0 0 Apr 3 12:04:22 pnfs-3-15 sped[100920]: 0 0 0 0 0 0 0 0
Where is FFFFFFA0 coming from? Okay, I took the code and stuck it in a driver:
> cat y.c
#include <sys/systm.h>
#include <sys/sdt.h>
#include <rpc/types.h>
#include <rpc/svc.h>
#include <rpc/xdr.h>
#include <rpc/auth_sys.h>
#include <stdio.h>
#include <stdlib.h>
#include <unistd.h>
#include <string.h>
void
sped_xdr_dump(char *jack, int len)
{
int i;
int j;
char buf[100];
char str[10];
char cbuf[4];
j = 0;
memset(&cbuf, '\0', 4);
buf[0] = '\0';
for (i = 0; i < len; i++) {
cbuf[0] = jack[i];
sprintf(str, " %X", cbuf[0]);
strcat(buf, str);
j++;
if (j == 8) {
printf("%s\n", buf);
j = 0;
buf[0] = '\0';
}
}
if (j != 0) {
printf("%s\n", buf);
}
}
int
main(int argc, char *argv[])
{
char jack[] = {
0x0, 0x0, 0x0, 0x2, 0x0, 0x0, 0x0, 0x0,
0x0, 0x0, 0x0, 0x2, 0x0, 0x3, 0xd, 0xa0,
0x0, 0x0, 0x0, 0xa, 0x2, 0x0, 0x0, 0x0,
0x0, 0x0, 0x0, 0xa, 0x2f, 0x6e, 0x69, 0x70,
0x70, 0x79, 0x2f, 0x68, 0x32, 0x62, 0x0, 0x0,
0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0
};
sped_xdr_dump(jack, sizeof(jack));
return (0);
}
> ./a.out
0 0 0 2 0 0 0 0
0 0 0 2 0 3 D FFFFFFA0
0 0 0 A 2 0 0 0
0 0 0 A 2F 6E 69 70
70 79 2F 68 32 62 0 0
0 0 0 0 0 0 0 0
So, I've captured the problem. Using gdb, I can get to the heart of the issue:
Breakpoint 3, sped_xdr_dump (jack=0x8047134 "", len=48) at x.c:32 32 cbuf[0] = jack[i]; 5: str = " D\000\000\000\000\000\000X\003" 4: j = 7 3: i = 15 2: jack[i] = -96 '?' 1: cbuf = "\r\000\000" (gdb) p jack[i-1] $1 = 13 '\r' (gdb) q
As tempting as it is to say that I'm grabbing a bit from the previous word, it is clear that the fat code is really working. And while I could do shenanigans with bit shifting, the problem isn't in the way I am explicitly pulling out the data. Nor is the " %X" an issue.
The real issue is that I've got the wrong type of data, I'm trying to print out signed data, and 'A0' has a high order bit set.
> diff y.c y1.c 21c21 < char cbuf[4]; --- > unsigned char cbuf[4]; > ./a.out 0 0 0 2 0 0 0 0 0 0 0 2 0 3 D A0 0 0 0 A 2 0 0 0 0 0 0 A 2F 6E 69 70 70 79 2F 68 32 62 0 0 0 0 0 0 0 0 0 0
Okay, we can do better than that. We need to get rid of cbuf:
> diff y.c y2.c 21d20 < char cbuf[4]; 24d22 < memset(&cbuf, '\0', 4); 28,29c26 < cbuf[0] = jack[i]; < sprintf(str, " %X", cbuf[0]); --- > sprintf(str, " %X", (unsigned char)jack[i]); > ./a.out 0 0 0 2 0 0 0 0 0 0 0 2 0 3 D A0 0 0 0 A 2 0 0 0 0 0 0 A 2F 6E 69 70 70 79 2F 68 32 62 0 0 0 0 0 0 0 0 0 0
And we need to clean-up the output:
> diff y2.c y3.c 26c26 < sprintf(str, " %X", (unsigned char)jack[i]); --- > sprintf(str, " %2X", (unsigned char)jack[i]); > ./a.out 0 0 0 2 0 0 0 0 0 0 0 2 0 3 D A0 0 0 0 A 2 0 0 0 0 0 0 A 2F 6E 69 70 70 79 2F 68 32 62 0 0 0 0 0 0 0 0 0 0
I might be able to make it even more either smarter or compact, but it is now throw away code. But even thow away code has to be functionally correct. When it isn't, look for areas where you are doing more work than you need to. If you remember you tried something and weren't convinced it would work, and it only worked partially, go back and question that code.
Okay, my snv 73 box is now a snv 85 box. Everything is working except for my punchin, but only because I need to bypass my Sun Ray 1G (and the Sun Ray Server 4.0 was dead easy to install). But the vnic_setup.sh script is still not working:
# ./vnic_setup.sh 0:1:4a:f2:31:34 Invalid link name: LINK # LD_LIBRARY_PATH=/opt/VirtualBox:/opt/VirtualBox/qtgcc/lib:. ; export LD_LIBRARY_PATH # ./vnic_setup.sh 0:1:4a:f2:31:34 Invalid link name: LINK # ./vnic_setup.sh 0:1:4a:f2:31:34 vnic1 vnic1 # ifconfig -a lo0: flags=2001000849mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 bge0: flags=201000843 mtu 1500 index 2 inet 192.168.2.130 netmask ffffff00 broadcast 192.168.2.255 ether 0:a:e4:34:2f:da lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128
Well the VirtualBox manual tells me how to do it manually:
# /usr/lib/vna bge0 0:1:4a:f2:31:34 vnic0 # ifconfig vnic0 plumb # ifconfig -a lo0: flags=2001000849mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 bge0: flags=201000843 mtu 1500 index 2 inet 192.168.2.130 netmask ffffff00 broadcast 192.168.2.255 ether 0:a:e4:34:2f:da vnic0: flags=201000842 mtu 1500 index 3 inet 0.0.0.0 netmask 0 ether 0:1:4a:f2:31:34 lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128 # /usr/lib/vna bge0 0:1:4a:f2:31:36 vnic1 # /usr/lib/vna bge0 0:1:4a:f2:31:38 vnic2 # ifconfig vnic1 plumb # ifconfig vnic2 plumb # ifconfig vnic0 192.168.2.150 destination 192.168.2.160 netmask 255.255.255.0 up # ifconfig vnic1 192.168.2.151 destination 192.168.2.161 netmask 255.255.255.0 up # ifconfig vnic2 192.168.2.152 destination 192.168.2.162 netmask 255.255.255.0 up # ifconfig -a lo0: flags=2001000849 mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 bge0: flags=201000843 mtu 1500 index 2 inet 192.168.2.130 netmask ffffff00 broadcast 192.168.2.255 ether 0:a:e4:34:2f:da vnic0: flags=201000851 mtu 1500 index 3 inet 192.168.2.150 --> 192.168.2.160 netmask ffffff00 ether 0:1:4a:f2:31:34 vnic1: flags=201000851 mtu 1500 index 4 inet 192.168.2.151 --> 192.168.2.161 netmask ffffff00 ether 0:1:4a:f2:31:36 vnic2: flags=201000851 mtu 1500 index 5 inet 192.168.2.152 --> 192.168.2.162 netmask ffffff00 ether 0:1:4a:f2:31:38 lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128
And when the system comes up, it has an IP of 192.168.2.29. And I can't ping any of the 3.
It looks like I need to learn the CLI for VirtualBox. Here is a related article: Internal network does not work for OpenSolaris guests
As mentioned, I have a minimal install for my OpenSolaris test machines. Evidently I am missing something quite important for getting a host network up and running:
# ./vnic_setup.sh 0:1:4a:f2:31:34 ./vnic_setup.sh: line 42: /usr/lib/vna: No such file or directory
Hmm, I can either load the iso and get the real files or learn how to work with the NAT mode.So either I do a complete reinstall or I figure how to get just a couple of files over. Sounds like I should learn about NAT anyway.
With two machines and NAT, they are both getting the same address. I could use the Internal Network option, but I'm still going to have to reinstall. Hmm, when I do a machine, I select Core Configure and add the following:
BIND DNS Name server and tools BIND Name server Manifest Filebench Freeware Compression Tools Perl-Compatible Regular Expressions Freeware shells Freeware Other Utilities GLIB XCU4 Utilities GNU Autotools Secure Shell GNU which gcmn - Common GNU package gdb ggrep gtar Lint Libraries (root) Lint Libraries (usr) GNU binutils GNU diffutils Perl 5.6.1 (core) Perl 5.6.1 (non-core) GNU textinfo Libevent Get all of System and Network Admin Live Upgrade Software MDB (root) Programming Tools resource pool (root) Resource Pools in core software for resource pools Solaris Zones Vi IMproved autoconf bcc coreutils rsync ROCSEC_GSS Kerberos V5 KDC (root) Kerberos V5 Master KDC (root) Kerberos Version 5 support (kernel) NIS Server for Solaris (root) NIS Server for Solaris (usr) Interprocess Communication
Time to find what I need to add. First we need to look in the ISO image:
[tdh@warlock ~]> sudo lofiadm -a /zoo/isos/x86/snv85/solarisdvd.iso /dev/lofi/1 [tdh@warlock ~]> sudo mount -F hsfs /dev/lofi/1 /mnt [tdh@warlock Product]> cd /mnt/Solaris_11/Product/ [tdh@warlock Product]> grep lib/vna */pkgmap SUNWcsu/pkgmap:1 f none usr/lib/vna 0555 root bin 12592 13767 1204942578
Hmm, wait, I need to find that on my host system and not the guest machine. D'Oh!
[tdh@warlock lib]> uname -a SunOS warlock 5.11 snv_73 i86pc i386 i86pc [tdh@warlock lib]> sudo lofiadm -d /zoo/isos/x86/snv7 snv79/
I have the DVD, but 73 is ancient! But I'll check:
[tdh@warlock Product]> grep lib/vna SUNWcsu/pkgmap [tdh@warlock Product]>
Ugh, the biggest hassle is that I use this machine as a Sun Ray Server. Okay, time for a reinstall!
RTFM - at least the one which comes with the distribution. I think the online one does not have OpenSolaris support. And, I am right it does not.
Anyway, you do not need to retrieve the qt libraries and build, as I just did, it is supplied for you.
cd /opt/VirtualBox LD_LIBRARY_PATH=/opt/VirtualBox:/opt/VirtualBox/qtgcc/lib:. ./VirtualBox
Downloading and installing VirtualBox on my OpenSolaris box was a snap. Running and installing on it was a bit harder. First off, you need qt installed. Easy instructions are at VirtualBox on OpenSolaris. Then you need to tell it that something other than Right-Ctrl is your host key - you need to do this because Sun keyboards do not have that key . I chose my Right-Meta key. Imagine VirtualBox has your mouse and keyboard, your screensaver kicks in, and you can't get the system to understand you want to bring it to life.
Telling the tool how to load my OpenSolaris iso image was a bit counterintuitive. But the biggest problem I ended up having was only giving 8M to my graphics card. The Developer's install seemed okay with that, but the normal graphical install would puke on it. I was able to do a text install, until the point the install decided I was doing a NFS install and did not like the path I was giving. It also would not see the attached iso as a cdrom.
Kicking both tyres together - VirtualBox and Indiana was very helpful for this issue, Alan casually mentions you need 32M of Video Ram. I gave the system that and could then do the graphical install. And it correctly identified the iso as a cdrom. I'll bring that value down once I finish the install. I'm not going to install a GUI on this machine.