GIRI MANDALIKA's SCRATCHPAD
Solaris 10: Zone Creation for Dummies
(Reproducing the three and half year old blog entry, a top 5 one, "as is" from my other blog hosted on blogger. Source URL: http://technopark02.blogspot.com/2006/02/solaris-10-zone-creation-for-dummies.html)About Zones
In its simple form, a zone is a virtual operating system environment created within a single instance of the Solaris operating system. Efficient resource utilization is the main goal of this technology.
Solaris 10's zone partitioning technology can be used to create local zones that behave like virtual servers. All local zones are controlled from the system's global zone. Processes running in a zone are completely isolated from the rest of the system. This isolation prevents processes that are running in one zone from monitoring or affecting processes that are running in other zones. Note that processes running in a local zone can be monitored from global zone; but the processes running in a global zone or even in another local zone cannot be monitored from a local zone.
As of now, the upper limit for the number of zones that can be created/run on a system is 8192; of course, depending on the resource availability, a single system may or may not run all the configured zones effectively.
Global Zone
When we install Solaris 10, a global zone gets installed automatically; and the core operating system runs under global zone. To list all the configured zones, we can use zoneadm command:
% zoneadm list -v ID NAME STATUS PATH 0 global running /
Global zone is the only one:
- bootable from the system hardware
- to be used for system-wide administrative control, such as physical devices, routing, or dynamic reconfiguration (DR). ie., global zone is the only zone that is aware of all devices and all file systems
- from which a non-global zone can be configured, installed, managed, or uninstalled. ie., global zone is the only zone that is aware of the existence of non-global (local) zones and their configurations. It is not possible to create local zones, within a local zone
Steps to create a Local Zone
Prerequisites:
- Plenty of disk space to hold the newly installed zone. It needs at least 2G space to copy the essential files to the local zone, and of course the disk space needed by the application(s) you are planning to run, in this zone; and
- A dedicated IP for network connectivity
Basic Zone creation steps with examples:
- Check the disk space & network configuration
% df -h / Filesystem size used avail capacity Mounted on /dev/dsk/c1t1d0s0 29G 22G 7.1G 76% / % ifconfig -a lo0: flags=2001000849
mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 eri0: flags=1000843 mtu 1500 index 2 inet 192.168.74.217 netmask fffffe00 broadcast 192.168.75.255 - Since there is more than 5G free space, I've decided to install a local zone under
/zones.% mkdir /zones
- Next step is to define/create the zone root. This is the path to zone's root directory that is relative to the global zone's root directory. Zone root must be owned by root user with the mode 700. This will be used in setting the
zonepathproperty, during the zone creation process% cd /zones % mkdir appserver % chmod 700 appserver % ls -l total 2 drwx------ 2 root root 512 Feb 17 12:46 appserver
- Create & configure a new 'sparse root' local zone, with root privileges
% zonecfg -z appserv appserv: No such zone configured Use 'create' to begin configuring a new zone. zonecfg:appserv> create zonecfg:appserv> set zonepath=/zones/appserver zonecfg:appserv> set autoboot=true zonecfg:appserv> add net zonecfg:appserv:net> set physical=eri0 zonecfg:appserv:net> set address=192.168.175.126 zonecfg:appserv:net> end zonecfg:appserv> add fs zonecfg:appserv:fs> set dir=/repo2 zonecfg:appserv:fs> set special=/dev/dsk/c2t40d1s6 zonecfg:appserv:fs> set raw=/dev/rdsk/c2t40d1s6 zonecfg:appserv:fs> set type=ufs zonecfg:appserv:fs> set options noforcedirectio zonecfg:appserv:fs> end zonecfg:appserv> add inherit-pkg-dir zonecfg:appserv:inherit-pkg-dir> set dir=/opt/csw zonecfg:appserv:inherit-pkg-dir> end zonecfg:appserv> info zonepath: /zones/appserver autoboot: true pool: inherit-pkg-dir: dir: /lib inherit-pkg-dir: dir: /platform inherit-pkg-dir: dir: /sbin inherit-pkg-dir: dir: /usr inherit-pkg-dir: dir: /opt/csw net: address: 192.168.175.126 physical: eri0 zonecfg:appserv> verify zonecfg:appserv> commit zonecfg:appserv> exit
Sparse Root Zone Vs Whole Root Zone(Updated 05/07/2008)
In a Sparse Root Zone, the directories
/usr, /sbin, /lib and /platformwill be mounted as loopback file systems. That is, although all those directories appear as normal directories under the sparse root zone, they will be mounted as read-only file systems. Any change to those directories in the global zone can be seen from the sparse root zone.However if you need the ability to write into any of those directories listed above, you may need to configure a Whole Root Zone. For example, softwares like ClearCase need write permissions to
/usrdirectory. In that case configuring a Whole Root Zone is the way to go. The steps for creating and configuring a new 'Whole Root' local zone are as follows:% zonecfg -z appserv appserv: No such zone configured Use 'create' to begin configuring a new zone. zonecfg:appserv> create zonecfg:appserv> set zonepath=/zones/appserver zonecfg:appserv> set autoboot=true zonecfg:appserv> add net zonecfg:appserv:net> set physical=eri0 zonecfg:appserv:net> set address=192.168.175.126 zonecfg:appserv:net> end zonecfg:appserv> add inherit-pkg-dir zonecfg:appserv:inherit-pkg-dir> set dir=/opt/csw zonecfg:appserv:inherit-pkg-dir> end zonecfg:appserv> remove inherit-pkg-dir dir=/usr zonecfg:appserv> remove inherit-pkg-dir dir=/sbin zonecfg:appserv> remove inherit-pkg-dir dir=/lib zonecfg:appserv> remove inherit-pkg-dir dir=/platform zonecfg:appserv> info zonepath: /zones/appserver autoboot: true pool: inherit-pkg-dir: dir: /opt/csw net: address: 192.168.175.126 physical: eri0 zonecfg:appserv> verify zonecfg:appserv> commit zonecfg:appserv> exit
Brief explanation of the properties that I added:
*
zonepath=/zones/appserverLocal zone's root directory, relative to global zone's root directory. ie., local zone will have all the
bin, lib, usr, dev, net, etc, var, optetc., directories physically under /zones/appserver directory*
autoboot=trueboot this zone automatically when the global zone is booted
*
physical=eri0eri0card is used for the physical interface*
address=192.168.175.126192.168.175.126 is the IP address. It must have all necessary DNS entries
[Added 08/25/08] The whole
add fssection adds the file system to the zone. In this example, the file system that is being exported to the zone is an existing UFS file system.*
set dir=/repo2/repo2is the mount point in the local zone*
set special=/dev/dsk/c2t40d1s6 set raw=/dev/rdsk/c2t40d1s6Grant access to the block (
/dev/dsk/c2t40d1s6) and raw (/dev/rdsk/c2t40d1s6) devices so the file system can be mounted in the non-global zone. Make sure the block device is not mounted anywhere right before installing the non-global zone. Otherwise, the zone installation may fail withERROR: file system check </usr/lib/fs/ufs/fsck> of </dev/rdsk/c2t40d1s6> failed: exit status <33>: run fsck manually. In that case, unmount the file system that is being exported, uninstall the partially installed zone (zoneadm -z <zone> uninstall) then install the zone from the scratch (no need to re-configure the zone, just do a re-install).*
The file system is of type UFSset type=ufs*
set options noforcedirectioMount the file system with the option
noforcedirectio[/Added 08/25/08]*
dir=/opt/cswread-only path, will be lofs'd (loop back mounted) from global zone. Note: it works for sparse root zone only -- whole root zone cannot have any shared file systems
zonecfgcommandsverifyandcommit, verifies and commits the zone configuration for the zone, respectively. Note that it is not necessary to commit the zone configuration; it will be done automatically when we exit fromzonecfgtool.infodisplays information about the current configuration - Check the state of the newly created/configured zone
% zoneadm list -cv ID NAME STATUS PATH 0 global running / - appserv configured /zones/appserver
- Next step is to install the configured zone. It takes a while to install the necessary packages
% zoneadm -z appserv install /zones must not be group writable. could not verify zonepath /zones/appserver because of the above errors. zoneadm: zone appserv failed to verify % ls -ld /zones drwxrwxr-x 3 root root 512 Feb 17 12:46 /zones
Since /zones must not be group writable, let's change the mode to 700.
% chmod 700 /zones % ls -ld /zones drwx------ 3 root root 512 Feb 17 12:46 /zones % zoneadm -z appserv install Preparing to install zone
. Creating list of files to copy from the global zone. Copying <2658> files to the zone. Initializing zone product registry. Determining zone package initialization order. Preparing to initialize <1128> packages on the zone. Initialized <1128> packages on zone. Zone is initialized. Installation of these packages generated errors: Installation of <2> packages was skipped. Installation of these packages generated warnings: <CSWbdb3 CSWtcpwrap CSWreadline CSWlibnet CSWlibpcap CSWjpeg CSWzlib CSWcommon CSWpkgget SMCethr CSWxpm SMClsof SMClibgcc SMCossld OpenSSH SMCtar SUNWj3dmx CSWexpat CSWftype2 CSWfconfig CSWiconv CSWggettext CSWlibatk CSWpango CSWpng CSWtiff CSWgtk2 CSWpcre CSWlibmm CSWgsed CSWlibtool CSWncurses CSWunixodbc CSWoldap CSWt1lib CSWlibxml2 CSWbzip2 CSWlibidn CSWphp> The file contains a log of the zone installation. - Verify the state of the
appservzone, one more time% zoneadm list -cv ID NAME STATUS PATH 0 global running / - appserv installed /zones/appserver
- Boot up the
appservzone. Let's note down the ifconfig output to see how it changes after the local zone boots up. Also observe that there is no answer from the server yet, since it is not up% ping 192.168.175.126 no answer from 192.168.175.126 % ifconfig -a lo0: flags=2001000849
mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 eri0: flags=1000843 mtu 1500 index 2 inet 192.168.74.217 netmask fffffe00 broadcast 192.168.75.255 ether 0:3:ba:2d:0:84 % zoneadm -z appserv boot zoneadm: zone 'appserv': WARNING: eri0:1: no matching subnet found in netmasks(4) for 192.168.175.126; using default of 255.255.0.0. % zoneadm list -cv ID NAME STATUS PATH 0 global running / 1 appserv running /zones/appserver % ping 192.168.175.126 192.168.175.126 is alive % ifconfig -a lo0: flags=2001000849 mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 lo0:1: flags=2001000849 mtu 8232 index 1 zone appserv inet 127.0.0.1 netmask ff000000 eri0: flags=1000843 mtu 1500 index 2 inet 192.168.74.217 netmask fffffe00 broadcast 192.168.75.255 ether 0:3:ba:2d:0:84 eri0:1: flags=1000843 mtu 1500 index 2 zone appserv inet 192.168.175.126 netmask ffff0000 broadcast 192.168.255.255 Observe that the zone
appservhas it's own virtual instance oflo0, the system's loopback interface and the zone's IP address is also being served by theeri0network interface - Login to the Zone {console} and performing the internal zone configuration.
zloginutility can be used to enter a zone. The first time we log in to the console, we get a chance to answer a series of questions for the desired zone configuraton.-Coption ofzlogincan be used to log in to the Zone console.% zlogin -C -e [ appserv [Connected to zone 'appserv' console] Select a Language 0. English 1. es 2. fr Please make a choice (0 - 2), or press h or ? for help: 0 Select a Locale 0. English (C - 7-bit ASCII) 1. Canada (English) (UTF-8) 2. Canada-English (ISO8859-1) 3. U.S.A. (UTF-8) 4. U.S.A. (en_US.ISO8859-1) 5. U.S.A. (en_US.ISO8859-15) 6. Go Back to Previous Screen Please make a choice (0 - 6), or press h or ? for help: 0 ... Enter the host name which identifies this system on the network. The name must be unique within your domain; creating a duplicate host name will cause problems on the network after you install Solaris. A host name must have at least one character; it can contain letters, digits, and minus signs (-). Host name for eri0:1 appserv v440appserv ... ... System identification is completed. ... rebooting system due to change(s) in /etc/default/init [NOTICE: Zone rebooting] SunOS Release 5.11 Version snv_23 64-bit Copyright 1983-2005 Sun Microsystems, Inc. All rights reserved. Use is subject to license terms. Hostname: v440appserv v440appserv console login: root Password: Feb 17 15:15:30 v440appserv login: ROOT LOGIN /dev/console Sun Microsystems Inc. SunOS 5.11 snv_23 October 2007 %
That is all there is in the creation of a local zone. Now simply login to the newly created zone, just like connecting to any other system in the network.
[New 08/27/2008] Mounting file systems in a non-global zone
Sometimes it might be necessary to export file systems or create new file systems when the zone is already running. This section's focus is on exporting block devices and the raw devices in such situations i.e., when the local zone is already configured.
Exporting the Raw Device(s) to a non-global zone
If the file system does not exist on the device, raw devices can be exported as they are, so the file system can be created inside the non-global zone using the normal newfs command.
The following example shows how to export the raw device to a non-global zone when the zone is already configured.
# zonecfg -z appserv zonecfg:appserv> add device zonecfg:appserv:device> set match=/dev/rdsk/c5t0d0s6 zonecfg:appserv:device> end zonecfg:appserv> verify zonecfg:appserv> commit zonecfg:appserv> exit
In this example /dev/rdsk/c5t0d0s6 is being exported.
After the zonecfg step, reboot the non-global zone to make the raw device visible inside the non-global zone. After the reboot, check the existence of the raw device.
# hostname v440appserv # ls -l /dev/rdsk/c5t0d0s6 crw-r----- 1 root sys 118, 126 Aug 27 14:33 /dev/rdsk/c5t0d0s6
Now that the raw device is accessible within the non-global zone, we can use the regular Solaris commands to create any file system like UFS.
eg.,# newfs -v c5t0d0s6 newfs: construct a new file system /dev/rdsk/c5t0d0s6: (y/n)? y mkfs -F ufs /dev/rdsk/c5t0d0s6 1140260864 -1 -1 8192 1024 251 1 120 8192 t 0 -1 8 128 n Warning: 4096 sector(s) in last cylinder unallocated /dev/rdsk/c5t0d0s6: 1140260864 sectors in 185590 cylinders of 48 tracks, 128 sectors 556768.0MB in 11600 cyl groups (16 c/g, 48.00MB/g, 5824 i/g) super-block backups (for fsck -F ufs -o b=#) at: 32, 98464, 196896, 295328, 393760, 492192, 590624, 689056, 787488, 885920, Initializing cylinder groups: ............................................................................... ............................................................................... ......................................................................... super-block backups for last 10 cylinder groups at: 1139344160, 1139442592, 1139541024, 1139639456, 1139737888, 1139836320, 1139934752, 1140033184, 1140131616, 1140230048
Exporting the Block Device(s) to a non-global zone
If the file system exists on the device, block devices can be exported as they are, so the file system can be mounted inside the non-global zone using the normal Solaris command, mount.
The following example shows how to export the block device to a non-global zone when the zone is already configured.
# zonecfg -z appserv zonecfg:appserv> add device zonecfg:appserv:device> set match=/dev/dsk/c5t0d0s6 zonecfg:appserv:device> end zonecfg:appserv> verify zonecfg:appserv> commit zonecfg:appserv> exit
In this example /dev/dsk/c5t0d0s6 is being exported.
After the zonecfg step, reboot the non-global zone to make the block device visible inside the non-global zone. After the reboot, check the existence of the block device; and mount the file system within the non-global zone.
# hostname v440appserv # ls -l /dev/dsk/c5t0d0s6 brw-r----- 1 root sys 118, 126 Aug 27 14:40 /dev/dsk/c5t0d0s6 # fstyp /dev/dsk/c5t0d0s6 ufs # mount /dev/dsk/c5t0d0s6 /mnt # df -h /mnt Filesystem size used avail capacity Mounted on /dev/dsk/c5t0d0s6 535G 64M 530G 1% /mnt
Mounting a file system from the global zone into the non-global zone
Sometimes it is desirable to have the flexibility of mounting a file system in the global zone or non-global zone on-demand. In such situations, rather than exporting the file systems or block devices into the non-global zone, create the file system in the global zone and mount the file system directly from the global zone into the non-global zone. Make sure to unmount that file system in the global zone if mounted, before attempting to mount it in the non-global zone.
eg.,In the non-global zone:
# mkdir /repo1
In the global zone:
# df -h /repo1 /dev/dsk/c2t40d0s6 134G 64M 133G 1% /repo1 # umount /repo1 # ls -ld /zones/appserv/root/repo1 drwxr-xr-x 2 root root 512 Aug 27 14:45 /zones/appserv/root/repo1 # mount /dev/dsk/c2t40d0s6 /zones/appserv/root/repo1
Now go back to the non-global zone and check the mounted file systems.
# hostname v440appserv # df -h /repo1 Filesystem size used avail capacity Mounted on /repo1 134G 64M 133G 1% /repo1To unmount the file system from the non-global zone, run the following command from the global zone.
# umount /zones/appserv/root/repo1
Removing the file system from the non-global zone
eg.,Earlier in the zone creation step, the block device /dev/dsk/c2t40d1s6 was exported and mounted on the mount point /repo2 inside the non-global zone. To remove the file system completely from the non-global zone, run the following in the global zone.
# zonecfg -z appserv zonecfg:appserv> remove fs dir=/repo2 zonecfg:appserv> verify zonecfg:appserv> commit zonecfg:appserv> exit
Reboot the non-global zone for this setting to take effect.
Shutting down and booting up the local zones (Updated 01/15/2008)
- To bring down the local zone:
% zlogin appserv shutdown -i 0
- To boot up the local zone:
% zoneadm -z appserv boot
Just for the sake of completeness, the following steps show how to remove a local zone.
Steps to delete a Local Zone
- Shutdown the local zone
% zoneadm -z appserv halt % zoneadm list -cv ID NAME STATUS PATH 0 global running / - appserv installed /zones/appserver
- Uninstall the local zone -- remove the root file system
% zoneadm -z appserv uninstall Are you sure you want to uninstall zone appserv (y/[n])? y zoneadm list -cv ID NAME STATUS PATH 0 global running / - appserv configured /zones/appserver
- Delete the configured local zone
% zonecfg -z appserv delete Are you sure you want to delete zone appserv (y/[n])? y zoneadm list -cv ID NAME STATUS PATH 0 global running /
[New: 07/14/2009]
Cloning a Non-Global Zone
The following instructions are for cloning a non-global zone on the same system. The example shown below clones the siebeldb zone. After the cloning process, a brand new zone oraclebi emerges as a replica of siebeldb zone.
# zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared - siebeldb installed /zones/dbserver native excl
- Export the configuration of the zone that you want to clone/copy
# zonecfg -z siebeldb export > /tmp/siebeldb.config.cfg
-
Change the configuration of the new zone that differ from the existing one -- for example, IP address, data set names, network interface etc. To make these changes, edit
/tmp/siebeldb.config.cfg - Create the zone root directory for the new zone being created
# mkdir /zones3/oraclebi # chmod 700 /zones3/oraclebi # ls -ld /zones3/oraclebi drwx------ 2 root root 512 Mar 12 15:41 /zones3/oraclebi
- Create a new (empty, non-configured) zone in the usual manner with the edited configuration file as an input
# zonecfg -z oraclebi -f /tmp/siebeldb.config.cfg # zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared - siebeldb installed /zones/dbserver native excl - oraclebi configured /zones3/oraclebi native excl
- Ensure that the zone you intend to clone/copy is not running
# zoneadm -z siebeldb halt
- Clone the existing zone
# zoneadm -z oraclebi clone siebeldb Cloning zonepath /zones/dbserver...
This step takes at least 5 minutes to clone the whole zone. Larger zones may take longer to complete the cloning process.
- Boot the newly created zone
# zoneadm -z oraclebi boot
Bring up the halted zone (the source zone) as well, if wish.
- Login to the console of the new zone to configure IP, networking, etc., and you are done.
# zlogin -C oraclebi
[New: 07/15/2009]
Migrating a Non-Global Zone from One Host to Another
Keywords: Solaris, Non-Global Zone, Migration, Attach, Detach
The following instructions demonstrate how to migrate the non-global zone, orabi to another server with examples.
# zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared 4 siebeldb running /zones/dbserver native excl - orabi installed /zones3/orabi native shared
- Halt the zone to be migrated, if running
# zoneadm -z orabi halt
- Detach the zone. Once detached, it will be in the configured state
# zoneadm -z orabi detach # zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared 4 siebeldb running /zones/dbserver native excl - orabi configured /zones3/orabi native shared
- Move the
zonepathfor the zone to be migrated from the old host to the new host.Do the following on the old host:
# cd /zones3 # tar -Ecf orabi.tar orabi # compress orabi.tar # sftp newhost Connecting to newhost... sftp> cd /zones3 sftp> put orabi.tar.Z Uploading orabi.tar.Z to /zones3/orabi.tar.Z sftp> quit
On the newhost:
# cd /zones3 # uncompress orabi.tar.Z # tar xf orabi.tar
- On the new host, configure the zone.
Create the equivalent zone
orabion the new host -- use thezonecfgcommand with the-aoption and thezonepathon the new host. Make any required adjustments to the configuration and commit the configuration.# zonecfg -z orabi orabi: No such zone configured Use 'create' to begin configuring a new zone. zonecfg:orabi> create -a /zones3/orabi zonecfg:orabi> info zonename: orabi zonepath: /zones3/orabi brand: native autoboot: false bootargs: pool: limitpriv: all,!sys_suser_compat,!sys_res_config,!sys_net_config,!sys_linkdir,!sys_devices,!sys_config,!proc_zone,!dtrace_kernel,!sys_ip_config scheduling-class: ip-type: shared inherit-pkg-dir: dir: /lib inherit-pkg-dir: dir: /platform inherit-pkg-dir: dir: /sbin inherit-pkg-dir: dir: /usr net: address: IPaddress physical: nxge1 defrouter not specified zonecfg:orabi> set capped-memory zonecfg:orabi:capped-memory> set physical=8G zonecfg:orabi:capped-memory> end zonecfg:orabi> commit zonecfg:orabi> exit
- Attach the zone on the new host with a validation check and update the zone to match a host running later versions of the dependent packages
# ls -ld /zones3 drwxrwxrwx 5 root root 512 Jul 15 12:30 /zones3 # chmod g-w,o-w /zones3 # ls -ld /zones3 drwxr-xr-x 5 root root 512 Jul 15 12:30 /zones3 # zoneadm -z orabi attach -u Getting the list of files to remove Removing 1740 files Remove 607 of 607 packages Installing 1878 files Add 627 of 627 packages Updating editable files The file within the zone contains a log of the zone update. # zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared - orabi installed /zones3/orabi native shared
Note:
It is possible to force the attach operation without performing the validation. You can do so with the help of
-Foption# zoneadm -z orabi attach -F
Be careful when using this option because it could lead to an incorrect configuration; and an incorrect configuration could result in undefined behavior
[New: 07/19/2009]
Tip: How to find out whether connected to the primary OS instance or the virtual instance?
If the command zonename returns global, then you are connected to the OS instance that was booted from the physical hardware. If you see any string other than global, you might have connected to the virtual OS instance.
Alternatively try running prstat -Z or zoneadm list -cv commands. If you see exactly one non-zero Zone ID, it is an indication that you are connected to a non-global zone.
Suggested reading:
- System Administration Guide: Solaris Containers-Resource Management and Solaris Zones
- Zones and Containers FAQ at opensolaris.org
- Zones : Unofficial FAQ
Posted at 09:39PM Jul 19, 2009 by Giri Mandalika in Solaris | Comments[0]
Controlling [Virtual] Network Interfaces in a Non-Global Solaris Zone
In the software world, some tools like SAP NetWeaver's Adaptive Computing Controller (ACC) require full control over a network interface, so they can bring up/down the NICs at their will to fulfill their responsibilities. Those tools may function normally on Solaris 10 [and later] as long as they are run in the global zone. However there might be some trouble when those tools are attempted to run in a non-global zone, especially on machines with only one physical network interface installed, and when the non-global zones are created with the default configuration. This blog post attempts to suggest few solutions to get around those issues, so the tools can function the way they normally do in the global zone.
If the machine has only one NIC installed, there are at least two issues that will prevent tools like ACC from working in a non-global zone.
- Since there is only one network interface on the system, it is not possible to dedicate that interface to the non-global zone where ACC is supposed to run. Hence all the zones, including the global zone, must share the physical network interface.
- When the physical network interface is being shared across multiple zones, it is not possible to plumb/unplumb the network interface from a Shared-IP Non-Global Zone. Only the root users in the global zone can plumb/unplumb the lone physical network interface.
When a non-global zone is created with the default configuration, Shared-IP zone is created by default. Shared-IP zones have separate IP addresses, but share the IP routing configuration with the global zone.
Fortunately, Solaris 10 has a solution to the aforementioned issues in the form of Network Virtualization. Crossbow is the code name for network virtualization in Solaris. Crossbow provides the necessary building blocks to virtualize a single physical network interface into multiple virtual network interfaces (VNICs) - so the solution to the issue at hand is to create a virtual network interface, and then to create an Exclusive-IP Non-Global Zone using the virtual NIC. Rest of the blog post demonstrates the simple steps to create a VNIC, and to configure a non-global zone as Exclusive-IP Zone.
Create a Virtual Network Interface using Crossbow
- Make sure the OS has Crossbow functionality
global# cat /etc/release Solaris Express Community Edition snv_111 SPARC Copyright 2009 Sun Microsystems, Inc. All Rights Reserved. Use is subject to license terms. Assembled 23 March 2009Crossbow has been integrated into Solaris Express Community Edition (Nevada) build 105 - hence all Nevada builds starting with build 105 will have the Crossbow functionality. OpenSolaris 2009.06 and the next major update to Solaris 10 are expected to have the support for network virtualization out-of-the-box.
- Check the existing zones and the available physical and virtual network interfaces.
global# zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared global# dladm show-link LINK CLASS MTU STATE OVER e1000g0 phys 1500 up --
In this example, there is only one NIC, e1000g0, on the server; and there are no non-global zones installed.
- Create a virtual network interface based on device e1000g0 with an automatically generated MAC address. If the NIC has factory MAC addresses available, one of them will be used. Otherwise, a random address is selected. The
automode is the default action if none is specified.global# dladm create-vnic -l e1000g0 vnic1
- Check the available network interfaces one more time. Now you should be able to see the newly created virtual NIC in addition to the existing physical network interface. It is also possible to list only the virtual NICs.
global# dladm show-link LINK CLASS MTU STATE OVER e1000g0 phys 1500 up -- vnic1 vnic 1500 up e1000g0 global# dladm show-vnic LINK OVER SPEED MACADDRESS MACADDRTYPE VID vnic1 e1000g0 1000 2:8:20:32:9:10 random 0
Create a Non-Global Zone with the VNIC
- Create an Exclusive-IP Non-Global Zone with the newly created VNIC being the primary network interface.
global # mkdir -p /export/zones/sapacc global # chmod 700 /export/zones/sapacc global # zonecfg -z sapacc sapacc: No such zone configured Use 'create' to begin configuring a new zone. zonecfg:sapacc> create zonecfg:sapacc> set zonepath=/export/zones/sapacc zonecfg:sapacc> set autoboot=false zonecfg:sapacc> set ip-type=exclusive zonecfg:sapacc> add net zonecfg:sapacc:net> set physical=vnic1 zonecfg:sapacc:net> end zonecfg:sapacc> verify zonecfg:sapacc> commit zonecfg:sapacc> exit global # zoneadm -z sapacc install global # zoneadm -z sapacc boot global # zoneadm list -cv ID NAME STATUS PATH BRAND IP 0 global running / native shared 1 sapacc running /export/zones/sapacc native excl
- Configure the new non-global zone including the IP address and the network services
global # zlogin -C -e [ sapacc ... > Confirm the following information. If it is correct, press F2; to change any information, press F4. Host name: sap-zone2 IP address: 10.6.227.134 System part of a subnet: Yes Netmask: 255.255.255.0 Enable IPv6: No Default Route: Detect one upon reboot - Inside the non-global zone, check the status of the VNIC and the status of the network service
local# hostname sap-zone2 local# zonename sapacc local# ifconfig -a lo0: flags=2001000849
mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 vnic1: flags=1000843 mtu 1500 index 2 inet 10.6.227.134 netmask ffffff00 broadcast 10.6.227.255 ether 2:8:20:32:9:10 lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128 local# svcs svc:/network/physical STATE STIME FMRI disabled 13:02:18 svc:/network/physical:nwam online 13:02:24 svc:/network/physical:default Check the network connectivity.
From inside the non-global zone to the outside world:
local# ping -s sap29 PING sap29: 56 data bytes 64 bytes from sap29 (10.6.227.177): icmp_seq=0. time=0.680 ms 64 bytes from sap29 (10.6.227.177): icmp_seq=1. time=0.452 ms 64 bytes from sap29 (10.6.227.177): icmp_seq=2. time=0.561 ms 64 bytes from sap29 (10.6.227.177): icmp_seq=3. time=0.616 ms ^C ----sap29 PING Statistics---- 4 packets transmitted, 4 packets received, 0% packet loss round-trip (ms) min/avg/max/stddev = 0.452/0.577/0.680/0.097
From the outside world to the non-global zone:remotehostonWAN# telnet sap-zone2 Trying 10.6.227.134... Connected to sap-zone2.sun.com. Escape character is '^]'. login: test Password: Sun Microsystems Inc. SunOS 5.11 snv_111 November 2008 -bash-3.2$ /usr/sbin/ifconfig -a lo0: flags=2001000849
mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 vnic1: flags=1000843 mtu 1500 index 2 inet 10.6.227.134 netmask ffffff00 broadcast 10.6.227.255 lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128 -bash-3.2$ exit logout Connection to sap-zone2 closed.
Dynamic [Re]Configuration of the [Virtual] Network Interface in a Non-Global Zone
- Finally try plumbing down/up the virtual network interface inside the Exclusive-IP Non-Global Zone
global # zlogin -C -e [ sapacc [Connected to zone 'sapacc' console] .. zoneconsole# ifconfig vnic1 unplumb zoneconsole# /usr/sbin/ifconfig -a lo0: flags=2001000849
mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 zoneconsole# ifconfig vnic1 plumb zoneconsole# ifconfig vnic1 10.6.227.134 netmask 255.255.255.0 up zoneconsole# /usr/sbin/ifconfig -a lo0: flags=2001000849 mtu 8232 index 1 inet 127.0.0.1 netmask ff000000 vnic1: flags=1000843 mtu 1500 index 2 inet 10.6.227.134 netmask ffffff00 broadcast 10.6.227.255 lo0: flags=2002000849 mtu 8252 index 1 inet6 ::1/128
As simple as that! Before we conclude, be informed that prior to Crossbow, Solaris system administrators were required to use Virtual Local Area Networks (VLAN) to achieve similar outcomes.
Check Zones and Containers FAQ, if you are stuck with a strange situation or if you need some interesting ideas around virtualization on Solaris.
Posted at 01:29AM Apr 07, 2009 by Giri Mandalika in Solaris | Comments[0]
Mounting Windows' NTFS on [Open]Solaris x86/x64
The steps outlined in this blog post are derived from the Miscellaneous filesystem support for OpenSolaris on x86 web page. I just added few examples to illustrate the steps to mount a partition with NTFS filesystem that exists on the external hard drive (in this case, it is a Seagate FreeAgent external hard drive).
Step-by-Step instructions to mount NTFS filesystem on [Open]Solaris
-
Install the packages : FSWpart and FSWfsmisc.
-
Download http://www.belenix.org/binfiles/FSWpart.tar.gz and http://www.belenix.org/binfiles/FSWfsmisc.tar.gz
-
Uncompress and install the packages with 'root' privileges.
% gunzip -c FSWpart.tar.gz | tar xvf - # pkgadd -d . FSWpart % gunzip -c FSWfsmisc.tar.gz | tar xvf - # pkgadd -d . FSWfsmisc
-
-
Find the logical device name for the NTFS partition.
-loption of thermformatcommand lists all removable devices along with their device names.# rmformat -l Looking for devices... 1. Logical Node: /dev/rdsk/c1t0d0p0 Physical Node: /pci@0,0/pci-ide@1f,1/ide@1/sd@0,0 Connected Device: MATSHITA UJDA750 DVD/CDRW 1.60 Device Type: DVD Reader Bus: IDE Size:Label: Access permissions: 2. Logical Node: /dev/rdsk/c2t0d0p0 Physical Node: /pci@0,0/pci1179,1@1d,7/storage@1/disk@0,0 Connected Device: Seagate FreeAgentDesktop 100F Device Type: Removable Bus: USB Size: 953.9 GB Label: Access permissions: - Identify the NTFS partition on the external disk with the help of
fdisk# fdisk /dev/rdsk/c2t0d0p0 Total disk size is 60800 cylinders Cylinder size is 32130 (512 byte) blocks Cylinders Partition Status Type Start End Length % ========= ====== ============ ===== === ====== === 1 IFS: NTFS 0 60800 60801 100 SELECT ONE OF THE FOLLOWING: 1. Create a partition 2. Specify the active partition 3. Delete a partition 4. Change between Solaris and Solaris2 Partition IDs 5. Exit (update disk configuration and exit) 6. Cancel (exit without updating disk configuration) Enter Selection: 6 #In this example, partition #1 i.e., c2t0d0p1 has the NTFS filesystem.
-
Mount the NTFS partition just like mounting an UFS filesystem using the
mountcommand. Use the argumentntfsto the command line option-F. Since the filesystem was mounted in a slightly different way than the conventional way, use/usr/bin/xlsmountsto see the detailed mount table information.# mount -F ntfs /dev/dsk/c2t0d0p1 /mnt # /usr/bin/xlsmounts PHYSICAL DEVICE LOGICAL DEVICE FS PID ADDR Mounted on /dev/dsk/c2t0d0p1 /dev/dsk/c2t0d0p1 ntfs 6755 127.0.0.1:/ /mnt # ls /mnt expForSun.dmp MySQL5.1 RECYCLER medium-64-bit $RECYCLE.BIN System Volume Information
Notice the
127.0.0.1:/underADDRcolumn in the output ofxlsmounts. NTFS mount uses userland NFSv2 server to access the filesystems on raw partitions. That is why the mount was shown as NFS client mounted from 127.0.0.1:/ -
To unmount the NTFS filesystem, use
/usr/bin/xumount. Solaris standardumountcommand unmounts the filesystem but does not terminate the background NFS server process.# /usr/bin/xumount /mnt - OR - # /usr/bin/xumount /dev/dsk/c2t0d0p1
Check the Miscellaneous filesystem support for OpenSolaris on x86 page and Moinak Ghosh's blog post Mount and Access NTFS and Ext2FS from Solaris x86 for the rest of the fine details.
Posted at 02:17AM Feb 08, 2009 by Giri Mandalika in Solaris | Comments[3]
PeopleSoft on Solaris 10: Fixing the "msgget: No space left on device" Error
(Crossposting the 8+ month old blog entry from my other blog hosted on blogger. Source URL:http://technopark02.blogspot.com/2008/03/peoplesoft-fixing-msgget-no-space-left.html)
When a large number of application server processes are configured in a single PeopleSoft domain or in multiple domains cumulative, it is very likely that the PeopleSoft application server domain boot process may fail with errors like:
Booting server processes ...
exec PSSAMSRV -A -- -C psappsrv.cfg -D CS90SPV -S PSSAMSRV :
Failed.
113954.ben15!PSSAMSRV.29746.1.0: LIBTUX_CAT:681: ERROR: Failure to create message queue
113954.ben15!PSSAMSRV.29746.1.0: LIBTUX_CAT:248: ERROR: System init function failed, Uunixerr = :
msgget: No space left on device
113954.ben15!tmboot.29708.1.-2: CMDTUX_CAT:825: ERROR: Process PSSAMSRV at ben15 failed with /T
tperrno (TPEOS - operating system error)
In this particular example, the PeopleSoft Enterprise is running on a Solaris 10 system. Fortunately the error message is very clear in this case; and the failure is related to the message queues. During the domain boot up process, there is a call to msgget() to create a message queue. If the call to msgget() succeeds, it returns a non-negative integer that serves as the identifier for the newly created message queue. However in the case of a failure, it returns -1 and sets the error number to EACCES, EEXIST, ENOENT or ENOSPC depending on the underlying reason.
From the above error messages it clear that the msgget() failed with the errno set to ENOSPC (No space left on device). Man page of msgget(2) has the following explanation for ENOSPC error code on Solaris:
ERRORS
The msgget() function will fail if:
...
...
ENOSPC A message queue identifier is to be created but
the system-imposed limit on the maximum number of
allowed message queue identifiers system wide
would be exceeded. See NOTES.
NOTES
...
...
The system-imposed limit on the number of message queue
identifiers is maintained on a per-project basis using the
project.max-msg-ids resource control.
It has enough clues to suspect the configured number for the message queue identifiers.
Prior to the release of Solaris 10, the /etc/system System V IPC tunable, msgsys:msginfo_msgmni, was used to control the maximum number of message queues that can be created. The default value on pre-Solaris 10 systems is 50.
With the release of Solaris 10, majority of the System V IPC tunables were obsoleted and equivalent resource controls were created for the remaining tunables to reduce the administrative overhead. On Solaris 10 and later versions, System V IPC can be tuned on a per project basis using the newly introduced resource controls.
On any Solaris 10 system, the resource control, project.max-msg-ids, replaced the old /etc/system tunable, msginfo_msgmni. And the default value has been raised to 128.
Now back to the failure in PeopleSoft environment. Let's first check the current value configured for project.max-msg-ids.
- Get the project ID.
% id -p uid=222227(psft) gid=2294(dba) projid=3(default)
- Examine the
project.max-msg-idsresource control for the project with ID 3, using theprctlutility.% prctl -n project.max-msg-ids -i project 3 project: 3: default NAME PRIVILEGE VALUE FLAG ACTION RECIPIENT project.max-msg-ids privileged 128 - deny - system 16.8M max deny -
Alternatively run the command ipcs -q to check the number of active message queues. Note that the project with id '3' is configured to create a maximum of 128 (default) message queues. In any case, the number of active message queues from the ipcs -q output may almost match with the configured value for the project.max-msg-ids.
Since it appears the configured PeopleSoft domain(s) needs more than 128 message queues in order to bring up all the application server processes that constitute the PeopleSoft Enterprise, the solution is to increase the value for the resource control, project.max-msg-ids, to any value beyond 128. For the sake of simplicity, let's increase it to 256 (2 * default value, that is). Again prctl utility can be used to set the new value for the resource control.
- Assume the privileges of the 'root' user
% su Password:
- Increase the maximum value for the message queue identifiers to 256 using the
prctlutility.# prctl -n project.max-msg-ids -r -v 256 -i project 3
- Verify the new maximum value for the message queue identifiers
# prctl -n project.max-msg-ids -i project 3 project: 3: default NAME PRIVILEGE VALUE FLAG ACTION RECIPIENT project.max-msg-ids privileged 256 - deny - system 16.8M max deny -
With this change, the PeopleSoft Enterprise should boot up at least with no Failure to create message queue .. msgget: No space left on device errors.
Before we conclude, note that the above mentioned solution is not persistent across multiple operating system reboots. To make it persistent, create a new project using the projadd command. The man page for projadd(1M) has an example showing the creation of a project.
Posted at 12:38AM Dec 01, 2008 by Giri Mandalika in Solaris | Comments[4]
Oracle on Solaris 10 : Fixing the 'ORA-27102: out of memory' Error
(Crossposting the 2+ year old blog entry from my other blog hosted on blogger. Source URL:
http://technopark02.blogspot.com/2006/09/solaris-10oracle-fixing-ora-27102-out.html)
Symptom:
As part of a database tuning effort you increase the SGA/PGA sizes; and Oracle greets with an ORA-27102: out of memory error message. The system had enough free memory to serve the needs of Oracle.
SQL> startup ORA-27102: out of memory SVR4 Error: 22: Invalid argument
Diagnosis
$ oerr ORA 27102 27102, 00000, "out of memory" // *Cause: Out of memory // *Action: Consult the trace file for details
Not so helpful. Let's look the alert log for some clues.
% tail -2 alert.log WARNING: EINVAL creating segment of size 0x000000028a006000 fix shm parameters in /etc/system or equivalent
Oracle is trying to create a 10G shared memory segment (depends on SGA/PGA sizes), but operating system (Solaris in this example) responded with an invalid argument (EINVAL) error message. There is a little hint about setting shm parameters in /etc/system.
Prior to Solaris 10, shmsys:shminfo_shmmax parameter has to be set in /etc/system with maximum memory segment value that can be created. 8M is the default value on Solaris 9 and prior versions; where as 1/4th of the physical memory is the default on Solaris 10 and later. On a Solaris 10 (or later) system, it can be verified as shown below:
% prtconf | grep Mem
Memory size: 32760 Megabytes
% id -p
uid=59008(oracle) gid=10001(dba) projid=3(default)
% prctl -n project.max-shm-memory -i project 3
project: 3: default
NAME PRIVILEGE VALUE FLAG ACTION RECIPIENT
project.max-shm-memory
privileged 7.84GB - deny -
system 16.0EB max deny -
Now it is clear that the system is using the default value of 8G in this scenario, where as the application (Oracle) is trying to create a memory segment (10G) larger than 8G. Hence the failure.
So, the solution is to configure the system with a value large enough for the shared segment being created, so Oracle succeeds in starting up the database instance.
On Solaris 9 and prior releases, it can be done by adding the following line to /etc/system, followed by a reboot for the system to pick up the new value.
set shminfo_shmmax = 0x000000028a006000However shminfo_shmmax parameter was obsoleted with the release of Solaris 10; and Sun doesn't recommend setting this parameter in /etc/system even though it works as expected.
On Solaris 10 and later, this value can be changed dynamically on a per project basis with the help of resource control facilities . This is how we do it on Solaris 10 and later:
% prctl -n project.max-shm-memory -r -v 10G -i project 3
% prctl -n project.max-shm-memory -i project 3
project: 3: default
NAME PRIVILEGE VALUE FLAG ACTION RECIPIENT
project.max-shm-memory
privileged 10.0GB - deny -
system 16.0EB max deny -
Note that changes done with prctl command on a running system are temporary, and will be lost when the system is rebooted. To make the changes permanent, create a project with projadd command as shown below:
% projadd -p 3 -c 'eBS benchmark' -U oracle -G dba -K 'project.max-shm-memory=(privileged,10G,deny)' OASBFinally make sure the project is created with
projects -l or cat /etc/project commands.
% projects -l
...
...
OASB
projid : 3
comment: "eBS benchmark"
users : oracle
groups : dba
attribs: project.max-shm-memory=(privileged,10737418240,deny)
% cat /etc/project
...
...
OASB:3:eBS benchmark:oracle:dba:project.max-shm-memory=(privileged,10737418240,deny)
With these changes, Oracle would start the database up normally.
SQL> startup ORACLE instance started. Total System Global Area 1.0905E+10 bytes Fixed Size 1316080 bytes Variable Size 4429966096 bytes Database Buffers 6442450944 bytes Redo Buffers 31457280 bytes Database mounted. Database opened.
Related information:
- What's New in Solaris System Tuning in the Solaris 10 Release?
- Resource Controls (overview)
- System Setup Recommendations for Solaris 8 and Solaris 9
- Man page of prctl(1)
- Man page of projadd
Addendum : Oracle RAC settings
Anonymous Bob suggested the following settings for Oracle RAC in the form of a comment for the benefit of others who run into similar issue(s) when running Oracle RAC. I'm pasting the comment as is (Disclaimer: I have not verified these settings):
Thanks for a great explanation, I would like to add one comment that will help those with an Oracle RAC installation. Modifying the default project covers oracle processes great and is all that is needed for a single instance DB. In RAC however, the CRS process starts the DB and it is a root owned process and root does not use the default project. To fix ORA-27102 issue for RAC I added the following lines to an init script that runs before the init.crs script fires.
# Recommended Oracle RAC system params ndd -set /dev/udp udp_xmit_hiwat 65536 ndd -set /dev/udp udp_recv_hiwat 65536 # For root processes like crsd prctl -n project.max-shm-memory -r -v 8G -i project system prctl -n project.max-shm-ids -r -v 512 -i project system # For oracle processes like sqlplus prctl -n project.max-shm-memory -r -v 8G -i project default prctl -n project.max-shm-ids -r -v 512 -i project default
So simple yet it took me a week working with Oracle and SUN to come up with that answer...Hope that helps someone out.
Bob
# posted by Blogger Bob : 6:48 AM, April 25, 2008
Posted at 01:10AM Nov 22, 2008 by Giri Mandalika in Solaris | Comments[11]
Sunday Jul 19, 2009
