Hello World. Welcome to my Weblog. Here, we will participate in discussions regarding how Sun Block Storage (Sun StorageTek 2500 Series, Sun StorageTek 6000 Series and Sun StorageTek 9000 Series) products integrate with Virualization technologies such as Sun xVM portfolio, VMware, Microsoft Hyper-V and so on.
I plan on posting several topics here in the coming weeks which will be open for community discussion. Please do not hesitate to openly participate.
Said-
==
Said A. Syed has over 14 years of industry experience, including over 8 years with Sun. Said started with Sun as a System Support Engineer in Chicago supporting high-end and mid-range servers and Sun storage products. Said joined Sun's Storage Product Technical Support group in 2004 as the Sun Support Services global lead for Brocade SAN products. In this position, Said managed the Sun Support Services relationship with Brocade Support Services directly and supported world-wide Sun customers on high visibility, high severity escalations involving SAN infrastructure products and Sun's high-end and low-cost storage products, the Sun Storage 3000 and 9000 series arrays. In 2008, Said was promoted to Staff Engineer role within Sun's NPI and OEM array engineering group and is currently chartered with gaining in-depth understanding of how virtualization applications such as VMware ESX server, the Sun xVM platform, Microsoft Hyper-V virtualization, Sun Logical Domains (LDoms), Solaris™ Containers, Cloud Computing, and other similar applications interact with Sun's modular and high-end storage arrays, the Sun Storage 2500, 6000 and 9000 series arrays. Said is a VMware Certified Professional (VCP) and Microsoft Certified Technology Professional - Hyper-V (MCTS: Hyper-V)
HI,
I would like to use my VMWare ESX server to run a Solaris x86 instance, and reserve a PCI-e scsi adapter card to that instance. I then would like to create a ZFS array from the SCSI JBOD I will attach to that dual-port U320 SCSI card.
1) Is this possible to do in ESX 3. u3 ? Or another way: is VMWare going to gracefully let me talk to the PCI-E U320 SCSI card without
interfering and corrupting my ZFS communications
2) Is my performance in ZFS going to suffer over using the jbod on bare metal
Thanks,
Matt
Posted by Matt on January 29, 2009 at 10:47 PM CST #
Matt,
Currently, you cant really reserve a PCI card for a Virtual Machine. However, one of the ways to get around this is to not map any luns from that JBOD to any other Virtual Machine.
To answer your question regarding ZFS. Yes, you can create ZFS pools from LUNs on that JBOD without VMware interfering with your ZFS communication. You can do that by mapping the LUNs to the Virtual machine as Raw Device Mappings instead of VMFS volumes. An RDMs give direct access to storage LUNs from Virtual Machines.
RDM (Raw Device Mapping)
=======================
In cases where a virtual machine requires direct access to storage and without VMFS-3 file system, a Raw Device Mapping can be used. This allows for the virtual machine to have raw access to the lun(s) just like a physical server would with the exception of data location, permission and locking which is performed by a mapping file resident in a VMFS volume.
RDM with Physical Compatibility Mode
====================================
An RDM with Physical Compatibility Mode provides direct access to the lun or physical disk for applications requiring lower level control. The virtualization layer in the ESX server storage I/O stack is by-passed and the mapping file is directly accessed.
RDM with Virtual Compatibility Mode
===================================
An RDM with Virtual Compatibility Mode provide for complete virtualization of the mapped device. The guest operating system thinks of the Lun as a virtual disk file in a VMFS-3 volume. This mode allows users to take advantage of snapshots as well as remote replication as it is more portable across storage hardware.
For ZFS, I would suggest RDM in Physical Compatibility Mode but you would lose the capability to create snapshots in VMware. Not sure if that is important to you or not.
Regarding the performance questions, as long as your ESX server is sized properly and not over committed too much on memory etc, I dont think you will see much performance degradation in terms zfs pools.
Here is a good VMware document that talks about RDMs and VMFS volumes and the differences:
http://www.vmware.com/pdf/vi3_35/esx_3/r35u2/vi3_35_25_u2_san_cfg.pdf
Give it a try. Let me know how it works out.
Said-
Posted by Said Syed on January 30, 2009 at 09:03 AM CST #
Said,
Thank you for your quick and helpful response - I did more reading and still dont completely understand about physical mapping a RDM for my external attached SCSI JBOD. Do I need to set up a Physical Compatibility Mode RDM for each hard drive in the JBOD, or can I just make an RDM for the PCI card and get all the devices attached to it?
Your comment about the VM Snapshot is interesting - I assume that I would be able to snapshot the Solaris VM's boot disk images that are hosted on the ESX server's VMFS-3 filesystem, but it would simply ignore the RDM-mode ZFS scsi array. If so, that is fine, as I can snapshot in ZFS within the VM if I want to do that. Just have to keep all the meta layers straight.
Another question - If I run a "zpool xport tank" in the VM, can I detach this array then from the ESX server and plug it in to physical machine and import it?
Maybe thats more of a ZFS compatibility question, i.e. can I export a ZFS pool on Solaris 10 x86 VM and then import it on a physical SF v240 Solaris 10 SPARC box?
And vice versa? It seems like I should be able to. What do you think?
Thanks again,
Matt
Posted by Matt on January 31, 2009 at 01:23 AM CST #
Matt,
I got your response in my email but it looks like for some reason it did not post to the blog. So I am incorporating my answer to your questions here:
Matt:
=====
Said,
Thank you for your quick and helpful response - I did more reading and still dont completely understand about physical mapping a RDM for my external attached SCSI JBOD. Do I need to set up a Physical Compatibility Mode RDM for each hard drive in the JBOD, or can I just make an RDM for the PCI card and get all the devices attached to it?
SAID:
=====
First off, you need to ensure that the JBOD you are using is supported on ESX servers. To do that, you will need refer to the SAN HCL guide:
http://www.vmware.com/resources/compatibility/pdf/vi_san_guide.pdf
Now, once thats done, the way it works is:
- Each hard drive would be recognized by the ESX server as a lun (depending on the kind of array/JBOD you have)
- You chose to either create VMFS volumes from on each of these LUNs or you simply go to your VM's "Edit Settings", go to "Hardware", Select Hard drive and click on "Add". Go through the menu and select Raw Device Mapping with Physical Compatibility Mode.
- Then you go to your OS and perform a devfsadm
Again, refer to the SAN administration guide I referred to and VMware Basic administration guide for further detailed assistance on this.
MATT:
=====
Your comment about the VM Snapshot is interesting - I assume that I would be able to snapshot the Solaris VM's boot disk images that are hosted on the ESX server's VMFS-3 filesystem, but it would simply ignore the RDM-mode ZFS scsi array. If so, that is fine, as I can snapshot in ZFS within the VM if I want to do that. Just have to keep all the meta layers straight.
SAID:
=====
You chose to create snapshots for your VM data (OS and data luns if they are in VMFS or RDM with Virtual Compatibility Mode). You just wont be able to snapshot RDMs in Physical Compatibility Mode. Yes, you can use ZFS snapshot within the VM's OS to do this.
MATT:
=====
Another question - If I run a "zpool xport tank" in the VM, can I detach this array then from the ESX server and plug it in to physical machine and import it?
SAID:
=====
You should be able to do that. ZFS pools dont know they are part of a virtual machine.
MATT:
=====
I export a ZFS pool on Solaris 10 x86 VM and then import it on a physical SF v240 Solaris 10 SPARC box?
SAID:
=====
You know, I dont know.. HMMM. I have to try it. Never thought about that. I will post my answer soon on this.
Posted by Said Syed on January 31, 2009 at 03:18 PM CST #
Matt,
Indeed, ZFS pools are portable between sparc and x86/x64 Solaris 10 version. Keep in mind that only raw data will be portable not application level information, even Oracle data possibly may not be accurately interpretable due to embedded platform specific sig.
You can surely give it a try if you like and report back... I'd be interested... :-)
Posted by Said Syed on February 04, 2009 at 12:15 PM CST #
Said,
Im getting closer to trying this out.... I have a VMWare ESX 3.5 U3 server with an
LSI PCI-e scsi card in it. I attached an external SCSI jbod - a Dell PV220S w/6 U320
SCSI drives in it. I could not get the ESX server to add a disk as a RAW device and posted a Q to VMWare about it... this is what they sent me... a hack to make a raw device:
http://itknowledgeexchange.techtarget.com/virtualization-pro/vmware-esx-storage-how-to-get-local-storage-to-act-as-a-raw-disk-for-vms/
next move is to try this and add all 6 drives to ESX as raw devices - then use
opensolaris to try to make a ZFS FS with the set. I feel a little like Ive gone off the rails here-- im out in the weeds somewhere -- but have to give it a go anyway. :)
Matt
Posted by Matt on February 16, 2009 at 12:17 PM CST #
I am not sure why you need to do any of that. You can really do that simply by adding the disks as RDMs via the VMware Virtual Center or VI Client, the process more simpler than what they are suggesting. I guess let me know how it goes.
Posted by Said Syed on February 25, 2009 at 09:15 PM CST #
Hi; curretly we have several performance issues with a Brocade 48k Intermix FICON/FC/FCip environment. Under Ficon run VSM, FC for EMC DMX and FCip for replica to other Brocade 48k.
Some of you know the best practices for Intermix configuration or limitants that have this environment.
Which could be the best architecture for connections / blades / ports in this particular configuration?
Regards
Posted by Israel Erick Pacheco Caballero on February 26, 2009 at 02:09 AM CST #
Hello Israel,
Can you give me a little bit of an idea on what kind of blades specifically you have installed on your 48Ks? And where do you see the performance issues?
I am assuming you have FR4-18i blades for FCIP?
Posted by Said Syed on February 26, 2009 at 08:32 AM CST #
Israel,
I am attaching your response here instead of email. I prefer we have our discussion over the Blog if that is ok with you so everyone can benefit:
====================
Hi;
Yes, our blades are FR4-18i.
This is a history.
Our customer has a FCIP extension solution that he’s using for FICON traffic. It’s VSM on one side, and 9940B tape drives on the other side, at a distance of 1000 Km. The FCIP tunnel has 150 Mbps of uncommited bandwidth. When the customer starts a remote tape backup through VSM, he says that the bandwidth used through the tunnel to write to two tape drives is only of 10 Megabytes per second (or 5 MB/s per tape drive). He would expect to see the full 150 Mbps in use, and he doesn’t know if he should blame the FCIP solution, or if the problem lies somewhere else. What do you think?
The other problem is about FC (EMC DMX). When the compression is disabled in brocade, we are able to transfer at 85% BW available (145Mbps), but with compression enabled, the performance goes down to 60% BW. FC have 4 process incoming from EMC DMX. When 1 process is running, the BW utilization is good, but when 4 processes are delivery, the BW utilization goes down, like a process saturation problem into the blades.
Do you know the best practices and limitations for INTERMIX environments.What could we expect as a maximum performance? FC has some recommendations that can not be enabled due to the FICON mixon architecture, like DLS or IOD.
I have a SAN healt if you want to know in deep our confguration. This problem actually is under revision by Sun Support USA and Brocade Support USA, but this question are not answered. I want to have your point of view and know cases like in your experience maybe have.
Thanks
Israel
============
Posted by Said Syed on February 26, 2009 at 12:28 PM CST #
RDM's can only be created on LUNs that are SAN attached. You can't do RDM's using a SCSI DAS like the PV220S or internal SCSI / SAS Storage.
Posted by Francois Auger on March 11, 2009 at 03:39 PM CDT #