Taylor's Take on Sun Storage : Weblog

Taylor's Take on Sun Storage

My storage team and I focus on three of the most important aspects in any industry: customers, competitors and market trends. There is insight to gain and share in this role, so here is our take on Sun and Storage - Taylor Allis


amazon disk emc honeycomb hp hpc ibm netapp open opensolaris openstorage saas solaris storage storagetek sun sunstorage tape thumper virtualization vtl web2.0 x4500 zfs
Thursday Jun 26, 2008

Open Storage: Early Customers

Now let's talk about what really matters when it comes down to Open Storage - customers.  Now, forgive me for repeating myself because I mentioned some of these customer testimonials in my first Open Storage blog.  But it just so happens that the ones I blogged about ended up in the Open Storage Adoption White Paper.    So, I'll talk about them (again), but also throw some new ones in... 

Customer Blog Comments:  Before I do, I want to point to a couple of blog comments.  The first one was submitted to Chuck Hollis' EMC blog post Do-It-Yourself Storage.  After Mr. Hollis finished criticizing Open Storage, an experienced storage and IT customer posted this comment in response to his critique:  

"I think you are missing the point. I did actually a price comparison. Building your storage solution (in my case several hundred TBs) from cheap disks using x86 servers + ZFS + Open Solaris + Solaris Cluster + ZFS, where all software is not only open sourced it is also for free, *does* make a huge difference. We actually started building our solution on EMC Symmetrix (great box) and EMC Celerra years ago and endup on really cheap storage + ZFS as a replacement and a way to move forward. Additionally all features like snapshots, cloning, end-to-end checksuming, remote replication, built-in compression, built-in cryptography, NFS, CIFS, iSCSI, ... are also for free. Better - they work exactly the same regardles what cheap storage or server we put underneath.

What ZFS brings to the market is the open sourced and free Google like approach to storage - how to cheaply build reliable storage from small to large scale installations.

Sure, especially for SMB market, what is needed is an easy GUI interface built on-top of Solaris + ZFS. I'm sury you will see one sooner or later."

I like the "Google like approach to storage" comment (and the fact that a disk + ZFS system replaced an EMC DMX and Celerra system for this particular customer!)  I also had a customer post these comments to my blog -  they just deployed a 2TB OpenSolaris ZFS + COMSTAR storage system for a VMWare Cluster with off the shelf components saving €2,000 in the process.  Cool stuff.   

Open Storage Customers:  Not surprisingly, a lot of these early adopters use open source to compete in their respective businesses.  You may also expect that early Open Storage adopters would come from Sun's Solaris install base - while true, many new customers do NOT come from Sun's current base.  In fact, a lot are Linux users.  They have chosen Sun because of...Storage. (Open Storage to be precise)


DigiTar
DigiTar provides messaging security and processing services over the Internet (antivirus, antispam, antiphishing, firewall, and archiving).  DigiTar is using Open Storage to improve the performance and efficiency of their database servers. They are using Sun's X4500 storage servers and ZFS to automate database storage administration - with ZFS they have reduced the identification and fixing of database corruption by days and/or weeks.   They are also an active member of Sun's OpenSolaris community and use the OpenSolaris community and SunSpectrum for tech support.

What I personally love about DigiTar's story is that they were (still are) a Linux shop.  So what made them a Sun OpenSolaris customer???  Storage!

Why? Read their CEO/CTO's blog:  Democratizing Storage.  He basically states that OpenSolaris was brought into the company because it made for a superior storage platform.  The clencher (and essence of Open Storage) for me is when he compares deploying Open Storage vs. a traditional storage architecture: 

"To replicate the level of redundancy we get with two X4500s, we’d have to install two completely separate storage arrays…not to mention also buy two very large beefy servers to run the databases. By using X4500s, we get the same reliability and redundancy for about 85% less cost. That kind of savings means we can deploy 6.8x more storage for the same price footprint and do all sorts of cool things like:

  • Create multiple data warehouses for data mining spam and mal-ware trends.
  • Develop and deploy new service features whenever we want without considering storage costs.
  • Be cost competitive with competitors 10x our size."

Want do do more with less?  DigiTar is with Open Storage...


Nexenta
Nexenta is unique in that they are an Open Storage customer - but they are using Open Storage to build and sell storage products of their own.  They bill their product as “Enterprise-class data storage for everyone!” Nexenta has built its NexentaOS and NexentaStor software appliance from OpenSolaris and ZFS; and they can deploy it over the Sun Fire X4500 as well as other HW.

Like DigiTar, Nexenta was founded by Linux gurus.  In fact, the Nexenta team developed the iSCSI stack that was adopted by the Linux community. So what platform did these experts in storage and open-source software choose to build a new storage offering?   OpenSolaris & ZFS - due to its advanced storage functionality and long history in enterprise environments.  

So now the company offers NexentaStor - a software-based NAS and iSCSI solution with unlimited incremental backups, snapshot mirroring and the inherent virtualization, performance, thin provisioning and ease of use benefits of ZFS.


Sapotek Inc.
Sapotek Inc. is a Software-as-a-Service (SaaS) provider that offers online desktop services to (currently) 200,000 users worldwide.  This SaaS provider had a classic Web 2.0 storage problem - how to massively scale, efficiently and affordably? 

Sapotek was running Red Hat Enterprise Linux on Dell servers and had been maxing out at five concurrent threads per server. So they migrated to Sun Fire X4200 servers and the Sun Fire X4500 storage server running ZFS.  They replaced four (4) Dell/EMC storage systems with one (1) Sun Fire X4500.  Sapotek also used Sun's ZFS snapshot feature, and reported that it had reduced backup and recovery times by 99%!  (From hours/days to minutes).  Here is what their cheif tech officer,  Oscar Mondragon, said about Sun Open Storage: 

"The ZFS file system feature of the Solaris 10 OS is a marvel. It creates a common storage pool where all storage performs as fast as if it were local. Our administrators can grow, add, or remove storage on the fly in a single step. Just two people administer 24 TB."


Above is just a sampling of early customers that have deployed components of Sun's hardware and software Open Storage portfolio.  Read the Open Storage Adoption White Paper for more case studies or my previous blog post.  Additionally, take a look at these customer testimonials (Some have bought commercial versions of Sun's storage offerings and some have deployed open source offerings)

  • Dow Corning Corporation: Increased storage capacity by 50% with ZFS
  • Joyent: Sun Fire X4500 gave them a hosted storage business at $1/GB as opposed to $6-$7/GB for EMC
  • Nomura Bank: Used ZFS to create a common storage pool, with no partitions to manage. Admins can now provision or grow storage, and add or remove a file system with a single command.  ZFS also safeguards data at the bank by running 64-bit checksums
  • OmniTI: During a catastrophic accident when 1.8 TBs were lost, they restored their entire database in just seconds with ZFS
  • University of Calgary: Deployed SAM-QFS software for 229TBs of storage, moves data between Sun StorageTek Disk and Tape via policy
  • University of Oxford: Their digital library project will manage 9 million library items and support an average of 9,000 library resource requests a week
  • IN2P3/CNRS: Optimized datacenter space and reduced energy consumption with X4500

Tuesday Jun 17, 2008

Open Storage: Vendor Landscape

Ok - chapter 5 in the Open Storage Adoption White Paper talks about the vendor landscape.  We'll start with Sun.  

Sun's Open Storage Differentiation
There are three areas that position Sun as the best partner for Open Storage solutions:

  1. Innovative HW systems: Sun’s hardware differentiation lies in design innovation.  Three examples of this are the Sun Fire X4500 (Thumper) which combines a four-way x64 server with 48 TB of SATA disk in a 4U rack space - one of the most efficient and dense storage servers on the planet (and there is more to come).  Sun's unique ST5800 (Honeycomb) archive platform scored perfect 10s in reliability and scalability with InfoWorld.  On the server side, Sun's Blade 6000 is the most open blade platform in the industry - delivering Solaris, Linux, Windows or VMware on single and multicore processors by Sun, AMD, and Intel in one chassis.
  2. OpenSolaris as a storage platform:  One of the most robust and reliable OSes in IT.  Sun also offers advanced open-source file systems including NFS, the upcoming Parallel NFS (pNFS) and ZFS.  ZFS can manage zettabytes of storage and offers data services including volume management, data integrity and software RAID.  
  3. Open-source storage applications: Sun has now open-sourced more high-level storage application software than any other storage vendor.  Applications like remote-mirror-copy and point-in-time-copy.  Take a look at Sun's complete open-source, end-to-end storage portfolio (a storage developer's dream):

Other "Open Storage" Efforts

In keeping with the definition of open source software + industry standard HW = Open Storage (aka a more scalable, economic storage architecture) let's look at what other vendors are doing with open source software and industry-standard hardware.  But remember, several vendors use open source software and industry standard hardware - but still limit customer choice and charge higher rates in their implementations...

IBM: In terms of industry-standard hardware, IBM sells Intel and AMD servers as well as SAS- and SATA-based disk and JBOD systems. IBM does see value in open source as it is a large Linux supporter. (However, Sun has more than 3,000 members and 30 open-source storage projects in development for OpenSolaris AND has even open-sourced its commercial applications like the Sun StorageTek Availability Suite - giving its customers full and affordable access to its own IP).  IBM’s recent investment in the storage market has been its recent acquisition of XIV - see IBM buys XIV - good move or bad?  XIV NEXTRA does use industry-standard hardware, but its software is proprietary (not open source).  XIV shows IBM has realized customers need more than what traditional disk products offer today - the design points of the XIV architecture are low cost and massive scalability.  However, the technology is new and IBM’s claims of low cost are yet to be determined.

HP: HP also sells Intel and AMD processor-based servers; as well as SAS, SATA and SCSI JBOD arrays.  HP sells ProLiant servers or industry-standard servers running Windows Storage Server (Not open source, but obviously a high-volume OS).  HP acquired PolyServe in 2007 to cluster its storage and server systems.  HP recently announced its HP StorageWorks 9100 Extreme Data Storage System (ExDS9100) - a large NAS appliance with an entry configuration of a whopping 246TB of standard disk.  HP also broke away from Windows in its implementation - using open-source Linux as the platform for this higher-capacity platform.   HP says it will ship the ExDS9100 by year's end, but without some features like CIFS support.  (And by the time the ExDS9100 ships, Sun's X4500 will have been on the market for over 2 years with over 250 petabytes installed.)

EMC: EMC primarily offers closed systems today - custom components and software that are available only through EMC.  But EMC can identify business/IT trends and adapt to them. In January 2008, EMC announced its first Storage as a Service (SaaS) or “Cloud” storage offering.  EMC is also investing in two products code-named “HULK” and “MAUI.”  MAUI is software that will provide what EMC calls a “global repository” - but not much in known at the time of this blog.   According to this blogger, HULK's official name is EMC InfiniFlex and a single system consists of a full 44U rack with up to 300 drives (10 disk trays x 30 disks).  They fit these 30 drives in 3U it looks like (and does the front-to-back cooling suggest they stick the extra disk behind the front-end disk?)  It also uses 12 dual core, 1U servers as well as 2 ethernet switches.   So, it essentially looks like industry-standard HW (storage & servers) with TBD software.  HULK and MAUI may be EMC’s first venture into the open-storage space - especially if the systems are able to work with other, third-party, industry-standard components. But the benefits to EMC’s new offerings, and just how “open” they are, are yet to be announced...

NetApp: NetApp sells their own proprietary hardware and also develops their own custom operating system called Data OnTap (while others in the industry have been moving to an open or high-volume operating system for storage - like Solaris, Linux or Windows.)  NetApp does not open-source its storage operating system software.

Dell: Dell has built its business on industry-standard, volume-based products.  Dell lets customers configure servers with industry-standard Intel and AMD processors, SATA disk drives and various Linux distributions. Dell can be credited for its online configuration and ordering services; but Open Storage customers really require enterprise-class software, services and tech support from their Open Storage vendors - like Sun offers

Next Blog...
Open Storage Case Studies

Thursday Jun 12, 2008

Open Storage: Size & Growth

Both EMC and IBM have chimed in on what their thoughts are on Open Storage's future.  Tony's analogy made me think about a simple way to explain what I think Open Storage's future will be like: 

Open Storage & Builders Square: Let's rewind to the 1990's.  At that time I knew very little about home improvement.  In my mind, the only option was to save up to hire an expensive contractor to update my kitchen/bathroom/bedroom, etc.  But then a new store opened up close to my home - Builders Square.  That shop was subsequently replaced by a new store called Home Depot.  I was intimidated when I first walked in - there were only serious contractors around and I couldn't believe the amount of stuff they had.

Well the time came to update our bathroom - and as I was looking up contractors and plumbers in the phone book my wife said, "why don't you see how much it would cost to do it yourself by going to that new home store down the street?"  Fast forward to today and the people at our local Home Depot and Lowe's know me by name.  I have installed bathrooms, kitchens and built our basement.  I had help from store employees, instruction guides and affordable house "components" to do everything.  And when I walk into a store today; I see families make up the majority of customers - not contractors.   The simple fact is this - I could  not have done what I wanted to do with my house if it wasn't for the introduction of these new DIY home stores.   Open storage will help users do more with less - and reference architectures, solution blueprints, online communities and consultant services will help customers deploy open storage.

Open Storage & Stay-at-Home Moms:  Let's go back to the 90's.  I was also an early Web developer back then.  In fact, a quick Google search of my name will find a question I posted to a developer's help desk in 1996 where I ask, "Do you know when Netscape 2.0 for mac will be able to pick up Java applets?"  Pretty funny; and a testament to the longevity to open standards like HTML.  But my point is this - back then my wife was working in consumer PR and had absolutely no idea what I was doing for a living in high tech.  The Internet? The WWW? 

Today my wife has taken on the full-time job of raising our kids.  And I am happy to report that she is looking into starting her own blog - and she is looking at a blog from another stay-at-home mom as an example.  Now, if I told my wife she would be "developing" her own WWW journal back in the 90's she would have thought me crazy (which happens often) or the task impossible.   But Internet tools have evolved in away that let's anyone self-publish on the web for little to no cost.

I believe open storage will find its way in early markets; and the support resources and tools around it will evolve over time, bringing open storage to more traditional, mature markets.

Open Storage Size & Growth: (From the Open Storage Adoption White Paper)
How do we know this short of getting out our crystal ball?  Well, we don't - we need to estimate - and this is how we did it:  We took multiple IDC forecasts and rolled them into an internal Sun model - forecasts which included revenue from industry standard storage (JBOD, SSD, etc.) and open source software.  Then we made our own assumptions on how the market will behave - for example, we believe the fastest open storage adoption rates will appear in the entry and midrange NAS, unified storage (iSCSI, FC, etc.) and object-based storage markets.  We assume adoption will be slower in more traditional, higher-end markets.  So, this is NOT an IDC model - but an internal Sun model based on IDC and industry data (to be clear).  And this is an industry/market revenue estimate - it includes ALL vendor revenue from industry-standard hardware + open source software used in ANY storage system and does not give ANY info on Sun revenue or performance (to be clear again)

So...we estimate open storage products and services will represent just under 12% percent of the total storage market in 2011. With IDC estimating the total storage market (hardware, software and services) generating approximately $90B in 2011, the open storage portion could be just over $10.6 billion.

We also predict that open storage will represent more than 20% of the external disk market by 2011.  At $5.2 billion, hardware represents the largest portion of the open storage market.  Tape storage and storage networking products are not included in the open storage forecast (although one could argue that LTO is industry standard hardware).

Next Blog: Sun Open Storage vs. Other Vendor Efforts

Tuesday Jun 10, 2008

Open Storage: Early Markets

As promised, below is a second White Paper on Open Storage. This second one addresses Open Storage market drivers and growth and is titled Open Storage Adoption.  It can be downloaded here:

Again, I would like to thank Bruce Norikane, our Sr. Analyst, for his help as well as our market research manager, Chris Ilg, for his forecasting work. And again, I'll use this blog to post the CliffsNotes for those short on time.  Below I will cover the need for a new storage architectures and early target markets.   In subsequent blogs I'll cover the Open Storage future market forecast, other vendor initiatives and customer case studies - early adopters who have used Open Storage to solve their critical business needs...

The Need for a New Storage Architecture

Bruce, mentioned above, made a profound statement during our Open Storage planning that ended up in the White Paper.  He said, "Google and Amazon would not exist if they hadn’t built their own storage infrastructures."  They certainly wouldn't exist in their current state.  When they started, traditional storage architectures were too expensive and inflexible to support the business model they had in mind.  So what did they do?  They had to buy commodity components and developed their own software like the Google File System (GFS)

Certainly not everyone can build their own file system today.  But the requirements that drove Google to build their own file system have done nothing but increased.   Consider the following facts:

  • IDC’s report “The Expanding Digital Universe" states that new digital data will grow 6X between 2006 and 2010 (Emerging market data is growing up to 40% faster)
    • 70% of this data will be created by individuals (me, on this blog for example)
    • Enterprises will be responsible for storing 85% of this new digital data (Sun's IT dept has to manage the data I create)
  • YouTube was founded in February 2005, and by May 2006, 50,000 videos were being added every day
    • A YouTube search today can return about 72.1 million videos and 2.89 million user channels

A new, more economic and scalable storage architecture is desperately needed - enter Open Storage...

Open Storage Growth Markets

Open Storage can (and will) compete with traditional storage architectures.  But Open Storage won't "take over the world" overnight.  Most likely the data center mix of open storage architectures vs. closed storage architectures will change over time and vary data center to data center (if history is our guide).  But what markets will adopt sooner?  What are the Open Storage "sweet spots"?

Web 2.0:  I count Web 2.0 apps as applications delivered via the Web.  Apps like blogs, wikis, podcasts, RSS feeds, mashups, and social-networking sites like MySpace, Facebook or SmugMug.  Consider this:

  • Technorati stated in March 2008 that there were 112.8 million blogs
    • Bloggers update their content with 1.6 million posts EVERY day (or more than 18 updates per second!)
  • There are currently 3,115 mashup applications on the Internet

And Web 2.0 apps are not just for up starts - Forrester surveyed 2,200 IT decision makers from traditional enterprises and found that 33% were planning on investing in Web 2.0 applications.  Web 2.0 storage requirements differ from traditional storage requirements as well. They need massively scalable but low-cost systems. Web 2.0 users are even willing to trade high availability for lower costs.  Everyone needs high scalability at lower costs - but the need in the Web 2.0 space is acute.   Thus, Web 2.0 will be the key driver for Open Storage architectures. 

HPC Storage:  IDC estimates that HPC storage systems added about $3.9 billion to the 2006 server revenue total and will undergo faster annual growth than HPC servers.  Maximizing I/O bandwidth and minimizing latency while scaling storage capacity is the top priority for HPC storage users. Because of this, data locality is an issue for many HPC implementations.  What's data locality?  HPC services provider Instrumental, Inc.explains:

Data locality is a big issue in some architectures. Sometimes you need to know where data is in memory to get the best performance. Locality issues are compounded by the enormous amount of software ‘in the middle (OS, file system, volume management, failover, host bus adapters, and so on)’.

To manage issues such as data locality, an open storage architecture is needed. The one thing that HPC storage deployments have in common is that they are all custom built. HPC users need direct access to their storage components and software along with the flexibility to swap components and customize software to optimize their storage.  This is difficult to do with closed storage systems.

Additionally, parallel, shared or clustered file systems that leverage global namespace technologies are used in most HPC storage environments. This includes the HPC open source file systems Sun offers - like Lustre.  In fact, Lustre is used in 15% of the top 500 supercomputers in the world and in six of the top 10 supercomputers.

Lastly, an additional top storage requirement in HPC is Hierarchal Storage Management (HSM) software (moving data from disk to tape)Why?  Just look at the massive amounts of data HPC applications generate. The San Diego Supercomputer Center states their earthquake simulations alone generate 47TB every week!  By 2011, they expect archived data to grow to more than 100PB. HPC centers must leverage the economics of tape to store such massive amounts of data.  Sun offers tape as well as open-source HSM software for disk-to-tape data migration - Sun's Storage Archive Manager (SAM) software.  

To see the real-world benefits an open storage architecture can offer HPC customers, see the Texas Advanced Computing Center (TACC) implementation of Sun Constellation - aka Ranger.   

Server Virtualization:  Open storage introduces more flexibility and consolidation benefits to the server-virtualization market. This added functionality can be realized in two ways:

  1. By running open storage software inside a virtual machine (VM)
  2. By running any vendor’s storage software on an open storage server
In the first scenario, storage users can consolidate servers using offerings such as Sun xVM software or VMware. Each operating system instance on the server is a VM. However, one VM can deploy storage software in order to create a virtual appliance inside the server.  In the below diagram, VM1 is running open source Sun StorageTek Archive Manager, creating a virtual archive appliance inside the server.

Storage users can now consolidate three servers and a storage appliance onto a single server. In a closed architecture, storage software cannot be separated from the hardware.

In the second scenario, users can use an open storage server, such as the Sun Fire X4500, as a storage target or shared appliance. What’s unique is that users can repurpose their storage appliance as their needs change. For example, customers can repurpose the same Sun Fire X4500 into a NAS device, a Virtual Tape Library (VTL) or a data replication appliance without buying more hardware. Now that's investment protection!

In the following diagram, a customer has taken a Sun Fire X4500 server running Linux-based VTL software and has repurposed it into a remote replication appliance by leveraging server virtualization and open source Sun StorageTek Availability Suite software. 

Sever Virtualization and Open Storage can deliver better investment protection and significant cost and consolidation advantages ...

Next Blog...

 What we predict the size and growth of the Open Storage will be

Tuesday Feb 12, 2008

Server Virtualization's Impact on Storage

Today Sun announced it entered into a stock purchase agreement to acquire innotek.  In a nutshell, innotek develops VirtualBox, an open source desktop virtualization platform.   On the server virtualization side, Sun has had an alliance with VMware for 2.5 years now, and Sun has also delivered its own Sun's xVM platform with some pretty unique self-healing and management capabilities. (Get OpenxVM here).  innotek will add to Sun's xVM platform, it allows laptops or desktops running Windows, Mac, Linux or Solaris to run multiple OSes side-by-side.  Cool stuff.  To learn more on innotek see the Weblogs of Steve Wilson and Joe Bonasera or download VirtualBox here

So all this server virtualization talk got us to thinking...

What is server virtualization's impact on Storage? 

I admit, my team and I have discussed this and we believe that the true impact is yet to be determined - this is new stuff after all.   This is what we do know:

Server Virtualization's link to storage: The most important link b/w server virtualization and storage is application mobility.  In server virtualization, customers can ultimately move applications from box-to-box and system-to-system much easier than ever before.  But as applications move to different systems, customers need to maintain the links to storage.  If customers have to maintain links to storage as they move their applications around, it would make sense for virtualized environments to leverage networked storage - maintaining the links through the network.

Which storage network benefits the most from Server Virtualization? SAN, iSCSI or NAS?  All of the prominent storage networks, FC, iSCSI and NAS, are fighting for virtual server market share. After reading several IDC briefs, all three show signs of growth.  In one brief, IDC claimed that Server Virtualization contributed to the increase of industry FC SAN sales in 2007.  IDC also predicts that ~50% of virtualized servers will be attached to iSCSI in the future - citing that server admins are generally more comfortable with IP-based storage and networks.  NAS vendors are also pushing file server networking to support virtual servers.

Server Virtualization has the potential to dramatically impact Storage customer requirements:  Server Virtualization is still emerging and maturing, but it will impact storage purchase patterns.  This will (or should)  impact how storage is marketed and sold and will most likely disrupt analyst's long-term forecasts of the storage market.  The amount and type of impact to vendors and customers should be interesting to watch.    

 I'd love to hear any comments on how others think server virtualization will impact storage....

Friday Nov 30, 2007

Top 10 Storage Technology Trends

It’s getting to be the end of the year – and Sun, like every other vendor I assume, is looking at the new technology trends that have (or will) impact our industry.

My team and I were asked to evaluate some – to see where to prioritize future investments, evaluate current and future competitive threats, etc.  Ensuring we stay ahead of the technology game.  So below is our very own top 10 list of emerging technologies.  (As well as some color to what Sun is doing here) 

I’m sure we’re missing some and comments and claims are our own, they don’t represent any commitment or position from Sun Corporate (For official news on Sun go here…)    

 Top 10 Storage Technology Trends:  

1. Open Storage Platform (aka general purpose storage, open source storage):  Trend #1 is a term we coined, so it may not sound familiar.  It is a combination of market trends as well as a direction Sun is taking with its newer products.  The concept of a common platform is not new – several vendors have tried to build one platform that can run multiple storage applications, saving users time and money.  “Open” is a relatively new concept for storage, but not for software or servers.  There are generally three components that make up an Open Storage Platform:

  • General Purpose Components: General-purpose servers, processors, storage and operating systems are now being deployed in enterprise-class storage devices. Previously, storage vendors had built proprietary operating systems, ASIC chips and other custom-built components. As commodity chips, components and software have matured, storage vendors are now using general purpose components in their systems. The cost savings are significant - however, most vendors' prices remain the same (contributing to vendor margin). Sun's philosophy is to pass these savings to the consumer. Enterprise-class systems based on general purpose components give customers higher-value systems at a fraction of the cost (see graphic below). They are also much more flexible, as they can be re-purposed for other uses.
  • OS/File System Storage Services: Traditional appliances charge customers extra in software licensing costs for data management services like administration, replication or volume management.  What if this functionality came already embedded in the storage system itself?  Sun's newest file system, ZFS, has started to incorporate these services at the File System level. ZFS deploys point-in-time-copy, volume management, administration and data integrity features like Copy-on-write and RAID. Storage services deployed at the OS-level have several benefits, including efficiency, performance, reliability and affordability (no more licenses for extra software needed).  And before we think this is a new concept, mainframe has been doing it for years. The new concept is doing this in the open systems space – and Sun Solaris ZFS is leading the charge.      
  • Open Source Storage Software: The ability to download software, test it and add features to it is critical to developers building new applications – we’ve seen this in the OS market.  But what about storage?   This has been an investment area of Sun’s - Sun actually offers one of the most complete open source storage software stacks (from protocols to drivers to data management software).  Developers can build their own storage solutions and sell them by leveraging Sun’s open source software (see the Nexenta Storage Appliance for a perfect example).  Open source has an added benefit to customers with in-house development resources - customers can deploy new software features by searching for it in the open source community or even developing it themselves (why wait for vendor roadmaps?). This is not possible with a proprietary appliance. 

The simplest way to show the impact the Open Storage Platform concept will have on the storage industry is a basic economic comparison.  We used IDC's Pricing Database to compare over 50 actual purchase orders of enterprise disk (Sun ST9900, EMC DMC3), Midrange Disk (Sun ST6140, EMC CX3) and a Sun product built on our Open Storage Platform (the SunFire X4500):

  • Enterprise Disk = $18.84/GB
  • Midrange Disk = $10.39/GB
  • Open Storage Platform = $1.50/GB
    (7x less than Midrange and 13x less than Enterprise)
NOTE:  There are certain applications and features that must run on higher-end disk.  But at a $1.50, customers will be compelled to find which applications should be running on the SunFire X4500. What is even more remarkable, the SunFire X4500 includes the server, OS, data services, storage and networking components - all for $1.50/GB! 

Game-changing economics…

2.  Virtualization:  Where to begin?  The benefits are obvious – massive savings through optimization, consolidation and optimization.  The best definition of virtualization I have seen came from the 451 Group’s Virtualization Report which defined Virtualization as, “A software abstraction layer that permits aggregation, emulation or partitioning.”  Let’s look at the different types of virtualization out there today

  • Server Virtualization:  The ability to host Windows, Linux and Solaris operating systems on one platform has literally taken the market by storm.  A huge boon to IT managers and developers alike.  VMWare led the charge with some start-ups and open source initiatives in tow.  On October 5, 2007 Sun entered the race with Sun xVM.  (And if virtualization has hit the systems world, expect to see it hit the desktop market soon…)      
  • File Virtualization (a.k.a Clustering, Global Namespace, Unified Namespace, NAS Virtualization):  A few years ago we were talking about Grid Storage – a bunch of storage nodes acting as one, single system.  File virtualization is shaping up to be the technology that will take us there.  In 2005 & 2006 we saw EMC buy Rainfinity, NetApp buy Spinnaker and Brocade buy NuView.  In recent years we we have seen EMC announce Rainfinity, NetApp announce a rough start to Data OnTap GX (based on Spinnaker technology) and file virtualization start ups Ibrix, Isilon and LeftHand Networks continue to grow.  Expect this market to heat up, and consolidate as start ups are either bought, or simply run out of runway.    
  • Virtual Tape: A disk system that emulates a tape library.  Virtual Tape has been in the mainframe market for years, and Sun is the market leader here.  Virtual Tape Libraries (VTL) in the open systems market are relatively new – disk prices have eroded to a point where backup administrators can now take advantage of disk’s access speeds without replacing their exiting backup infrastructure.  IDC pegs this market as small, but growing at 16.2% (2006-2011 CAGR). 
  • Virtual Disk (aka Storage Virtualization):  A disk system that aggregates 3rd-party disk and offers disk partitioning.  A necessity for anyone that needs to consolidate their environment and/or make data migration between independent systems easier.  Virtual Disk Systems include IBM’s San Volume Controller (SVC), FalconStor’s IPStor and Sun’s award-winning StorageTek 9000 arrays.          

3. Thin Provisioning:  Better system utilization is the name of the game.  Most admins know that the utilization rates on their disk systems are not where they need to be.  Thin Provisioning allows admins to allocate or provision space to specific applications, making full use of their system’s capacity.  3PAR spearheaded open systems Thin Provisioning and NetApp offers it as a part of Data OnTap.  Sun announced Thin Provisioning on its StorageTek 9990V system in May – meaning consumers can have the world’s fastest enterprise array, Virtual Disk AND Thin Provisioning all on one platform. 

Pretty cool…  

 
4. Data Deduplication (aka De-dup, Single-instance storage):  In a world where there is more data coming into a company than can possibly be managed – data compression ratios ranging from 10:1 to 50:1 sound pretty darn nice (See how De-dup works here).  Data Domain, Diligent, FalconStor and other upstarts get credit for bringing this new technology to market and larger vendors are quickly following suite.  De-dup is still emerging, can have performance issues and does not work perfectly for every application – but economics dictate its worth consumers investigating where it can work for them.

There are two emerging de-dup architectures:  “Inline” – where the de-dup magic happens in real-time, as data comes into the system, as found in Diligent's ProtecTIER appliance.  Or “Post-Processing” where the magic happens as a secondary process after the backup job, as found in FalconStor’s Single Instance Repository (SIR) software.  Both have their pros and cons, and deciding which approach to use depends on balancing your performance vs. complexity needs.  For the record, Sun sells both….         

5. Data Encryption:  One need only read the horror stories of lost tape and disk drives to see the importance of data encryption.  While it has been around for a while – the need has never been greater.  Growing storage capacity has caused another problem – one can store a lot of personnel records on a single cartridge or drive.  In an age of identify theft, losing one storage device can put a company out of business.  The new trend is not how to encrypt, but where to encrypt  On the host server?  On an appliance in the network?  In the storage device itself?  Decru (since bought by NetApp) benefited from this trend with their encryption appliance.

I once worked with a brilliant engineer whose favorite saying was “never put a product where a feature should be.  I’d say this was Sun’s philosophy when we delivered the Sun StorageTek T10000 tape drive.  Put simply, Sun put an encryption chip next to the compression chip on the drive – so data is encrypted as it is fed onto the tape.  Simple and affordable – no extra appliance needed.  Sun also offers the StorageTek Crypto Key Management Station to centrally authorize, secure and manage encryption keys.

6. Eco Storage (aka Green Storage/IT):  I freely admit that when I was first approached with “Green Storage” I was a skeptic.  I would have also never guessed Al Gore would win the Nobel Peace Prize!  But Eco also stands for Economics.  If you save power and footprint, and the world while you are at it – who can argue with that?  But the challenge for storage customers will be sorting through the vendors who make REAL Eco investments vs. the ones that just add “Eco” or “Green” to their marketing collateral.  Sun’s in the “real” category, investing heavily in Eco IT.  Sun’s Eco efforts can be seen here...        

7. Object Archive (aka CAS, Application Aware Archive): The dizzying array of regulations, compliance requirements and influx of data have made the archive market one of the fastest growing markets in IT and storage.  And customers must continually evaluate which archive approach will work best for them.  The trend here is to “build a better mousetrap archive.”  The challenge is this, an archive system must:

  • Store a lot of data affordablyHoneycomb
  • Have WORM  functionality so documents show up un-altered in an audit or court of law
  • Be easily and quickly accessible
  • For more than 100 years…
Easier said than done.  While tape continues to be the old staple in archive, and faster and denser tape systems are coming out each year – a lot of new innovations are happening on the disk and software side.  Disk-based object-level archive systems include EMC’s Centera, HP’s RISS and Sun’s StorageTek 5800 “Honeycomb” pictured at right.  Sun’s ST5800 system uses advanced meta data features and processors close to each storage cluster for fast access to deep archives – great for digital library, Web 2.0 and HPC applications and environments. 

But do keep in mind for deep archive; Sun’s StorageTek SL8500 Tape Library is tough to beat – just one library's max raw capacity is 56 Petabytes, and data sitting on tape consumes 0 kilowatts and generates 0 CO2 (see above trend #6)   

8. New Interfaces, Protocols & Configurations: There is a lot of change happening in storage systems and how they are configured.  The three primary ways storage is attached is Direct Attached Storage (DAS), Storage Area Networks (SAN) and Network Attached Storage (NAS).  A disk system can also be configured in a couple different ways.  RAID configurations stripe data across multiple drives and impact a system’s reliability and performance.  JBOD (Just a Bunch of Disks) is more affordable because it does not require a disk controller, but provides no data redundancy.  New interfaces and protocols will impact each of these markets significantly. 

  • SAS:  In DAS, Serial Attached SCSI (SAS) is a newer serial communication protocol that will make data transfer speeds much faster and at a lower cost.  SAS drives have replaced parallel SCSI for internal storage, and SAS HBA external host interfaces are just starting to ship in volume.  SAS disk arrays started shipping this year and SAS JBOD are starting to ship.  To complement the SAS HBA's and SAS arrays, there are new network components called expanders which are basically switches.  Generally SAS disk will be cheaper than Fibre Channel (FC) with similar performance and capacity.  (Sun has SAS host I/F, disk I/F and SAS disk drives)
  • iSCSI:  In the SAN market, the iSCSI protocol is making a significant impact by taking the cost out of expensive Fibre Channel SANs.  iSCSI allows users to send SCSI commands over their existing IP networks. 
  • File Virtualization:  In the NAS market, file virtualization is making NAS farms much more scalable and manageable (See trend #2)
  • Clustered RAID:  A new innovation to watch in the RAID market is horizontally scalable RAID or clustered RAID for large applications.  Digi-Data is a small storage company pushing this innovation. 
  • JBOD:  Keep your eye on Sun in the JBOD market.  JBOD is more affordable than RAID, but does not have RAID’s redundancy and reliability features.  But what if you had an infinitely scalable file system with data integrity and RAID features running on JBOD, say, something like ZFS?
  • Unified Storage:  Another trend in this area are unified or hybrid devices – storage systems that can handle multiple protocols and interfaces, including iSCSI, Fibre Channel, and NAS – all in one unit.  Makes sense to users with a dizzying array of choices in the market.             
  • 10GbE:  Lastly, 10 Gigabit Ethernet or 10GbE is the latest and fastest of the Ethernet standards that will re-shape data center networking – offering a fast, common and affordable network technology for IT and Web 2.0 applications from supercomputing to networked storage.
  •  

9. Solid State Disk (aka SSD, Flash, Memory):  We see the perfect storm happening around SSD.  SSD has no moving parts, making it one of the most reliable and fastest storage mediums in the world.  But it is incredibly expensive when you compare $/GB vs. Disk and Tape storage.  However, SSD has made a name for itself in the consumer market (digital cameras, phones, iPods, etc.)  The price pressure in the consumer arena is enormous – and this had been rapidly eroding the price of SSD.  As SSD prices drop, expect to see hybrid disk drives and storage systems that leverage more SSD for greater speed and IOPS.        

 

10.  Storage as a Service: 

Storage as a service offered over the Internet has been talked about for years – but poor performance and implementations have cooled this trend.  However, Amazon has given Storage as a Service a power boost with its Amazon Simple Storage Service (Amazon S3).  By leveraging Amazon’s existing e-commerce and storage infrastructure, the company is offering customers storage capacity for $0.15 per GB-Month of storage used – possibly the cheapest $/GB on the planet.  And while this may have more play in the consumer market, Amazon could re-invigorate the storage as a service trend.  Also keep an eye on Sun’s Internet Service offerings over Network.com...             

--- Updates ---

Tuesday Jun 19, 2007

Lacrosse and the future of storage...

I like storage, strategy and sports - so I really liked Scott Tracy's "Telegraph" Blog.  I commented on it, but will elaborate further here.  He talks about Storage running on general-purpose Solaris, and shows a nice OpenSolaris Storage Platform diagram. I'll serve up my own sports analogy (and re-live the glory days while I am at it...)

I played Football and Lacrosse in high school and college  (yes, I was a UPS Logger) - but I'll stick with the Lacrosse theme. 

In high school my team competed in Florida's Cocoa Expo Lacrosse Tournament. We were quickly mocked by all the teams there - they represented the best of east coast lacrosse and we were from cow-town Colorado. We were at first intimidated by their "moves" - quick sticks, behind the back passes, etc. But our coach told us to stick to the basics - two hands on the stick, straight forward passes, etc.

What happened?

We won the tournament that year - first place, #1 (and went on to win State by the way). So what's the storage tie in?

There is an appliance or point-product for everything in storage today - virtualization, data movers, back up, CDP, encryption, etc.  A lot of fancy stuff that can solve individual problems, but adds to the overall complexity of IT storage. I like the OpenSolaris Storage Platform approach because it sticks to the basics - data volume management, data services, file systems - open and residing at the heart of any system or solution - its OS.

That's a winning strategy...


Today's Page Hits: 144