Taylor's Take on Sun Storage : Weblog

Taylor's Take on Sun Storage

My storage team and I focus on three of the most important aspects in any industry: customers, competitors and market trends. There is insight to gain and share in this role, so here is our take on Sun and Storage - Taylor Allis


amazon disk emc honeycomb hp hpc ibm netapp open opensolaris openstorage saas solaris storage storagetek sun sunstorage tape thumper virtualization vtl web2.0 x4500 zfs
Tuesday Jul 01, 2008

NetApp's David Hitz Declaration

I have made it a point to not comment or offer opinion on the ongoing NetApp litigation for obvious reasons - I'll leave that task up to Sun's lawyers and our Chief General Counsel.  

Fortunately, the court recently unsealed a document in the ongoing litigation.  So for anyone interested in details; a copy of a declaration from NetApp's co-founder and EVP, Dave Hitz, can be viewed below: 

Below are some quotes taken directly from the document:     

On patents being discussed that revolve around Sun ZFS and NetApp's WAFL technology: 

"Sun's ZFS technology appears to be a conscious reimplementation of NetApp's innovative WAFL filesystem, as admitted by the creators of ZFS: 'The fie system that has come closest to our design principles, other than ZFS itself, is WAFL...the first commercial file system to use the copy-on-write tree of blocks approach to file system consistency.'" 

On Sun open sourcing ZFS software: 

"Irreparable harm to NetApp through Sun's open sourcing of ZFS: 
Sun has open-sourced ZFS and thereby given away for free NetApp's patented technology to anyone that wants to download a copy. That means Sun has created infringing computer code and made it easy for software users and software companies everywhere to infringe, instead of having users compensate NetApp for its technology through normal product purchases. This is not much different from the problems caused when an entity builds a business by distributing for free infringing copies of music. In both cases, there are practical problems in any attempt to recover the infringing copies or to enforce rights against everyone that has downloaded copies of the infringing software. One difference is that adoption of ZFS requires time because it is a software program and not just a song. The next two or three years are very significant for the proliferation of ZFS and it is vital to shut down Sun's distribution promptly."

Mike Dillon updates us that Sun submitted 6 reexamination requests on the patents asserted by NetApp to the US Patent Office (PTO); asking that it reconsider whether those patents should have ever been issued.  According to Mr. Dillon, the PTO has already issued a first action rejecting all the claims of one patent - no 6,857,001.  Sun has also asserted 22 patents in response against NetApp's product line.  

On Sun freely licensing ZFS to other companies:

"Sun is not the only company that could potentially hurt NetApp's market position. Because Sun freely licenses ZFS to other companies, it is possible that the entire competitive profile of the data storage market might change within the next 3-5 years.

REDACTED

Moreover, because ZFS is open-sourced, it lowers the barier to entry for startup companies to bring products incorporating ZFS technology to market and start competing with NetApp. Indeed, because Sun is distributing ZFS at no cost, it dramatically lowers the product development costs for any company, not just startups."

I encourage anyone to read the document in its full context.  For any legal updates, be sure to bookmark Mike's Blog.

Thursday Jun 26, 2008

Open Storage: Early Customers

Now let's talk about what really matters when it comes down to Open Storage - customers.  Now, forgive me for repeating myself because I mentioned some of these customer testimonials in my first Open Storage blog.  But it just so happens that the ones I blogged about ended up in the Open Storage Adoption White Paper.    So, I'll talk about them (again), but also throw some new ones in... 

Customer Blog Comments:  Before I do, I want to point to a couple of blog comments.  The first one was submitted to Chuck Hollis' EMC blog post Do-It-Yourself Storage.  After Mr. Hollis finished criticizing Open Storage, an experienced storage and IT customer posted this comment in response to his critique:  

"I think you are missing the point. I did actually a price comparison. Building your storage solution (in my case several hundred TBs) from cheap disks using x86 servers + ZFS + Open Solaris + Solaris Cluster + ZFS, where all software is not only open sourced it is also for free, *does* make a huge difference. We actually started building our solution on EMC Symmetrix (great box) and EMC Celerra years ago and endup on really cheap storage + ZFS as a replacement and a way to move forward. Additionally all features like snapshots, cloning, end-to-end checksuming, remote replication, built-in compression, built-in cryptography, NFS, CIFS, iSCSI, ... are also for free. Better - they work exactly the same regardles what cheap storage or server we put underneath.

What ZFS brings to the market is the open sourced and free Google like approach to storage - how to cheaply build reliable storage from small to large scale installations.

Sure, especially for SMB market, what is needed is an easy GUI interface built on-top of Solaris + ZFS. I'm sury you will see one sooner or later."

I like the "Google like approach to storage" comment (and the fact that a disk + ZFS system replaced an EMC DMX and Celerra system for this particular customer!)  I also had a customer post these comments to my blog -  they just deployed a 2TB OpenSolaris ZFS + COMSTAR storage system for a VMWare Cluster with off the shelf components saving €2,000 in the process.  Cool stuff.   

Open Storage Customers:  Not surprisingly, a lot of these early adopters use open source to compete in their respective businesses.  You may also expect that early Open Storage adopters would come from Sun's Solaris install base - while true, many new customers do NOT come from Sun's current base.  In fact, a lot are Linux users.  They have chosen Sun because of...Storage. (Open Storage to be precise)


DigiTar
DigiTar provides messaging security and processing services over the Internet (antivirus, antispam, antiphishing, firewall, and archiving).  DigiTar is using Open Storage to improve the performance and efficiency of their database servers. They are using Sun's X4500 storage servers and ZFS to automate database storage administration - with ZFS they have reduced the identification and fixing of database corruption by days and/or weeks.   They are also an active member of Sun's OpenSolaris community and use the OpenSolaris community and SunSpectrum for tech support.

What I personally love about DigiTar's story is that they were (still are) a Linux shop.  So what made them a Sun OpenSolaris customer???  Storage!

Why? Read their CEO/CTO's blog:  Democratizing Storage.  He basically states that OpenSolaris was brought into the company because it made for a superior storage platform.  The clencher (and essence of Open Storage) for me is when he compares deploying Open Storage vs. a traditional storage architecture: 

"To replicate the level of redundancy we get with two X4500s, we’d have to install two completely separate storage arrays…not to mention also buy two very large beefy servers to run the databases. By using X4500s, we get the same reliability and redundancy for about 85% less cost. That kind of savings means we can deploy 6.8x more storage for the same price footprint and do all sorts of cool things like:

  • Create multiple data warehouses for data mining spam and mal-ware trends.
  • Develop and deploy new service features whenever we want without considering storage costs.
  • Be cost competitive with competitors 10x our size."

Want do do more with less?  DigiTar is with Open Storage...


Nexenta
Nexenta is unique in that they are an Open Storage customer - but they are using Open Storage to build and sell storage products of their own.  They bill their product as “Enterprise-class data storage for everyone!” Nexenta has built its NexentaOS and NexentaStor software appliance from OpenSolaris and ZFS; and they can deploy it over the Sun Fire X4500 as well as other HW.

Like DigiTar, Nexenta was founded by Linux gurus.  In fact, the Nexenta team developed the iSCSI stack that was adopted by the Linux community. So what platform did these experts in storage and open-source software choose to build a new storage offering?   OpenSolaris & ZFS - due to its advanced storage functionality and long history in enterprise environments.  

So now the company offers NexentaStor - a software-based NAS and iSCSI solution with unlimited incremental backups, snapshot mirroring and the inherent virtualization, performance, thin provisioning and ease of use benefits of ZFS.


Sapotek Inc.
Sapotek Inc. is a Software-as-a-Service (SaaS) provider that offers online desktop services to (currently) 200,000 users worldwide.  This SaaS provider had a classic Web 2.0 storage problem - how to massively scale, efficiently and affordably? 

Sapotek was running Red Hat Enterprise Linux on Dell servers and had been maxing out at five concurrent threads per server. So they migrated to Sun Fire X4200 servers and the Sun Fire X4500 storage server running ZFS.  They replaced four (4) Dell/EMC storage systems with one (1) Sun Fire X4500.  Sapotek also used Sun's ZFS snapshot feature, and reported that it had reduced backup and recovery times by 99%!  (From hours/days to minutes).  Here is what their cheif tech officer,  Oscar Mondragon, said about Sun Open Storage: 

"The ZFS file system feature of the Solaris 10 OS is a marvel. It creates a common storage pool where all storage performs as fast as if it were local. Our administrators can grow, add, or remove storage on the fly in a single step. Just two people administer 24 TB."


Above is just a sampling of early customers that have deployed components of Sun's hardware and software Open Storage portfolio.  Read the Open Storage Adoption White Paper for more case studies or my previous blog post.  Additionally, take a look at these customer testimonials (Some have bought commercial versions of Sun's storage offerings and some have deployed open source offerings)

  • Dow Corning Corporation: Increased storage capacity by 50% with ZFS
  • Joyent: Sun Fire X4500 gave them a hosted storage business at $1/GB as opposed to $6-$7/GB for EMC
  • Nomura Bank: Used ZFS to create a common storage pool, with no partitions to manage. Admins can now provision or grow storage, and add or remove a file system with a single command.  ZFS also safeguards data at the bank by running 64-bit checksums
  • OmniTI: During a catastrophic accident when 1.8 TBs were lost, they restored their entire database in just seconds with ZFS
  • University of Calgary: Deployed SAM-QFS software for 229TBs of storage, moves data between Sun StorageTek Disk and Tape via policy
  • University of Oxford: Their digital library project will manage 9 million library items and support an average of 9,000 library resource requests a week
  • IN2P3/CNRS: Optimized datacenter space and reduced energy consumption with X4500

Thursday Jun 19, 2008

My Switch from PC to Mac....

Well, I'm officially blogging from my new MacBook Pro after my old Dell PC fizzled out on me.... 

What a difference a decade makes.  I started out on Apple as a Web developer.  It was my platform of choice and I never thought I'd go to a PC - I was loyal to the core.  I remember the good 'ol days of making sure my Config PPP screen showed two connected smiley faces; and the days of posting docs to my websites with my Gopher or Fetch (FTP) applications.  The days of Mosaic...and then Netscape (I still remember how excited I was when I first saw Netscape).  But then I left Web development to join an HP distributor and it was PCs from then on...

Why the switch?  

Despite the great ads; I got into multimedia and Web 2.0 for lack of a better term.  Even though I'm a "business/marketing guy"; I get to re-live the good 'ol days through my blog, Wikis and Podcasts.  I also develop home movies and videos.  So the switch started at home - I bought a Mac for our home computer several months ago b/c of all the multimedia features it had. 

I made the switch to Mac for business this week after leaving Mac over 10 years ago.  I use multimedia and Web features in business more now than I ever have before.  The design and innovation coming out of Mac has been incredible lately - so the geek in me had some influence as well.  Also, so far, all the applications I had on my PC work well (sometimes better) on this Mac (but of course I just got it).  

But a large driver was also working at Sun - while my new Mac is pretty foreign to me at this point in time; I did see something familiar in its disk utility - ZFS.  It's pretty cool to see your company's enterprise-class file system on your consumer laptop.  A testament to the versatility to ZFS and open software...  

Thursday May 15, 2008

What is Open Storage? White Paper

Our Open Storage announcement drummed up interest in the industry - and for good reason...

With that said - there is still a lot more to discuss about Open Storage.   We will have announcements upcoming, and we will also be publishing some White Papers on the subject. 

This first one is appropriately titled, What is Open Storage?  And it can be downloaded here:

I'll post the CliffsNotes below.  I would also like to thank Bruce Norikane, our Sr. Analyst, for his help in writing these papers - his intellect on storage infrastructures, past and present, is astounding.  I'd like to also thank or intelligence manager, Chris Ilg, for his forecasting work; as well as Graham Lovell, Jason Schaffer and our army of developers/engineers for their direction and edits (and leading the way to Open Storage...)

What is Open Storage?   (The short version)
Open Storage refers to open systems storage products based on industry-standard hardware and open source software.  Also, in an open storage architecture, customers can select the best hardware and software components for their individual needs.  By contrast, almost all modern disk arrays and NAS storage offerings are closed systems.

To see the evolution of disk architectures and benefits to Open Storage, see the blog post, "Sun is on to something"

Open Storage Versus Traditional Storage
The following chart compares vendor products, first ship dates, and storage system components. Red items are system components that are closed, proprietary, and available through a single vendor. Yellow items are built from industry-standard components yet sold as part of a closed, proprietary system. Green items are truly open components. They are industry-standard or open-source software, giving customers choice over how their systems are built.


Of significance, since 1992, storage systems have started to utilize open-source, commodity, or industry-standard technologies.  FreeBSD and Linux are used in several systems from vendors such as IBM, EMC, NetApp, and HP.  However, these open technologies are deployed in a closed-system approach.

Storage software is the last major storage infrastructure software to become open source, and Sun’s OpenSolaris Storage community has been leading the change.

Sun Open Storage Value Proposition 
The market shift to open storage systems and Sun’s open storage approach offer four distinct advantages:

  1. A global development community working to make storage more innovative, functional and affordable
  2. Breakthrough economics that can save storage users up to 90% over closed, proprietary storage
  3. Enterprise-class quality storage systems that are reliable and offer some of the highest data integrity in the industry
  4. The ability to easily and dynamically scale IT infrastructures to meet changing business needs

Sun Open Storage Offerings.

  • Sun Open Storage Servers: The Sun Fire X4500 server leverages industry-standard hardware and software. It is a Dual-Core AMD Opteron processor-based server with 48 hot-swappable SATA drives in a single 4 U chassis that can achieve 48 TB of raw capacity with 1 TB SATA drives.
  • Sun Open Storage Archive: The Sun StorageTek 5800 system also leverages industry standard components. It is the first integrated, fixed content archiving system built using open-source software.  In a recent InfoWorld product review, it scored perfect 10s in reliability and scalability.
  • Sun Open Storage HPC: The Sun Constellation System builds on cost-effective, off-the-shelf components
    and state-of-the-art technologies to deliver an open, petascale architecture capable of up to 1.7 petaFLOPS of
    computing power.
    • Lustre file system: Lustre is Sun’s open-source shared disk file system that is generally used for largescale
      cluster computing. The Lustre file system is currently used in 15% of the top 500 supercomputers in the world, and six of the top 10 supercomputers.
  • Sun Open Storage Software: OpenSolaris is the cornerstone of Sun Open Storage offerings and provides a solid foundation as an open storage platform.  It includes:
    • Solaris ZFS: ZFS can address 256 quadrillion zettabytes of storage and handle a maximum
      file size of 16 exabytes.
    • Solaris DTrace: DTrace enables users to ask arbitrary diagnostic questions of the storage subsystem
    • Solaris Fault Management Architecture: Solaris Fault Management Architecture provides automatic monitoring and diagnosis of I/O subsystems and hardware faults
    • Sun StorageTek Availability Suite: Sun AVS delivers open-source remote-mirror-copy and point-in-time-copy applications as well as a collection of supporting software and utilities.
Stay tuned, there is more to come...

Tuesday Apr 29, 2008

Sun is on to something - Open Storage

I freely admit, when Sun announced its open source storage community a year go I was a skeptic.  Sure, open source has its play in software and servers - but storage?  

Well, after a year of watching Sun's open storage investments, industry-standard hardware used in enterprise storage and working within Sun on today's announcement - Sun Extends World's First Open Storage Platform - I think Sun is on to something...

Also read all about Sun Open Storage here.... 

I'll make a brief case for open storage and Sun's leadership in it here.   We have also developed a series of open storage White Papers that give more detail on the below info - I'll post them here.  

What is Open Storage? 

Here is a simple definition:  Open storage is the combination of open source software with industry-standard hardware to create enterprise-class storage systems

Open source software like Linux or OpenSolaris OS.  Open source applications like MySQL database software.  And Sun has been one of the first companies to break the barrier with higher-level open source storage applications which include:

Industry-standard hardware is typically available through multiple vendors and is very price-competitive.  Examples include x86 servers and standard FC/SATA/SAS disk drives.  One could also include LTO tape because it is an industry-standard tape technology - but I'll primarily focus on disk systems, as this is the market that will be most impacted by open storage. 

In an open storage architecture, the customer selects the best hardware and software for the job.  In contrast, almost all of today's disk arrays and NAS appliances are closed - customers are locked into using the vendor's disk drives, controllers and proprietary software. 

The irony being that a lot of closed systems are built from open source software and industry-standard hardware - helping vendor margins but not customer budgets
    

The Evolution of disk Architectures  

Our brilliant disk analyst, Bruce Norikane, also points out that industry disk systems have been evolving to more open architectures over time and with each new market introduction.  A similar trend has also happened in the server market.  Consider the graphic below:   

 

Early disk systems were custom, proprietary engineering projects starting with IBM's SLED (Single Large Expensive Disk)  in 1956 where everything was custom.  Then in the 1980's a high-volume disk market emerged thanks to PCs and servers; and in the 1990's Enterprise RAID was adopted.  Enterprise RAID incorporated a custom disk controller and these new market drives.  Modular storage then hit the market, consisting of a separate controller and disk enclosures that fit in a standard rack - more flexible and affordable.  Most recently we have seen the RAIN (Redundant Array of Inexpensive Nodes) architecture emerge - distributed storage based on server technologies offering better scalability at a better price point.  RAIN architectures are largely based on industry-standard servers, operating systems and networks.  However, while RAIN systems leverage open components, they tend to be build as closed systems with locked-in components from traditional vendors.

The next logical step is open storage - industry-standard hardware and open source software that drive down storage economics and spur greater innovation.  Again, this storage systems evolution is not unlike what happened in the server world - where servers were large, proprietary and expensive years ago. Smaller, industry-standard servers and open source software changed the economics in the server market - and they are doing the same thing to storage.

Why Open Storage?   

Four reasons:

1. Enterprise-class storage:  Systems that offer as much or more quality, reliability and data integrity as closed systems.  Sounds like a stretch?  Just see the InfoWorld review of Sun's x86, SATA, open source software-based archive solution.  It scored perfect 10s in reliability and scalability.  Let's also not forget that ZFS offers 19 9s of data integrity with predictive, self-healing features.          

2. An Open Storage Software Community:   This is important if you are a developer, a company that has developers or a company that is planning on hiring developers to differentiate through IT.  When we launched OpenSolaris Storage last year we had only a handful of open source projects - we now have over 30.  The OpenSolaris community has more than 96,000 registered members in all.  Why is this important?  Customers don't have to wait on a vendor for the features they need - they can find new innovations in the community or develop features themselves.  Innovation is not held back by vendor objectives or limited R&D budgets...  

3. Breakthrough Economics:  Probably the most compelling argument for open storage and why open storage is needed today.   The best way to understand how open storage can impact storage economics is through an overly-simplistic diagram of a closed system:

 

Now consider an open storage architecture:

 

In a nutshell, storage applications are free from licensing costs and open to developers.  Open storage users can choose the platform their IT staff is most familiar with.   An industry-standard server with ZFS (which again includes RAID, data management and data integrity features) can take the place on an expensive controller.   And affordable, market-priced disks can be deployed under the system - even fast and cheap JBOD if you leverage ZFS.

To see the real-world impact we compared some closed systems vs. open systems using Ideas International pricing:



In full disclosure, these are US list prices.  We did configure every system to be as close in capacity as possible using affordable SATA drives in most every configuration.  And finally, certain applications, features and environments simply must run higher-end arrays today - I am not implying that everyone throw out their closed storage and go with 100% open.  But at this economic price difference - users will be compelled to determine which applications and which data should migrate to more open storage - and we presume the data center mix of open vs. closed storage will change over time.         

You can also read about the open storage impact in the VTL space here...

4. Dynamic Scalability: Lastly, the ability to dynamically and efficiently scale to meet today's huge data demand has become business critical, especially with emerging Web 2.0 applications.  Sun sells systems that scale from less than 10TB to greater than 100PBs.   And OpenSolaris ZFS is a 128-bit file system that provides 16 billion, billion times (yes you read that right) the capacity of 32-bit or even 64-bit file systems. 

Sun Open Storage 

Sun offers every component of open storage:  A tried-and-tested enterprise platform in OpenSolaris, the leading HPC file system Lustre, and the open source storage applications mentioned above.  Sun also offers a complete portfolio of innovative and efficient industry-standard servers and storage. 

Sun has, and will announce, storage systems built on an open storage platform as well - Sun Fire X4500 and Sun StorageTek 5800 are Sun's first products built on a truly open storage platform. 

Sun also offers open storage services and resources through its community :

Sun Open Storage Customers

More compelling than anything I can write is what our customers say about Sun and open storage:

DigiTar
DigiTar provides advanced messaging security and processing services over the Internet.  They are using the opensolaris, Solaris ZFS and Sun Fire X4500's.  Jason Williams is DigiTar's COO/CTO and highlights his expereince with Sun open storage in his blogpost Democratizing Storage.  Even though the DigiTar team is self-described as “Linux zealots,” OpenSolaris was brought in because it made a superior storage platform.  Some of my favorite quotes are below:

“That’s the really amazing thing about OpenSolaris as a storage platform. It has all of the features of an expensive array and because it allows you to build reliable storage out of commodity components, you can build the storage architecture you need instead of being held hostage by the one you can afford.”
“When you’ve got rock-solid iSCSI, NFS, and I/O multipathing implementations, as well as a file system (ZFS) that loves cheap disks…and none of it requires licensing…you can suddenly do anything.  Need to handle 3600 non-cached IOPs for under $60K? No problem. Have an existing array but can’t justify $10K for snapshotting? No problem. How ‘bout serving line-rate iSCSI with commodity storage and CPUs? No problemo.” 

“By using X4500s, we get the same reliability and redundancy for about 85% less cost. That kind of savings means we can deploy 6.8x more storage for the same price footprint and do all sorts of cool things..."

Nexenta
Nexenta has built its NexentaOS and NexentaStor software appliance on Sun open storage products – OpenSolaris and ZFS.  This is significant, as the Nexenta team developed an iSCSI stack that was adopted by the Linux community.  Nexenta's team choose OpenSolaris for their storage platform to actually build a new NAS appliance.  Nexenta's NexentaStor offering is a software-based NAS and iSCSI solution - read about it here.  There is also an excellent blog on ZFS and Nexenta here.   

TACC 
Open Storage also has a large play in HPC - consider one of the world's largest supercomputer built from Sun's open storage, servers and traditional storage offerings.  TACC's Ranger system will be used in computational science & technology research.  Ranger runs 3,936 nodes and 62,976 processing cores; has 23TB of memory and 504TFlops at peak performance; and uses 1.73PB of shared disk and 31.4TB of local disk.  Ranger uses Lustre file system running across 72 Sun Fire X4500 servers. For long-term data retention and archive, Ranger runs Sun StorageTek SAM software over six metadata servers - and deploys five Sun StorageTek SL8500 libraries with 48 StorageTek T10000 tape drives.  Ranger will scale to over 3.1PB of online storage and 200PB of near-line storage. 

From a simple NAS appliance to one of the world's largest supercomputers - open storage scales!

You can read more user case studies below: 

  • The University of Oxford is storing 19th century works with the ST5800
  • Gracenote uses Sun Fire X4500's for its mobile music services
  • Web 2.0 SaaS provider Sapotek says, "The ZFS file system feature of the Solaris 10 OS is a marvel. It creates a common storage pool where all storage performs as fast as if it were local. Our administrators can grow, add or remove storage on the fly in a single step. Just 2 people administer 24 terabytes.”

What about Sun's other Storage offerings? 

I invariably get this question when we highlight one architecture or approach.  So, to be clear - Sun sells closed systems too...and we sell a lot of them.  We now sell both depending on customer needs.  But we see the need for open storage - and we are investing in it while other vendors are not.  We are also investing in our traditional storage products - our customers deploy a mix of storage architectures depending on their needs - so Sun sells both.  Lastly, you can't claim breakthrough economics without leveraging tape in your portfolio.  If you want to hear about Sun's tape commitments, read about my trip down to Imation

But as far as open storage goes, I think Sun is on to something...

---- Updates ---- 

Other Open Storage Blogs: 

Friday Nov 30, 2007

Top 10 Storage Technology Trends

It’s getting to be the end of the year – and Sun, like every other vendor I assume, is looking at the new technology trends that have (or will) impact our industry.

My team and I were asked to evaluate some – to see where to prioritize future investments, evaluate current and future competitive threats, etc.  Ensuring we stay ahead of the technology game.  So below is our very own top 10 list of emerging technologies.  (As well as some color to what Sun is doing here) 

I’m sure we’re missing some and comments and claims are our own, they don’t represent any commitment or position from Sun Corporate (For official news on Sun go here…)    

 Top 10 Storage Technology Trends:  

1. Open Storage Platform (aka general purpose storage, open source storage):  Trend #1 is a term we coined, so it may not sound familiar.  It is a combination of market trends as well as a direction Sun is taking with its newer products.  The concept of a common platform is not new – several vendors have tried to build one platform that can run multiple storage applications, saving users time and money.  “Open” is a relatively new concept for storage, but not for software or servers.  There are generally three components that make up an Open Storage Platform:

  • General Purpose Components: General-purpose servers, processors, storage and operating systems are now being deployed in enterprise-class storage devices. Previously, storage vendors had built proprietary operating systems, ASIC chips and other custom-built components. As commodity chips, components and software have matured, storage vendors are now using general purpose components in their systems. The cost savings are significant - however, most vendors' prices remain the same (contributing to vendor margin). Sun's philosophy is to pass these savings to the consumer. Enterprise-class systems based on general purpose components give customers higher-value systems at a fraction of the cost (see graphic below). They are also much more flexible, as they can be re-purposed for other uses.
  • OS/File System Storage Services: Traditional appliances charge customers extra in software licensing costs for data management services like administration, replication or volume management.  What if this functionality came already embedded in the storage system itself?  Sun's newest file system, ZFS, has started to incorporate these services at the File System level. ZFS deploys point-in-time-copy, volume management, administration and data integrity features like Copy-on-write and RAID. Storage services deployed at the OS-level have several benefits, including efficiency, performance, reliability and affordability (no more licenses for extra software needed).  And before we think this is a new concept, mainframe has been doing it for years. The new concept is doing this in the open systems space – and Sun Solaris ZFS is leading the charge.      
  • Open Source Storage Software: The ability to download software, test it and add features to it is critical to developers building new applications – we’ve seen this in the OS market.  But what about storage?   This has been an investment area of Sun’s - Sun actually offers one of the most complete open source storage software stacks (from protocols to drivers to data management software).  Developers can build their own storage solutions and sell them by leveraging Sun’s open source software (see the Nexenta Storage Appliance for a perfect example).  Open source has an added benefit to customers with in-house development resources - customers can deploy new software features by searching for it in the open source community or even developing it themselves (why wait for vendor roadmaps?). This is not possible with a proprietary appliance. 

The simplest way to show the impact the Open Storage Platform concept will have on the storage industry is a basic economic comparison.  We used IDC's Pricing Database to compare over 50 actual purchase orders of enterprise disk (Sun ST9900, EMC DMC3), Midrange Disk (Sun ST6140, EMC CX3) and a Sun product built on our Open Storage Platform (the SunFire X4500):

  • Enterprise Disk = $18.84/GB
  • Midrange Disk = $10.39/GB
  • Open Storage Platform = $1.50/GB
    (7x less than Midrange and 13x less than Enterprise)
NOTE:  There are certain applications and features that must run on higher-end disk.  But at a $1.50, customers will be compelled to find which applications should be running on the SunFire X4500. What is even more remarkable, the SunFire X4500 includes the server, OS, data services, storage and networking components - all for $1.50/GB! 

Game-changing economics…

2.  Virtualization:  Where to begin?  The benefits are obvious – massive savings through optimization, consolidation and optimization.  The best definition of virtualization I have seen came from the 451 Group’s Virtualization Report which defined Virtualization as, “A software abstraction layer that permits aggregation, emulation or partitioning.”  Let’s look at the different types of virtualization out there today

  • Server Virtualization:  The ability to host Windows, Linux and Solaris operating systems on one platform has literally taken the market by storm.  A huge boon to IT managers and developers alike.  VMWare led the charge with some start-ups and open source initiatives in tow.  On October 5, 2007 Sun entered the race with Sun xVM.  (And if virtualization has hit the systems world, expect to see it hit the desktop market soon…)      
  • File Virtualization (a.k.a Clustering, Global Namespace, Unified Namespace, NAS Virtualization):  A few years ago we were talking about Grid Storage – a bunch of storage nodes acting as one, single system.  File virtualization is shaping up to be the technology that will take us there.  In 2005 & 2006 we saw EMC buy Rainfinity, NetApp buy Spinnaker and Brocade buy NuView.  In recent years we we have seen EMC announce Rainfinity, NetApp announce a rough start to Data OnTap GX (based on Spinnaker technology) and file virtualization start ups Ibrix, Isilon and LeftHand Networks continue to grow.  Expect this market to heat up, and consolidate as start ups are either bought, or simply run out of runway.    
  • Virtual Tape: A disk system that emulates a tape library.  Virtual Tape has been in the mainframe market for years, and Sun is the market leader here.  Virtual Tape Libraries (VTL) in the open systems market are relatively new – disk prices have eroded to a point where backup administrators can now take advantage of disk’s access speeds without replacing their exiting backup infrastructure.  IDC pegs this market as small, but growing at 16.2% (2006-2011 CAGR). 
  • Virtual Disk (aka Storage Virtualization):  A disk system that aggregates 3rd-party disk and offers disk partitioning.  A necessity for anyone that needs to consolidate their environment and/or make data migration between independent systems easier.  Virtual Disk Systems include IBM’s San Volume Controller (SVC), FalconStor’s IPStor and Sun’s award-winning StorageTek 9000 arrays.          

3. Thin Provisioning:  Better system utilization is the name of the game.  Most admins know that the utilization rates on their disk systems are not where they need to be.  Thin Provisioning allows admins to allocate or provision space to specific applications, making full use of their system’s capacity.  3PAR spearheaded open systems Thin Provisioning and NetApp offers it as a part of Data OnTap.  Sun announced Thin Provisioning on its StorageTek 9990V system in May – meaning consumers can have the world’s fastest enterprise array, Virtual Disk AND Thin Provisioning all on one platform. 

Pretty cool…  

 
4. Data Deduplication (aka De-dup, Single-instance storage):  In a world where there is more data coming into a company than can possibly be managed – data compression ratios ranging from 10:1 to 50:1 sound pretty darn nice (See how De-dup works here).  Data Domain, Diligent, FalconStor and other upstarts get credit for bringing this new technology to market and larger vendors are quickly following suite.  De-dup is still emerging, can have performance issues and does not work perfectly for every application – but economics dictate its worth consumers investigating where it can work for them.

There are two emerging de-dup architectures:  “Inline” – where the de-dup magic happens in real-time, as data comes into the system, as found in Diligent's ProtecTIER appliance.  Or “Post-Processing” where the magic happens as a secondary process after the backup job, as found in FalconStor’s Single Instance Repository (SIR) software.  Both have their pros and cons, and deciding which approach to use depends on balancing your performance vs. complexity needs.  For the record, Sun sells both….         

5. Data Encryption:  One need only read the horror stories of lost tape and disk drives to see the importance of data encryption.  While it has been around for a while – the need has never been greater.  Growing storage capacity has caused another problem – one can store a lot of personnel records on a single cartridge or drive.  In an age of identify theft, losing one storage device can put a company out of business.  The new trend is not how to encrypt, but where to encrypt  On the host server?  On an appliance in the network?  In the storage device itself?  Decru (since bought by NetApp) benefited from this trend with their encryption appliance.

I once worked with a brilliant engineer whose favorite saying was “never put a product where a feature should be.  I’d say this was Sun’s philosophy when we delivered the Sun StorageTek T10000 tape drive.  Put simply, Sun put an encryption chip next to the compression chip on the drive – so data is encrypted as it is fed onto the tape.  Simple and affordable – no extra appliance needed.  Sun also offers the StorageTek Crypto Key Management Station to centrally authorize, secure and manage encryption keys.

6. Eco Storage (aka Green Storage/IT):  I freely admit that when I was first approached with “Green Storage” I was a skeptic.  I would have also never guessed Al Gore would win the Nobel Peace Prize!  But Eco also stands for Economics.  If you save power and footprint, and the world while you are at it – who can argue with that?  But the challenge for storage customers will be sorting through the vendors who make REAL Eco investments vs. the ones that just add “Eco” or “Green” to their marketing collateral.  Sun’s in the “real” category, investing heavily in Eco IT.  Sun’s Eco efforts can be seen here...        

7. Object Archive (aka CAS, Application Aware Archive): The dizzying array of regulations, compliance requirements and influx of data have made the archive market one of the fastest growing markets in IT and storage.  And customers must continually evaluate which archive approach will work best for them.  The trend here is to “build a better mousetrap archive.”  The challenge is this, an archive system must:

  • Store a lot of data affordablyHoneycomb
  • Have WORM  functionality so documents show up un-altered in an audit or court of law
  • Be easily and quickly accessible
  • For more than 100 years…
Easier said than done.  While tape continues to be the old staple in archive, and faster and denser tape systems are coming out each year – a lot of new innovations are happening on the disk and software side.  Disk-based object-level archive systems include EMC’s Centera, HP’s RISS and Sun’s StorageTek 5800 “Honeycomb” pictured at right.  Sun’s ST5800 system uses advanced meta data features and processors close to each storage cluster for fast access to deep archives – great for digital library, Web 2.0 and HPC applications and environments. 

But do keep in mind for deep archive; Sun’s StorageTek SL8500 Tape Library is tough to beat – just one library's max raw capacity is 56 Petabytes, and data sitting on tape consumes 0 kilowatts and generates 0 CO2 (see above trend #6)   

8. New Interfaces, Protocols & Configurations: There is a lot of change happening in storage systems and how they are configured.  The three primary ways storage is attached is Direct Attached Storage (DAS), Storage Area Networks (SAN) and Network Attached Storage (NAS).  A disk system can also be configured in a couple different ways.  RAID configurations stripe data across multiple drives and impact a system’s reliability and performance.  JBOD (Just a Bunch of Disks) is more affordable because it does not require a disk controller, but provides no data redundancy.  New interfaces and protocols will impact each of these markets significantly. 

  • SAS:  In DAS, Serial Attached SCSI (SAS) is a newer serial communication protocol that will make data transfer speeds much faster and at a lower cost.  SAS drives have replaced parallel SCSI for internal storage, and SAS HBA external host interfaces are just starting to ship in volume.  SAS disk arrays started shipping this year and SAS JBOD are starting to ship.  To complement the SAS HBA's and SAS arrays, there are new network components called expanders which are basically switches.  Generally SAS disk will be cheaper than Fibre Channel (FC) with similar performance and capacity.  (Sun has SAS host I/F, disk I/F and SAS disk drives)
  • iSCSI:  In the SAN market, the iSCSI protocol is making a significant impact by taking the cost out of expensive Fibre Channel SANs.  iSCSI allows users to send SCSI commands over their existing IP networks. 
  • File Virtualization:  In the NAS market, file virtualization is making NAS farms much more scalable and manageable (See trend #2)
  • Clustered RAID:  A new innovation to watch in the RAID market is horizontally scalable RAID or clustered RAID for large applications.  Digi-Data is a small storage company pushing this innovation. 
  • JBOD:  Keep your eye on Sun in the JBOD market.  JBOD is more affordable than RAID, but does not have RAID’s redundancy and reliability features.  But what if you had an infinitely scalable file system with data integrity and RAID features running on JBOD, say, something like ZFS?
  • Unified Storage:  Another trend in this area are unified or hybrid devices – storage systems that can handle multiple protocols and interfaces, including iSCSI, Fibre Channel, and NAS – all in one unit.  Makes sense to users with a dizzying array of choices in the market.             
  • 10GbE:  Lastly, 10 Gigabit Ethernet or 10GbE is the latest and fastest of the Ethernet standards that will re-shape data center networking – offering a fast, common and affordable network technology for IT and Web 2.0 applications from supercomputing to networked storage.
  •  

9. Solid State Disk (aka SSD, Flash, Memory):  We see the perfect storm happening around SSD.  SSD has no moving parts, making it one of the most reliable and fastest storage mediums in the world.  But it is incredibly expensive when you compare $/GB vs. Disk and Tape storage.  However, SSD has made a name for itself in the consumer market (digital cameras, phones, iPods, etc.)  The price pressure in the consumer arena is enormous – and this had been rapidly eroding the price of SSD.  As SSD prices drop, expect to see hybrid disk drives and storage systems that leverage more SSD for greater speed and IOPS.        

 

10.  Storage as a Service: 

Storage as a service offered over the Internet has been talked about for years – but poor performance and implementations have cooled this trend.  However, Amazon has given Storage as a Service a power boost with its Amazon Simple Storage Service (Amazon S3).  By leveraging Amazon’s existing e-commerce and storage infrastructure, the company is offering customers storage capacity for $0.15 per GB-Month of storage used – possibly the cheapest $/GB on the planet.  And while this may have more play in the consumer market, Amazon could re-invigorate the storage as a service trend.  Also keep an eye on Sun’s Internet Service offerings over Network.com...             

--- Updates ---

Tuesday Jun 19, 2007

Lacrosse and the future of storage...

I like storage, strategy and sports - so I really liked Scott Tracy's "Telegraph" Blog.  I commented on it, but will elaborate further here.  He talks about Storage running on general-purpose Solaris, and shows a nice OpenSolaris Storage Platform diagram. I'll serve up my own sports analogy (and re-live the glory days while I am at it...)

I played Football and Lacrosse in high school and college  (yes, I was a UPS Logger) - but I'll stick with the Lacrosse theme. 

In high school my team competed in Florida's Cocoa Expo Lacrosse Tournament. We were quickly mocked by all the teams there - they represented the best of east coast lacrosse and we were from cow-town Colorado. We were at first intimidated by their "moves" - quick sticks, behind the back passes, etc. But our coach told us to stick to the basics - two hands on the stick, straight forward passes, etc.

What happened?

We won the tournament that year - first place, #1 (and went on to win State by the way). So what's the storage tie in?

There is an appliance or point-product for everything in storage today - virtualization, data movers, back up, CDP, encryption, etc.  A lot of fancy stuff that can solve individual problems, but adds to the overall complexity of IT storage. I like the OpenSolaris Storage Platform approach because it sticks to the basics - data volume management, data services, file systems - open and residing at the heart of any system or solution - its OS.

That's a winning strategy...


Today's Page Hits: 167