Tag: "NTAP"

Flood Affects Storage Industry


There was a great post a couple of weeks ago, with Tom Coughlin as a contributing editor, on Forbes’ news site about the floods that hit Thailand and how it will affect the disk drive market.  The great thing about the article is it truly highlights that necessity is the mother of invention.  What do I mean by that?  Over the past few “storage efficiency” has been a big topic with vendors.  Helping customers “do more with less”, especially in these stringent economic times, is key to the vitality of a number of businesses.  Technologies such as storage virtualization and thin provisioning have helped customers to slow their storage spend and get better utilization out of their existing storage.  Once customers have moved their utilization rates from 35% to 65% or 70%, time comes when new storage needs to be acquired to keep up with the growth of data.  The issue comes when there are no more disk drives to be acquired.  Due to the floods in Thailand, analysts predict that the storage industry could be 50 to 60 million units shy of the demand this quarter.  This does two things:

1)      Drives the price of disk higher, at a time when the expectation is to spend less for disk

2)      Has IT getting more creative on how they use and deploy their storage

It is the later that I want to focus on as paying more for disk is not necessarily the best option.  It is important to note that data grows for one reason, business does not stop, it needs to keep going and it is what is driving the demand on the data.

In the Forbes piece Tom talks about “a surge in new technologies because of this disk shortage” but he doesn’t cover some of the most innovative technologies that are available to help customers.  I would agree with Tom that we “could” see a surge in SSD but that would be short lived do to both supply and cost as well as a surge in tape, but these aren’t really “new technologies”.

PDF Printer    Send article as PDF   

Storage Efficiency Panel – SNW 2011 Fall


Yesterday I was on a panel at SNW in Orlando Florida.  The panel was hosted by Dave Vellente, Founder of Wikibon and always a great host for these kinds of things.  On the panel was Larry Freeman of NetApp, Craig Nunes of HP (formally 3Par), Jarred Floyed CTO / Founder at Permabit and myself, IBM (formally Storwize).

Some interesting data came out of this panel.  There were probably over 150 people in the audience.  It was a well-attended session.  Also, Dave is VERY good about asking the audience questions.  Let me start by making sure we all know where everyone sits at the “storage efficiency table” that was on the panel.

  • Larry Freeman is from NetApp – they claim, and I believe them, that they have 10 storage efficiency technologies that are embedded into WAFL
  • Craig Nunes main focus on the panel was ‘zero reclamation’ to optimize storage
  • I have a Real-time Compression drum I am beating
  • Jarred Floyed focuses on data deduplication

Here are some questions and answers Dave got when speaking to the audience:

Dave’s Question

Audience Response (in close estimated %)

How many people use deduplication / compression in their storage? 60% responded they did use one or both of these technologies in their environment
How do users use these technologies - embedded or appliance? 100% of the 60% said "embedded"
Who is your storage vendor was that provided these technologies? 100% of the 60% said NTAP
What is the number 1 issue was with the embedded solution and making it not more widely adopted? Performance was the answer.  They all believed that for 70% of their applications, the embedded solution was “good enough” but for 30% where performance is critical – it couldn’t do the job.
Why are not more appliances deployed to solve the performance issues? The response was that customers didn’t want to have to manage multiple solutions in their environment doing the same thing.
PDF    Send article as PDF   

Storage in Eastern Europe


 

 

 

 

 

 

 

 

 

 

Today I begin a 12 day trip to Easter Europe to talk about IBM Storage.

The trip will take me to:

  • Moscow, Russia
  • Warsaw, Poland
  • Prague, Czech Republic
  • Ljubljana, Slovenia
  • Umag, Croatia

In Russia, on September 6, I will be at the Information Infrastructure Conference and the following day meeting with customers to discuss storage and storage efficiency.

In Poland on September 8, I will be presenting IBM’s Real-time Compression at Storage University.

In Prague I will be meeting with the press as well as speaking with customers.  Additionally, I will be spending the weekend in Prague, a city I have always wanted to visit.

In Slovenia on September 14, I will be presenting at IBM’s Innovation Center at an IBM Solutions Event.

Finally in Croatia on September 15, I will be at the IBM Forum, the largest IBM even in Croatia.

In each location, I will be speaking with partners and customer on IBM’s innovation in storage, storage efficiency and Real-time Compression.  I am looking forward to learning what the largest storage challenges are across Eastern Europe and users go about solving their challenges.  Additionally, I will be doing some local enablement for our partners and sellers.

I will blog from each location.  I will talk about the professional part of my travels as well as, hopefully, one personal event.  I have tried to make sure that in each city I have time to do one interesting thing.  I don’t know when, if ever, I’ll be back to these cities and these are some places I have always hoped to go.  Too often we travel and its all business.

Also stay tuned, when I land I will have an update from my trip to VMworld.  It was fantastic.  Truly the best end user show around.  I learned a great deal and can’t wait to share some of what I saw.  As always – comments are always welcome.

Free PDF    Send article as PDF   

Storage Efficiency Spotlight at VMworld


VMworld Live 2011
Via: Wikibon

PDF    Send article as PDF   

Virtual Disk Storage


History truly does repeat itself.  We are talking about the history of data storage.  Every once and a while a new technology comes along that requires a new way to think about infrastructure.  Notice I said “infrastructure”.  I’d like to paint two analogies:

Analogy 1: RAID – Prior to RAID users stored their data on disk and if they could afford it, they backed that data up to have a protected copy of their data.  When RAID came out, users were able to store their data on multiple disks appearing as one device.  The benefits to this were, increased data reliability, better performance.  This new technology however, fundamentally changed how disk was sold, but the questions were the same:

  1. How much capacity do you need?
  2. What type of performance does your application require?

The sales reps point of view changed.  There were a number of new considerations that needed to be taken into account.  First, the age old question, “Will I sell less storage “stuff?”  Remember the person, at the time, selling the disk was probably also selling the backup tape and software to protect that information.  If the disks are more reliable, maybe the customer won’t need as much tape?  Second, when the capacity question came up, the seller also needed to know what type of RAID the customer wanted to ensure they sold them enough drives.  It was no longer as simple as asking the capacity requirements and dividing it by the drive capacity at the time.  Now depending upon RAID levels there was a new set of math that needed to be done.  Third was the notion of performance and more spindles meant more performance so now that the capacity equation was solved for, you also needed to know the I/O requirements in order to make sure the right number of drives were sold to solve for the capacity as well as the performance.

PDF Creator    Send article as PDF   

Efficiency vs. Optimization


“Storage Efficiency” has become a big topic over the past 12 months.  There are a number of new technologies that have come out in the last few years that are helping to deal with storage growth.  We all know that data is the root of the decisions that drive business today.  The more data you have, hopefully, the better decisions you can make to drive your business to success.  The question is, “what is the value (and hence the cost) of the infrastructure to create that success?”  What we do know is that the ability to put more data in a highly efficient footprint can give your company a competitive edge.  There are five technologies that can help an IT organization create an efficient storage infrastructure.  These are:

 

1)      Tiering

2)      Virtualization

3)      Thin Provisioning

4)      Compression

5)      Deduplication

It is also important to point out that there are some semantics when talking about storage efficiency, specifically between efficiency and optimization technologies.  I think it is useful to attempt to define these as they lead us to picking the right solutions for what we are trying to accomplish.  For the purpose of this post, efficiency will relate to making existing capacity more useful and optimization will mean making more capacity out of existing capacity.

Using these definitions, technologies such as Tiering, Virtualization and Thin Provisioning are efficiency technologies.  These technologies help to utilize the existing capacity that you have.

Tiering is technology that is used on about 10% of your data or less.  It is used to move data that requires higher performance to flash storage.  Good tiering technology analyzes data access patterns and moves the most active data to the highest performing disk.  It doesn’t really change the amount of physical capacity that is required; it just changes what type of capacity is required and allows IT to make sure data is operating as fast and efficiently as possible.

PDF    Send article as PDF   

Linked In Storage Discussion on Storage Efficiency


Great conversation on Linked In about deduplication and compression for storage efficiency in the Data Storage Professionals Group.  Help the storage community answer this question:

Does anyone has any experience in NAS de-duplication at filesystem level, like NetApps. Does it really work? I concerns/limitations?

PDF Creator    Send article as PDF   

Top 10 Reasons Real-time Compression Provides Extraordinary Storage Efficiency


Over the past few weeks I have witnessed the proverbial mudslinging that takes place in the blogosphere when marketing feathers are ruffled.  Most recently I was reading Rich Anderson of The StorageSavvy Blog.  The article was "Compression better than Dedup?  NetApp Confirms!"

I have to agree with Rich on many fronts.  First, "When all you have is a hammer, everything is a nail."  Rich points out vendors have to sell "what's in the bag" so it is conceivable that all problems look like they can be solved with their solution.  If you look back over the last few years NTAP has always had a "me too" reputation.  Whatever the industry has, they have one too and its better.  For the last few years, while competing against Storwize, they have pulled the EMC tactic of trying to stall a market by saying, "We have optimization for primary storage with deduplication."   The reality is, you can't use it in real time, it is a resource hog, and again Rich mentions, the only use case it works well on in primary storage is VMware (and that is ONLY IF the customer stores their data outside the .vmdk file otherwise compression is much better).  Now that NTAP has compression their story has changed saying that compression on primary storage is better for most use cases.  Duh!  The folks at Storwize (now IBM Real-time Compression) have been saying that for years.  Why, deduplication is great for repetitive data sets, i.e. backup, not primary storage.  There just isn't that much repetitive data in primary storage.  Again, NTAP is trying to stall the market saying they have "in-line" compression for primary storage.  Sorry guys, not good enough.  In-line is NOT Real-time.  Rich also points out that the key characteristics of storage for customers are capacity and performance.  Patrick Rogers of NTAP has said publically that compression WILL indeed impact performance and that they even have a tool that will tell you how much performance will be impacted.  While NTAP may say compression is "free", we all know nothing worth having in life is free, you get what you pay for.  If you need the performance to do compression you are going to have to perform a major upgrade to  your filer in order to just be able to perform compression let alone try to do compression in real time.  No real savings there.

PDF Download    Send article as PDF   

A Blog with no Comments?


Today I read a very well written blog by The SANMan.  The only issue is, you can't comment on his blog.  This is the first technology blog I have seen like this.  So, I will have to post my thought here.

In his post "NetApp Takes the "Primary" Lead for Data Reduction" - which seems more like theory and a commercial for NTAP than reality (see comments @ The Register) the SANMan states:

"Yes, Ocarina and Storwize have appliances that compress and uncompress data as it’s alternatively stored and read but what performance overhead do such technologies have when hundreds of end users concurrently access the same email attachment? As for Oracle’s Solaris ZFS file system sub level deduplication which is yet to see the light of day one wonders how much hot water it will get Oracle into should it turn out to be a direct rip off of the NetApp model."

I have two comments:

1) You are right - you CAN'T do deduplicaiton on primary if you affect performance.  All indications for customers are that they cannot use NTAP deduplicaiton or even compression 'in-line' as the performance is just too terrible so all processes must be done post-process.

2) I direct your attention to the Wikibon Blog on CORE - "Dedupe Rates Matter...Just Not as Much as You Think" - Storwize can do in-line data optimization without any performance degradation.  So the question is - if customers can 'Optimize without Compromise' - why wouldn't they?

Updated 6/7/2010 - Oh, quick question - how does the SANMan get away with the graphics he uses?  I would think that Walt Disney & Pixar would get a bit upset with the use of the character Carl Fredricksen, no?

PDF Printer    Send article as PDF   

Storage Tiers – Take 3


 I find myself in a true quandary.  First, I have true admiration for my good friend and fellow blogger 3Par Farley and never feel comfortable being on the other side of the coin from him.  Second, I find myself agreeing, to a degree, with Jon Toigo (who still uses crazy permalinks and considers Novell a serious storage player.  What is up with that?).

I’m sure by now most of you all have read the fury lately over Tom Georgens’ comments about the future of storage tiering.  A number of folks who have ‘tiering’ technology reacted with disdain (see a list on Storagerap).  Some wondered how a storage visionary like Tom could turn his back on technology that helps people save money in storage.  Some even suggested that this is just marketing to overcome deficiency in the NetApp product line.  However, one applauded Tom for understanding how the real world deploys storage.  All good points, but I have my own theory on storage teiring...

I want to come right out and say I think that storage tiering is an incredibly smart concept.  (Now that that is off the table…) I would also say that much like the prediction that tape is ‘dead’ (I guess Data Domain didn’t get that memo), storage tiering, while it can’t be dead, because in reality, it never actually was, nor do I think it will be for a very long time.  Let’s look at the facts:

First, HSM never really went anywhere.  There is not mass adoption of HSM technology.  Second, tiering is not a technology issue.  Humans are lazy.  What do I mean?  HSM / Tiering or whatever you want to call it depends on policy.  IT can’t get any two groups in a company to decide on anything other than storage is too expensive.  When I speak to well respected people in IT the ‘real world’ (my dad), they tell me it is too difficult to get organizations to agree on when data can be archived in order to save money (and that is what this is all about really).  Finally, IT processes get in the way of a good tiering strategy.  Getting data to go one way is easy – move data to cheaper and cheaper tiers of storage until it vanishes.  Try getting it back.  That takes a lot of management tools and integration and costs just as much as doing nothing.

PDF    Send article as PDF