<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Are we missing the point with SSD/Flash?</title>
	<atom:link href="http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/</link>
	<description>Welcome to the bigger truth! I&#039;ll try to add some context around &#34;how&#34; or &#34;why&#34; things might mean more than meets the eye.</description>
	<lastBuildDate>Fri, 03 Feb 2012 06:42:49 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Mike Workman</title>
		<link>http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/comment-page-1/#comment-46</link>
		<dc:creator>Mike Workman</dc:creator>
		<pubDate>Tue, 02 Jun 2009 18:27:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/#comment-46</guid>
		<description>Steve,
I think the guys above said it very well, except I think they were too nice :-)
Until we come up with breakthrough architecture that solve ALL the problems, we make incremental improvements. Always have, always will. Faster processors, more cache, higher RPM&#039;s and the like.
Adding SSD to the storage pool just gives us one more weapon in the war on storage *subsystem* performance. It is probably, for now, not as great a weapon as the hype would suggest, but it is certainly a good one, and it will get better over time. Someday, it will be stupid not to have it in your array. In the meantime people will argue against it only if their subsystem design cannot make effective use of it, or if their management system turns it into a huge kludge to manage. We all know who these folks are by the structure of their arguments for or against SSD.
In my first Blog on this topic I made exactly your first point - put the SSD closest to the processors doing the work - Fusion IO for example connected with PCIe busses - far better than accessing a shared network with SSD&#039;s hanging off the back of it. That is all fine if we are willing to give up the shared storage architecture. It some cases, it is the best weapon for performance.
Finally, I am curious as to how all of a sudden people think FAST is so cool, when Pillar did this Tiering in the box for the last last 3+ years?  Our QoS does exactly that, automated movement of data between different Tiers of non-volatile storage components. Out management structure makes addition of SSD into the storage pool trivial. Before people beat me up over this - I didn&#039;t say using SSD effectively is trivial, the technology certainly presents it&#039;s challenges.
Now for the Silver Oak: I made mine Shafer One Point Five.  If you haven&#039;t tried it, you should. At least you should if you like big wines. If you are partial to things closer to standard Pinot&#039;s then stay away from any Shafer - it will cause you to convulse....if you have a big budget, go for Hillside Select.
Have a great weekend.
Mike
</description>
		<content:encoded><![CDATA[<p>Steve,<br />
I think the guys above said it very well, except I think they were too nice <img src='http://www.thebiggertruth.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /><br />
Until we come up with breakthrough architecture that solve ALL the problems, we make incremental improvements. Always have, always will. Faster processors, more cache, higher RPM&#8217;s and the like.<br />
Adding SSD to the storage pool just gives us one more weapon in the war on storage *subsystem* performance. It is probably, for now, not as great a weapon as the hype would suggest, but it is certainly a good one, and it will get better over time. Someday, it will be stupid not to have it in your array. In the meantime people will argue against it only if their subsystem design cannot make effective use of it, or if their management system turns it into a huge kludge to manage. We all know who these folks are by the structure of their arguments for or against SSD.<br />
In my first Blog on this topic I made exactly your first point &#8211; put the SSD closest to the processors doing the work &#8211; Fusion IO for example connected with PCIe busses &#8211; far better than accessing a shared network with SSD&#8217;s hanging off the back of it. That is all fine if we are willing to give up the shared storage architecture. It some cases, it is the best weapon for performance.<br />
Finally, I am curious as to how all of a sudden people think FAST is so cool, when Pillar did this Tiering in the box for the last last 3+ years?  Our QoS does exactly that, automated movement of data between different Tiers of non-volatile storage components. Out management structure makes addition of SSD into the storage pool trivial. Before people beat me up over this &#8211; I didn&#8217;t say using SSD effectively is trivial, the technology certainly presents it&#8217;s challenges.<br />
Now for the Silver Oak: I made mine Shafer One Point Five.  If you haven&#8217;t tried it, you should. At least you should if you like big wines. If you are partial to things closer to standard Pinot&#8217;s then stay away from any Shafer &#8211; it will cause you to convulse&#8230;.if you have a big budget, go for Hillside Select.<br />
Have a great weekend.<br />
Mike</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: the storage anarchist</title>
		<link>http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/comment-page-1/#comment-45</link>
		<dc:creator>the storage anarchist</dc:creator>
		<pubDate>Fri, 29 May 2009 22:03:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/#comment-45</guid>
		<description>A very thought provoking conversation - thank you sir.
(shrugging off my storage-centric blinders)...and yes, I have to agree, in the big picture, all we can struggle to accomplish is relocate the bottleneck somewhere else.
And we (by definition) will NEVER succeed in eliminating all the bottlenecks, unless we stop &quot;improving&quot; software, that is.
Heady stuff - I think there&#039;s a bottle of Silver Oak waiting at home that needs my attention.
Thanks for making Friday Fun!
</description>
		<content:encoded><![CDATA[<p>A very thought provoking conversation &#8211; thank you sir.<br />
(shrugging off my storage-centric blinders)&#8230;and yes, I have to agree, in the big picture, all we can struggle to accomplish is relocate the bottleneck somewhere else.<br />
And we (by definition) will NEVER succeed in eliminating all the bottlenecks, unless we stop &#8220;improving&#8221; software, that is.<br />
Heady stuff &#8211; I think there&#8217;s a bottle of Silver Oak waiting at home that needs my attention.<br />
Thanks for making Friday Fun!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: the storage anarchist</title>
		<link>http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/comment-page-1/#comment-44</link>
		<dc:creator>the storage anarchist</dc:creator>
		<pubDate>Fri, 29 May 2009 20:44:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/#comment-44</guid>
		<description>Pragmatically speaking, I agree with you that centralizing storage resources is more efficient than distributing them out into the servers. This is the foundation of the external storage value proposition.
And I believe that centralizing expensive resources designed to improve I/O performance is also most prudent - we&#039;ve done that for years with large DRAM caches and the compute power to fuel intelligent pre-fetch to increase cache hit rates, for example.
But I take exception with the assertion that adding Flash to the centralized resource pool in an external array necessarily increases management complexity - in fact, I believe that the opposite is true.
I contend that with the appropriate intelligent algorithms and architecture, the external array can leverage Flash to improve performance in much the same way that it leverages DRAM cache today.
Furthermore, extensive installed-base analysis shows that the vast majority of workloads are far more static than people imagine. This is especially true the larger your granularity of analysis: DRAM cache is appropriate for block-level optimization because the &quot;busy&quot; block set will always be larger than DRAM. But when you move up in granularity, you find that the vast majority of systems service 80-90% of all IO cache-miss requests from only 5-10% of the LUNs.
And perhaps surprisingly, the set of &quot;workhorse&quot; LUNs don&#039;t really change all that much over time.
This fact is indeed the foundational reason why tiering across 15K, 10K and 7200rpm SATA drive works for so many customers. And it is ALSO the foundational reason why wide-striping can work - each drive has to support a small fraction of the workhorse LUNs, and the rest of the LUNs don&#039;t see much I/O anyway so their impact on the workhorses is rare.
What most people haven&#039;t considered is that taking the HEAVIEST workhorse LUNs - the ones that service the MOST IOs per day/week/month - and moving just these to Flash has two immediate benefits: 1) these busy LUNs get the benefit of the drastically reduced response times afforded by Flash, and 2) the rest of the workload left on spinning rust ALSO gets better response time BECAUSE THE DISK DRIVES DON&#039;T HAVE TO WORK AS HARD! Less head contention = better performance!
In fact, customers often find that they can wide-stripe the remaining workload on SATA drives, using a combination of Flash (lowest $/IOP) and SATA (lowest $/GB) to reduce their overall purchase cost and environmental footprint - without sacrificing performance OR increasing operational overhead!
And given that the workload distribution of most arrays is typically very stable, you usually won&#039;t have to add any overhead to constantly rebalance or redistribute the data in/out of Flash.
And for those environments where the workloads ARE more dynamic, EMC has announced FAST - Fully Automated Storage Tiering. FAST will dynamically analyze running workloads and adjust the distribution of LUNs and/or sub-LUNs across the available storage tiers.
FAST is probably the &quot;that&quot; you&#039;re looking for, but it&#039;s not necessarily the case that everyone needs FAST to benefit from centralized Flash.
----Well at least pragmatically you agree with me!  A few points:
1. I am NOT suggesting that improving ANY end node (array, server, filer, etc.) element is bad - I like &quot;better&quot; given the alternative.  If the problem can be contained to a single system and isolated to a single tier, then go nuts. If you can fix any problem by adding some memory, why wouldn&#039;t you?
2. Your argument is completely valid (albeit only for the storage end of the equation) PRESUMING that all my data is in your stuff - and the magic that your stuff contains makes sure that the right data is in the right tier at all times.  If all I had were super-mega Symms and FAST, then yes, I could theoretically solve all of my optimization/automation issues at the STORAGE layer.  I also acknowledge that this scenario would be wicked good - for EMC!  However, this is not the case with MOST of the real world, who do deal with (gasp!) non-EMC systems, and even those who deal with multiple EMC systems - it&#039;s a fact that there are lots and lots of disparate elements - and that is the real issue.  Plus, your argument falls down if the real issue turns out to be external to the actual storage layer.  If the problem is at the load balancer, the server, or the network, you could sell me an array made out of pure gold that squirts Silver Oak and it won&#039;t do me any good.  Until I get drunk and sell the thing, of course.
3. In one sense the unrealistic answer to this is that we take all of the individual elements we deal with and throw them out and buy a 1973 Mainframe.  Surely as an industry we are attempting to do that by virualizing the disparat elements.  Pragmatically however, I&#039;m realistic enough to know that we aren&#039;t going to be able to do that, not in short order anyway, and as such will continue to have a zillion elements that comprise our &quot;infrastructure&quot;.  Storage is only 1 layer.  Storage by itself is useless.  We either implode back to a single system or we deal with the distributed nature of the stack as best we can.
4. Finally, in terms of making things harder, I revert to the statements above.  FAST is an awesome concept but is predicated upon having the data under it&#039;s control - and that the storage layer is the entire issue.  What happens if after I have FAST put my proper data into my properly sized SSD and Caches - and the user experience still sucks?  What happens is the storage guy says, &quot;ain&#039;t me man&quot;, and drinks the Silver Oak while the server, application, and network guys sweat out the problem.  They ultimately find out that because the VMware experiment that moved Exchange from 25 shittly little servers down to 4 mongo-ass kicking blade servers as well as 11 Sharepoint servers they piled on but forgot to upgrade to SQL 2007, that the workload got all funky and even though CPU is only at 45% now, performance still sucks.  So it ain&#039;t the storage, and it ain&#039;t the processor - but it sure is something.  Your system is spitting love on demand, but do you think the user cares?  Do you think the operations staff is psyched with their Flash investment now or the magic new Nehalems?  Nope.  They want to shoot someone.
Which I guess is my point here - systemic issues can only be masked by point improvements - but as long as change is the rule of the day, eventually the next weakest link will be exposed.   Flash, nor FAST, nor Silver Oak can solve every problem - only the ones it can touch.  -----Steve
</description>
		<content:encoded><![CDATA[<p>Pragmatically speaking, I agree with you that centralizing storage resources is more efficient than distributing them out into the servers. This is the foundation of the external storage value proposition.<br />
And I believe that centralizing expensive resources designed to improve I/O performance is also most prudent &#8211; we&#8217;ve done that for years with large DRAM caches and the compute power to fuel intelligent pre-fetch to increase cache hit rates, for example.<br />
But I take exception with the assertion that adding Flash to the centralized resource pool in an external array necessarily increases management complexity &#8211; in fact, I believe that the opposite is true.<br />
I contend that with the appropriate intelligent algorithms and architecture, the external array can leverage Flash to improve performance in much the same way that it leverages DRAM cache today.<br />
Furthermore, extensive installed-base analysis shows that the vast majority of workloads are far more static than people imagine. This is especially true the larger your granularity of analysis: DRAM cache is appropriate for block-level optimization because the &#8220;busy&#8221; block set will always be larger than DRAM. But when you move up in granularity, you find that the vast majority of systems service 80-90% of all IO cache-miss requests from only 5-10% of the LUNs.<br />
And perhaps surprisingly, the set of &#8220;workhorse&#8221; LUNs don&#8217;t really change all that much over time.<br />
This fact is indeed the foundational reason why tiering across 15K, 10K and 7200rpm SATA drive works for so many customers. And it is ALSO the foundational reason why wide-striping can work &#8211; each drive has to support a small fraction of the workhorse LUNs, and the rest of the LUNs don&#8217;t see much I/O anyway so their impact on the workhorses is rare.<br />
What most people haven&#8217;t considered is that taking the HEAVIEST workhorse LUNs &#8211; the ones that service the MOST IOs per day/week/month &#8211; and moving just these to Flash has two immediate benefits: 1) these busy LUNs get the benefit of the drastically reduced response times afforded by Flash, and 2) the rest of the workload left on spinning rust ALSO gets better response time BECAUSE THE DISK DRIVES DON&#8217;T HAVE TO WORK AS HARD! Less head contention = better performance!<br />
In fact, customers often find that they can wide-stripe the remaining workload on SATA drives, using a combination of Flash (lowest $/IOP) and SATA (lowest $/GB) to reduce their overall purchase cost and environmental footprint &#8211; without sacrificing performance OR increasing operational overhead!<br />
And given that the workload distribution of most arrays is typically very stable, you usually won&#8217;t have to add any overhead to constantly rebalance or redistribute the data in/out of Flash.<br />
And for those environments where the workloads ARE more dynamic, EMC has announced FAST &#8211; Fully Automated Storage Tiering. FAST will dynamically analyze running workloads and adjust the distribution of LUNs and/or sub-LUNs across the available storage tiers.<br />
FAST is probably the &#8220;that&#8221; you&#8217;re looking for, but it&#8217;s not necessarily the case that everyone needs FAST to benefit from centralized Flash.<br />
&#8212;-Well at least pragmatically you agree with me!  A few points:<br />
1. I am NOT suggesting that improving ANY end node (array, server, filer, etc.) element is bad &#8211; I like &#8220;better&#8221; given the alternative.  If the problem can be contained to a single system and isolated to a single tier, then go nuts. If you can fix any problem by adding some memory, why wouldn&#8217;t you?<br />
2. Your argument is completely valid (albeit only for the storage end of the equation) PRESUMING that all my data is in your stuff &#8211; and the magic that your stuff contains makes sure that the right data is in the right tier at all times.  If all I had were super-mega Symms and FAST, then yes, I could theoretically solve all of my optimization/automation issues at the STORAGE layer.  I also acknowledge that this scenario would be wicked good &#8211; for EMC!  However, this is not the case with MOST of the real world, who do deal with (gasp!) non-EMC systems, and even those who deal with multiple EMC systems &#8211; it&#8217;s a fact that there are lots and lots of disparate elements &#8211; and that is the real issue.  Plus, your argument falls down if the real issue turns out to be external to the actual storage layer.  If the problem is at the load balancer, the server, or the network, you could sell me an array made out of pure gold that squirts Silver Oak and it won&#8217;t do me any good.  Until I get drunk and sell the thing, of course.<br />
3. In one sense the unrealistic answer to this is that we take all of the individual elements we deal with and throw them out and buy a 1973 Mainframe.  Surely as an industry we are attempting to do that by virualizing the disparat elements.  Pragmatically however, I&#8217;m realistic enough to know that we aren&#8217;t going to be able to do that, not in short order anyway, and as such will continue to have a zillion elements that comprise our &#8220;infrastructure&#8221;.  Storage is only 1 layer.  Storage by itself is useless.  We either implode back to a single system or we deal with the distributed nature of the stack as best we can.<br />
4. Finally, in terms of making things harder, I revert to the statements above.  FAST is an awesome concept but is predicated upon having the data under it&#8217;s control &#8211; and that the storage layer is the entire issue.  What happens if after I have FAST put my proper data into my properly sized SSD and Caches &#8211; and the user experience still sucks?  What happens is the storage guy says, &#8220;ain&#8217;t me man&#8221;, and drinks the Silver Oak while the server, application, and network guys sweat out the problem.  They ultimately find out that because the VMware experiment that moved Exchange from 25 shittly little servers down to 4 mongo-ass kicking blade servers as well as 11 Sharepoint servers they piled on but forgot to upgrade to SQL 2007, that the workload got all funky and even though CPU is only at 45% now, performance still sucks.  So it ain&#8217;t the storage, and it ain&#8217;t the processor &#8211; but it sure is something.  Your system is spitting love on demand, but do you think the user cares?  Do you think the operations staff is psyched with their Flash investment now or the magic new Nehalems?  Nope.  They want to shoot someone.<br />
Which I guess is my point here &#8211; systemic issues can only be masked by point improvements &#8211; but as long as change is the rule of the day, eventually the next weakest link will be exposed.   Flash, nor FAST, nor Silver Oak can solve every problem &#8211; only the ones it can touch.  &#8212;&#8211;Steve</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tony Asaro</title>
		<link>http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/comment-page-1/#comment-43</link>
		<dc:creator>Tony Asaro</dc:creator>
		<pubDate>Fri, 29 May 2009 18:09:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.thebiggertruth.com/2009/05/are-we-missing-the-point-with-ssdflash/#comment-43</guid>
		<description>Steve - disk drives are the slowest thing in the performance chain which is why solving this problem does become valuable.  There already is caching in multiple places - the host systems and the storage controllers - and cache + cache is additive.  However, there will be cache misses and that is why technologies like wide stripping make a real impact on overall performance.  Additionally, there is a need for faster processors - if Intel didn&#039;t drive this then VMware couldn&#039;t exist.  Who knows what unforeseen value improving performance at levels will manifest?  I agree that SSD doesn&#039;t necessarily solve all problems and remember it is price/performance that is what ultimately matters and right now SSD has not hit the &quot;no-brainer&quot; curve.  There is no panacea for solving this problem when all the parts of the data center are so disaggregated and heterogeneous.  That is why having the market drive improvement with each vendor motivated to do so.
----I agree that we absolutely need to constantly improve at the component level, and apologize if I didn&#039;t state that clearly.  However, the concept of wide-striping is a great example of my point - it is only relevant if the data I&#039;m striping is within that system.  If the data most important at this point in time sits elsewhere, the fact that array A is wide striped is meaningless to me.  The same holds true if that array contains zero disks - and all SSD.  Success is still predicated on the data required BEING in that place.
With data growth never abating, it signifies that you will never have all your data in any one container, thus there will always be a growing percentage of that data that is sub-optimzed.  I guess what I&#039;m rambling about is that even if EVERY storage array were 100% SSD - it would still not make for 100% optimized IT infrastructure hollistically - it would just remove the storage end of the wire from being the issue and push it elsewhere.  Thanks --- Steve
</description>
		<content:encoded><![CDATA[<p>Steve &#8211; disk drives are the slowest thing in the performance chain which is why solving this problem does become valuable.  There already is caching in multiple places &#8211; the host systems and the storage controllers &#8211; and cache + cache is additive.  However, there will be cache misses and that is why technologies like wide stripping make a real impact on overall performance.  Additionally, there is a need for faster processors &#8211; if Intel didn&#8217;t drive this then VMware couldn&#8217;t exist.  Who knows what unforeseen value improving performance at levels will manifest?  I agree that SSD doesn&#8217;t necessarily solve all problems and remember it is price/performance that is what ultimately matters and right now SSD has not hit the &#8220;no-brainer&#8221; curve.  There is no panacea for solving this problem when all the parts of the data center are so disaggregated and heterogeneous.  That is why having the market drive improvement with each vendor motivated to do so.<br />
&#8212;-I agree that we absolutely need to constantly improve at the component level, and apologize if I didn&#8217;t state that clearly.  However, the concept of wide-striping is a great example of my point &#8211; it is only relevant if the data I&#8217;m striping is within that system.  If the data most important at this point in time sits elsewhere, the fact that array A is wide striped is meaningless to me.  The same holds true if that array contains zero disks &#8211; and all SSD.  Success is still predicated on the data required BEING in that place.<br />
With data growth never abating, it signifies that you will never have all your data in any one container, thus there will always be a growing percentage of that data that is sub-optimzed.  I guess what I&#8217;m rambling about is that even if EVERY storage array were 100% SSD &#8211; it would still not make for 100% optimized IT infrastructure hollistically &#8211; it would just remove the storage end of the wire from being the issue and push it elsewhere.  Thanks &#8212; Steve</p>
]]></content:encoded>
	</item>
</channel>
</rss>

