<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0" xmlns:osnews="http://osnews.com/rss2#">
	<channel>
		<title>OSNews: </title>
		<link>http://www.osnews.com/story/20635/Inside_Tsubame_Japan_s_NVIDIA_GPU_Supercomputer</link>
		<description>Exploring the Future of Computing</description>
		<language>en-us</language>
		<copyright>Copyright 2001-2010, David Adams</copyright>
		<webMaster>adam+nospam@osnews.com</webMaster>
		<lastBuildDate>Tue, 16 Mar 2010 22:06:47 GMT</lastBuildDate>
		<image>
			<url>http://www.osnews.com/images/osnews.gif</url>
			<title>OSNews.com</title>
			<link>http://www.osnews.com</link>
		</image>
		<item>
			<title>The heat dissipation must be a joy on the Power bill</title>
			<link>http://osnews.com/thread?339856</link>
			<guid isPermaLink="true">http://osnews.com/thread?339856</guid>
			<description>The cooling of that setup must be a delight.</description>
			<pubDate>Thu, 11 Dec 2008 00:21:00 GMT</pubDate>
			<author>donotreply@osnews.com (tyrione)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: The heat dissipation must be a joy on the Power bill</title>
			<link>http://osnews.com/thread?339877</link>
			<guid isPermaLink="true">http://osnews.com/thread?339877</guid>
			<description>Well, the GPUs only take like 4 racks. I assume they are using another 4 racks for the host computers. 8/10 racks is not that bad for a system that puts out 77 tflops.</description>
			<pubDate>Thu, 11 Dec 2008 02:50:00 GMT</pubDate>
			<author>donotreply@osnews.com (javiercero1)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: The heat dissipation must be a joy on the Power bill</title>
			<link>http://osnews.com/thread?339880</link>
			<guid isPermaLink="true">http://osnews.com/thread?339880</guid>
			<description>4 racks? Are you kidding?<br />
<br />
&quot;Tsubame itself - once you move past the air-conditioners - is split across several rooms in two floors of the building and is largely made up of rack-mounted Sun x4600 systems. There are 655 of these in all, each of which has 16 AMD Opteron CPU cores inside it, and Clearspeed CSX600 accelerator boards.<br />
<br />
The graphics chips are contained in 170 Nvidia Tesla S1070 rack-mount units that have been slotted in between the Sun systems. Each of the 1U Nvidia systems has four GPUs inside, each of which has 240 processing cores for a total of 960 cores per system.&quot;<br />
<br />
More info on <a href="http://www.pcworld.com/article/155242/.html?tk=rss_news" rel="nofollow">http://www.pcworld.com/article/155242/.html?tk=rss_news</a></description>
			<pubDate>Thu, 11 Dec 2008 03:18:00 GMT</pubDate>
			<author>donotreply@osnews.com (GatoLoko)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>Not the future today</title>
			<link>http://osnews.com/thread?339893</link>
			<guid isPermaLink="true">http://osnews.com/thread?339893</guid>
			<description>NVidia's gpus really fall flat on double precision.  The nvidia tesla system advertises 1TFlop SP and 80GFlop DP.  That means 64bit doubles run at only 8% the speed of 32bit floats.  And 32bit floats are just about totally worthless for general scientific processing.  An 8 core powermac 2 years ago could churn ~40GFlop DP.  That was 2 intel generations ago....<br />
<br />
The new cell successor from IBM looks far more interesting but it seems that IBM isn't interested in selling it (same mistake DEC made).</description>
			<pubDate>Thu, 11 Dec 2008 04:50:00 GMT</pubDate>
			<author>donotreply@osnews.com (bnolsen)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[3]: The heat dissipation must be a joy on the Power bill</title>
			<link>http://osnews.com/thread?339914</link>
			<guid isPermaLink="true">http://osnews.com/thread?339914</guid>
			<description>They have 170 tesla cabinets, each is 1U rack unit. Each standard rack tends to be 42 units. <br />
<br />
Simple math dictates that 4 racks could hold those 170Us. <br />
<br />
I am not sure they packed them that much. But I was referring to the GPUs alone. Which is indeed a remarkable density of computing.</description>
			<pubDate>Thu, 11 Dec 2008 07:40:00 GMT</pubDate>
			<author>donotreply@osnews.com (javiercero1)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: Not the future today</title>
			<link>http://osnews.com/thread?339915</link>
			<guid isPermaLink="true">http://osnews.com/thread?339915</guid>
			<description>Please, refrain from making blanket statements like &quot;simple precision FP is worthless&quot; it turns out that for some applications, it is enough. Which I am sure that is what these machines are being targeted at.</description>
			<pubDate>Thu, 11 Dec 2008 07:42:00 GMT</pubDate>
			<author>donotreply@osnews.com (javiercero1)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: Not the future today</title>
			<link>http://osnews.com/thread?339936</link>
			<guid isPermaLink="true">http://osnews.com/thread?339936</guid>
			<description>It was my understanding that nvidia was bringing native doubles with their latest hardware.  The boards used were announced in 2007, while the vendor ClearSpeed is saying that they support single and double, this is with their api.</description>
			<pubDate>Thu, 11 Dec 2008 10:31:00 GMT</pubDate>
			<author>donotreply@osnews.com (nsrbrake)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: Not the future today</title>
			<link>http://osnews.com/thread?339992</link>
			<guid isPermaLink="true">http://osnews.com/thread?339992</guid>
			<description>Not all scientific applications require 64-bit floating point support. Some algorithms are resilient enough for the 32-bit FP support to be sufficient, even with the restricted rounding modes. When you absolutely need 64-bit support, you can mix it with 32-bit floats where 64-bit is not necessary. And when you absolutely need 64-bits, remember that all that memory bandwidth (141.7 GB/sec) is available. Your 8% figure is only about raw execution resources but says nothing about actual application performance, which is of course totally dependent on the application. Considering super computers are not exactly cheap, they must have taken this into account somewhere. <br />
<br />
<div class="cquote"> The new cell successor from IBM looks far more interesting but it seems that IBM isn't interested in selling it (same mistake DEC made). </div><br />
<br />
True. Cell is still very expensive. IBM cannot compete with COTS nVidia solutions, since the Cell in the PS3 does not support DP, so the volumes must be quite small of the DP enabled Cell.</description>
			<pubDate>Thu, 11 Dec 2008 15:42:00 GMT</pubDate>
			<author>donotreply@osnews.com (drahca)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: Not the future today</title>
			<link>http://osnews.com/thread?340033</link>
			<guid isPermaLink="true">http://osnews.com/thread?340033</guid>
			<description><div class="cquote">Not all scientific applications require 64-bit floating point support.  </div><br />
<br />
True, but the majority (if you ask me for a number I'd say &gt;95%) of scientific applications that require a super computer will need 64 bit floating point support.</description>
			<pubDate>Thu, 11 Dec 2008 19:35:00 GMT</pubDate>
			<author>donotreply@osnews.com (evangs)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>Suse Linux Enterprise Server 9</title>
			<link>http://osnews.com/thread?340037</link>
			<guid isPermaLink="true">http://osnews.com/thread?340037</guid>
			<description>Just in case anyone was wondering:<br />
<br />
<a href="http://tinyurl.com/6998vg" rel="nofollow">http://tinyurl.com/6998vg</a></description>
			<pubDate>Thu, 11 Dec 2008 19:56:00 GMT</pubDate>
			<author>donotreply@osnews.com (sbergman27)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[3]: Not the future today</title>
			<link>http://osnews.com/thread?340043</link>
			<guid isPermaLink="true">http://osnews.com/thread?340043</guid>
			<description>Be careful, pulling numbers like that out of one's derriere is a bit counter productive :-)</description>
			<pubDate>Thu, 11 Dec 2008 21:28:00 GMT</pubDate>
			<author>donotreply@osnews.com (javiercero1)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[3]: Not the future today</title>
			<link>http://osnews.com/thread?340046</link>
			<guid isPermaLink="true">http://osnews.com/thread?340046</guid>
			<description><div class="cquote">True, but the majority (if you ask me for a number I'd say &amp;gt;95%) of scientific applications that require a super computer will need 64 bit floating point support. </div><br />
Remember that this is a hybrid system.  It also has 10,480 Opterons. Parts of the job which require higher precision can use the Opterons, while parts which can get by with lower precision, can use the GPUs.  Also, as I understand it, GPUs are really great for vector processing, but really suck at most everything else. So they can be viewed as one more resource that the supercomputer application programmer has at hand which can be applied at his or her discretion, just like any other resource.</description>
			<pubDate>Thu, 11 Dec 2008 21:52:00 GMT</pubDate>
			<author>donotreply@osnews.com (sbergman27)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE[2]: Not the future today</title>
			<link>http://osnews.com/thread?340093</link>
			<guid isPermaLink="true">http://osnews.com/thread?340093</guid>
			<description>Actually cell SPEs can do DP, butt also with a big hit in performance.  The newer ibm PowerXCell boosts the double precision.  Apparently a single cell with 8 SPEs could only do 14GFlops DP.  PowerXCell can do 102GFlops with 8 SPEs...<br />
<br />
In fact a super computer made up of these PowerXCell was (maybe still is?) the top super computer in the world.<br />
<br />
<a href="http://www.ppcnux.com/?q=node/7144" rel="nofollow">http://www.ppcnux.com/?q=node/7144</a><br />
<br />
What sucks is that cell should absolutely not be so expensive.  Proof: sony who's claiming no hardware cost losses ~$399 per ps3 sale.  That's with a helluva lot more hardware than just cell.<br />
<br />
IBM wants their 10,000% markup, that's all.</description>
			<pubDate>Fri, 12 Dec 2008 04:58:00 GMT</pubDate>
			<author>donotreply@osnews.com (bnolsen)</author>
			<category>Comments</category>
		</item>

		<item>
			<title>RE: Not the future today</title>
			<link>http://osnews.com/thread?340108</link>
			<guid isPermaLink="true">http://osnews.com/thread?340108</guid>
			<description>Interesting to see people agree with this statement.<br />
<br />
OK, the thing is this: There are computations that require double precision. And there then there are some which don't. Now guess what this is made for. Also, the machine doesn't consist of GPUs only. Btw, if you see the performance of NV GPUs within the folding at home project you will find that they easily outdo even the Cell. Again, it depends what you wanna do with it. I hope you don't believe the issue escaped them before they had built the thing... <img src="/images/emo/wink.gif" alt=";)" /></description>
			<pubDate>Fri, 12 Dec 2008 10:07:00 GMT</pubDate>
			<author>donotreply@osnews.com (Googol)</author>
			<category>Comments</category>
		</item>
	</channel>
</rss>
