10,000-core Linux supercomputer built in Amazon cloud

BoppreH · on April 6, 2011

Curious about hearing the expression before, I went to search and discovered that "embarrassingly parallel" seems to be the correct technical term: http://en.wikipedia.org/wiki/Embarrassingly_parallel

codex · on April 6, 2011

The speed of the interconnect really matters for most supercomputer problems. Without knowing the characteristics of that interconnect I would be hesitant to call 10,000 machines at Amazon a supercomputer.

Florin_Andrei · on April 6, 2011

They say in the article the algorithm is "embarrassingly parallel", so then it's okay.

But you're right, if there are any interdependencies then the interconnect becomes important.

codex · on April 6, 2011

Ah, forgive me; I didn't see that.

I wonder how they'd count Folding@Home, then: 500K active clients, 6M total clients, but only a fraction of their clients are active at any given point in time.

jensnockert · on April 6, 2011

I highly doubt it is anything better than a shared gigabit ethernet connection, which makes me doubt that it has any chance at all to get to TOP500 levels, and I doubt you could get enough of the HPC instances (where you are kind of promised a dedicated network if you get enough of them) to get 5k cores.

mgw · on April 6, 2011

This makes me wonder, how many idle computing resources Amazon has on standby. Are there any numbers on this in the public?

BoppreH · on April 6, 2011

The thing that always bothered me about cloud computing is where the cloud gets its resources when everybody has a spike, such as during holidays.

jonknee · on April 6, 2011

Not everyone gets spikes during holidays--for example Amazon and other retailers get super busy at Christmas, but it's the slowest time of the year for a lot of web apps.

seldo · on April 6, 2011

That said, the last two Decembers we have had trouble instantiating new instances (in US-east-1a, their default and hence most popular ___location) in December. We solved the problem just by switching to US-east-1b for some tasks.

influx · on April 7, 2011

"In order to prevent an overloading of a single availability zone when everybody tries to run their instances in us-east-1a, Amazon has added a layer of indirection so that each account’s availability zones can map to different physical data center equivalents."

http://alestic.com/2009/07/ec2-availability-zones

gojomo · on April 6, 2011

Or even worse, when some emergent AI suddenly wants all the cycles, all the time.

(Will the first sign of a runaway AI be skyrocketing AWS spot prices?)

riobard · on April 7, 2011

This is the beauty of aggregation: when you add a lot more different types of users to the system, the load tends to be balanced out because not everybody needs the same resource at exactly the same time. As long as the distribution of different resource requirement is relatively even, the larger the system is, the stabler it becomes.

Statistically, adding multiple standard derivations yield a smaller value than their sum.

shykes · on April 7, 2011

No, that's the theoretical beauty of aggregation. If black friday causes a 10x spike in transactions, a larger system won't help you.

riobard · on April 7, 2011

This kind of edge cases will happen much more frequently when you have multiple smaller systems. A larger system helps you by balancing out those. Will a larger system run out of resources if everyone requests it at the same time? Of course it will, but the likelihood will be statistically smaller than smaller systems.

ceejayoz · on April 6, 2011

> The thing that always bothered me about cloud computing is where the cloud gets its resources when everybody has a spike, such as during holidays.

When one side of the world is spiking, the other side is sleeping soundly.

rsynnott · on April 6, 2011

Occasionally, you just have to wait a while for an instance to start. It's rarely more than half an hour.

lallysingh · on April 6, 2011

I recall EC2 going to complete crap during the holidays -- it is (or at least was) their spare christmas capacity.

amock · on April 6, 2011

EC2 has never been spare capacity: http://www.quora.com/How-and-why-did-Amazon-get-into-the-clo...

dhughes · on April 6, 2011

It seems to happen to Reddit everyday.

tkahn6 · on April 6, 2011

Amazon gave a tech talk at my school.

EC2 sprang from the problem that Amazon had to buy a bunch of servers to handle the load around the holidays and these servers went underutilized during the rest of the year. So they decided to lease those resources.

When asked about what happens to EC2 during the holidays, the engineer basically replied that Amazon has priority.

whakojacko · on April 7, 2011

According to Werner Vogels' (Amazon's CTO) Quora answer, the excess capacity story is a myth. http://www.quora.com/How-and-why-did-Amazon-get-into-the-clo...

tkahn6 · on April 7, 2011

Well then, that's odd. Thanks for clearing that up.

wmf · on April 6, 2011

Thanks to the spot market there should be little or no idle capacity.

nraynaud · on April 6, 2011

my guess is more like you have waves of computation, with a mass effect of people targeting the cheap spots.

jensnockert · on April 6, 2011

The problem in HPC is less often pure CPU horsepower though, it is often cache or memory bandwidth, or in the interconnects.

I guess you might be able to build a system in the cloud to provide TOP500 level of performance, but it would be pretty hard even with the fancy EC2 HPC instances (http://aws.amazon.com/ec2/hpc-applications/).

nwhitehead · on April 6, 2011

Thanks for pointing out the HPC instances that Amazon has. A few commenters were saying that it's not really a supercomputer without a fast interconnect. Yes, they have that! You just pay more for those instances.

In my experience Amazon did a pretty good job setting things up. It's fun to play around with HPC instances, you can get some sweet performance.

quail_bird · on April 6, 2011

better than might... you can: http://www.top500.org/system/10661

adestefan · on April 7, 2011

Amazon can. That has no information about how the nodes were allocated. They could have hand picked X rack of nodes that were all connected via the same switch, etc. You don't get that guarantee from AWS.

quail_bird · on April 7, 2011

Fair enough, I guess.... they could have done many things.

Although they do not provide an answer, here are some links to additional info - I spent some time searching for additional info on the Top500 setup, but found little:

* http://aws.typepad.com/aws/2010/07/the-new-amazon-ec2-instan... * http://news.ycombinator.com/item?id=1904590

aeroevan · on April 6, 2011

> its calculations were "embarrassingly parallel," with no communication between nodes

That's probably the only type of process that would work in the cloud. Most HPC applications require lots of communication between nodes, so I don't think I would call this a proper supercomputer.

arctangent · on April 6, 2011

I agree. At large scales the speed of light becomes a limiting factor.

brianbreslin · on April 7, 2011

I must say these are my favorite types of articles on HN. I also think these are the perfect use cases of cloud computing platforms such as AWS. Not sure why massively parallel and "embarrassingly parallel" computing intrigues me.

riobard · on April 7, 2011

The best part of it? It doesn't cost millions of dollars to use! Only thousands!

tarpsocks · on April 6, 2011

Am I missing something? If the performance scales linearly, they are at 1000 computers internally (1/10 * 10,000), and it was said to take 8 hours. That would only be 80 hours if they hadn't have used this service.

This makes me believe someone is lying about something in this article.

kalleboo · on April 7, 2011

Perhaps their internal capacity is already tied up in other tasks, so while they have 1000 cores internally, they can't all be monopolized for 80 hours for a single task like the AWS machines can.

nraynaud · on April 6, 2011

who cares about P=NP when you can do that for $8000 ?

JonnieCache · on April 6, 2011

You are misunderstanding. P=NP is a theoretical, in some ways even a philosophical question about the limits of logic and knowledge. Many problems that are NP complete have algorithms to find almost perfect answers in polynomial time. P=NP is about knowing if you can find perfect answers.

Then, there are many problems out there which are not NP complete for which we are nowhere near to finding fast, accurate solutions. The problem is not that logic prevents us, but that we're simply not clever enough yet.

What I'm trying to say in a roundabout way is that spinning up many cores will not help you find a perfect, fast solution to an NP complete problem. And just because you have 10,000 cores that is not an indicator of it being difficult or hard to solve a given problem, regardless of its complexity class.

bad_user · on April 6, 2011

Because NP problems have exponential time growth.

Example: a brute-force attack of an encryption algorithm that uses a 256-bit-key, would require trying out all possible keys, which is 2^256 ... which right now it would take far longer than the age of the universe to complete.

AND, most importantly, dividing that number by 10,000 (the number of computers in the article), or heck, let's be generous and say we have 1,000,000,000 computers ... would be absolutely meaningless.

It's simple really -- 2^256 / 1 billion computers =~ 2^226 -- and computing it still takes far longer than the age of our universe.

And lets say that with technology advances, you can have 70,000,000,000 computers (that's 70 billion computers, or a 700,000,000 % increase from the number in our article). Nevermind the energy required to power them or the storage capacity needed, or other such none-sense. So instead of 2^226, you now have 2^220 cycles to go through, an absolutely meaningless decrease, and still takes far longer than the age of our universe.

As a fun exercise, try figuring out how many computers would be required to bring that number down to ~ 2^200 -- that would still take far longer than the age of our universe to compute ;)

amock · on April 6, 2011

Almost everyone. 2^n/1000 is still far slower than n^2 for n >= 19.

nraynaud · on April 7, 2011

Thanks guys for telling me about P/NP

time for me to teach you something : http://en.wikipedia.org/wiki/Humour

jokermatt999 · on April 7, 2011

Humour is pretty much seen as noise on Hacker News. If a joke also has some insight about the article, it will get upvotes, but if it's just a joke it gets downvoted.

ruslan · on April 6, 2011

Seems like those guys knew nothing about HPCs. Why didn't they run LINPACK test? It's essential to measure any parallel computing system even of two cores. Also, any first grade CS student knows that the most significant part of an HPC is not the cores, but the network. You need to connect hosts using Infiniband or alike. Using regular ethernet is futile because of high latency, you will waste 90% of CPU cycles in data exchange/syncronization wait loops. I bet they could achieve a way better results on just 1/3 number of cores or even less.

mukyu · on April 6, 2011

LINPACK is about as useful of a benchmark as BogoMips.

""". Genentech benefited from the high number of cores because its calculations were "embarrassingly parallel," with no communication between nodes, so performance stats "scaled linearly with the number of cores," Corn said."""

That is a direct quote from the article.

awj · on April 6, 2011

Not that I think it entirely bursts your internet tough guy rant, but amazon does offer [cluster computing instances](http://aws.amazon.com/ec2/hpc-applications/) for exactly this purpose. Granted, they only have 10 Gigabit ethernet, but it's not exactly like this is some failure cluster running all over a busy datacenter on 10M ether.