[Beowulf] PetaBytes on a budget, take 2

Greg Lindahl lindahl at pbm.com
Thu Jul 21 15:07:42 PDT 2011

On Thu, Jul 21, 2011 at 02:55:30PM -0400, Ellis H. Wilson III wrote:

> My personal experience with getting large amounts of data from local
> storage to HDFS has been suboptimal compared to something more raw,

If you're writing 3 copies of everything on 3 different nodes, then
sure, it's a lot slower than writing 1 copy. The benefit you get from
this extra up-front expense is resilience.

-- greg

More information about the Beowulf mailing list