[Beowulf] Nobody ever got fired for using Hadoop on a cluster

Joe Landman landman at scalableinformatics.com
Wed Apr 24 13:04:35 PDT 2013


On 04/24/2013 04:00 PM, Adam DeConinck wrote:

[...]

> "However, evidence suggests that the majority of analytics jobs do not
> process huge data sets. For example, as we will discuss in more detail
> later, at least two analytics production clusters (at Microsoft and
> Yahoo) have median job input sizes under 14 GB, and 90%
> of jobs on a Facebook cluster have input sizes under 100 GB."

Its very helpful to have an ontological mapping in place so you know 
what "big" and "fast" mean to the end user in their context.  Because 
its often the case that what they mean by "big and fast" is different 
than what you mean by that.

-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics, Inc.
email: landman at scalableinformatics.com
web  : http://scalableinformatics.com
        http://scalableinformatics.com/siflash
phone: +1 734 786 8423 x121
fax  : +1 866 888 3112
cell : +1 734 612 4615



More information about the Beowulf mailing list