Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] VMC - Virtual Machine Console

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Peter Skomoroch peter.skomoroch at gmail.com
Fri Mar 7 10:30:11 PST 2008


My blog is here:

http://www.datawrangling.com/

I had a post last year describing launching a cluster of the small 32 bit
instances:

http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3.html

Since then, Amazon upgraded to 64 bit "extra-large" instances with full
gigabit ethernet which might make this feasible.  Essentially you are
getting a full physical box, where with the small instances you would be
sharing network, disk, etc.  I built a set of new Fedora images and config
scripts for the extra-large instances which include
NFS,mpich,lam,openmpi,ganglia etc.  I'd like to use a standard cluster
distribution, but it would take some hacking to get the networking to work
properly within EC2.  Amazon uses a custom firewall setup where
autodiscovery won't work, also multicast is not supported and subnets are
randomly assigned.



On Fri, Mar 7, 2008 at 1:18 PM, Peter St. John <peter.st.john at gmail.com>
wrote:

> Peter Sk,
> Where is the blog you mentioned (where you'll be posting followups)?
> Thanks,
> Peter St
>
> On Fri, Mar 7, 2008 at 12:49 PM, Peter Skomoroch <
> peter.skomoroch at gmail.com> wrote:
>
> > I'm running bonnie++ on a xlarge instance right now with 30 GB files on
> > /mnt.  I'll post the results when it finishes.  I also have Ganglia set up
> > on the node, so you can check that out until I shut the instance down:
> >
> > http://ec2-72-44-53-20.compute-1.amazonaws.com/ganglia
> >
> >
> > On Fri, Mar 7, 2008 at 12:05 PM, Peter Skomoroch <
> > peter.skomoroch at gmail.com> wrote:
> >
> > > Joe, thanks for the feedback.  The bonnie results were not actually
> > > mine, I was just pointing to some numbers run by Paul Moen.
> > >
> > > Your 1GB file data is likely more representative, but with 15 GB ram,
> > > > you need to be testing 30-60 GB files.
> > > >
> > >
> > > I'll try to tweak the BPS bonnie tests to run some large files...
> > >
> > >
> > >
> > > On Fri, Mar 7, 2008 at 11:57 AM, Joe Landman <
> > > landman at scalableinformatics.com> wrote:
> > >
> > > > Peter Skomoroch wrote:
> > > >
> > > > > Extra Large Instance:
> > > > >
> > > > >       15 GB memory
> > > > >       8 EC2 Compute Units (4 virtual cores with 2 EC2 Compute
> > > > Units each)
> > > > >       1,690 GB instance storage (4 x 420 GB plus 10 GB root
> > > > partition)
> > > > >       64-bit platform
> > > > >       I/O Performance: High
> > > >
> > > > Note:  minor criticism, but overall, nice results.
> > > >
> > > > Looking over your bonnie results is worth a quick comment.  Any time
> > > > you
> > > > have bonnie or IOzone (or other IO benchmarks) which are testing
> > > > file
> > > > sizes less than ram size, you are not actually measuring disk IO.
> > > >  This
> > > > is cache speed pure and simple.  Either page/buffer cache, or RAID
> > > > cache, or whatever.
> > > >
> > > > We have had people tell us to our face that their 2GB file results
> > > > (on a
> > > > 16 GB RAM machine) were somehow indicative of real file performance,
> > > > when, if they walked over to the units they were testing, they would
> > > > have noticed the HD lights simply not blinking ...  Yeah, an amusing
> > > > beer story (the longer version of it), but a problem none-the-less.
> > > >
> > > > Your 1GB file data is likely more representative, but with 15 GB
> > > > ram,
> > > > you need to be testing 30-60 GB files.
> > > >
> > > > Not trying to be a marketing guy here or anything like that ... we
> > > > test
> > > > our JackRabbit units with 80GB to 1.3TB sized files.  We see
> > > > (sustained)
> > > > 750 MB/s - 1.3 GB/s in these tests.  We also note some serious
> > > > issues
> > > > with the linux buffer cache and multiple RAID controllers (buffer
> > > > cache
> > > > appears to serialize access).  We do this as we actually want to
> > > > measure
> > > > disk performance, and not buffer cache performance.
> > > >
> > > > That criticism aside, nice results.  It shows what a "cloud" can do.
> > > >
> > > > >       Price: $0.80 per instance hour
> > > >
> > > >
> > > > --
> > > > Joseph Landman, Ph.D
> > > > Founder and CEO
> > > > Scalable Informatics LLC,
> > > > email: landman at scalableinformatics.com
> > > > web  : http://www.scalableinformatics.com
> > > >        http://jackrabbit.scalableinformatics.com
> > > > phone: +1 734 786 8423
> > > > fax  : +1 866 888 3112
> > > > cell : +1 734 612 4615
> > > >
> > >
> > >
> > >
> > >  --
> > > Peter N. Skomoroch
> > > peter.skomoroch at gmail.com
> > > http://www.datawrangling.com
> > >
> >
> >
> >
> > --
> > Peter N. Skomoroch
> > peter.skomoroch at gmail.com
> > http://www.datawrangling.com
> >
> > _______________________________________________
> > Beowulf mailing list, Beowulf at beowulf.org
> > To change your subscription (digest mode or unsubscribe) visit
> > http://www.beowulf.org/mailman/listinfo/beowulf
> >
> >
>


-- 
Peter N. Skomoroch
peter.skomoroch at gmail.com
http://www.datawrangling.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.scyld.com/pipermail/beowulf/attachments/20080307/383d5e5b/attachment.html


More information about the Beowulf mailing list