Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] RE: [Bioclusters] FPGAin bioinformatics clusters (again?)

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

bella at carolina.rr.com bella at carolina.rr.com
Mon Jan 16 17:36:56 PST 2006


Mike Davis wrote:

> But BLAST is only a small part and argueably the easiest part of 
> genomics work. The advantages of parallelization and/or smp come into 
> play when attempting to assemble the genome. Phred/Phrap can do the 
> work but starts to slow even large machines when your talking 50k+ of 
> sequences (which it wants to be in one folder). A quiz for  the Unix 
> geeks out there, what happens when a folder has 50,000 files in it. 
> Can you say SLOOOOOOOOOWWWW?
>
> Mike Davis
>
Sorry... I just couldn't let this one go by.  And no offense meant to 
anyone but...

Many times I have found users and application folks making inordinately 
and (in my opinion) unacceptably large numbers of files in 
sub-directories on one of "my" UNIX or Linux boxes. 

I simply gently take them aside and have a little "prayer meetin'" with 
them.  There is always a way to fix this kind of problem by consulting 
with the applications folks, and helping them see a better way.  That's 
why God made "mkdir (2)".

In my opinion, if this "Phred/Phrap" thingy (about which I KNOW NOTHING 
- all disclaimers apply) _absolutely_  requires one to place 50,000 (or 
more) files in a single sub-directory... and therefore is slow... the 
application is simply broken.  Contact the developers, or get the 
source... and we'll go fix it.

My 1 & 1/2 cents worth.

Arthur Bell
Senior UNIX/Linux System Administrator




More information about the Beowulf mailing list