<HTML><BODY style="word-wrap: break-word; -khtml-nbsp-mode: space; -khtml-line-break: after-white-space; "><DIV><BR class="khtml-block-placeholder"></DIV><DIV>Hi folks,</DIV><DIV><BR class="khtml-block-placeholder"></DIV><DIV>During an HPC talk some years ago, I recall someone mentioned a tool which can copy large datasets across a cluster using a ring topology.  Perhaps someone here knows of this tool?</DIV><DIV><BR class="khtml-block-placeholder"></DIV><DIV>More to the point, we are pushing around datasets that are about 1Gbyte.  The datasets are pushed out to dozens of nodes all at once and we foresee saturating the I/O system on our cluster as we grow.  We are limited to using just the available disks and are looking for a reasonable solution that can support this kind of simultaneous access.   Currently we push the data out using rsync, but if I don't get any better ideas I may simply move to a pull system where the data is fetched by HTTP.  I can get better throttling that way, at least.</DIV><DIV><BR class="khtml-block-placeholder"></DIV><DIV>-geoff</DIV><DIV><BR class="khtml-block-placeholder"></DIV><BR><DIV> <SPAN class="Apple-style-span" style="border-collapse: separate; border-spacing: 0px 0px; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; text-align: auto; -khtml-text-decorations-in-effect: none; text-indent: 0px; -apple-text-size-adjust: auto; text-transform: none; orphans: 2; white-space: normal; widows: 2; word-spacing: 0px; "><DIV>Geoff Galitz</DIV><DIV><A href="mailto:geoff@galitz.org">geoff@galitz.org</A></DIV><DIV><BR class="khtml-block-placeholder"></DIV><BR class="Apple-interchange-newline"></SPAN> </DIV><BR></BODY></HTML>