<html>

<head>

<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">

<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>

</head>

<body dir="ltr">

<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);">

Would love to help you guys out in anyway i can in terms of hardware processing. <br>

</div>

<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);">

<br>

</div>

<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);">

Have you guys thought of doing something like SETI@home and those projects to get idle compute power to help churn through the massive amounts of data?</div>

<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);">

<br>

</div>

<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);">

Regards,<br>

Jonathan<br>

</div>

<div id="appendonsend"></div>

<hr style="display:inline-block;width:98%" tabindex="-1">

<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" style="font-size:11pt" color="#000000"><b>From:</b> Tim Cutts <tjrc@sanger.ac.uk><br>

<b>Sent:</b> 04 February 2021 11:26<br>

<b>To:</b> Jonathan Aquilina <jaquilina@eagleeyet.net><br>

<b>Cc:</b> Beowulf <beowulf@beowulf.org><br>

<b>Subject:</b> Re: [Beowulf] Project Heron at the Sanger Institute [EXT]</font>

<div> </div>

</div>

<div class="" style="word-wrap:break-word; line-break:after-white-space"><br class="">

<div><br class="">

<blockquote type="cite" class="">

<div class="">On 4 Feb 2021, at 10:14, Jonathan Aquilina via Beowulf <<a href="mailto:beowulf@beowulf.org" class="">beowulf@beowulf.org</a>> wrote:</div>

<br class="x_Apple-interchange-newline">

<div class="">

<div class="" style="font-style:normal; font-variant-caps:normal; font-weight:normal; letter-spacing:normal; text-align:start; text-indent:0px; text-transform:none; white-space:normal; word-spacing:0px; text-decoration:none; font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; background-color:rgb(255,255,255)">

I am curious though to chunk out such large data is something like hadoop/HBase and the like of those platforms, are those whats being used?</div>

<br class="x_Apple-interchange-newline">

</div>

</blockquote>

</div>

<br class="">

<div class="">It’s a combination of our home-grown sequencing pipeline which we use across the board, and then a specific COG-UK analysis of the genomes themselves.  This pipeline is common to all consortium members who are contributing sequence data.  It’s

 a Nextflow pipeline, and the code is here:</div>

<div class=""><br class="">

</div>

<div class=""><a href="https://github.com/connor-lab/ncov2019-artic-nf" class="">https://github.com/connor-lab/ncov2019-artic-nf</a></div>

<div class=""><br class="">

</div>

<div class="">Being nextflow, you can run it on anything for which nextflow has a backend scheduler.   It supports data from both Illumina and Oxford Nanopore sequencers.</div>

<div class=""><br class="">

</div>

<div class="">Tim</div>

-- The Wellcome Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE.

</div>

</body>

</html>