[Beowulf] Jupyter and EP HPC
Lux, Jim (337K)
james.p.lux at jpl.nasa.gov
Fri Jul 27 13:56:49 PDT 2018
-----Original Message-----
From: Beowulf [mailto:beowulf-bounces at beowulf.org] On Behalf Of Joe Landman
Sent: Friday, July 27, 2018 11:54 AM
To: beowulf at beowulf.org
Subject: Re: [Beowulf] Jupyter and EP HPC
On 07/27/2018 02:47 PM, Lux, Jim (337K) wrote:
>
> I’ve just started using Jupyter to organize my Pythonic ramblings..
>
> What would be kind of cool is to have a high level way to do some
> embarrassingly parallel python stuff, and I’m sure it’s been done, but
> my google skills appear to be lacking (for all I know there’s someone
> at JPL who is doing this, among the 6000 people doing stuff here).
>
> What I’m thinking is this:
>
> I have a high level python script that iterates through a set of data
> values for some model parameter, and farms out running the model to
> nodes on a cluster, but then gathers the results back.
>
> So, I’d have N copies of the python model script on the nodes.
>
> Almost like a pythonic version of pdsh.
>
> Yeah, I’m sure I could use lots of subprocess() and execute() stuff
> (heck, I could shell pdsh), but like with all things python, someone
> has probably already done it before and has all the nice hooks into
> the Ipython kernel.
>
I didn't do this with ipython or python ... but this was effectively the way I parallelized NCBI BLAST in 1998-1999 or so. Wrote a perl script to parse args, construct jobs, move data, submit/manage jobs, recover results, reassemble output. SGI turned that into a product.
-- yes.. but I was hoping someone had done that for Jupyter..
>>> for parametervalue in parametervaluelist:
.... result = simulation(parametervalue)
Results.append(result)
More information about the Beowulf
mailing list