[Beowulf] slow jobs when run through queue
Nick Evans
nick.c.evans at gmail.com
Tue Dec 19 21:37:36 PST 2017
I completely agree. We have a web page where people can see
- where their jobs are running
- what sort of resources were requested
- the peak resources actually used
- wall time remaining (orange highlighted at 20% remaining and red at
10% remaining)
On 20 December 2017 at 03:41, Peter Clapham <pc7 at sanger.ac.uk> wrote:
> Show back of utilization and use patterns openly also removes admins from
> being “the Police”.
>
> Instead each user of the system can see who is requesting excessive
> memory, using inappropriate queues or just inefficient workloads at scale.
> This creates a self-Policing environment and certainly both re-enforces a
> community feel and improves communication between the groups of users.
> Pete
>
> On 12/6/17, 6:36 PM, "Beowulf on behalf of Tim Cutts" <
> beowulf-bounces at beowulf.org on behalf of tjrc at sanger.ac.uk> wrote:
>
> Of course, if you charge for your cluster time, that hurts them in the
> wallet, since they pay for all the allocated unused time. If you don’t
> charge (which is the case for us) it’s hard to incentivise them not to do
> this. Shame works, a bit. We publish cluster analytics showing CPU
> efficiency and memory efficiency league tables for the users, and that has
> had some good effects in the past...
>
> Tim
>
> > On 6 Dec 2017, at 18:20, David Mathog <mathog at caltech.edu> wrote:
> >
> > on Wed, 06 Dec 2017 21:39:20 +1100 Chris Samuel wrote:
> >> If this is, as I suspect is likely, bioinformatics code it could
> well be that
> >> it is a pipeline type application and only part of the application
> may be able
> >> to make use of parallelism (and then might not be very good at it).
> >
> > Exactly. Super frustrating to set something like '--cpus=40' and
> then watch the resulting heap of programs sit for long periods of time
> (hours, not seconds) running only on a single CPU.
> >
> > Regards,
> >
> > David Mathog
> > mathog at caltech.edu
> > Manager, Sequence Analysis Facility, Biology Division, Caltech
> > _______________________________________________
> > Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin
> Computing
> > To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
>
>
>
> --
> The Wellcome Trust Sanger Institute is operated by Genome Research
> Limited, a charity registered in England with number 1021457 and a
> company registered in England with number 2742969, whose registered
> office is 215 Euston Road, London, NW1 2BE.
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin
> Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
>
>
>
>
> --
> The Wellcome Trust Sanger Institute is operated by Genome Research
> Limited, a charity registered in England with number 1021457 and a
> company registered in England with number 2742969, whose registered
> office is 215 Euston Road, London, NW1 2BE.
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20171220/7a834768/attachment.html>
More information about the Beowulf
mailing list