[Beowulf] cluster building advice?
Christopher Samuel
samuel at unimelb.edu.au
Wed Sep 26 17:50:47 PDT 2012
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 27/09/12 07:59, Vincent Diepeveen wrote:
> Linus posted something like: "there is no excuse to not generate
> CMOV instructions now that also intel core2 can execute them
> fast".
No, what he actually said was:
http://ondioline.org/mail/cmov-a-bad-idea-on-out-of-order-cpus
# - on a P4, branch mispredictions are expensive, but so is cmov,
# so all the above is to some degree exaggerated. On nicer
# microarchitectures (the Intel Core 2 in particular is something
# I have to say is very nice indeed), the difference will be a lot
# less noticeable. The loss from cmov isn't very big (it's not as
# sucky as P4), but neither is the win (branch misprediction isn't
# that expensive either).
[...]
[example showing CMOV version slower than non-CMOV on Core2 elided]
[...]
# So don't use cmov. Except for non-performance-critical code, or
# if you really care about code-size, and it helps (which is actually
# fairly rare: quite often cmov isn't even smaller than a conditional
# jump and a regular move, partly because a regular move can take
# arguments that a cmov cannot: move to memory, move from an immediate
# etc etc, so depending on what you're moving, cmov simply isn't good
# even if it's _just_ a move).
cheers,
Chris
- --
Christopher Samuel Senior Systems Administrator
VLSCI - Victorian Life Sciences Computation Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
http://www.vlsci.org.au/ http://twitter.com/vlsci
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://www.enigmail.net/
iEYEARECAAYFAlBjoucACgkQO2KABBYQAh8YDwCdHFfOo/V3FvUO6JVUvz7iAHrD
ap8AnR6tuNJpStCWn9X+AGGVg4GdFkU5
=S7jk
-----END PGP SIGNATURE-----
More information about the Beowulf
mailing list