Paul Hinker's Weblog

pageicon Tuesday Jan 17, 2006

Dual-Core Amd performance of Perflib

I recently got some quality time on a Galaxy2 machine with 285 CPUs. This machine has two CPUs each which has 2 cores. The machine is clocked at 2.6 Ghz and had 16 Gb of RAM. I ran the Linpack benchmark using the Sun Studio 11 compiler. The Sun Performance Library comes packaged with the compiler collection and was used when building the Linpack executable. The results are as follows:

SizeSun Studio11
10003343.33
20003815.24
30003942.67
40003983.08
50004024.29
60004029.99
70004060.43
80004065.5
90004052.7
100004027.53

These numbers are not great and after using the collect/analyzer (which incidentally also comes packaged with the Sun Studio Compiler Collection). Brad Lewis was able to make some changes to the algorithm to increase performance.

SizeSun Studio11Sun Studio 11u1
10003343.333714.81
2000 3815.244205.77
3000 3942.674320.86
4000 3983.084379.35
5000 4024.294423.52
6000 4029.994423.46
7000 4060.434467.19
8000 4065.54468.22
9000 4052.74463.48
10000 4027.534461.84

This shows a nearly 10% performance improvement and the changes will be incorporated into an upcoming performnace patch to the Sun Studio 11 compiler collection version of the Performance Library.

These are obviously scalar performance numbers which are interesting but do not address some of the questions one might have concerning how things scale on a machine that has multi-core processors. In upcoming entries, I'll present some of the multi-core numbers obtained on this machine and a 4 Cpu, dual-core Amd box.

Comments:

Post a Comment:
Comments are closed for this entry.