Wednesday Jun 13, 2007
Sun Blade X6250 Delivers a pair of x86 SPEC CPU2006 integer performance World Records:
Sun Blade X6250 (Dual-Core Intel Xeon 5160)
and running Solaris 10 and using Sun Studio 12 compiler delivered the
best x86 result for the SPECint2006 benchmark.
Sun Blade X6250 (Dual-Core Intel Xeon 5160) using Solaris 10 and
Studio 12, delivered x86 4-core world record on
SPECint_rate2006.
Sun Blade X6250 server had a SPECint2006 result of 21.0 and SPECint_rate2006 result of 65.0. The advanced features of freely available
Sun Studio 12 complier were critical for getting this level of
performance on the Sun Blade 6250.
The Sun Blade X6250 is only 3% slower than the peak score of the very-expensive
new IBM POWER6 p570, which was recently announced. SPECint2006 is a single
job stream. So let's now turn to comparing 4 thread results, in this case
the Sun Blade X6250 is 7% faster than the peak SPECint_rate2006 score of
he very-expensive new IBM POWER6 p570 (both IBM and Sun at 4 threads). Oh, and remember that anymore clock
rate is not how you compare systems the Sun Blade X6250 is at 3GHz and the
IBM POWER6 is at 4.7GHz. CPU frequency is basically irrelevant, it is CPU and system architecture that matters!
SPEC CPU2006 Landscape - bigger is better, selected recent results
SPECint2006
| System |
Processors |
Performance Results |
| Type |
GHz |
Chips |
Cores |
Peak |
Base |
| IBM p570 (power6) |
Power6 |
4.7 |
1 |
1 |
21.6 |
17.8 |
| Sun Blade X6250 |
Intel Xeon 5160 |
3.0 |
2 |
4 |
21.0 |
|
| Supermicro X7DB8+ board |
Intel Xeon 5160 |
3.0 |
2 |
4 |
20.8 |
18.9 |
| Sun Ultra 40 M2 |
AMD Opteron 2222SE |
3.0 |
2 |
4 |
16.1 |
|
SPECint_rate2006
| System |
Processors |
Performance Results |
| Type |
GHz |
Chips |
Cores |
Threads / Copies |
Peak |
Base |
| Sun Blade X6250 |
Intel Xeon 5160 |
3.0 |
2 |
4 |
4 |
65.0 |
|
| Supermicro X7DB8+ |
Intel Xeon 5160 |
3.0 |
2 |
4 |
4 |
64.9 |
60.0 |
| IBM p570 (Power6) |
Power6 |
4.7 |
1 |
2 |
4 |
60.9 |
53.2 |
| Sun Ultra 40 M2 |
AMD Opteron 2222SE |
3.0 |
2 |
4 |
4 |
60.4 |
|
| Fujitsu BX620 S3 |
Xeon 5160 (Woodcrest) |
3.0 |
2 |
4 |
4 |
59.4 |
56.7 |
Results as of 06 Jun 2007 from www.spec.org.
Benchmark Description
SPEC CPU2006 is made up of two suites of benchmarks, CFP2006 and
CINT2006. CFP2006 targets floating-point performance, while CINT2006
targets integer performance.
Each suite has two different measures. First is the CPU measure, which
is the performance on the suite as a single stream. This can be either
a single thread or automatic compiled parallel run. This measure is
further defined by base and optimized runs. Base uses the same compiler
flags for all kernels, where optimized is allowed to use different
compiler flags for each kernel. Results are compared against a baseline
system run that was standardized by SPEC.
The second measure is Rate. It is a measure of how many CPU measures
can be run at a time. Typically, it is run as n processes on n
processors. It shows how well the same job mix can run on a system
under some load. It also is run as a base and optimized set of
results.
Disclosure Statement:
SPEC, SPECint, SPECfp reg tm of Standard Performance Evaluation Corporation.
Results from
www.spec.org or from IBM public websites as of 6/06/07.
Sun Blade X6250 (Intel Xeon 5160, 2chips/4cores, Solaris 10) 65.0 SPECint_rate2006;
Sun Blade X6250 (Intel Xeon 5160, 2chips/4cores, Solaris 10) 21.0 SPECint2006;
IBM System p 570 (POWER6, 1chip/1core, AIX 5L v5.3) 21.6 SPECint2006;
IBM System p 570 (POWER6, 4 theads, 1chip/2cores, AIX 5L v5.3) 60.9 SPECint_rate2006.
System Configuration
| Results |
| Reference Date: |
|
Jun 06, 2007 |
| System: |
|
Sun Blade X6250
SPEED: 16GB memory 8x2GB
RATE : 32GB memory 8x4GB |
|
X6250 |
|
21.0 SPECint2006 |
|
X6250 |
|
65.0 SPECint_rate2006 |
| Total Number Processors: |
|
2 x Intel Xeon 5160 |
| Software: |
|
Solaris 10 11/06, Sun Studio 12 Compiler, MicroQuill's SmartHeap Library v7.4 |
See Also
All Benchmark results on Sun Blade 6000 Blade Server
Tuesday Mar 20, 2007
World Record SPECompL2001: Solaris beats RedHat Linux and
The Sun Fire X4600 M2 delivers the best performance on the SPEC OMPL2001
benchmark suite of all x86 systems.
-
Solaris 10 and Studio 11 duo help X4600 M2 perform 8% better than Red Hat
Linux (RHEL4) and PathScale compiler on SPECompL2001.
-
The Sun Fire X4600 M2 server in 4-socket configuration using dual-core AMD
Opteron Model 8220 processors, produced best SPECompL2001 result of 111,893.
-
The Sun Fire X4600 M2 beats the HP DL585 G2 (AMD Opteron 8220 4chips/8cores) using RedHat Linux
by 9%
-
The results show that the combination of Solaris 10 using Sun
Studio 11 is unmatched by the competition for assisting users in writing parallel code.
SPECompL2001 (bigger is better, ordered by peak)
| Result |
Cores |
Chips |
Thrds |
System |
| Peak |
Base |
| 111,893 |
105,465 |
8 |
8 |
2 |
Sun X4600M2 Opteron 8220 S10/SS11 |
| 103,466 |
100,610 |
8 |
8 |
2 |
Sun X4600M2 Opteron 8220 RHEL4u4, PathScale v2.5 |
| 102,283 |
99,907 |
8 |
8 |
2 |
HP DL585 G2 Opteron 8220SE, RHEL4u4, PathScale v2.5 |
| 92,725 |
92,418 |
16 |
16 |
1 |
HP Superdome 1.5GHz Itanium 2 |
| 79,627 |
68,051 |
24 |
24 |
1 |
Sun Fire 6800 |
| 44,376 |
42,400 |
16 |
16 |
1 |
HP Superdome 875MHz PA-8700+ |
| 42,864 |
41,056 |
16 |
16 |
1 |
HP server rp8400 (875MHz PA-8700+) |
Benchmark Description
The SPEC OMPL2001 Benchmark Suite was released in June 2001 and
tests HPC performance using OpenMP for parallelism.
9 programs (2 in C and 7 in Fortran) parallelized using OpenMP API.
Goals of suite are: first, target to Large-range (8-128 processor)
parallel systems, 2nd have run rules, tools and reporting similar
to SPEC CPU2000 and 3rd to have programs representative of HPC and
Scientific Applications.
Results Summary
| Result |
|
X8420 8-threads: |
|
111893 SPECompL2001 |
| Reference Date: |
|
Mar 19, 2007 |
| System: |
|
Sun Fire X4600 M2 |
| Total Number Processors: |
|
4 |
| Total Memory : |
|
32 GB (16x4GB DIMMs), DDR667 |
| Processor/GHz of Server: |
|
Opteron 8220, 2.8 GHz |
| Operating System: |
|
Solaris 10 |
| Compiler: |
|
Sun Studio 11 |
Disclosure Statement:
SPEC, SPEComp reg tm of Standard Performance Evaluation Corporation.
Results from www.spec.org as of Mar 19, 2007, Sun result submitted to SPEC.
Sun Fire X4600 M2 (S10/SS11, Opteron 8220, 8 cores, 4 chips, 8 threads), 111893 SPECompL2001.
Sun Fire X4600 M2 (RHEL4u4, Opteron 8220, 8 cores, 4 chips, 8 threads), 103466 SPECompL2001.
HP DL585 G2 Opteron 8220SE (RHEL4u4, Opteron 8220, 8 cores, 4 chips, 8 threads), 103466 SPECompL2001. Sockets refers to chips.
See Also
SPEC OMP2001 Page
SPEC Home Page
sun.com X4600 M2 Benchmark Page
Tuesday Feb 20, 2007
SPEC.org has released a version of SPECjAppServer2004 called EAStress2004.
This is a multi-tier Java benchmark which measures the performance of Java Platform Enterprise Edition (Java EE) application servers.
EAStress2004 that relaxes run and reporting rules, enabling informal results to be shared in open-source projects.
"Results from the EAStress2004 workload cannot be used for marketing purposes, and comparisons to other SPECjAppServer2004 results are not permitted."
read more at:
http://www.3dprofessor.org/Press%20Releases.htm
...so you won't see any EAstress2004 benchmarks reported on this blog,
but I really think this is a very cool thing to do. Cheers!
Tuesday Feb 20, 2007
Is IBM 3.3x or 1.4x faster? - I guess it depends if you use a
over-optimised benchmark like TPC-C. As mentioned yesterday,
IBM doesn't publish on a variety of standard benchmarks like
SPECint_rate2006 or SPECjbb2005 on their high-end systems so we
have to look at the SPECint_rate2000 which is just about to be EOL'ed
and completely replaced by SPECint_rate2006.
First let's compare an IBM p5 595 (Power5+ 2.3GHz 64p, 128thread) to
a HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread, single core/CPU)
on SPECint_rate2000.
Constructing a SPECint_rate2000 ratio
1.4x = 1513/1108
we find that the IBM 595 is 1.4x faster, it makes sense because this
isn't the latest HP dual-core Itanium2. Both IBM and HP systems have
results on TPC-C U SPECint_rate2000.
OK now using TPC-C, let's compare a IBM p5 595 (Power5+ 2.3GHz 64p,
128thread) to a HP Integrity Superdome
(Itanium2 1.6 GHz 64p, 64thread, single core/CPU).
Constructing a TPC-C ratio
3.3x = 4033378/1231433
what?
comparing the same systems the IBM is 3.3x faster ?!?
Looks that TPC-C over-inflates what can be expected from IBM.
My guess is IBM over-optimised and played lots of tuning tricks
on TPC-C, correct? So is TPC-C relavent to customers if this
is the case?
...maybe that's why seven years ago Sun, upon publishing a world
record TPC-C result said:
"It's well-understood in the technical communities that TPC-C no longer
represents current customer workloads since the transaction load that
its models are made of are small, primitive and disconnected transactions.
While this model was acceptable for the workloads of the late 1980s, it
misses the mark..."
http://www.sun.com/smi/Press/sunflash/2000-08/sunflash.20000831.1.html
You'll also notice the Aug 2000 press release said, "Customer workloads
nowadays require a more ad hoc workload than the TPC-C specifies."
Disclosure Statements
IBM p5 595 (Power5+ 2.3GHz 64p, 128thread) 4,033,378 tpmC,
2.97 US $/tpmC, Avail 01/22/07, IBM DB2 9, IBM AIX 5L V5.3, Microsoft COM+.
HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread), 1,231,433 tpmC,
4.82 US $/tpmC, Avail 06/05/06, Microsoft SQL Server 2005 Enterprise Edt SP1,
Microsoft Windows Server 2003 Datacenter Ed.(64-bit)SP1. Results as of
2/15/07, see http://www.tpc.org.
IBM System p5 595 (Power5+ 2.3GHz 64p, 128thread), 64 cores, 32 chips,
2 cores/chip (SMT on), 1513 SPECint_rate2000. HP Integrity Superdome
(Itanium2 1.6 GHz 64p, 64thread, 16 cells), 64 cores, 64 chips,
1 core/chip, 1108 SPECint_rate2000. SPEC, SPECint, SPECfp reg tm of
Standard Performance Evaluation Corporation. Results from http://www.spec.org. as of 2/15/07.
World record TPC-C results referenced above was an overall performance
world record at August 31, 2000. Sun Enterprise 10000 server (Starfire)
running Sybase Adaptive Server Enterprise (ASE), 156,873.03 tpmC, $48.81 price/tpmC, available February 28, 2001. A full disclosure report and executive summary are available through the TPC Web site located at
http://www.tpc.org.
Thursday Nov 02, 2006
Woodcrest having issues scaling with GHz? On the
SPECfp_rate2000 result website, when Woodcrest goes from 2 GHz to 3 Ghz (50% clock increase), but the SPECfpRate only adds 17.8%. Not good, is it?
| System |
Chip/GHz |
Config |
Score |
| Fujitsu Siemens CELSIUS R540 |
5160 3.0 GHz |
4-core 2-Socket |
80.6 |
| Fujitsu Siemens CELSIUS R540 |
5150 2.66 GHz |
4-core 2-Socket |
77.4 |
| Fujitsu Siemens CELSIUS R540 |
5130 2.0 GHz |
4-core 2-Socket |
68.4 |
I'm thinking memory latency is an issue? Tomorrow, I'll look at scaling
as you add cores, maybe an issue there?
Disclosure
Fujitsu Siemens CELSIUS R540 2.0GHz (2chips,4cores), 68.4 SPECfp_rate2000, Fujitsu Siemens CELSIUS R540 2.66GHz (2chips,4cores), 77.4 SPECfp_rate2000, Fujitsu Siemens CELSIUS R540 3.0GHz (2chips,4cores), 80.6 SPECfp_rate2000.
SPEC, SPECint reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org as of 11/2/06.