BM Seer Facts & Questions from an Anonymous Sun Source

Sun Blade X6250 & Sun Studio 12 x86 World Record

Wednesday Jun 13, 2007

Sun Blade X6250 Delivers a pair of x86 SPEC CPU2006 integer performance World Records:

Sun Blade X6250 (Dual-Core Intel Xeon 5160) and running Solaris 10 and using Sun Studio 12 compiler delivered the best x86 result for the SPECint2006 benchmark.

Sun Blade X6250 (Dual-Core Intel Xeon 5160) using Solaris 10 and Studio 12, delivered x86 4-core world record on SPECint_rate2006.

Sun Blade X6250 server had a SPECint2006 result of 21.0 and SPECint_rate2006 result of 65.0. The advanced features of freely available Sun Studio 12 complier were critical for getting this level of performance on the Sun Blade 6250.

The Sun Blade X6250 is only 3% slower than the peak score of the very-expensive new IBM POWER6 p570, which was recently announced. SPECint2006 is a single job stream. So let's now turn to comparing 4 thread results, in this case the Sun Blade X6250 is 7% faster than the peak SPECint_rate2006 score of he very-expensive new IBM POWER6 p570 (both IBM and Sun at 4 threads). Oh, and remember that anymore clock rate is not how you compare systems the Sun Blade X6250 is at 3GHz and the IBM POWER6 is at 4.7GHz. CPU frequency is basically irrelevant, it is CPU and system architecture that matters!

SPEC CPU2006 Landscape - bigger is better, selected recent results

SPECint2006

System Processors Performance Results
Type GHz Chips Cores Peak Base
IBM p570 (power6) Power6 4.7 1 1 21.6 17.8
Sun Blade X6250 Intel Xeon 5160 3.0 2 4 21.0
Supermicro X7DB8+ board Intel Xeon 5160 3.0 2 4 20.8 18.9
Sun Ultra 40 M2 AMD Opteron 2222SE 3.0 2 4 16.1

SPECint_rate2006

System Processors Performance Results
Type GHz Chips Cores Threads
/ Copies
Peak Base
Sun Blade X6250 Intel Xeon 5160 3.0 2 4 4 65.0
Supermicro X7DB8+ Intel Xeon 5160 3.0 2 4 4 64.9 60.0
IBM p570 (Power6) Power6 4.7 1 2 4 60.9 53.2
Sun Ultra 40 M2 AMD Opteron 2222SE 3.0 2 4 4 60.4
Fujitsu BX620 S3 Xeon 5160 (Woodcrest) 3.0 2 4 4 59.4 56.7

Results as of 06 Jun 2007 from www.spec.org.

Benchmark Description

SPEC CPU2006 is made up of two suites of benchmarks, CFP2006 and CINT2006. CFP2006 targets floating-point performance, while CINT2006 targets integer performance.

Each suite has two different measures. First is the CPU measure, which is the performance on the suite as a single stream. This can be either a single thread or automatic compiled parallel run. This measure is further defined by base and optimized runs. Base uses the same compiler flags for all kernels, where optimized is allowed to use different compiler flags for each kernel. Results are compared against a baseline system run that was standardized by SPEC.

The second measure is Rate. It is a measure of how many CPU measures can be run at a time. Typically, it is run as n processes on n processors. It shows how well the same job mix can run on a system under some load. It also is run as a base and optimized set of results.

Disclosure Statement:

SPEC, SPECint, SPECfp reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org or from IBM public websites as of 6/06/07. Sun Blade X6250 (Intel Xeon 5160, 2chips/4cores, Solaris 10) 65.0 SPECint_rate2006; Sun Blade X6250 (Intel Xeon 5160, 2chips/4cores, Solaris 10) 21.0 SPECint2006; IBM System p 570 (POWER6, 1chip/1core, AIX 5L v5.3) 21.6 SPECint2006; IBM System p 570 (POWER6, 4 theads, 1chip/2cores, AIX 5L v5.3) 60.9 SPECint_rate2006.

System Configuration

Results
Reference Date: Jun 06, 2007
System: Sun Blade X6250
SPEED: 16GB memory 8x2GB
RATE : 32GB memory 8x4GB
X6250 21.0 SPECint2006
X6250 65.0 SPECint_rate2006
Total Number Processors: 2 x Intel Xeon 5160
Software: Solaris 10 11/06, Sun Studio 12 Compiler, MicroQuill's SmartHeap Library v7.4

See Also

  • All Benchmark results on Sun Blade 6000 Blade Server
  • [4] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Solaris beats Linux performance

    Tuesday Mar 20, 2007

    World Record SPECompL2001: Solaris beats RedHat Linux and The Sun Fire X4600 M2 delivers the best performance on the SPEC OMPL2001 benchmark suite of all x86 systems.

    • Solaris 10 and Studio 11 duo help X4600 M2 perform 8% better than Red Hat Linux (RHEL4) and PathScale compiler on SPECompL2001.
    • The Sun Fire X4600 M2 server in 4-socket configuration using dual-core AMD Opteron Model 8220 processors, produced best SPECompL2001 result of 111,893.
    • The Sun Fire X4600 M2 beats the HP DL585 G2 (AMD Opteron 8220 4chips/8cores) using RedHat Linux by 9%
    • The results show that the combination of Solaris 10 using Sun Studio 11 is unmatched by the competition for assisting users in writing parallel code.

    SPECompL2001 (bigger is better, ordered by peak)
    Result Cores Chips Thrds System
    Peak Base
    111,893 105,465 8 8 2 Sun X4600M2 Opteron 8220 S10/SS11
    103,466 100,610 8 8 2 Sun X4600M2 Opteron 8220 RHEL4u4, PathScale v2.5
    102,283 99,907 8 8 2 HP DL585 G2 Opteron 8220SE, RHEL4u4, PathScale v2.5
    92,725 92,418 16 16 1 HP Superdome 1.5GHz Itanium 2
    79,627 68,051 24 24 1 Sun Fire 6800
    44,376 42,400 16 16 1 HP Superdome 875MHz PA-8700+
    42,864 41,056 16 16 1 HP server rp8400 (875MHz PA-8700+)

    Benchmark Description

    The SPEC OMPL2001 Benchmark Suite was released in June 2001 and tests HPC performance using OpenMP for parallelism. 9 programs (2 in C and 7 in Fortran) parallelized using OpenMP API.

    Goals of suite are: first, target to Large-range (8-128 processor) parallel systems, 2nd have run rules, tools and reporting similar to SPEC CPU2000 and 3rd to have programs representative of HPC and Scientific Applications.

    Results Summary

    Result
    X8420 8-threads: 111893 SPECompL2001
    Reference Date: Mar 19, 2007
    System: Sun Fire X4600 M2
    Total Number Processors: 4
    Total Memory : 32 GB (16x4GB DIMMs), DDR667
    Processor/GHz of Server: Opteron 8220, 2.8 GHz
    Operating System: Solaris 10
    Compiler: Sun Studio 11

    Disclosure Statement:

    SPEC, SPEComp reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org as of Mar 19, 2007, Sun result submitted to SPEC. Sun Fire X4600 M2 (S10/SS11, Opteron 8220, 8 cores, 4 chips, 8 threads), 111893 SPECompL2001. Sun Fire X4600 M2 (RHEL4u4, Opteron 8220, 8 cores, 4 chips, 8 threads), 103466 SPECompL2001. HP DL585 G2 Opteron 8220SE (RHEL4u4, Opteron 8220, 8 cores, 4 chips, 8 threads), 103466 SPECompL2001. Sockets refers to chips.

    See Also

    SPEC OMP2001 Page
    SPEC Home Page
    sun.com X4600 M2 Benchmark Page

    [1] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    SPEC updates SPECjAppServer2004 to benefit open-source community

    Tuesday Feb 20, 2007

    SPEC.org has released a version of SPECjAppServer2004 called EAStress2004. This is a multi-tier Java benchmark which measures the performance of Java Platform Enterprise Edition (Java EE) application servers.

    EAStress2004 that relaxes run and reporting rules, enabling informal results to be shared in open-source projects.

    "Results from the EAStress2004 workload cannot be used for marketing purposes, and comparisons to other SPECjAppServer2004 results are not permitted."

    read more at: http://www.3dprofessor.org/Press%20Releases.htm

    ...so you won't see any EAstress2004 benchmarks reported on this blog, but I really think this is a very cool thing to do. Cheers!

    Like this post? del.icio.us | furl | slashdot | technorati | digg

    judging by the wrong things: IBM & TPC-C

    Tuesday Feb 20, 2007

    Is IBM 3.3x or 1.4x faster? - I guess it depends if you use a over-optimised benchmark like TPC-C. As mentioned yesterday, IBM doesn't publish on a variety of standard benchmarks like SPECint_rate2006 or SPECjbb2005 on their high-end systems so we have to look at the SPECint_rate2000 which is just about to be EOL'ed and completely replaced by SPECint_rate2006.

    First let's compare an IBM p5 595 (Power5+ 2.3GHz 64p, 128thread) to a HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread, single core/CPU) on SPECint_rate2000.

    Constructing a SPECint_rate2000 ratio
    1.4x = 1513/1108
    we find that the IBM 595 is 1.4x faster, it makes sense because this isn't the latest HP dual-core Itanium2. Both IBM and HP systems have results on TPC-C U SPECint_rate2000.

    OK now using TPC-C, let's compare a IBM p5 595 (Power5+ 2.3GHz 64p, 128thread) to a HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread, single core/CPU).

    Constructing a TPC-C ratio
    3.3x = 4033378/1231433
    what?
    comparing the same systems the IBM is 3.3x faster ?!? Looks that TPC-C over-inflates what can be expected from IBM.

    My guess is IBM over-optimised and played lots of tuning tricks on TPC-C, correct? So is TPC-C relavent to customers if this is the case?

    ...maybe that's why seven years ago Sun, upon publishing a world record TPC-C result said:

    "It's well-understood in the technical communities that TPC-C no longer represents current customer workloads since the transaction load that its models are made of are small, primitive and disconnected transactions. While this model was acceptable for the workloads of the late 1980s, it misses the mark..."
    http://www.sun.com/smi/Press/sunflash/2000-08/sunflash.20000831.1.html

    You'll also notice the Aug 2000 press release said, "Customer workloads nowadays require a more ad hoc workload than the TPC-C specifies."

    Disclosure Statements

    IBM p5 595 (Power5+ 2.3GHz 64p, 128thread) 4,033,378 tpmC, 2.97 US $/tpmC, Avail 01/22/07, IBM DB2 9, IBM AIX 5L V5.3, Microsoft COM+. HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread), 1,231,433 tpmC, 4.82 US $/tpmC, Avail 06/05/06, Microsoft SQL Server 2005 Enterprise Edt SP1, Microsoft Windows Server 2003 Datacenter Ed.(64-bit)SP1. Results as of 2/15/07, see http://www.tpc.org.

    IBM System p5 595 (Power5+ 2.3GHz 64p, 128thread), 64 cores, 32 chips, 2 cores/chip (SMT on), 1513 SPECint_rate2000. HP Integrity Superdome (Itanium2 1.6 GHz 64p, 64thread, 16 cells), 64 cores, 64 chips, 1 core/chip, 1108 SPECint_rate2000. SPEC, SPECint, SPECfp reg tm of Standard Performance Evaluation Corporation. Results from http://www.spec.org. as of 2/15/07.

    World record TPC-C results referenced above was an overall performance world record at August 31, 2000. Sun Enterprise 10000 server (Starfire) running Sybase Adaptive Server Enterprise (ASE), 156,873.03 tpmC, $48.81 price/tpmC, available February 28, 2001. A full disclosure report and executive summary are available through the TPC Web site located at http://www.tpc.org.

    [7] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Woodcrest lagging on GHz gain?

    Thursday Nov 02, 2006

    Woodcrest having issues scaling with GHz? On the SPECfp_rate2000 result website, when Woodcrest goes from 2 GHz to 3 Ghz (50% clock increase), but the SPECfpRate only adds 17.8%. Not good, is it?

    System Chip/GHz Config Score
    Fujitsu Siemens CELSIUS R540 5160 3.0 GHz 4-core 2-Socket 80.6
    Fujitsu Siemens CELSIUS R540 5150 2.66 GHz 4-core 2-Socket 77.4
    Fujitsu Siemens CELSIUS R540 5130 2.0 GHz 4-core 2-Socket 68.4
    I'm thinking memory latency is an issue? Tomorrow, I'll look at scaling as you add cores, maybe an issue there?

    Disclosure

    Fujitsu Siemens CELSIUS R540 2.0GHz (2chips,4cores), 68.4 SPECfp_rate2000, Fujitsu Siemens CELSIUS R540 2.66GHz (2chips,4cores), 77.4 SPECfp_rate2000, Fujitsu Siemens CELSIUS R540 3.0GHz (2chips,4cores), 80.6 SPECfp_rate2000. SPEC, SPECint reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org as of 11/2/06.

    [5] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg