BM Seer Facts & Questions from an Anonymous Sun Source

careful reading shows a lot

Wednesday Oct 24, 2007

You have to read some things carefully

    "...And the good news is that about 40-70% of the stuff we do in performance tuning actually ends up helping end users," -- Bruce Lindsay(IBM Fellow), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

    "This is feasible in the TPC-C benchmark because there are only five tables and only ten to fifteen columns in each table. In a more realistic application, where there are many more queries to be considered, the tables are typically much, much wider, in the 80 to 100 column range; and there are dozens if not thousands of tables. Then this kind of analysis(ed note: tuning) is no longer practical." -- Bruce Lindsay(IBM Fellow since '96), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

    "The idea is to get the numbers by hook and by crook." -- Bruce Lindsay(IBM Fellow since '96), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

    The TPC-C benchmark is an industry standard for measuring the ability of a system to process complex online transactions and large volumes of business data. The TPC-C benchmark is unique in the way it exercises all components of a system, including processors, memory, networking, storage, operating system and database software, demonstrating total system performance in a way that many of the other benchmarks touted by some competitors do not. -- Bruce Lindsay(IBM Fellow since '96), July 25, 2006, http://www-03.ibm.com/solutions/sap/doc/content/news/pressrelease/1623288130.html

Issues:
  • This means that 30% to 60% of IBM's TPC-C tuning is useless for customers.
  • IBM clearly over-hyped TPC-C, just 2-3 months after they publicly showed all of its problems and "optimizations" they used.

    Next:

      "Significantly, the high utilization rate of the System z9 mainframes -- systems can and do operate at 80 to 100 percent utilization -- combined with its ability to "virtualize" workloads, can enable a single mainframe processor to perform far more work than a single x86 processor running Microsoft Windows. The latter may run as low as 5 percent utilization." - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/19577.wss
    Issues:
  • used different work for mainframe and for its competitor.
  • "do" and "may" mean very different things
  • "mainframes do operate at 80-100%", "x86 processor running Microsoft Windows. The latter may run as low as 5%". So it is a valid but totally useless statement.
  • An equally invalid statement: x86 do operate at 80-100% and mainframes may run as low as 5%.

    Next:

      "First of all, the math is really simple. 4.7 is greater than 1.4. IBM's POWER6 4.7 GHz chip is faster than Sun's 1.4 GHz UltraSPARC T1 chip. And second of all, the IBM System p 570 remains the #1 SPECjbb2005 2-core result (1)." Marketing Program manager of IBM performance blog, Jun07
    Issues:
  • Did not compare system or chip performance but only quoted the GHz of a chip?
  • Made a true statement about core count but ignored that that IBM cores cost much more than Sun UltraSPARC T1 and/or UltraSPARC T2 on a per core basis, I know this is hard to verify since IBM isn't public about pricing, so you'll have to ask your IBM people to price specific configurations for you, be specific so you understand exactly what is priced.

    Next:

      "Even more impressive, the processor bandwidth of the POWER6 chip – 300 gigabytes per second -- could download the entire iTunes catalog in about 60 seconds" - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/21580.wss
    Issues:
  • Added every bandwidth (L3 cache, address bandwidth?!?,...) in a chip, even though peak memory bandwidth is limited to at least a 10th of that, delivered is a lot less.
  • stated "processor bandwidth", even though "delivered" system bandwidth would actually be required to move the data (not address :) ).

    Next:

      "IBM calculates that 30 SunFire v890s can be consolidated into a single rack of the new IBM machine, saving more than $100,000 per year on energy costs (3)." - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/21580.wss
    Issues:
  • used 2 year old sun result compared to power6 yet to be shipped as of may press release
  • said V890, so that people think it is a current comparison, had to read in the footnotes that it was 1.5 GHz slower CPU. Sun has introduced 1.8GHz, and 2.1GHz since.
  • made a "conservative" comparisons by giving IBM another 15% in performance
  • claimed Sun at 20% utilisation and IBM at 60% utilisation, that is one way to get 3x over your competition :)
  • never showed exactly what power was drawn by a 4.7GHz, 64GB memory system, at ??MHz DDR2 used in the comparison, etc.

    This was a bit of a repeat, but some things should not be forgotten.

    I've never been about popularity or names. You don't need my expertise to see funny things in IBM's statements. Don't attack me, attack the facts. Anonymously yours, Sun's BM Seer.

    Disclosure statement:

    TPC-C is a trademark of Transaction Processing Performance Council (TPC). More info www.tpc.org.

    Like this post? del.icio.us | furl | slashdot | technorati | digg
  • SPEC CPU2006 UltraSPARC T2 exactly real just like we said

    Tuesday Oct 09, 2007

    Today, Sun submitted the SPECint_rate2006 and SPECfp_rate2006 Single-Chip World Records on the Sun SPARC Enterprise T5120/T5220. What are these servers? UltraSPARC T2 1.4GHz servers that you will hear loads more on today.

    The Sun SPARC Enterprise T5120 is the 1RU version, and the Sun SPARC Enterprise T5220 is the 2RU version, both of these servers are electronically equivalent with the 2RU having a bit more connectivity and storage if you need.

    The Sun SPARC Enterprise T5220 server, running at 1.4 GHz, beat all single-chip results running SPECint_rate2006 with a result of 78.5.

    The Sun SPARC Enterprise T5220 server beats the best single IBM 4.7 GHz dual-core POWER6 processor result by 29% and beat the best published single 3 GHz Xeon quad-core by 28% on SPECint_rate2006. There are no single quad-core Opteron results published for SPECint_rate2006.

    "but I've heard there is no floating point on Niagara processors :) Nay, the 1.4GHz UltraSPARC T2 in the Sun SPARC Enterprise T5220 server, beat all single-chip results running SPECfp_rate2006 with a result of 62.3.

    The Sun SPARC Enterprise T5220 server beat the best single IBM 4.7 GHz POWER6 processor based system result by 7% and beats the best published single 3 GHz quad-core Intel Xeon by 61% for SPECfp_rate2006.

    There are no single quad-core Opteron results published for SPECfp_rate2006.

    SPEC CPU2006 Performance Charts - bigger is better, selected recent results, please see www.spec.org for complete results.

    SPECint_rate2006

    System Procs Perf Results
    Type GHz Chips
    Cores
    Threads Peak Base
    T5120/T5220 UltraSPARC T2 1.4 1, 8 64 78.5 73.0
    HP DL380 G5 Intel X5365 3.0 1, 4 4 61.3 53.8
    IBM p 570 Power6 4.7 1, 2 4 60.9 53.2
    Fujitsu RX300 Intel X5355 2.66 1,4 4 52.8 50.5

    SPECfp_rate2006

    System Processors Performance Results
    Type GHz Chips, Cores Threads Peak Base
    T5120/T5220 UltraSPARC T2 1.4 1, 8 64 62.3 57.9
    IBM p 570 Power6 4.7 1, 2 4 58.0 51.5
    HP DL380 G5 Intel X5365 3.0 1, 4 4 38.8 36.4
    Fujitsu RX300 Intel X5355 2.66 1, 4 4 37.5 36.2

    Results as of 27 Sep 2007 from www.spec.org.

    Benchmark Description

    SPEC CPU2006 is made up of two suites of benchmarks, CFP2006 and CINT2006. CFP2006 targets floating-point performance, while CINT2006 targets integer performance.

    Each suite has two different measures. First is the CPU measure, which is the performance on the suite as a single stream. This can be either a single thread or automatic compiled parallel run. This measure is further defined by base and optimized runs. Base uses the same compiler flags for all kernels, where optimized is allowed to use different compiler flags for each kernel. Results are compared against a baseline system run that was standardized by SPEC.

    The second measure is Rate. It is a measure of how many CPU measures can be run at a time. Typically, it is run as n processes on n processors. It shows how well the same job mix can run on a system under some load. It also is run as a base and optimized set of results.

    Disclosure Statement:

    SPEC, SPECint reg tm of Standard Performance Evaluation Corporation. Sun result submitted to SPEC, other results from www.spec.org as of 9/27/07. Sun SPARC Enterprise T5220/T5120 (UltraSPARC T2, 1 chip, 8 cores), 78.5 SPECint_rate2006, IBM p570 (POWER6, 1 chip, 2 cores), 60.9 SPECint_rate2006, HP DL380 G5 (X5365, 1 chip, 4 cores), 61.3 SPECint_rate2006, Sun SPARC Enterprise T5220 (UltraSPARC T2, 1 chip, 8 cores), 62.3 SPECfp_rate2006.

    SPEC, SPECfp reg tm of Standard Performance Evaluation Corporation. Sun result submitted to SPEC, other results from www.spec.org as of 9/27/07. Sun SPARC Enterprise T5220/T5120 (UltraSPARC T2, 1 chip, 8 cores), 62.3 SPECfp_rate2006. IBM p570 (POWER6, 1 chip, 2 cores), 58.0 SPECfp_rate2006, Sun SPARC Enterprise T5220 (UltraSPARC T2, 1 chip, 8 cores), 62.3 SPECfp_rate2006. HP DL380 G5 (X5365, 1 chip, 4 cores), 38.8 SPECfp_rate2006.

    System Configuration

    Results
    Reference Date: Oct 09, 2007
    System: Sun SPARC Enterprise T5120/T5220
    Processor: Sun UltraSPARC T2, 1.4 GHz
      78.5 SPECint_rate2006
      62.3 SPECfp_rate2006
    Software: Solaris 10, Sun Studio 12 Compiler

    [6] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Critical reading absolutely required for IBM

    Friday Oct 05, 2007

    First:

      "First of all, the math is really simple. 4.7 is greater than 1.4. IBM's POWER6 4.7 GHz chip is faster than Sun's 1.4 GHz UltraSPARC T1 chip. And second of all, the IBM System p 570 remains the #1 SPECjbb2005 2-core result (1)." - Elisabeth Stahl, program manager of IBM performance marketing, 20 years experience, Jun07
    Issues:
  • Did not compare system or chip performance but only quoted GHz of a chip.
  • Made a true statement about core count but ignored that that IBM core costs an order of magnitude more than Sun T1 on a per core basis

    Next:

      "Even more impressive, the processor bandwidth of the POWER6 chip – 300 gigabytes per second -- could download the entire iTunes catalog in about 60 seconds" - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/21580.wss
    Issues:
  • Added every bandwidth (L3 cache, address bandwidth?!?,...) in a chip, even though peak memory bandwidth is limited to at least a 10th of that, delivered is a lot less.
  • stated "processor bandwidth", even though "delivered" system bandwidth would actually be required to move the data (not address :) ).

    Next:

      "IBM calculates that 30 SunFire v890s can be consolidated into a single rack of the new IBM machine, saving more than $100,000 per year on energy costs (3)." - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/21580.wss
    Issues:
  • used 2 year old sun result compared to power6 yet to be shipped as of may press release
  • implied V890, so that people think it is a current comparison, had to read in the footnotes that it was 1.5 GHz slower CPU. Sun has introduced 1.8GHz, and 2.1GHz since.
  • made a "conservative" comparisons by giving IBM another 15% in performance
  • claimed Sun at 20% util and IBM at 60% util to get bogus 3x factor
  • never showed exactly what power was drawn by a 4.7GHz, 64GB memory system, at ??MHz DDR2 used in the comparison, etc.
  • ...and many more.

    Next:

      "...And the good news is that about 40-70% of the stuff we do in performance tuning actually ends up helping end users," -- Bruce Lindsay(IBM Fellow), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

      "This is feasible in the TPC-C benchmark because there are only five tables and only ten to fifteen columns in each table. In a more realistic application, where there are many more queries to be considered, the tables are typically much, much wider, in the 80 to 100 column range; and there are dozens if not thousands of tables. Then this kind of analysis(ed note: tuning) is no longer practical." -- Bruce Lindsay(IBM Fellow since '96), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

      "The idea is to get the numbers by hook and by crook." -- Bruce Lindsay(IBM Fellow since '96), May 06, http://www.sigmod.org/sigmod/record/issues/0506/p71-column-winslet.pdf

      The TPC-C benchmark is an industry standard for measuring the ability of a system to process complex online transactions and large volumes of business data. The TPC-C benchmark is unique in the way it exercises all components of a system, including processors, memory, networking, storage, operating system and database software, demonstrating total system performance in a way that many of the other benchmarks touted by some competitors do not. -- Bruce Lindsay(IBM Fellow since '96), July 25, 2006, http://www-03.ibm.com/solutions/sap/doc/content/news/pressrelease/1623288130.html

    Issues:
  • This means that 30% to 60% of IBM's TPC-C tuning is useless for customers.
  • IBM clearly over-hyped TPC-C, just 2-3 months after they publicly showed all of its problems and "optimizations" they used.

    Next:

      "Significantly, the high utilization rate of the System z9 mainframes -- systems can and do operate at 80 to 100 percent utilization -- combined with its ability to "virtualize" workloads, can enable a single mainframe processor to perform far more work than a single x86 processor running Microsoft Windows. The latter may run as low as 5 percent utilization." - IBM Press Release http://www-03.ibm.com/press/us/en/pressrelease/19577.wss
    Issues:
  • very convenient to use different work for mainframe and for its competitor.
  • "mainframes do operate at 80-100%", "x86 processor running Microsoft Windows. The latter may run as low as 5%". So it is a valid but totally useless statement.
  • "do" and "may" mean very different things
  • An equally invalid statement: x86 do operate at 80-100% and mainframes may run as low as 5%.

    Moral: Be VERY VERY CAREFUL when you read big blue.

    Disclosure statement:

    TPC-C is a trademark of Transaction Processing Performance Council (TPC). More info www.tpc.org.

    Like this post? del.icio.us | furl | slashdot | technorati | digg
  • Oracle & UltraSPARC T1 - Commercial databases and CMT are no problem

    Thursday Aug 23, 2007

    In the last posting we showed Oracle Database with SAP-SD benchmarks all running on a Sun Fire T2000. As Sun has been saying since Day one of CMT. Major databases are perfectly matched for UltraSPARC T1. By the way Sun has also used Open source databases on benchmarks as well.

    We have lots of customers deploying RDBMS on UltraSPARC T1 and planning on UltraSPARC T2 servers. It really works well even though competitors and doubters want to try to say it is special purpose, sorry it isn't.

    Here is an opinion:

      "Now Sun's T2 is out and it's pretty much the world beater they promised - 30% faster on SPEC throughput than IBM's 4.7 Ghz Dual core Power6 and, more significantly, one third the cost and somewhere between two and three times the throughput of the Itanium. ... anyone still buying HP-UX and Itanium after Rock comes out will be doing it because they hate Sun and are quietly hoping for a miracle, just as DEC's partisans (and HP's own MPE customer base) did before them." -- zdnet's Paul Murphy

      Source: "A Dumb prediction: IBM will Buy HP's Unix Customers," By Paul Murphy, zdnet, 08/17/07, http://blogs.zdnet.com/Murphy/?p=941

    [2] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    World Record Single-chip SAP-SD 2-Tier ECC 6.0 Sun Fire T2000

    Thursday Aug 23, 2007

    The SPARC Enterprise Model T2000 | Sun Fire Model T2000 is the performance leader in Two-Tier SAP-SD Standard Application Benchmarks on single processor systems as of August 22nd, 2007. This result used the Oracle database on the UltraSPARC T1. Again as Sun has always maintained the UltraSPARC T1 is good at database-tier, application tier, and web tier!

    • Sun Fire Model T2000 supported 1100 SD Benchmark Users, 5530 SAPS, using Oracle 10g is the fastest single-processor systems.
    • Sun Fire Model T2000 beats a 2-chip dual-core Itanium2-based HP Integrity rx2660.
    • Sun Fire Model T2000 beats a 2-chip dual-core Opteron-based HP ProLiant DL365.
    • Sun Fire Model T2000 beats a 2-chip dual-core Xeon-based Fujitsu BFi20 S2 (Unicode).
    • The Fujitsu BX620 S4 that uses two-chip 3GHz Xeon Quad-cores is only 1.8x faster than a single chip Sun Fire Model T2000 using UltraSPARC T1.
    • The IBM p570 that uses two-chip 4.7GHz POWER6 is only 1.8x faster than a single chip Sun Fire Model T2000 using UltraSPARC T1.
    • The just-announced UltraSPARC T2 has twice the thread count of the UltraSPARC T1.

    SAP-SD 2-Tier Performance, Benchmark Users (bigger is better)

    Sys Users # / GHz / Type Mem OS DB LI/Hr SAPS BM rev Date
    IBM p570 2035 two 4.7 POWER6+ DC 32 GB AIX 5L 5.3 Oracle 10g 203,670 10,180 6.0 5/21/07
    Fujitsu BX620 S4 1940 two 3.0 Xeon QC 32 GB Windows Srvr 2003 EE SQL Server 2005 194,000 9,700 6.0 8/13/07
    Sun Fire T2000 1100 one 1.4 US T1 64 GB Solaris 10 Oracle 10g 110,670 5,530 6.0 8/22/07
    HP Integrity rx2660 1090 two 1.6 Itan2 DC 32 GB HP-UX 11iV3 DB2 9 109,670 5,480 6.0 3/20/07
    HP ProLiant DL365 1083 two 2.8 Opt DC 32 GB Windows Srvr 2003 EE SQL Srvr 2005 108,670 5,430 6.0 2/9/07
    Fujitsu BFi20 S2 Unicode 1020 two 3 Xeon 5160 DC 16 GB Solaris 10 Oracle 10g 102,330 5,120 6.0 5/4/07
    IBM p550 1000 four 1.9 POWER5+ DC 32 GB SuSE Linux ES9 DB2 UDB 8.2.2 100,330 5,020 5.0 10/04/05
    Sun Fire T2000 950 one 1.2 US T1 32 GB Solaris 10 MaxDB 7.5 95,670 4,780 5.0 11/17/05
    IBM x3250 850 one 2.13 Xeon 8 GB Windows SrVr 2003 EE DB2 9 88,000 4,400 6.0 5/11/07

    Complete benchmark results may be found at the SAP benchmark website http://www.sap.com/benchmark.

    Benchmark Description

    The SAP Standard Application SD (Sales and Distribution) Benchmark is a two-tier ERP business test that is indicative of full business workloads of complete order processing and invoice processing, and demonstrates the ability to run both the application and database software on a single system. The SAP Standard Application SD Benchmark represents the critical tasks performed in real-world ERP business environments.

    SAP is one of the premier world-wide ERP application providers, and maintains a suite of benchmark tests to demonstrate the performance of competetive systems on the various SAP products.

    SAP has specified that the Benchmark Users metric is the only metric to be used for public comparisons. However, Benchmark Users can be traded off with response time in performance tuning, and so comparing Line Items per Hour or SAPS may be a different way to compare the actual power of systems.

      Funny that Sun compares against current IBM results, IBM bloggers decide to do funny comparisons on a different SAP benchmark, but compared their latest system to a 16-month old result on a US-IV system that is 2 processor GHz upgrades behind. I guess that is one way to win...

    Disclosure Statement:

    Two-tier SAP Standard Sales and Distribution (SD) standard SAP ERP 2004/2005 application benchmark: SPARC Enterprise Model T2000 | Sun Fire T2000 (1-way, 1 proc, 8 cores, 32 threads) 1 x 1.4 GHz UltraSPARC T1, 64GB memory, 1100 SD Benchmark users, 1.91 sec avg response time, Cert#2007051, Oracle 10g, Solaris 10; Sun Fire T2000 (1-way, 1 proc, 8 cores, 32 threads) 1 x 1.2 GHz UltraSPARC T1, 32GB memory, 950 SD Benchmark users, 1.91 sec avg response time, Cert#2005047., MaxDB 7.5 database, Solaris 10; Fujitsu Siemens Computers PRIMERGY MOdel BX620 S4 (2-way, 2 procs, 8 cores, 8 threads), 2 x 3.0 GHz Quad-Core Intel Xeon, 32 GB memory, 1940 SD Benchmark users, 1.99 sec avg response time, Cert#2007049, SQL Server 2005, Windows Server 2003 Enterprise Edition; HP ProLiant DL365 (2-way, 2 procs, 4 cores, 4 threads) 2 x 2.8 GHz Opteron, 32GB memory, 1083 SAP SD Benchmark users, 1.98 sec avg response time, Cert#2007006, SQL Server 2005, Windows Server 2003 Enterprise Edition; HP Integrity rx2660 (2-way, 2 procs, 4 cores, 8 threads) 2 x 1.6 GHz Itanium, 32GB memory, 1090 SAP SD Benchmark users, 1.93 sec avg response time, Cert#2007016, DB2 9, HP-UX 11iV3; IBM System p 570 (2-way, 2 procs, 4 cores, 8 threads) 2 x 4.7 GHz POWER6+, 32GB memory, 2035 SD Benchmark users, 1.99s avg resp time, Cert#2007037, Oracle 10g, AIX 5L Version 5.3; Fujitsu Siemens Computers PRIMERGY Model BFi20 S2 (2-way, 2 procs, 4 cores, 4 threads) 2 x 3GHz Intel Xeon 5160 dual-core, 16GB memory,(Unicode) 1020 SAP SD Benchmark users, 1.94 sec avg response time, Cert#2007031, Oracle 10g, Solaris 10; IBM System x3250 (1-way, 1 proc, 4 cores, 4 threads) 1 x 2.13 GHz Xeon, 8GB memory, 850 SD Benchmark users, 1.59s avg resp time, Cert#2007036, DB2 9, Windows Server 2003 Enterprise Edition; IBM System eServer p5 550 (4-way, 4 procs, 4 cores, 8 threads) 4 x 1.9 GHz POWER5+, 32GB memory, 1000 SD Benchmark users, 1.97s avg resp time, Cert#2005040, IBM DB2 Universal Database 8.2.2, SuSE Linux Enterprise Server 9; SAP, R/3, mySAP reg TM of SAP AG in Germany and other countries. More info http://www.sap.com/benchmark.
    Certified Results
    Performance: 1100 benchmark users
    Server: Sun Fire
    Processors: 1 1.4 GHz UltraSPARC T1
    Memory: 64 GB
    Operating system: Solaris 10
    Database S/W: Oracle 10g
    SAP S/W: SAP ECC 6.0
    SAP Certification: 2007051
    Storage: Sun StorEdge 6020

    [8] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    UltraSPARC T2, and Old UltraSPARC T1 world records & new Xeon's

    Thursday Aug 09, 2007

    Postscript:

    Be careful when comparing performance results, as an example look at a comment in yesterday's "Can I use 64 threads in a chip?" posting. At least this comment pointed out that you can use 4-8 threads in 2 chip Intel-based systems, but it was really trying to be a stab at UltraSPARC Performance. Here was the comment: One really needs to look at the complete data on those .pdf's to make a fair comparison (also in the disclosure statement below).

  • First: The T2000 SAP-SD used a 1.2GHz UltraSPARC T1, Sun now ships faster 1.4GHz UltraSPARC T1, and has announced 1.4GHz UltraSPARC T2. The 1.4GHz T2 has double the threads of that 1.4GHz (double the computational power).
  • Second: The T2000 SAP-SD result was submitted in Dec 2005, at that time it was near the performance of the expensive 4-way POWER5 IBM p550.
  • Third: The 2-chip Dual-core Xeon SAP-SD result above was submitted 18 months after the T2000 SAP-SD result.
  • Fourth: Different versions of the benchmark. The 2-chip Dual-core Xeon was run with ECC 6.0 (not SAP 5.0). The a newer version of the benchmark takes more computational work to produce the same results. Dual-core SAP-SD result was also run with Solaris 10 on Xeon, how cool is that!
  • Fifth: The 2-chip quad-core Xeon SAP-SD result above was submitted 19 months after the T2000 SAP-SD result.
  • Sixth: The Sun result used open-source MySQL MaxDB database, how cool is that! The Xeon results used Oracle or MicroSoft SQL Server.
      postscript:
      Sun latter used Oracle, others suggested US T1 has some sort of silly database limitation - NOT TRUE!

    You'll see more results soon.

    Triffids, as a reminder if you work for a partner company of SAP you must put the following disclosures when you post results. If you are not they you don't need to put this in, but as you can see the data in it would have allowed you to make a better comparison of systems. Don't worry I'm not asking you to identify yourself at all.

    Disclosure Statement:

    Two-tier SAP ECC 5.0 Standard Sales and Distribution (SD) benchmark Sun Fire T2000 (1-way, 1 proc, 8 cores, 32 threads) 1x 1.2 GHz UltraSPARC T1, 32 GB mem, 950 SD benchmark users, 1.91 sec avg response time, Cert#2005047., MaxDB 7.5 database, Solaris 10; Two-tier SAP ECC 5.0 Standard Sales and Distribution (SD) benchmark IBM System eServer p5 550 (4-way, 4 procs, 4 cores, 8 threads) 4x 1.9 GHz POWER5+, 32GB mem, 1,000 SD benchmark users, 1.97s avg resp time, Cert#2005040, IBM DB2 Universal Database 8.2.2, SuSE Linux Enterprise Server 9; Two-tier SAP ECC 6.0 Standard Sales and Distribution (SD) benchmark Fujitsu Siemens Computers PRIMERGY Model BFi20 S2 (2 procs, 4 cores, 4 threads) 2x Intel Xeon 5160, 3.0 GHz, 16GB mem, 1,020 SD benchmark users, 1.94s avg resp time, Cert#2007031, Oracle 10g, Solaris 10; Two-tier SAP ECC 6.0 Standard Sales and Distribution (SD) benchmark Fujitsu Siemens Computers PRIMERGY Model TX300 S3 (2 procs, 8 cores, 8 threads) 4x Quad-Core Intel Xeon Processor X5355 2.66 GHz, 32GB mem, 1865 SD benchmark users, 1.99s avg resp time, Cert#2007025, SQL Server 2005, Windows Server 2003 Enterprise Edition; SAP, R/3, mySAP reg TM of SAP AG in Germany and other countries. More info www.sap.com/benchmark.

    I edited in:
    2 processors into Quad-Core Intel Xeon Processor X5355 2.66 GHz

    ...and..

    32 threads to the Sun Fire T2000, 1 processor / 8 cores ...in order to make the comparisons more consistent.

    Like this post? del.icio.us | furl | slashdot | technorati | digg
  • news trickles out on Niagara2

    Friday Aug 03, 2007

    News is starting to trickle out on Niagara2. Sun continues to do revolutionary things that will have huge effect on how we compute. For what has leaked so far see: http://www.informationweek.com/blog/main/archives/2007/08/suns_niagara_2.html

    At OpenSPARC.net you will find a in-depth presentation of the upcoming Niagara 2 processor: "An 8-core, 64-thread, 64-bit, power efficient SPARC SoC (Niagara2)". The presentation was held at the International Solid State Circuits Conference 2007.

    So the question becomes how do we now judge performance of chips with different architectures? One thing that hasn't changed is system performance and system cost. Look at what a system costs compare it against other systems in the same price range.

    The computer industry can no longer determine performance based on old dinosaur habits. Remember when vendors determined all performance by GHz. Then remember back in '04 when Intel quite naming processors by GHz... they realized that GHz is only part of the overall equation for processor performance. What does that equation look like in very rough terms?

      performance = GHz * number-of-cores * number-of-threads-per-core * efficiency-of-design * other * software efficiency

    ...so if you compare performance between different designs by any of these factors alone (example: perf/core, perf/Ghz, perf/thread) you are being very misleading!

    Case in point, IBM loves to talk about perf/core, but they avoid telling everyone that they lead the industry, by several factors in price/core!

    The IT jungle says: "340,000. It costs $17,700 to activate a core, so pushing it to eight cores costs another $141,600." So on a per core basis that means POWER6 costs $60,000/core. Actually I've estimated that in a system with memory an IBM power6 system costs $65K/core to $130K/core! Any customer needs to do their own pricing.

    So comparing systems of dramatically different cost-per-core on a per-core basis NO LONGER MAKES ANY SENSE! No wonder IBM wants to stilt comparisons and avoid pricing! The only other clues on list price on power6 is here.

    [3] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    IBM power6 chip already 3rd fastest

    Friday Jun 15, 2007

    IBM statement no longer true. IBM press release proudly stated 3 weeks ago that "IBM Unleashes World's Fastest Chip in Powerful New Computer". Now the 1.4 GHz UltraSPARC T1 chip is 10% faster than the IBM POWER6 chip.

    The IBM p570 Power6 4.7GHz 1 chip/2core/4threads had a score of 88,089 SPECjbb2005 bops. It is also very clear that you can not compare systems performance on a per core basis. You'd have to do it by system price, go ahead and price out a 4RU p570 and a 2RU T2000, or check the wattage. Also as you can see IBM's faster GHz and fewer very expensive cores end up delivering slower system performance.

    SPECjbb2005 (ordered by perf, bops : SPECjbb2005 Business Operations per Second, bigger is better)

    System Date Processors Performance
    (Chips, Cores, Threads) GHz / Type SPECjbb 2005
    bops
    JVMs SPECjbb 2005
    bops/JVM
    Dell PowerEdge 860 1/07 (1, 4, 4) 2.4 Xeon 112092 1 112092
    Sun Blade T6300 6/07 (1, 8, 32) 1.4 US-T1 96523 4 24121
    IBM p570 6/07 (1, 2, 4) 4.7 POWER6 88089 1 88089
    Fujitsu TX150 6/07 (1, 2, 2) 2.66 Xeon 70324 1 70324
    Dell PowerEdge 840 10/06 (1, 2, 2) 2.66 Xeon 52002 1 52002
    Fujitsu RX100 10/06 (1, 2, 2) 2.4 Xeon 49892 1 49892

    Complete benchmark results may be found at the SPEC benchmark website http://www.spec.org.

    Benchmark Description

    SPECjbb2005 (Java Business Benchmark) measures the performance of a Java implemented application tier (server-side Java). The benchmark is based on the order processing in a wholesale supplier application. The performance of the user tier and the database tier are not measured in this test. The metrics given are number of SPECjbb2005 bops (Business Operations per Second) and SPECjbb2005 bops/JVM (bops per JVM instance).

    Disclosure Statement:

    SPECjbb2005 Sun Fire T6300 (1 chip, 8 cores) 96523 SPECjbb2005 bops, 24131 SPECjbb2005 bops/JVM, IBM p570 (1 chip, 2 cores) 88089 SPECjbb2005 bops, 88089 SPECjbb2005 bops/JVM, Fujitsu TX150 (1 chip, 2 cores) 70324 SPECjbb2005 bops, 70324 SPECjbb2005 bops/JVM, Dell PowerEdge 860 (1 chip, 4 cores) 112092 SPECjbb2005 bops, 112092 SPECjbb2005 bops/JVM, Dell PowerEdge 840 (1 chip, 2 cores) 52052 SPECjbb2005 bops, 52052 SPECjbb2005 bops/JVM, Fujitsu RX100 (1 chip, 2 cores) 49892 SPECjbb2005 bops, 49892 SPECjbb2005 bops/JVM. IBM p570 Power6 (1 chip, 2 cores, 4 threads) 88089 SPECjbb2005 bops, 88089 SPECjbb2005 bops/JVM. SPEC, SPECjbb reg tm of Standard Performance Evaluation Corporation. Results as of 6/14/2007 on http://www.spec.org.
    Results
    Sun Blade T6300: 96523 SPECjbb2005 bops
    24131 SPECjbb2005 bops/JVM
    Reference Date: June 6, 2007
    Systems: Sun Blade T6300, 32GB
    Total Number Processors: 1
    Processor/GHz of Server: US-T1 1.4 GHz
    Operating System: Solaris 10 8/07
    JVM: Java HotSpot(TM) 32-Bit Server, Version 1.6.0_02

    See Also

    Sun Press Release

    [10] Comments
    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Niagara2 early press

    Wednesday Feb 07, 2007

    CNET news has some press about Sun's next CMT technology.

      "We have silicon back (from manufacturer Texas Instruments) and Victoria Falls systems running in the labs at Sun at full speed, full performance," John Fowler, executive vice president of systems, said via Webcast Tuesday. "In the first half of calendar 2008, we will introduce products based on Victoria Falls."

    You can read more at:
    http://news.com.com/Sun+dual-Niagara+servers+due+in+2008/2100-1010_3-6156883.html

    ...all I can say is fast to production, and fast in the lab!

    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Blueprint for UltraSPARC T1 success

    Wednesday Jan 31, 2007

    Sun Blueprints are technical best practices, derived from the real-world experience of Sun experts. The whitepapers are available for free! downloads. You can buy blueprint books on other topics.

    A new one of note is the updated: "Developing and Tuning Applications on UltraSPARC T1 Chip Multithreading Systems" written by Denis Sheehan. For more info see: http://www.sun.com/blueprints/0107/819-5144.html

    Like this post? del.icio.us | furl | slashdot | technorati | digg

    Sun T2000 3.4x beter than Itanium 2 on cost

    Wednesday Dec 13, 2006

    Sun Fire T2000 Sun Java Systems Application Server World Record Two-Node Price/Performance on SPECjAppserver2004. The HP Itanium2-based server is over 3.4x times most costly than the Sun Fire T2000 Server using Sun Java EE application. Sun's solutions is the lowest overall application tier deployment and maintenance cost. The Sun Fire T2000 server also requires only 2RU vs. 4RU for the more-costly HP Itanium 2 server.

    Sun's dual-node configuration delivered throughput in excess of 26,000 operations per minute with absolutely no Java EE Application Server licensing constraints. This submission is fully Java EE 5 compatible and fully supported by Sun Microsystems. Sun Microsystems is the only vendor to provide robust performance on the SPECjAppServer benchmark using a Java EE 5 certified application server.

    One Sun Fire T2000 server equipped with 1 UltraSPARC T1 processor (8 cores) at 1.2 GHz running Sun Java System Application Server 9.0 UR 1 PE and a Sun Fire T2000 equipped with 1 UltraSPARC T1 processor (6 cores) at 1.0 GHz running Oracle 10g Database Standard Edition delivered a result of 521.42 SPECjAppServer2004 JOPS@Standard for the SPECjAppServer2004 benchmark for the best application tier price/performance result for a two-node server.

    Competitive Landscape

    SPECjAppServer2004 Performance Chart - Ordered by Application Tier Cost per SPECjAppServer2004 JOPS@Standard (lower is better), CPJ column is the Cost Per SPECjAppServer2004 JOPS@Standard, Score column is the SPECjAppServer2004 JOPS@Standard result, Cost column is the application cost of the hardware and software.

    Submitter CPJ Score Cost J2EE Server DB Server
    Sun $51.81 521.42 $27,016.95 1 x Sun Fire T2000
    8 cores, 1 chip (8 cores/chip) @1.2GHz
    Sun Java System AS 9.0 PE
    1 x Sun Fire T2000
    6 cores, 1 chip (6 cores/chip) @1.0GHz
    Oracle 10g 10.2.0.2
    Sun/BEA $76.37 615.64 $47,016.95 1 x Sun Fire T2000
    8 cores, 1 chip (8 cores/chip) @1.2GHz
    WebLogic
    1 x Sun Fire V490
    8 cores, 4 chips (2 cores/chip) @1.5GHz
    Oracle 10g 10.1.0.4
    Sun $122.13 616.22 $75,234.92 1 x Sun Fire T2000
    8 cores, 1 chip (8 cores/chip) @1.2GHz
    WebSphere Application Server Network Deployment, Version 6.1
    1 x Sun Fire X4200
    4 cores, 2 chip (2 cores/chip) @2.2GHz
    DB2 Universal Database v8.2.4
    HP $298.20 471.28 $140,537.88 1 x rx4640
    4 cores, 4 chips (1 core/chip) Itanium 2 @1.6GHz
    BEA WebLogic 9.0
    1 x rx4640
    4 cores, 4 chips (1 core/chip) Itanium 2 @1.6GHz
    Oracle 10g 10.1.0.4

  • SPECjAppServer2004 Results Page
  • Benchmark Description

    SPECjAppServer2004 (Java Application Server) is a multi-tier benchmark for measuring the performance of Java 2 Enterprise Edition (J2EE) technology-based application servers. SPECjAppServer2004 is an end-to-end application which exercises all major J2EE technologies implemented by compliant application servers as follows:

    • The web container, including servlets and JSPs
    • The EJB container
    • EJB2.0 Container Managed Persistence
    • JMS and Message Driven Beans
    • Transaction management
    • Database connectivity
    Moreover, SPECjAppServer2004 also heavily exercises all parts of the underlying infrastructure that make up the application environment, including hardware, JVM software, database software, JDBC drivers, and the system network.

    The primary metric of the SPECjAppServer2004 benchmark is jAppServer Operations Per Second (JOPS) which is calculated by adding the metrics of the Dealership Management Application in the Dealer Domain and the Manufacturing Application in the Manufacturing Domain. There is NO price/performance metric in this benchmark.

    Disclosure Statement:

    SPEC, SPECjAppServer reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org as of Dec 6, 2005. Sun Fire T2000 (8 cores, 1 chip) 521.42 SPECjAppServer2004 JOPS@Standard. HP rx4640 (4 cores, 4 chips) 471.28 SPECjAppServer2004 JOPS@Standard. Sun HW+SW application tier cost = $27,016.95, appl cost per JOP = $51.81. HP HW+SW application tier cost = $140,537.88, appl cost per JOP = $298.20 HP Bill of Material from http://www.spec.org/jAppServer2004/results/res2005q3/jAppServer2004-20050913-00016.html. BEA pricing from http://www.awaretechnologies.com/BEA/index.html. IBM pricing from http://www.cdw.com/ System pricing dated 12/06/06 HP rx4640 server specifications 10/19/05 from http://www.hp.com/products1/servers/integrity/entry_level/rx4640/index.html

    Certified Results 521.42 SPECjAppServer2004 JOPS@Standard
    Reference Date: Dec 6, 2006
    Systems: 1 x Sun Fire T2000, UltraSPARC T1 1.2 GHz (8 cores, 1 proc), 32 GB Memory
    1 x Sun Fire T2000, UltraSPARC T1 1.0 GHz (6 cores, 1 proc), 8GB Memory
    Operating System: Solaris 10 6/06 (same for both)
    Software: Sun Java System Application Server 9.0 Platform Edition UR1 Patch 1
    Oracle 10g Database Enterprise Edition v10.2.0.2
    JVM: J2SE 6.0

    Like this post? del.icio.us | furl | slashdot | technorati | digg