BM Seer Unofficial thoughts from an anonymous Sun employee

Intel chip power & details on Opteron vs. Woodcrest, etc

Friday Dec 08, 2006

Here is a processor wattage chart, but notice that Intel has the memory controller off chip so you need to add 30-35 watts to this figures (opteron includes this on chip) http://www.intel.com/products/processor_number/chart/xeon.htm

For more details on power budget breakdown I got pointed to these two pages for an AMD comparison (looks like it is part of a bigger preso?, no confidential statement): http://www.amd.com/us-en/assets/content_type/DownloadableAssets/opt_vs_wc_8_dimms.pps
http://www.amd.com/us-en/assets/content_type/DownloadableAssets/4_opt_vs_4_pax.pps

[2] Comments
Like this post? del.icio.us | furl | slashdot | technorati | digg

details on power budgets: Opteron advantage over Woodcrest

Thursday Dec 07, 2006

More details on power budget differences that give Opteron at least a 34% lead over Woodcrest.

I gave some basics of this in this posting: http://blogs.sun.com/bmseer/entry/design_strategies%3A_wattage_advantage_of

Woodcrest power budget: Dual-core Xeon's : 160 watts per socket (80w each) PLUS 44.8 watts for chipset (incl memory controllers) PLUS 66.4 watts 166.4 watts FBDIMM (16 DIMMs).

    {{typo corrected: yes FB-DIMMs suck an amazing 170 watts for 16 DIMMs -- that's nearly 100watts more than DDR2. That is why Intel-based systems only report wattage on small memory configs, but still use the same large memory configs for various benchmarks.}}

Opteron power budget: Dual-core Opteron's: 190 watts socket (95w max each) PLUS 16 watts for chipset PLUS 70.4 watts for DDR2 (16 DIMMs).

...and this is just looking at just the chips -- and not adding the typical controllers you'd have for a functioning system like disk , network, etc...

[3] Comments
Like this post? del.icio.us | furl | slashdot | technorati | digg

Design strategies: wattage advantage of Opteron vs. Woodcrest

Tuesday Dec 05, 2006

Some things to look at when you seen marketing around wattage. You can avoid errors by really looking at total measured wattage when systems running and doing real work. I've seen a lot of Intel marketing about wattage of Woodcrest being 65 watts. But that really doesn't show the whole picture. I'll break it down a bit...

What GHz at what wattage?:First recognize that Woodcrest 2.66 GHz & 2.33 GHz is 65 watts for chip only, but Woodcrest at 3.0 GHz is 80 watts. ...and all benchmarks I've seen is on the 80 watt 3.0 GHz systems.

What about the memory controller?: The CPU isn't everything. Woodcrest designs have an external memory controller. Opteron designs have an integrated memory controller. So you need to add another 30 watts (or more) for the pair of Woodcrest CPUs.

What about the memory technology differences?: The CPU+Memory_controller isn't everything. Woodcrest designs use FB-DIMMs. Opteron designs use the more power efficient DDR2. FB-DIMMS draw a lot more power. In fact, as I've blogged about before, 32GB 2-socket Woodcrest system draws 500 watts! Measured when the CPU is busy. Sun's Opteron systems is way over 100 watts less.

Every IT department I talk to really wants to cut cost out -- power consumption is a growing a major factor in IT costs.

...this just in...

Sun is now shipping a wattage meter with the "Try-and-buy" program for Sun Fire T2000. More details at: http://blogs.sun.com/cohen/entry/kill_a_watt_--_power

Like this post? del.icio.us | furl | slashdot | technorati | digg

New World Record SPECint_rate2006 Sun Ultra 40 M2 Workstation

Thursday Nov 16, 2006

The Sun Ultra 40 M2 Workstation demonstrates a new World Record integer throughput performance for all x86 systems, sets a new world record on the new and improved SPEC cpu benchmark called "SPECint_rate2006." It fixes things like SPECint_rate2000 has/had floating-point applications in the integer suite, whaaaat? yes strange but true.

The Sun Ultra 40 M2 delivered the SPECint_rate2006 score of 48.8, using Solaris 10 and Studio 11 combination. Sun's Opteron beats Woodcrest by 7%. As you can see below 'Peak' means you add a few more compiler flags. I guess Woodcrest didn't have any others to try on Woodcrest or maybe they saw no improvement so they avoided publishing? Anyone know?

Competitive Landscape

Selected SPEC CPU2006 (SPECint_rate2006) Performance Results - bigger is better, see www.spec.org for complete results.

System Processors Performance Results
Type GHz Chips Cores Threads Peak Base
Sun Ultra 40 M2 AMD Opteron 2220SE 2.8 2 4 4 48.4 41.9
HP DL585 Opteron 854 2.8 4 4 4 46.9 41.4
Supermicro X7DBE Woodcrest, Xeon 5160 3.0 2 4 4 --- 45.2
Sun Fire X4200 Opteron 285 2.6 2 4 4 42.8 37.8
Fujjitsu RX220 Opteron 280 2.4 2 4 4 40.0 35.7
Sun Fire X4200 Opteron 256 3.0 2 2 2 26.4 23.1
HP DL585 Opteron 854 2.8 2 2 2 25.2 22.3
Dell PrecWork 380 Pentium EE 3.73 1 2 2 -- 23.1
HP DL380 G4 Pentium 4 3.8 2 2 2 -- 20.9

Benchmark Description

SPEC CPU2006 is made up of two suites of benchmarks, CFP2006 and CINT2006. CFP2006 targets floating-point performance, while CINT2006 targets integer performance.

Each suite has two different measures. First is the CPU measure, which is the performance on the suite as a single stream. This can be either a single thread or automatic compiled parallel run. This measure is further defined by base and optimized runs. Base uses the same compiler flags for all kernels, where optimized is allowed to use different compiler flags for each kernel. Results are compared against a baseline system run that was standardized by www.spec.org.

The second measure is Rate. I think this one is a LOT more important. It is a measure of how many CPU measures can be run at a time. Typically, it is run as n processes on n processors or threads. It shows how well the same job mix can run on a system under some load. It also is run as a base and optimized set of results. "Rate" is what you use for any mult-threaded workstation and all servers.

Disclosure Statement:

SPEC, SPECint reg tm of Standard Performance Evaluation Corporation. Results from www.spec.org as of 11/14/06. Sun Ultra 40 M2, 48.8 SPECint_rate2006.

System Configuration

  • Sun Ultra 40 M2
  • 2 x 2.8 GHz Opteron 2220SE
  • 16GB memory
  • Solaris 10
  • Sun Studio 11
  • 48.8 SPECint_rate2006

Like this post? del.icio.us | furl | slashdot | technorati | digg

Sun Opteron x4100 outscaling woodcrest part 2

Wednesday Nov 15, 2006

As mentioned in the posting earlier today, scalability is important factor in system performance. Woodcrest's poor scaling may not bode well for Cloverton. Sure you can package for threads onto a module, but unless you design for them you'll just have more threads not delivering performance but just burning more watts.

Wattage: I'll get detailed wattage results posted soon, but it looks like as we mentioned Opteron performance is about 20% more than Woodcrest. The wattage for both configurations looks the same. Therefore expect Sun's Opteron to have about 20% perf/watt advantage.

Sun's Fluent results will be posted shortly on the website, it is a busy week with Supercomputing conference and lots of busy people. So keep checking back. A few of the smaller gave Woodcrest a small percent advantage, but most were significantly faster on Sun's Opteron.

...maybe Woodcrest will have better idle power, but why in the world would you buy the latest server and leave it idle?

Like this post? del.icio.us | furl | slashdot | technorati | digg

Sun Opteron x4100 outscaling woodcrest (and outperforming = side benefit)

Wednesday Nov 15, 2006

Woodcrest scaling issues? Yes, remember scaling is critical for system performance, so don't look too much at single core performance or single job performance as it can lead to the wrong conclusions. In fact Sun's Opteron scaling means that the Sun systems can outperform Woodcrest by 18% to 22% as shown below.

On a 4 core/2chip Intel Woodcrest systems they are only seeing 2.8x to 2.9x on 4 cores -- this doesn't bode well for quad-core or larger systems made out of these. Sun sees 3.6x to 4.1x scaling in the table below. Couple this with the high-wattage of these Woodcrest (31-Oct posting) and Woodcrest may have issues?

Opteron leads poor Woodcrest scaling & performance on Fluent 6 Benchmark (Both systems 2 sockets and using dual-core)

System GHz/Chip #cores FL5M3 (scaling) FL5L2 (scaling)
INTEL S5000XAL 3.0GHz Xeon Woodcrest 5160 4-core 827.0 (2.8x) 400.0 (2.9x)
INTEL S5000XAL 3.0GHz Xeon Woodcrest 5160 2-core 553.7 (1.9x) 226.0 (1.6x)
INTEL S5000XAL 3.0GHz Xeon Woodcrest 5160 1-core 297.3 (1.0x) 138.0 (1.0x)
Sun
Sun X4100 M2 2.8GHz Opteron DC 2200 4-core 979.9 (3.6x) 486.6 (4.1x)
Sun X4100 M2 2.8GHz Opteron DC 2200 2-core 516.1 (1.9x) 241.8 (2.1x)
Sun X4100 M2 2.8GHz Opteron DC 2200 1-core 273.5 (1.0x) 117.6 (1.0x)

Rating = No. of sequential runs of test case possible in 1 day, 86,400/(Total Elapsed Run Time in Seconds)

Fluent results at: http://www.fluent.com/software/fluent/fl5bench/flbench_6.2/fullres.htm

...I suspect even better performance and scaling on Sun Fire X4100 M2 with Solaris...

Like this post? del.icio.us | furl | slashdot | technorati | digg

World Record Performance SPECapc Sun beats Woodcrest

Tuesday Nov 14, 2006

Sun Ultra 40 M2 w/2xFX 5500 nVidia Framebuffers (SLI) World Record Performance SPECapc Unigraphics UGS-NX3

The Sun Ultra 40 M2 with dual nVidia Quadro FX 5500s in SLI mode sets a world record running the SPEC APC UGS-NX3 graphics oriented MCAD benchmark beating all desktop platforms, including the Woodcrest and Intel Core2 "Extreme Processor" X6800 cpu's.

The SPEC APC MCAD benchmarks consist of tasks representative of what a designer would do in a typical session. This consists of "Graphics", "CPU", and "I/O" activities.

  • In dual framebuffer SLI mode the Ultra 40 M2 with 2.8GHz 2220SE dual core Opteron processors outperforms a Dell 690 (3.0 GHz Woodcrest) by 14% overall and by 37% in the graphics test components.
  • In addition, in dual framebuffer SLI mode the existing Ultra 40 outperforms the Dell 690 (Woocrest 3.0 GHz) by 16% overall and by 61% in the graphics component. The Ultra 40 with 3.0 GHz single core Opteron 256 processors (400 MHz DDR1 dimms) versus the 2.8 GHz dual core Opteron 2220SE processors (667 MHz DDR2 dimms), edging the Ultra 40 M2 by about 1%.

The Sun Ultra 40 with a single nVidia Quadro FX 5500 outperforms most other high end desktops equipped with a single framebuffer with currently posted results obtained running the SPEC APC UGS-NX3 benchmark.

  • The Sun Ultra 40 with FX 5500 framebuffer outperforms (is faster than) Woodcrest desktops. H-P XW 6400 (4% overall, 39% on graphics); Dell Precision 690 (9% overall, 52% on graphics); IBM Intellistation Z Pro 9228 (14% overall, 62% on graphics)

  • The Sun Ultra 40 with FX 5500 framebuffer also outperforms all desktops equipped with the Intel 2.93 GHz X6800 "Extreme Processors". H-P XW 4400 (6% overall, 47% on graphics); Dell Precision 390 (10% overall, 47% on graphics)

Sun Opteron desktops have dominated with leading MCAD benchmark results dating back to the introduction of the Sun W1100 and W2100. Sun desktops continue to exhibit excellent MCAD performance as demonstrated by the world record results here for this SPEC APC UGS-NX3 benchmark.

SPECapc Unigraphics NX 3 Benchmark Competitive Landscape (larger is faster):

System Overall
Composite
CPU
Composite
File I/0
Composite
Graphics
Composite
Sun Ultra 40
3.0GHz Opteron 256
2x FX 5500 (SLI)
7.28 2.94 2.85 19.81
Sun Ultra 40 M2
2.8GHz Opteron 2220SE
2x FX 5500 (SLI)
7.19 3.08 3.00 16.85
Fujitsu Siemens CELSIUS
3.0GHz Intel 5150
FX 5500
6.42 3.67 2.28 10.17
Dell Precision 690
3.0GHz Woodcrest
2x FX 4500 (SLI)
6.30 3.25 1.64 12.29
Sun Ultra 40
3.0GHz Opteron 256
FX 5500
5.66 2.94 1.96 10.11
HP xw6400 WS
3.0GHz Woodcrest
FX 4500
5.42 3.39 3.51 7.26
HP xw4400
2.93GHz X6800
FX 3500
5.33 3.40 4.52 6.87
Dell Precision 690
3.00 GHz Woodcrest
FX 3500
5.17 3.38 3.69 6.64
Dell Precision 390
2.93 GHz X6800
FX 3500
5.16 3.46 2.18 6.87
IBM Intellistation Z Pro 9228
3.0GHz Woodcrest
FX 3500
4.96 3.43 2.84 6.23

Results Summary for the SPECapc Unigraphics NX 3 benchmark:
Results
Dual
FX 5500
Dual
FX 5500
Overall Composite: 7.19 7.28
CPU Composite: 3.08 2.94
File I/O Composite: 3.00 2.85
Graphics Composite: 16.85 19.81
Reference Date: 11/10/06 10/12/06
System: Sun Ultra U40 M2 Sun Ultra U40
Processor/GHz: Opteron 2220SE/2.8 Opteron 256/3.0

Disclosure Statement:

SPEC reg tm, SPECapc server mark of Standard Performance Evaluation Corporation. Results from www.spec.org as of Oct 12, 2006: Sun Ultra 40, 2xFX 5500, overall composite 7.28; Dell Precision 690, 2xFX 4500, overall composite 6.30. Results from www.spec.org as of Oct 12, 2006: Sun Ultra 40, FX 5500, overall composite 5.66; HP xw6400, FX 4500, overall composite 5.42; Dell Precision 690, FX 3500, overall composite 5.17; IBM Intellistation Z Pro 9228, FX 3500, overall composite 4.96. Results from www.spec.org as of Nov. 8, 2006: Fujitsu Siemens CELSIUS, FX 5500, overall composite 6.42. Results from www.spec.org as of Nov 10, 2006: Sun Ultra 40 M2, 2xFX 5500, overall composite 7.19. Results from www.spec.org as of Oct 12, 2006: Sun Ultra 40, FX 5500, overall composite 5.66; HP xw4400, FX 3500, overall composite 5.33; Dell Precision 390, FX 3500, overall composite 5.16.

Like this post? del.icio.us | furl | slashdot | technorati | digg