Benchmarks
| Summary SGI/Cray Origin 2000
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/Processor
1 1x1x1 3.5313E+03 3.5112E+03 1.4309E+05 1.00 1.00
2 2x1x1 1.8813E+03 1.8705E+03 2.6853E+05 1.88 0.94
4 2x2x1 1.1321E+03 1.1254E+03 4.4627E+05 3.12 0.78
8 2x2x2 5.1975E+02 5.1642E+02 9.7238E+05 6.80 0.85
16 4x2x2 2.8300E+02 2.8098E+02 1.7863E+06 12.50 0.78
32 4x4x2 1.4050E+02 1.3947E+02 3.5984E+06 25.18 0.79
64 4x4x4 7.4563E+01 7.3919E+01 6.7854E+06 47.50 0.74
128 8x4x4 5.2875E+01 5.1545E+01 9.7136E+06 68.12 0.53
256(gsn) 8x8x4 2.5750E+01 2.5194E+01 1.9828E+07 139.37 0.54
512(gsn) 8x8x8 1.7500E+01 1.7012E+01 2.8890E+07 206.40 0.40
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/8 Proc.
8 2x2x2 5.3834E+03 5.3524E+03 7.5229E+05 1.00 1.00
16 4x2x2 2.7225E+03 2.7076E+03 1.4871E+06 1.98 0.99
32 4x4x2 1.4439E+03 1.4345E+03 2.8070E+06 3.73 0.93
64 4x4x4 5.5581E+02 5.5205E+02 7.2937E+06 9.70 1.21
128 8x4x4 3.5080E+02 3.4651E+02 1.1620E+07 15.45 0.97
256 8x8x4 2.0556E+02 2.0193E+02 1.9940E+07 26.19 0.82
512(gsn) 8x8x8 1.3425E+02 1.3069E+02 3.0811E+07 40.96 0.64
Magneto-Hydrodynamics Benchmarks
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/8 Proc.
8 2x2x2 2.4671E+03 2.4581E+03 2.0463E+05 1.00 1.00
16 4x2x2 1.2191E+03 1.2140E+03 4.1418E+05 2.02 1.01
32 4x4x2 5.8719E+02 5.8469E+02 8.5997E+05 4.20 1.05
64 4x4x4 2.9969E+02 2.9805E+02 1.6859E+06 8.25 1.03
128 8x4x4 1.5938E+02 1.5812E+02 3.1787E+06 15.55 0.97
256(gsn) 8x8x4 7.3250E+01 7.2483E+01 6.9100E+06 33.91 1.06
512(gsn) 8x8x8 6.0000E+01 5.9061E+01 8.4334E+06 41.62 0.65
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/16 Proc.
16 4x2x2 1.1872E+04 1.1825E+04 3.4051E+05 1.00 1.00
32 4x4x2 5.7961E+03 5.7703E+03 6.9781E+05 2.12 1.06
64 4x4x4 2.9386E+03 2.9229E+03 1.3776E+06 4.05 1.01
128 8x4x4 1.4267E+03 1.4141E+03 3.1787E+06 8.36 1.05
256 8x8x4 7.7675E+02 7.7006E+02 5.2289E+06 15.36 0.96
512(gsn) 8x8x8 4.3013E+02 4.2405E+02 9.4954E+06 27.89 0.87
Platinum IA-32 Linux Cluster
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/8 Proc.
8 2x2x2 4.2594E+02 4.4800E+02 1.1213E+06 1.00 1.00
16 4x2x2 2.5666E+02 2.2625E+02 2.2198E+06 1.98 0.99
32 4x4x2 1.1651E+02 1.2560E+02 3.9813E+06 3.57 0.89
64 4x4x4 6.0999E+01 6.4960E+01 7.6972E+06 6.90 0.86
128 8x4x4 2.4205E+01 3.7540E+01 1.3329E+07 11.93 0.75
256 8x8x4 3.9606E+01 2.9120E+01 1.7149E+07 15.39 0.48
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/64 Proc.
64 4x4x4 5.7435E+02 5.9830E+02 6.7300E+06 1.00 1.00
128 8x4x4 3.2650E+02 2.9315E+02 1.3735E+07 2.04 1.02
256 8x8x4 1.5479E+02 1.5067E+02 2.6724E+07 3.97 0.99
Magneto-Hydrodynamics Benchmarks
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/8 Proc.
8 2x2x2 1.2303E+03 1.2034E+03 4.1782E+05 1.00 1.00
16 4x2x2 6.0194E+02 6.2282E+02 8.0728E+05 1.93 0.97
32 4x4x2 3.4175E+02 3.3023E+02 1.5203E+06 3.64 0.91
64 4x4x4 1.5862E+02 1.7122E+02 2.9283E+06 7.03 0.88
128 8x4x4 9.8669E+01 9.0650E+01 5.5364E+06 13.28 0.83
256 8x8x4 3.5096E+01 5.1570E+01 9.7034E+06 23.34 0.73
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/64 Proc.
64 4x4x4 1.3812E+03 1.3694E+03 2.9404E+06 1.00 1.00
128 8x4x4 7.2175E+02 7.1130E+02 5.6608E+06 1.93 0.96
256 8x8x4 3.6330E+02 3.6265E+02 1.1103E+07 3.78 0.94
Titan IA-64 Linux Cluster
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/4 Proc.
4 2x2x1 3.8400E+02 3.9503E+02 1.2719E+06 1.00 1.00
8 2x2x2 1.9200E+02 2.1357E+02 2.3539E+06 1.85 0.93
16 4x2x2 1.2800E+02 1.1508E+02 4.3683E+06 3.43 0.86
32 4x4x2 5.6788E+01 5.6693E+01 8.8631E+06 6.97 0.87
64 4x4x4 3.0597E+01 3.0480E+01 1.6450E+07 12.96 0.81
128 8x4x4 1.6303E+01 1.6214E+01 3.0873E+07 24.36 0.76
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/32 Proc.
32 4x4x2 5.1200E+02 5.2716E+02 7.6381E+06 1.00 1.00
64 4x4x4 2.5600E+02 2.6164E+02 1.5390E+07 2.02 1.01
128 8x4x4 1.2800E+02 1.2679E+02 3.1758E+07 4.16 1.04
Magneto-Hydrodynamics Benchmarks
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/4 Proc.
4 2x2x1 1.2074E+03 1.2053E+03 4.1686E+05 1.00 1.00
8 2x2x2 6.4000E+02 6.0782E+02 8.2762E+05 1.99 0.99
16 4x2x2 3.2000E+02 3.2453E+02 1.5501E+06 3.71 0.93
32 4x4x2 1.2800E+02 1.6390E+02 3.0656E+06 7.35 0.92
64 4x4x4 6.4000E+01 8.8565E+01 5.6738E+06 13.61 0.85
128 8x4x4 6.4000E+01 4.7618E+01 1.0562E+07 25.31 0.79
GRID: fixed 512 x 512 x 512 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/32 Proc.
32 4x4x2 1.3440E+03 1.3258E+03 3.0371E+06 1.00 1.00
64 4x4x4 7.0400E+02 6.8462E+02 5.8814E+06 1.94 0.97
128 8x4x4 3.2000E+02 3.4359E+02 1.1719E+07 3.86 0.97
PSC Terascale Computing System
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/4 Proc.
4 2x2x1 3.7998E+02 3.7859E+02 1.3262E+06 1.00 1.00
8 2x2x2 1.9520E+02 1.9216E+02 2.6127E+06 1.97 0.99
16 4x2x2 1.0344E+02 1.0132E+02 4.9560E+06 3.74 0.94
32 4x4x2 5.4104E+01 5.0349E+01 9.9689E+06 7.52 0.94
64 4x4x4 2.7884E+01 2.5834E+01 1.9421E+07 14.66 0.92
128 8x4x4 1.8322E+01 1.4263E+01 3.5060E+07 26.54 0.83
256 8x8x4 1.2815E+01 8.5420E+00 5.8702E+07 44.32 0.69
Magneto-Hydrodynamics Benchmarks
GRID: fixed 256 x 256 x 256 (30 steps) MFLOPS
Processors Layout Wall Clock(s) tused(s) Zone-Cycles/sec Speedup Speedup/8 Proc.
8 2x2x2 7.2296E+02 7.2033E+02 6.9811E+05 1.00 1.00
16 4x2x2 3.7760E+02 3.7330E+02 1.3471E+06 1.93 0.97
32 4x4x2 1.8024E+02 1.7666E+02 2.8458E+06 4.08 1.02
64 4x4x4 8.6566E+01 8.3450E+01 6.0239E+06 8.63 1.08
128 8x4x4 4.5811E+01 4.3601E+01 1.1525E+07 16.52 1.03
256 8x8x4 2.5732E+01 2.3071E+01 2.1798E+07 31.22 0.98
Hydrodynamics: Magneto-Hydrodynamics: Poisson Solvers Benchmarks
GRID: fixed 255 x 255 x 255
Processors Layout CPU time(s) Speedup Speedup/8 Processors
8 2x2x2 206.98 1.00 1.00
16 4x2x2 112.02 1.85 0.93
32 4x4x2 56.70 3.65 0.91
64 4x4x4 56.43 3.66 0.46
128 8x4x4 97.71 2.12 0.13
GRID: fixed 256 x 256 x 256
Processors Layout CPU time(s) Speedup Speedup/Processor
1 1x1x1 40.72 1.00 1.00
2 2x1x1 19.29 2.11 1.06
4 4x1x1 10.83 3.76 0.94
8 8x1x1 5.65 7.21 0.90
16 16x1x1 2.47 16.49 1.03
32 32x1x1 1.68 24.24 0.76
64 64x1x1 3.28 12.41 0.19
128 128x1x1 3.11 13.09 0.10
Back to ZEUS-MP 1.0 |