• Embed Doc
  • Readcast
  • Collections
  • CommentGo Back
Download
 
46
This arTicle has been peer-reviewed.
C
omputing
 
in
S
CienCe
& e
ngineering
N
ovel
A
rchitectures
QPACE: Q CP C  C B E
Gottried Goldrian, Thomas Huth, Benjamin Krill, Jack Lauritsen, and Heiko Schick
 IBM Research and Development Lab, Böblingen, Germany
Ibrahim Ouda
 IBM Systems and Technology Group, Rochester, Minnesota
Simon Heybrock, Dieter Hierl, Thilo Maurer, Nils Meyer, Andreas Schäer, Stean Solbrig, Thomas Streuer, and Tilo Wettig
University of Regensburg, Germany
Dirk Pleiter, Karl-Heinz Sulanke, and Frank Winter
Deutsches Elektronen Synchrotron, Zeuthen, Germany
Hubert Simma
University of Milano-Bicocca, Italy
Sebastiano Fabio Schiano and Raaele Tripiccione
University of Ferrara, Italy
 Andrea Nobile
 European Center for Theoretical Studies, Trento, Italy
 Matthias Drochner and Thomas Lippert
 Research Center Jülich, Germany
Zoltan Fodor
University of Wuppertal, Germany
Application-driven computers or Lattice Gauge Theory simulations have oten been based onsystem-on-chip designs, but the development costs can be prohibitive or academic project budgets. An alternative approach uses compute nodes based on a commercial processor tightly coupled to a custom-designed network processor. Preliminary analysis shows that this solutionoers good perormance, but it also entails several challenges, including those arising romthe processor’s multicore structure and rom implementing the network processor on a feld- programmable gate array.
Q
  (QCD)  w-  - w       q  , w      , ,  . I    ,    QCDv— ,   w  - v    x    . I  ( ) ,   x ’ -,      . T    w  -v    QCD (LQCD), , - v   K W   30  .
1
I  w, QCD  -       w    M C q. M  v  LQCD MC  v    v - v     -  q. S   1980, LQCD  v    v     - ,  v  v    -
1521-9615/08/$25.00 © 2008 ieeeC
opubliShed
 
by
 
the
ieee CS
 and
 
the
aip
 
n
ovember
/d
eCember
2008
 
47
,  PC ,  -. W’      w LQCD        w w  v  --  w- . I  -,  v   QCD O C (QCDOC)
2
 NEXT,
3
w - LQCD   v   2004  2005, v, IBM’ BG
4
. T    -- . I , wv, VLSI  v      SC    -    . T   ,  QCD PC   C (QPACE)       IBM’ PwXC 8       - w   w w        3D  . T   w -’ , w   Xx Vx-5-   (FPGA). I   w ,   v w BG     wv . H, w   -   w   PwXC 8 , w w  ’ w   LQCD. W       w w’ -  v   .
la Gag ty cmpng
LQCD     -’  . T,   v - LQCD   v          - Tf-. A  LQCD   -    ’ . LQCD v    x-    v      x    .LQCD   4D – w     (
 N 
=
 L
4
   
 L
). A -v  - vv  x     :     D    q 
ψ
. I LQCD,  q 
ψ
 x ia
   
 x = 1, . . . , N 
   
a
= 1, . . . , 3   
= 1, . . . , 4. T,
ψ
  v w 12
 N 
x . A Eq 1 w,      W D 
D
 
 
ψ
 w(   x,   -    ): 
ψ ψγ ψγ ψ
µ µ µµ µ µ
 x  x  x
D
==+
( )
+
( )
+
h,,
11
µµ
=
14
. (1)H,
 μ
   – , 
 x
,
 μ
  SU(3)    w   w - . T    v -       (   4
 N 

 x
,
 μ
 3 × 3 x). T
γ 
 μ
  () D ,  . T, ---   w   
 L
  32 .R ,   LQCD         v -        -   D  w v . Ex   w. A Eq 1 w,
D
  -  , w   v    LQCD : 
-    . E        . W              ( 
< 4, w   4 −
       ).I       4D -   
 p
4
,    w     4D   (
 L
 /
 p
)
4
- . E     -   . T            ’      -- . P    w :          ’ ,     q       .P  w     -  ,      v      ,   w      ’ . W         -   w     v
 
48
C
omputing
 
in
S
CienCe
& e
ngineering
  q - w -  w
 p
(w v  w q ).C,   v  -  w   w  v- v     w      -- w q. IBM’ C/B.E.(www../vw/w/)   v .
t PwXc 8P’ Pfman
 T C/B.E.    PwPC     - .
5
E SPE       w 256 K -   (LS), w         /    128 - 128- . A SPE x w   ,    - f- . T,   -     SPE   C/B.E.  204.8 Gf 3.2 GH. T PwXC 8    v  C/B.E., w   - -    -  102.4 Gf w IEEE- .I   -     w  25.6 G/   - R FxIO I/O   -     w   w  25.6 G/.I,           (EIB)    (DMA) . A  w v   PS3,  C/B.E.  v v  -   w.
6–8
T  v   PwXC 8. A w   L 2007 C,
8
w v ’  x     .
9
 W w -  .
Pfman Md
 W  w   w v: v,     LS,       -      v      - /    v . S v   f-(FP) , DMA ,       w
β 
 - 
λ
. W  v         -   ’  v. Tx 
   
     ,
T I O
i i i
 /
β λ
+
( )
, (2) w
 I 
     . I w, w         x         . Gv ,   x 
T
exe
max
. (3)I 
 
   v    ’ FP    “ -,” w   FP  
ε
FP

ε
FP
=
 
 /
x
.F 1 w  fw   - x 
   :f- ,
FP
;/  w   (RF) LS,
RF
;-  ,

;  w SPE   ,

;x  w ,
x
= x(
NIF
,
 
);  v  EIB ( , , x ),
EIB
.
ILB RFLocal store (LS)Networkinterface
 
NetworkinterfaceMainmemory(MM)EIB
FP
ILB
RF
link
NIF
mem
EIB
Figure 1. Datafow paths or a single synergistic processor elementand corresponding execution times (
). The model assumes acommunication network with a 3D torus topology in which 12 links(six inbound and six outbound) simultaneously transer data.
of 00

Leave a Comment

You must be to leave a comment.
Submit
Characters: ...
You must be to leave a comment.
Submit
Characters: ...