Professional Documents
Culture Documents
Edmond Chow
School of Computational Science and Engineering
Georgia Institute of Technology
y1 A11 A12 · · · A1p x1
y2 A21 A22 · · · A2p x2
.. = ..
.. .. .. ..
. . . . . .
yp Ap1 Ap2 · · · App xp
Processor i stores Ai? , xi and computes yi
(block row-wise partitioning).
Distributed sparse matrix-vector multiplication
y1 A11 A12 ··· A1p x1
y2 A21 A22 ··· A2p x2
.. = ..
.. .. .. ..
. . . . . .
yp Ap1 Ap2 ··· App xp
On processor i:
1. Send components of xi needed by other processors
2. Compute ylocal = Aii xi
3. Receive components of xj from other processors j needed
by processor i
4. Compute yext = Ai,ext xext
5. Form yi = ylocal + yext
NP-hard problem.
Hypergraph partitioning
Hypergraph partitioning
Multilevel algorithm: coarsen hyperedges, refinement with
hyperedges.
:
1-----3--:--5-----7
| | : | |
| | : | |
2-----4--:--6-----8
: