You are on page 1of 1

Manually edited CUDA files

2DConvolution
2mm
3DConvolution
3mm
add2D
add3D
atax
bicg
correlation
covariance
fdtd
gemm
gemver
gesummv
lu
matmul
mvt
syr2k
syrk
symm
trisolv
trmm

Array Size
512 X 512
512 X 512
128 X 128 X 16
512 X 512
512 X 512
128 X 128 X 16
4096
4096
512 X 512
512 X 512
512 X 512
512 X 512
4096
4096
512 X 512
512 X 512
4096
512 X 512
512 X 512
512 X 512
4096
512 X 512

Speedup (double)
2.72030651340996
6.60309974779968
2.09708737864078
5.5279070569173
2.08565737051793
0.72422258592471
2.04681582132677
1.96073053668267
2.00869982600348
1.95618649944225
1.31843770338909
4.92366665814827
9.28575325029548
0.95336011975104
3.65745942724262
6.0834396195933
1.84589507106139
1.60771729918071
2.76924476087776
1.21367

2.04343613757468
0.16578114988719

Fermi
Tesla
Fermi
Misses (double)
Speedup (float)
Misses (float)
Speedup (double)
Misses (double)
Speedup (float)
Misses (float)
Misses (float)
Misses (double)
0
1.785519125683
0 0.23820174457616
260160
0.143673738595
0
262144
262144
0
4.694683495888
0 0.87277296132047
262144
0.705367186041
0
523773
523773
0
1.942857142857
0 15.4285714285714
222264
11.64640883978
222264
134217728
error
0
4.995785047186
0 2.80053793354674
262144
1.249173789661
0
785405
785405
0
0.707818930041
0 0.37259786476868
262132
0.115630252101
0
262108
262108
0
0.35101010101
0 4.91666666666667
259053
1.544444444444
259053
773793
error
0
1.924525425021
0 1.08547822665811
0
1.760513285445
0
6106
3166
0
1.737602820212
0 1.28381356623949
8192
1.598940311419
0
0
0
0
1.981960803722
0 2.01123393007329
261121
1.840728370283
0
0
0
0
1.941862860627
78681 1.94293350102185
262144
1.803209317274
95513
249001
249001
0
3.895137860143
0 0.12382107647761
308224
3.701150242818
0
512
512
0
3.898005666352
85505 0.65402861298804
262144
0.590849611562
107389
84858
0
0
7.986765715713
1335 1.05668615177694
4095
1.352324075787
1736
15966196
15966196
0
1.033561971606
1814 0.6794115995733
4095
0.946699395108
2878
995
0
2041
2.844428922148
2041 2.54715568862275
1531
1.888275166046
2041
0
0
0
4.036575792578
40297 0.79138024482739
261632
0.608160382658
30405
0
0
0
1.577230920452
482 1.00205191340925
8192
1.219372523476
170
0
0
0
2.011653326962
230149 0.52176550678706
262143
0.45035408833
230146
340800
0
0
2.334608030593
86135 0.38369219082182
262144
0.375943274773
105863
342437
0
130305
0.458907640665
262144 1.52984614415686
130305
0.563994044955
261632
262144
262144
0
1.630994234285
0 0.7385899381347
0
1.105940688471
0
0
0
207011
0.196735379282
259243 0.10883057883606
210498
0.343047275373
259243
37255
14659

HMPP altered programs


Tesla (Serrano)
Misses (float)
Misses (double)
262144
262144
0
523773
134217728
error
785405
785405
262108
262108
773793
error
9440
15998
6686
15998
0
282953
63000
249001
1000
592000
107090
262144
15966196
15966197
1431
7998
0
0
0
15288
4
7998
431511
1048576
432547
1048576
152178
262144
0
0
48460
261110

Tesla (cuda.acad)
Misses (float)
Misses (double)
0
0
0
0
0
error
0
0
0
0
0
error
error
0
0
0
0
0
0
0
110609
110698
0
0
15966196
15966196
0
0
0
0
0
0
0
0
0
0
0
0
5862
262144
0
0
0
0

You might also like