| Home Page |
$ ssh <username>@clairaut.cs.rit.edu
$ export PATH=/opt/cuda/bin:$PATH
$ export LD_LIBRARY_PATH=/opt/cuda/lib:$LD_LIBRARY_PATH
Things to notice:
all : OuterProductCPU \ OuterProductGPU OuterProductCPU : OuterProductCPU.cu Util.cu Random.cu nvcc -o OuterProductCPU OuterProductCPU.cu OuterProductGPU : OuterProductGPU.cu Util.cu Random.cu nvcc -arch compute_20 -code compute_20,sm_20 -o OuterProductGPU OuterProductGPU.cu clean : rm -f OuterProductCPU rm -f OuterProductGPU
$ ./OuterProductGPU 142857 16384 CUDA device 0: Tesla C2075, compute capability 2.0 A[0] = 0.856121 A[16383] = 0.074512 B[0] = 0.927307 B[16383] = 0.063003 C[0][0] = 0.793887 C[0][16383] = 0.053938 C[16383][0] = 0.069096 C[16383][16383] = 0.004694 23 msec computation
----CPU---- ----GPU----
N Comp Total Comp Total
1024 26 32 0 156
1024 25 32 1 192
1024 25 32 0 244
2048 101 112 0 284
2048 102 112 0 284
2048 74 88 1 284
4096 250 268 1 352
4096 229 244 1 284
4096 209 224 1 224
8192 711 748 5 664
8192 734 768 5 668
8192 737 772 5 564
16384 2625 2728 23 1752
16384 2549 2656 22 1784
16384 2600 2704 22 1776
----CPU---- ----GPU----
N Comp Total Comp Total
1024 25 32 0 156
2048 74 88 0 284
4096 209 224 1 224
8192 711 748 5 564
16384 2549 2656 22 1752
| Home Page |