SpeedIT Tools: Beyond Acceleration

January 4, 2010

Preliminary tests

Filed under: Uncategorized — Tags: , — admin @ 9:48 pm

Since around 90% of the computational time is devoted to Sparce Matrix Vector Multiplication we focused on testing this operation in the first place. The attached chart presents our results for 23 different matrices with different size, number of NNZ and the structure. As you can see, the performance depends strongly on the matrix structure. This is the reason why we decided to have two seperate kernels for two types of matrices: sparse and denser ones. Please also note that because of the memory transfers & PCIe bottleneck it is not worth to use our solvers only for few iterations.

Perfomance of SpMV Multiplication in Double Precision

Perfomance of SpMV Multiplication in Double Precision

Perfomance of SpMV Multiplication in Single Precision

Perfomance of SpMV Multiplication in Single Precision

Speed-up GPU vs. CPU

Speed-up GPU vs. CPU

Methodology

  1. Peak performance was calculated as a mean value from 10 runs with the same experimental conditions.
  2. Benchmark matrices where collected from University of Florida Sparse Matrix Collection in CSR format.
  3. Not all of the matrices could be loaded to GPU memory due to its limitations.
  4. CPU denotes a SpMV operation from Intel Math Kernel Library.
  5. GPU denotes our SpMV kernel.
  6. CPU machine: AMD Athlon(tm) 64 X2 Processor 3800+ working at 2010.373 MHz with 3 GB DDR 400 MH (Dual  Channel, bandwidth 6,4 GB/s) and  Nforce 4 SLI chipset.
  7. GPU machine: NVIDIA GeForce GTX295 (480 SP) with 1792 MB GDDR3 (896 bits) 999 MHz and 223.8 GB/s bandwith on PCI-Express 2.0.
  8. Bandwidth for ONE device measured with utility bandwidthTest from CUDA SDK:
      device to device: 93 GB/s
      host to device pageable memory: 1090 MB/s
      host to device non-pageable memory: 1591 MB/s
  9. System: Ubuntu 9.10 64bit, NVidia driver version: 190.42, CUDA  ver. 2.3

Benchmark Matrices

Benchmark Matrices from University of Florida Sparse Matrix Collection

No Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URL

Leave a comment

Powered by WordPress