This procedure for simple CUDA programming, both programs is to multiply matrices. Which is the difference between Matrix1 and Matrix2 Matrix2 uses shared memory. Each program using serial and parallel two ways to multiply, eventually will result in parallel and serial computation results, verify the correctness of calculations. Meanwhile, program timing module statistics using CUDA parallel computing time. After using the shared memory can be drawn to improve uptime.
Tips: You can preview the content of files by clicking file names^_^