(including memory allocation on the device and host-device data transfers)
Solution:
#include
Void main(float *A, int N)
{
float *A_h;
float *A_d;
Fig: 1