Cuda Runtime Error Invalid Argument 11
Contents |
here for a quick overview of the site Help Center Detailed answers to any questions you might have Meta Discuss the workings and policies of this site About Us cuda runtime error unspecified launch failure Learn more about Stack Overflow the company Business Learn more about hiring developers or
Cuda Runtime Error Unknown Error
posting ads with us Stack Overflow Questions Jobs Documentation Tags Users Badges Ask Question x Dismiss Join the Stack Overflow
Cuda Invalid Argument Kernel
Community Stack Overflow is a community of 4.7 million programmers, just like you, helping each other. Join them; it only takes a minute: Sign up Invalid Argument error when copying data from device to
Cudamalloc Invalid Argument
host up vote 3 down vote favorite 1 I am having problems copying data from my device back to the host. My data are arranged in a struct: typedef struct Array2D { double* arr; int rows; int cols; } Array2D; arr is a 'flat' array. rows and cols describes the dimensions. The code below shows how I am trying to copy the data back to the host: cuda invalid argument kernel call h_output = (Array2D*) malloc(sizeof(Array2D)); cudaMemcpy(h_output, d_output, sizeof(Array2D), cudaMemcpyDeviceToHost); double* h_arr = (double*) malloc(h_output->cols*h_output->rows*sizeof(double)); cudaMemcpy(h_arr, h_output->arr, h_output->cols*h_output->rows*sizeof(double), cudaMemcpyDeviceToHost); h_output->arr = h_arr; However, in the fourth line the execution fails with cuda error 11 (invalid argument). I cannot see why this is happening. The size of the array is correct, and I can access both h_output and h_array from the host and both have 'real' addresses. EDIT Sorry for the late response to the request for more information (= more code). I have tested that the pointer d_output->arr is a device pointer, by trying to access the value of the device pointer on the host. As expected, I was not allowed to do that leaving me with the thought that d_output->arr is in fact a valid device pointer. The code's objective is to solve Thiele's differential equation using the fourth order Runge-Kutta method. class CalculationSpecification { /* FUNCTIONS OMITTED */ public: __device__ void RK4_n(CalculationSpecification* cs, CalcData data, Array2D* d_output) { double* rk4data = (double*)malloc((data.pdata->endYear - data.pdata->startYear + 1)*data.pdata->states*sizeof(double)); /* CALCULATION STUFF HAPPENS HERE */ // We know that rows = 51, cols = 1 and that rk4data contains 51 values as it should. // This was confirmed by using printf directly in
Support Search GitHub This repository Watch 303 Star 4,121 Fork 1,033 karpathy/char-rnn Code Issues 57 Pull requests 17 https://github.com/karpathy/char-rnn/issues/81 Projects 0 Wiki Pulse Graphs New issue cutorch problem - invalid https://arrayfire.com/forums/viewtopic.php?f=17&t=43234 argument at (...)/cutorch/lib/THC/THCTensor.cu:32 #81 Closed ttajmaje opened this Issue Aug 11, 2015 · 3 comments Projects None yet Labels None yet Milestone No milestone Assignees No one assigned 3 participants ttajmaje commented Aug 11, 2015 I run into an error when trying to execute invalid argument char-rnn with CUDA support: $ th train.la -data_dir data/tinyshakespeare/ -gpuid 0 using CUDA on GPU 0... loading data files... cutting off end of data so that the batches/sequences divide evenly reshaping tensor... data load done. Number of data batches in train: 423, val: 23, test: 0 vocab size: 65 creating an lstm with 2 layers /home/ttajmaje/priv/rnn/torch/distro/install/bin/lua: cuda runtime error train.lua:156: cuda runtime error (11) : invalid argument at /tmp/luarocks_cutorch-scm-1-5019/cutorch/lib/THC/THCTensor.cu:32 stack traceback: [C]: in function 'clone' train.lua:156: in main chunk [C]: in function 'dofile' ...rch/distro/install/lib/luarocks/rocks/trepl/scm-1/bin/th:131: in main chunk [C]: in ? Using CUDA 5.5 SDK Lua 5.2 th> getDeviceProperties(1) { computeMode : 0 minor : 1 memPitch : 2147483647 sharedMemPerBlock : 49152 warpSize : 32 pciBusID : 1 totalConstMem : 65536 multiProcessorCount : 2 major : 2 clockRate : 1500000 totalGlobalMem : 1072889856 name : "GeForce GT 630" integrated : 0 regsPerBlock : 32768 kernelExecTimeoutEnabled : 1 textureAlignment : 512 maxTexture1D : 65536 pciDeviceID : 0 pciDomainID : 0 maxTexture1DLinear : 134217728 freeGlobalMem : 661180416 canMapHostMemory : 1 deviceOverlap : 1 maxThreadsPerBlock : 1024 } Any idea what may be the cause ? soumith commented Aug 11, 2015 when you get a cudaRuntimeError, the first thing I recommend is to upgrade from CUDA 5.5 to CUDA 7.0 (or 7.5). ttajmaje commented Aug 11, 2015 That would be tricky as CUDA 7.0 does not support 32bit system
all board cookies • All times are UTC - 5 hours [ DST ] Powered by phpBB Forum Software © phpBB Group Protected by Anti-Spam ACP