Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

Recent questions tagged cuda

0 votes
560 views
1 answer
    I'm running this command into a shell and get: C:Usersme>nvidia-smi -L GPU 0: Quadro K2000 (UUID: GPU- ... " Still looking for a kludge... See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
790 views
1 answer
    I want to disable a specific compiler warning with nvcc, specifically warning: NULL reference is not allowed The ... the warning in question. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
393 views
1 answer
    What is the correct option for measuring bandwidth using nvprof --metrics from the command line? I am using ... measuring FLOPS and bandwidth. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
539 views
1 answer
    In a multi-GPU computer, how do I designate which GPU a CUDA job should run on? As an example, when ... when running a compiled CUDA executable? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
877 views
1 answer
    From online documentation: cudaError_t cudaMemset (void * devPtr, int value, size_t count ) Fills the first count ... *dJunk; work? Others? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
603 views
1 answer
    I've successfully installed tensorflow (GPU) on Linux Ubuntu 16.04 and made some small changes in order to make it ... could be fixed as well. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
649 views
1 answer
    I need a device version of the following host code: double (**func)(double x); double func1(double x) { return x+1 ... (); test(); return 0; } See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
433 views
1 answer
    I'm now only need to show an intermediate progress of matrix multiplication. for(unsigned int col=0; col<mtxSize; ... know in the case of CUDA. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
423 views
1 answer
    How can I use two devices in order to improve for example the performance of the following code (sum of ... Thank you in advance. Michele See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
1.2k views
1 answer
    I have searched many places but ALL I get is HOW to install it, not how to verify that it is ... everything is working without CuDNN enabled. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
670 views
1 answer
    The first cudaMalloc call is slow (like 0.2 sec) because of some initialization work on GPU. Is there any ... not eliminate all init overheads. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
689 views
1 answer
    Can I copy a C++ object to the device? say I have: class CudaClass { public: int* data; CudaClass(int ... memory and launch kernel "useClass"? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
674 views
1 answer
    I ran cuda-11.2 nsight-compute on my cuda kernel. It reports that SOL SM is at 79.44% which I interpret as ... , making the Roofline chart not applicable for my code, I suspect....
asked Jan 29, 2021 in Technique[技术] by 深蓝 (71.8m points)
Ask a question:
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...