Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Login
Remember
Register
Ask
Q&A
All Activity
Hot!
Unanswered
Tags
Users
Ask a Question
Ask a Question
Categories
All categories
Topic[话题] (13)
Life[生活] (4)
Technique[技术] (2.1m)
Idea[创意] (3)
Jobs[工作] (2)
Others[杂七杂八] (18)
Code Example[编程示例] (0)
Recent questions tagged cuda
0
votes
893
views
1
answer
cuda - Reading from an unaligned uint8_t recast as a uint32_t array - not getting all values
I am trying to cast a uint8_t array to uint32_t array. However, when i try to do this, I cant seem to be able ... any way that I can do this? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
1.2k
views
1
answer
cuda - nvcc.exe linking error Microsoft Visual Studio configuration file 'vcvars64.bat' could not found
I want to use nvcc -ptx from windows command line, but I always get this error message: nvcc : fatal error ... . What can be the solution? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
688
views
1
answer
cuda - CUDA_ERROR_INVALID_IMAGE during cuModuleLoad
I've created a very simple kernel (can be found here) which I successfully compile using "C:Program ... valid and compiles without issues. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
686
views
1
answer
cuda - How to start debug version of project in nsight with optirun command?
I'we been writing some simple cuda program (I'm student so I need to practice), and the thing is I can ... for helping in advance folks. :) See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
659
views
1
answer
cuda - Do I have to use the MPS (MULTI-PROCESS SERVICE) when using CUDA6.5 + MPI?
By the link is written: https://docs.nvidia.com/deploy/pdf/CUDA_Multi_Process_Service_Overview.pdf 1.1.?AT A GLANCE ... will stay the same? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
786
views
1
answer
cuda - How is the 2D thread blocks padded for warp scheduling?
I understand that for a 1D thread block with 31 threads, it will be padded to 32 threads for warp execution. What ... (31*31=961; 961%32=1)? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
744
views
1
answer
cuda - thrust::sequence - how to increase the step after each N elements
I am using thrust::sequence(myvector.begin(), myvector.end(), 0, 1) and achieve good ordered list like: 0, 1, ... or am I missing a simple way.. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
492
views
1
answer
cuda kernels not executing concurrently
I'm trying to explore the concurrent kernels execution property of my Nvidia Quadro 4000, which has 2.0 ... CHK_ERR(cudaDeviceReset()); } See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
675
views
1
answer
cuda - Should I check the number of threads in kernel code?
I am a beginner with CUDA, and my coworkers always design kernels with the following wrapping: __global__ ... specified block/grid dimensions? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
576
views
1
answer
cuda - JIT in JCuda, loading multiple ptx modules
I said in this question that I had some problem loading ptx modules in JCuda and after @talonmies's idea, I ... variable by reference in JCuda? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
658
views
1
answer
cuda - Caffe compilation fails due to unsupported gcc compiler version
I struggle with Caffe compilation. Unfortunately I failed to compile it. Steps I followed: git clone https://github.com/ ... .9 - what to do?. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
915
views
1
answer
cuda - CURAND Library - Compiling Error - Undefined reference to functions
I have the following code which I am trying to compile using nvcc. Code: #include <stdio.h> #include <stdlib.h ... to solve my problem. Thanks! See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
755
views
1
answer
cuda - Performance of atomic operations on shared memory
How atomic operations perform when the address they are provided with resides in block shared memory? During ... atomic operation is done? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
906
views
1
answer
cuda - thrust reduction result on device memory
Is it possible to leave the return value of a thrust::reduce operation in device-allocated memory? In case it is ... I use a thrust::device_ptr? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
675
views
1
answer
cuda - What are "Other" Issue Stall Reasons displayed by the Nsight profiler?
I have a kernel that is performing poorly on CC 3.0 (Kepler) as opposed to CC 2.0 (Fermi). In the Nsight profiler, ... Nsight 3.0. RC / CC 3.0. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
569
views
1
answer
cuda - Does 'code=sm_X' embed only binary (cubin) code, or also PTX code, or both?
I am little bit confused about the 'code=sm_X' option within the '-gencode' statement. An example: What does ... is conflicting in my opinion. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
684
views
1
answer
cuda - Amdahl's law and GPU
I have a couple of doubts regarding the application of Amdahl's law with respect to GPUs. For instance, I ... for the parallel code? Thanks See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
941
views
1
answer
cuda - How to use GPUDirect RDMA with Infiniband
I have two machines. There are multiple Tesla cards on each machine. There is also an InfiniBand card on each ... dealing with this in OpenMPI. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
433
views
1
answer
cuda - __activemask() vs __ballot_sync()
After read this post on CUDA Developer Blog I am struggling to understand when is safecorrect use __activemask ... the function interface. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
919
views
1
answer
cuda - How do we use cuPrintf()?
What do we have to do to use cuPrintf()? (device compute capability 1.2, Ubuntu 12) I couldn't find " ... "hello_kernel") is not allowed Thanks! See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
974
views
1
answer
cuda - Set default host compiler for nvcc
I have just installed Debian Stretch (9) and Cuda 8 on a new GPU server. Stretch does not come with ... cuda config or an environment variable? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
889
views
1
answer
cuda - What is the difference between cudaMemcpy() and cudaMemcpyPeer() for P2P-copy?
I want to copy data from GPU0-DDR to GPU1-DDR directly without CPU-RAM. As said here on the page-15: http: ... any advantage, why it is needed? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
917
views
1
answer
cuda - Equivalent of cudaGetErrorString for cuBLAS?
CUDA runtime has a convenience function cudaGetErrorString(cudaError_t error) that translates an error enum into a ... function like this? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
1.1k
views
1
answer
cuda - Branch and predicated instructions
Section 5.4.2 of the CUDA C Programming Guide states that branch divergence is handled either by "branch ... set the predicate". Why? See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
503
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
542
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
644
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
0
votes
586
views
1
answer
cuda - How to compile PTX code
I need to modify the PTX code and compile it directly. The reason is that I want to have some specific instructions ... cubin) to "X.o" file. See Question&Answers more detail:os...
asked
Oct 24, 2021
in
Technique[技术]
by
深蓝
(
71.8m
points)
cuda
Page:
1
2
3
4
next »
Ask a question:
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question
Just Browsing Browsing
[1] How can I save query results into a new column and save it in the table with SQL Oracle Live?
[2] python - Efficient conditional selection with masks in very large dataframe
[3] vue3.0开启ts之后,如何配置开启google浏览器的vue-devtools?
[4] 后端数据怎样格式化才能得到需要的格式
[5] React如何实现popover气泡卡片功能?
[6] javascript - How to load long list with 10k elements in onsen ui cordova?
[7] kibana - How to index (ingest) geo data (Geometry, GeometryCollection) as GeoShape in ElasticSearch with C#, Nest, NetTopologySuite from GeoJson file / string?
[8] group by - use of group_by() function results in "Error in check_dots_used..." I tried updating packages and nothing worked. Any advice?
[9] Angular, BsDatepicker Format
[10] java - Why retrivied id from mongo body.response was incremented?
2.1m
questions
2.1m
answers
60
comments
57.0k
users
Most popular tags
javascript
python
c#
java
How
android
c++
php
ios
html
sql
r
c
node.js
.net
iphone
asp.net
css
reactjs
jquery
ruby
What
Android
objective
mysql
linux
Is
git
Python
windows
Why
regex
angular
swift
amazon
excel
algorithm
macos
Java
visual
how
bash
Can
multithreading
PHP
Using
scala
angularjs
typescript
apache
spring
performance
postgresql
database
flutter
json
rust
arrays
C#
dart
vba
django
wpf
xml
vue.js
In
go
Get
google
jQuery
xcode
jsf
http
Google
mongodb
string
shell
oop
powershell
SQL
C++
security
assembly
docker
Javascript
Android:
Does
haskell
Convert
azure
debugging
delphi
vb.net
Spring
datetime
pandas
oracle
math
Django
联盟问答网站-Union QA website
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
DevDocs API Documentations
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
在这了问答社区
DevDocs API Documentations
Xstack问答社区
生活宝问答社区
OverStack问答社区
Ostack问答社区
在这了问答社区
在哪了问答社区
Xstack问答社区
无极谷问答社区
TouSu问答社区
SQlite问答社区
Qi-U问答社区
MLink问答社区
Jonic问答社区
Jike问答社区
16892问答社区
Vigges问答社区
55276问答社区
OGeek问答社区
深圳家问答社区
深圳家问答社区
深圳家问答社区
Vigges问答社区
Vigges问答社区
在这了问答社区
DevDocs API Documentations
广告位招租
...