Skip to content
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion cuda_core/docs/source/getting-started.rst
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ First, we define a string containing the CUDA C++ kernel. Note that this is a te
size_t N) {
const unsigned int tid = threadIdx.x + blockIdx.x * blockDim.x;
for (size_t i=tid; i<N; i+=gridDim.x*blockDim.x) {
C[tid] = A[tid] + B[tid];
C[i] = A[i] + B[i];
}
}
"""
Expand Down
1 change: 1 addition & 0 deletions cuda_core/docs/source/release/0.X.Y-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -52,3 +52,4 @@ Fixes and enhancements
- Fixed a bug in :class:`Stream` and other classes where object cleanup would error during interpreter shutdown.
- :class:`StridedMemoryView` of an underlying array using the DLPack protocol will no longer leak memory.
- General performance improvement.
- Fixed incorrect index usage in vector_add example
2 changes: 1 addition & 1 deletion cuda_core/examples/simple_multi_gpu_example.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@
size_t N) {
const unsigned int tid = threadIdx.x + blockIdx.x * blockDim.x;
for (size_t i=tid; i<N; i+=gridDim.x*blockDim.x) {
C[tid] = A[tid] + B[tid];
C[i] = A[i] + B[i];
}
}
"""
Expand Down
2 changes: 1 addition & 1 deletion cuda_core/examples/vector_add.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
size_t N) {
const unsigned int tid = threadIdx.x + blockIdx.x * blockDim.x;
for (size_t i=tid; i<N; i+=gridDim.x*blockDim.x) {
C[tid] = A[tid] + B[tid];
C[i] = A[i] + B[i];
}
}
"""
Expand Down
Loading