Skip to content

Commit b46b314

Browse files
authored
Merge pull request #519 from rwgk/cuda_core_release_notes_update
cuda.core: release notes update
2 parents 1e0e7c3 + 5f31dd8 commit b46b314

File tree

1 file changed

+24
-2
lines changed

1 file changed

+24
-2
lines changed

cuda_core/docs/source/release/0.2.0-notes.rst

Lines changed: 24 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -27,8 +27,30 @@ New features
2727
- Expose :class:`ObjectCode` as a public API, which allows loading cubins from memory or disk. For loading other kinds of code types, please continue using :class:`Program`.
2828
- A C++ helper function ``get_cuda_native_handle()`` is provided in the new ``include/utility.cuh`` header to retrive the underlying CUDA C objects (ex: ``CUstream``) from a Python object returned by the ``.handle`` attribute (ex: :attr:`Stream.handle`).
2929
- For objects such as :class:`Program` and :class:`Linker` that could dispatch to different backends, a new ``.backend`` attribute is provided to query this information.
30-
- Support CUDA event timing.
31-
- An :class:`~_event.Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method.
30+
- Support CUDA :class:`Event` timing. (#481, #498, #508)
31+
- An :class:`Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method.
32+
- :class:`Program` now supports the additional ``PTX`` code type. (#317)
33+
- :meth:`Linker.link` exceptions now include the original error log. (#423)
34+
- In a systematic sweep through the cuda.core implementations, many exceptions messages were made more consistent and informative. (#458)
35+
36+
New examples
37+
------------
38+
- ``jit_lto_fractal.py`` — Demonstrates just-in-time link-time optimization for fractal generation. (:class:`Device`, :class:`LaunchConfig`, :class:`Linker`, :class:`LinkerOptions`, :class:`Program`, :class:`ProgramOptions`) (#475)
39+
- ``simple_multi_gpu_example.py`` — Example of using multiple GPUs. (:class:`Device`, :class:`Program`, :class:`LaunchConfig`) (#304)
40+
- ``show_device_properties.py`` — Displays detailed device properties. (:class:`Device`) (#474)
41+
42+
Minor fixes and enhancements
43+
----------------------------
44+
- A dangling pointer problem in ``_linker.py`` was fixed. (#516)
45+
- Add ``@functools.lru_cache`` decorator for :func:`get_binding_version`. (#512)
46+
- Selected ``.decode()`` were changed to ``.decode("utf-8", errors="backslashreplace")`` to ensure that decoding error messages does not abort the process. (#510)
47+
- The performance of :meth:`Device.compute_capability` was improved. (#459)
48+
- The :class:`Program` constructor now issues a warning when falling back to :func:`cuLink`. (#315)
49+
- To avoid deprecation warnings, the cuda.bindings imports in the cuda.core implementations were cleaned up. (#404)
50+
51+
Test fixes
52+
----------
53+
- Clean up device initialization in some tests. (#507)
3254

3355
Limitations
3456
-----------

0 commit comments

Comments
 (0)