DLPack overhaul #5661

mzient · 2024-10-07T11:37:50Z

Category:

New feature (non-breaking change which adds functionality)
Refactoring (Redesign of existing code that doesn't affect functionality)

Description:

This PR reworks the DLPack support in DALI.
New features:

shared buffer ownership (not just views)
add pinned memory support
Refactoring:
remove DLTensorResource inheritance
unify JAX DLTensorResource and other resources

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Existing tests: ExternalSource, PythonFunction, JAX, dali_test.bin:DL
New tests: buffer sharing (reference counting test).

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

dali-automaton · 2024-10-07T11:42:24Z

CI MESSAGE: [19098946]: BUILD STARTED

banasraf · 2024-10-07T12:27:26Z

dali/pipeline/data/dltensor.h

+  if (dl_type.code < std::size(code_str))
+    ss << code_str[dl_type.code];
+  else
+    ss << "<unknown" << dl_type.code + 0 << ">";


Suggested change

ss << "<unknown" << dl_type.code + 0 << ">";

ss << "<unknown:" << dl_type.code + 0 << ">";

banasraf · 2024-10-07T12:27:39Z

dali/pipeline/data/dltensor.h

+ *
+ * The text representation looks like:
+ * <type><bits>[x<lanes>]
+ * with <lanes>x present only if the number of lanes is > 1


Suggested change

* with <lanes>x present only if the number of lanes is > 1

* with x<lanes> present only if the number of lanes is > 1

banasraf · 2024-10-07T12:32:30Z

dali/pipeline/data/dltensor.h

+ *     +-- ...
+ * ```
+ *
+ * You can use any payload structure of your choice, but it must privde the storage for DLTensor's


Suggested change

* You can use any payload structure of your choice, but it must privde the storage for DLTensor's

* You can use any payload structure of your choice, but it must provide the storage for DLTensor's

dali-automaton · 2024-10-07T15:35:23Z

CI MESSAGE: [19098946]: BUILD PASSED

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

- Remove inheritance. - Make more stuff inline. - Make function names consistent. - Add DLPack resources which share tensor ownership - Add reference counting tests. - Add CPU pinned memory support. TODO(michalz): Unify with JAX. Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

dali-automaton · 2024-10-08T09:28:52Z

CI MESSAGE: [19136647]: BUILD STARTED

dali-automaton · 2024-10-08T22:35:04Z

CI MESSAGE: [19136647]: BUILD PASSED

klecki · 2024-10-09T11:00:41Z

dali/pipeline/data/dltensor.cc

@@ -19,18 +19,18 @@

 namespace dali {

-DLDataType GetDLType(DALIDataType type) {
+DLDataType ToDLType(DALIDataType type) {


Just a side note, the TYPE_SWITCH here seems like an overkill.

Perhaps, but then again it's almost certainly faster than using TypeTable.

klecki · 2024-10-09T12:42:43Z

dali/pipeline/data/dltensor.h

+template <typename Backend>
+std::vector<DLMTensorPtr> GetSharedDLTensorList(TensorList<Backend> &tensor_list) {
+  int device_id = tensor_list.device_id();
+  bool pinned = tensor_list.is_pinned();
+
+  std::vector<DLMTensorPtr> dl_tensors{};
+  dl_tensors.reserve(tensor_list.num_samples());
+
+  for (int i = 0; i < tensor_list.num_samples(); ++i)
+    dl_tensors.push_back(GetDLTensorView(tensor_list[i], pinned, device_id));


Looks like a copy-paste for GetDLTensorListView. It probably should use unsafe_sample_owner.

Nice catch.

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

dali-automaton · 2024-10-09T13:03:45Z

CI MESSAGE: [19179834]: BUILD STARTED

dali-automaton · 2024-10-09T16:55:24Z

CI MESSAGE: [19179834]: BUILD PASSED

banasraf self-assigned this Oct 7, 2024

banasraf reviewed Oct 7, 2024

View reviewed changes

banasraf approved these changes Oct 7, 2024

View reviewed changes

mzient added 7 commits October 8, 2024 11:21

[WIP]

34f5a64

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

[WIP]

4e72a38

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

[WIP]

f2fd224

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

JAX plugin refactor.

b1efece

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Documentation.

5d3c953

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix to_string(DLDataType). Fix typos.

9529a5a

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

mzient force-pushed the disentangle_dlpack branch from ef4502c to 9529a5a Compare October 8, 2024 09:21

Remove unnecessary const_cast.

f46b575

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

dali-automaton assigned klecki Oct 8, 2024

klecki reviewed Oct 9, 2024

View reviewed changes

Fix GetSharedDLTensorList. Add test.

84711ca

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

klecki approved these changes Oct 9, 2024

View reviewed changes

mzient merged commit 7a51e09 into NVIDIA:main Oct 10, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DLPack overhaul #5661

DLPack overhaul #5661

	ss << "<unknown" << dl_type.code + 0 << ">";
	ss << "<unknown:" << dl_type.code + 0 << ">";

	* with <lanes>x present only if the number of lanes is > 1
	* with x<lanes> present only if the number of lanes is > 1

	* You can use any payload structure of your choice, but it must privde the storage for DLTensor's
	* You can use any payload structure of your choice, but it must provide the storage for DLTensor's

DLPack overhaul #5661

DLPack overhaul #5661

Conversation

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment