llvm update the matrix spec based on new use argument

Aug 29 '22 20:08 dkhaldi

By the way, I corrected a lot of typos that are still relevant for the new document here: https://github.com/intel/llvm/pull/6525/files

Basically if you ignore the new sections I added there (#### Binary Multiply and Add and ##### wi_data as an marray for Nvidia® Tensor Cores) everything else is just small corrections that you could consider adding here.

Sep 06 '22 09:09 JackAKirk

By the way, I corrected a lot of typos that are still relevant for the new document here: https://github.com/intel/llvm/pull/6525/files

Basically if you ignore the new sections I added there (#### Binary Multiply and Add and ##### wi_data as an marray for Nvidia® Tensor Cores) everything else is just small corrections that you could consider adding here.

Yes, I will add these corrections here. Thanks.

Sep 09 '22 16:09 dkhaldi

I have a question about the old specification, which is now named "sycl_ext_oneapi_deprecated_matrix_no_use". Is that version of the API still supported? Does (or will) the implementation print a warning message for applications that use the old API?

Or, is the old API no longer supported, and it is (or will be) removed?

Both options are OK for an experimental extension like this. However, we should treat the documentation differently in the two cases.

Sep 13 '22 14:09 gmlueck

Looks pretty good overall. I just made some pretty simple suggestions.

Sep 13 '22 16:09 JackAKirk

I have a question about the old specification, which is now named "sycl_ext_oneapi_deprecated_matrix_no_use". Is that version of the API still supported?

@gmlueck, yes, this version is still supported and will be supported until the new API is implemented on the GPU. Currently, it is only implemented on the CPU.

Does (or will) the implementation print a warning message for applications that use the old API?

No it does not because the new API is not supported on the GPU yet. So for users, they still need to use the old API until we remove it for oneAPI 2023.1 release.

Or, is the old API no longer supported, and it is (or will be) removed? The old API will be removed for oneAPI 2023.1 release.

Both options are OK for an experimental extension like this. However, we should treat the documentation differently in the two cases.

For this release, we want to keep the old API unchanged and supported. At the same time, we move towards the new API. Once the new API is also supported on the GPU, we can completely remove the old API.

Sep 14 '22 15:09 dkhaldi

@intel/dpcpp-specification-reviewers, any more review on this?

Sep 19 '22 17:09 dkhaldi

For this release, we want to keep the old API unchanged and supported. At the same time, we move towards the new API. Once the new API is also supported on the GPU, we can completely remove the old API.

Sorry for the delayed response. It looks like both extensions define the same types in different ways. For example, the new version has:

namespace sycl::ext::oneapi::experimental::matrix {
template <typename T, use Use,
          size_t Rows=sycl::dynamic_extent, size_t Cols=sycl::dynamic_extent,
          layout Layout = layout::unused, typename Group = sub_group>
struct joint_matrix {
    joint_matrix(Group g) {}
};
}

and the old version has:

namespace sycl::ext::oneapi::experimental::matrix {
template <typename T, size_t Rows=sycl::dynamic_extent, size_t Cols=sycl::dynamic_extent,
          matrix_layout Layout = matrix_layout::row_major, typename Group = sub_group>
struct joint_matrix {
    joint_matrix(Group g) {}
};
}

How can you support both simultaneously? How does the user determine whether they get the new vs. the old API?

Sep 20 '22 14:09 gmlueck

How can you support both simultaneously? How does the user determine whether they get the new vs. the old API?

they can do this by specifying the SYCL_EXT_ONEAPI_MATRIX feature test macro. See what we have today here: https://github.com/intel/llvm/blob/sycl/sycl/include/sycl/ext/oneapi/matrix/matrix.hpp

Sep 20 '22 14:09 dkhaldi

they can do this by specifying the SYCL_EXT_ONEAPI_MATRIX feature test macro. See what we have today here: https://github.com/intel/llvm/blob/sycl/sycl/include/sycl/ext/oneapi/matrix/matrix.hpp

This is not how the feature-test macros are supposed to work. The implementation determines the value of the feature-test macro, not the application. (The extension specification says this in the section "Feature test macro".) If you want the user to choose the API via a macro, we should choose a different macro name. For example, SYCL_EXT_ONEAPI_MATRIX_LEGACY_API could enable the old API.

Sep 20 '22 14:09 gmlueck

You have this inconsistency throughout the spec. In many places, you refer to the three matrices as a, b, and c. However, the use enum refers to them as a, b, and accumulator. You should decide on one or the other and then use it consistently throughout. Since you call the first two matrices a and b, it seems natural to me to call the third matrix c. Is there a good reason to call the third matrix accumulator?

@JackAKirk what do you think?

Sep 22 '22 19:09 dkhaldi

Whenever you rename (or move) a specification, you should search the repo and see if there are any links to the old name. In your case, there are several references from "sycl/ReleaseNotes.md". These should be updated:

Point to the old file, or

Point to the new file, or

Change the wording to remove the link.

Thank you! yes, I will point to the new file

Sep 22 '22 19:09 dkhaldi

You have this inconsistency throughout the spec. In many places, you refer to the three matrices as a, b, and c. However, the use enum refers to them as a, b, and accumulator. You should decide on one or the other and then use it consistently throughout. Since you call the first two matrices a and b, it seems natural to me to call the third matrix c. Is there a good reason to call the third matrix accumulator?

@JackAKirk what do you think?

I think that the argument for having a,b and accumulator is that the programming idiom is either

d=a*b+c

Where the d matrix would also have to be use::c(use::accumulator)

or as in most use cases:

c=a*b+c

use::accumulator is a more general name than use::c. However I think either choice is fine.

Sep 23 '22 12:09 JackAKirk

LGTM. I just suggested some small changes.

Oct 11 '22 15:10 JackAKirk

@intel/dpcpp-doc-reviewers, any more reviews on this? It will be good if we can merge this ASAP.

Oct 17 '22 18:10 dkhaldi

@intel/dpcpp-doc-reviewers, can you please help merge this if there are no more comments?

Oct 21 '22 17:10 dkhaldi

@dkhaldi, please move deprecated extension to https://github.com/intel/llvm/tree/sycl/sycl/doc/extensions/deprecated

Oct 21 '22 18:10 pvchupin

@dkhaldi, please move deprecated extension to https://github.com/intel/llvm/tree/sycl/sycl/doc/extensions/deprecated

done

Oct 21 '22 19:10 dkhaldi

@intel/dpcpp-doc-reviewers, please help merge.

Oct 25 '22 18:10 dkhaldi

Ping @intel/dpcpp-specification-reviewers

Oct 25 '22 18:10 pvchupin

https://github.com/orgs/intel/teams/dpcpp-doc-reviewers, please help merge.

Sorry for the delay. This is the next thing on my review list.

Oct 25 '22 18:10 gmlueck

Since you moved the deprecated version of the API spec to "doc/extensions/deprecated", there isn't any need for the "doc/extensions/experimental/sycl_ext_oneapi_matrix" directory. Can you move the spec from:

sycl/doc/extensions/experimental/sycl_ext_oneapi_matrix/sycl_ext_oneapi_matrix.asciidoc

to:

sycl/doc/extensions/experimental/sycl_ext_oneapi_matrix.asciidoc

The folder is still necessary to add CUDA backend additional specific API (like marray-based element wise operations). see #6968.

Oct 26 '22 18:10 dkhaldi

@intel/dpcpp-specification-reviewers, please help merge

Oct 31 '22 14:10 dkhaldi