Half Tensor Dispatch compatibility ? #15

ClementPinard · 2018-07-25T18:21:58Z

Hi, been tweaking the repo a bit, and wanted to try Half Tensor compatibility

So in the cuda, instead of AT_DISPATCH_FLOATING_TYPES here and here

I just changed the dispatch function to AT_DISPATCH_FLOATING_TYPES_AND_HALF, naively hoping that everything would work without changing anything else.

Unfortunately, I got this error (while only dispatching floating types work) :

lltm_cuda_kernel.cu(123): error: identifier "Half" is undefined

lltm_cuda_kernel.cu(157): error: identifier "Half" is undefined

Is there something i forgot to do ? apparently the Half is not recognized by the compiler like it is for float or double so maybe I need to include a header ? I tried #include <cuda_fp16.h> , #include <ATen/Half.h> and #include <ATen/Type.h> but it didn't work.

Thanks !

Clément

The text was updated successfully, but these errors were encountered:

colesbury · 2018-07-25T23:28:03Z

Add using namespace at; at the top. We need to qualify Half as at::Half in Dispatch.h to avoid this.
replace fmax(0.0, z) with fmax(scalar_t(0.0), z)

This makes AT_DISPATCH_ALL_TYPES_AND_HALF valid outside of the at namespace. See pytorch/extension-cpp#15

ClementPinard · 2018-07-26T11:46:21Z

Thanks for the rapid answer ! I commented your PR. In the mean time, I applied your recommandations, and it works, thanks !

However I initially thought that your "2." comment was a general case, like "replace all 0 occurences by scalar_t(0.0). But when doing that, the compiler stops at fmin(scalar_t(0.0), ...) ! any idea why this modif is working on fmax and not fmin ? is there some overloading on fmin and not fmax ?

Summary: This makes AT_DISPATCH_ALL_TYPES_AND_HALF valid outside of the at namespace. See pytorch/extension-cpp#15 Pull Request resolved: #9848 Differential Revision: D9006921 Pulled By: colesbury fbshipit-source-id: a6e4f097a9d6fb85c921e1c9b9ea25d0f2db06dc

Summary: This makes AT_DISPATCH_ALL_TYPES_AND_HALF valid outside of the at namespace. See pytorch/extension-cpp#15 Pull Request resolved: pytorch/pytorch#9848 Differential Revision: D9006921 Pulled By: colesbury fbshipit-source-id: a6e4f097a9d6fb85c921e1c9b9ea25d0f2db06dc

goldsborough · 2018-07-30T15:55:30Z

@ClementPinard someone else reported issues with fmax being interpreted as std::fmax instead of the CUDA function once #14

colesbury · 2018-07-30T16:08:41Z

It's just a matter of operator overloading and implicit conversions. C++ allows an argument to undergo one implicit conversion to satisfy an overload, but not two. Half has an implicit conversion to float. float has an implicit conversion to double. But there's no implicit conversion from Half to double directly.

You can see the list of overloads for fmax here: https://en.cppreference.com/w/cpp/numeric/math/fmax

fmax(0.0, z) has the types (double, at::Half) which does not satisfy any overload. Changing it to fmax(scalar_t(0.0), z)gives the type (at::Half, at::Half). Both arguments are implicitly convertible to float so it uses the overload fmax(float, float)

fmin has the same rules as fmax but the argument is different. alpha * (exp(z) - 1.0)) is a double when z is Half because of the - 1.0.

There might be a performance penalty of using doubles here, I'm not sure. You could probably change all the 0.0 and 1.0 to 0.0f and 1.0f.

Summary: This makes AT_DISPATCH_ALL_TYPES_AND_HALF valid outside of the at namespace. See pytorch/extension-cpp#15 Pull Request resolved: pytorch#9848 Differential Revision: D9006921 Pulled By: colesbury fbshipit-source-id: a6e4f097a9d6fb85c921e1c9b9ea25d0f2db06dc

colesbury added a commit to colesbury/pytorch that referenced this issue Jul 25, 2018

Use qualified name at::Half in Dispatch.h

1f957b1

This makes AT_DISPATCH_ALL_TYPES_AND_HALF valid outside of the at namespace. See pytorch/extension-cpp#15

colesbury mentioned this issue Jul 25, 2018

Use qualified name at::Half in Dispatch.h pytorch/pytorch#9848

Closed

goldsborough closed this as completed Sep 10, 2018

onlytailei mentioned this issue Oct 6, 2018

segmentation fault for pcl icp implementation in pytorch cpp extension #20

Closed

ClementPinard mentioned this issue Apr 12, 2019

Compiler error /cuda/setup.py #27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Half Tensor Dispatch compatibility ? #15

Half Tensor Dispatch compatibility ? #15

ClementPinard commented Jul 25, 2018

colesbury commented Jul 25, 2018

Uh oh!

ClementPinard commented Jul 26, 2018 •

edited

Loading

Uh oh!

goldsborough commented Jul 30, 2018

Uh oh!

colesbury commented Jul 30, 2018 •

edited

Loading

Uh oh!

Half Tensor Dispatch compatibility ? #15

Half Tensor Dispatch compatibility ? #15

Comments

ClementPinard commented Jul 25, 2018

colesbury commented Jul 25, 2018

Uh oh!

ClementPinard commented Jul 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goldsborough commented Jul 30, 2018

Uh oh!

colesbury commented Jul 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ClementPinard commented Jul 26, 2018 •

edited

Loading

colesbury commented Jul 30, 2018 •

edited

Loading