TensorComprehensions issues

Examine constraints on tuning ranges

#354 exhibits potential issues with size selections Revisit after #307 and tinker with the tuner.

Less strict TC + JIT requirements

2

Currently TC requires to compile a new fully-specialized version for each new tensor size. #327 has some context about usage in the C2 case: > What if dimensions changed? You...

nicolasvasilache

intermittent failure of CompilationCacheTest.ExpectQuerySuccessConcurrent

2

I got this error message on a run of `test.sh` on 806e3b066a6a841786be4dde38357c972816dcaf: ``` [----------] 7 tests from CompilationCacheTest [ RUN ] CompilationCacheTest.ExpectQuerySuccess [ OK ] CompilationCacheTest.ExpectQuerySuccess (3558 ms) [ RUN...

skimo-openhub

Properly evaluate the scratchpad size necessary for CUB reductions

1

For parallel reductions, CUB requires a block of shared memory, which is wrapped in an opaque object. Shared memory promotion mechanism needs to know the amount of available shared memory....

ftynse

Indirections on LHS [for indexing grad]

11

the reason for having the LHS indirection is because let's say someone has the lookup table: ``` def lut(float(B, R) M, int32(B, N) I) -> (O) { O(b, n) +=!...

prigoyal

feature-request

Add argmin/argmax support to TC

As discussed in Slack, support to argmin/argmax operations should be added in the future.

cheng-chi

enhancement

language

intermittent error happens in autotuning

31

```python import tensor_comprehensions as tc import math as ma import torch import torch.nn as nn import torch.nn.functional as F import random import time import os from torch.autograd import Function LANG...

seongwook-ham

bug

fix-ready

TC ops fail to end when calculation is finished

2

I am experimenting a simple TC op, the calculation finished quickly but the program then hangs and never exit. Both GPU and CPU are still in full utilization even after...

Junonia

Support for user-provided cache key to reuse autotuning results on different machines

3

@ttheodor i autotune the function in server with 4gpu and get cache file. and i try to reuse generated cache file in different server with 4gpu. but it throws error...

seongwook-ham

Convolution outputs gibberish values when filter size > 32, with naive mapping options

2

Hi all, When playing around with large convolution filters, I discovered an unexpected behavior: with a large filter, if the input image size exceeds 32, a standard convolution outputs gibberish...

jeanfeydy

bug

fix-ready

awaiting-user-response

TensorComprehensions
TensorComprehensions copied to clipboard

Metadata

Examine constraints on tuning ranges

Less strict TC + JIT requirements

intermittent failure of CompilationCacheTest.ExpectQuerySuccessConcurrent

Properly evaluate the scratchpad size necessary for CUB reductions

Indirections on LHS [for indexing grad]

Add argmin/argmax support to TC

intermittent error happens in autotuning

TC ops fail to end when calculation is finished

Support for user-provided cache key to reuse autotuning results on different machines

Convolution outputs gibberish values when filter size > 32, with naive mapping options

← Metadata

Owner

Metadata

TensorComprehensions TensorComprehensions copied to clipboard

Metadata

← Metadata

Owner

Metadata

TensorComprehensions
TensorComprehensions copied to clipboard