Jutho comments

Results 241 comments of


                                            Jutho

Consider splitting out CuArrays?

Hi @amilsted. My apologies for the late response; I was on holidays. This is probably a good suggestion. I initially developed the CUDA wrappers in a separate package (https://github.com/Jutho/CuTensorOperations.jl ),...

more general catdomain and catcodomain?

I am certainly open to suggestions, but how would you like this to work or look like, even for the case of a tensor without symmetries (i.e. an object of...

more general catdomain and catcodomain?

Maybe the construction with the embedding is actually the easiest way also to provide an actual implementation. I often find the most difficult question to be the name and interface...

Correct way to parallelize this code?

What `data1` and `data2` do you have in mind? I.e. could you just post a runnable example, using some random generated data, so that I can test what is going...

Correct way to parallelize this code?

I can reproduce your timings and observe that something is going wrong with threading. When printing out the `threadid()` when actually doing the work, it shows `1` is most cases,...

Correct way to parallelize this code?

Although, digging a bit deeper, I don't think that's really the issue. Giving the large number of allocations, there seems to be something going on with type inference. Note that...

Correct way to parallelize this code?

I guess this has to do with some compiler heuristics on the complexity of tuples and nested parametric types.

Correct way to parallelize this code?

There is a constant `const MINTHREADLENGTH = 1

Correct way to parallelize this code?

I have not been following very actively myself all the recent developments. I think you want to check out things like LoopVectorization.jl , which will also do other optimizations

Not understand the tutorial about symmetry part

The tutorial can probably use some work. This notation means, build a vector space which carries a representation of ℤ₂ where irrep 0 (the trivial irrep) appears 3 times, and...