xla
xla copied to clipboard
Fix sided ouput index computation for transpose fusion
trafficstars
There exist bug in ComputeThreadIdToOutputIndexing func. Currently, this func can not calculate indexing map correctly for sided output. Fix it and add corresponding test.