NyuziToolchain issues

LLDB: allow evaluating function expressions

``` (lldb) p (int) strcmp("abcd", "efgh") error: Can't run the expression locally: Interpreter doesn't handle one of the expression's opcodes ``` Probably dependent on issue #84

jbush001

feature

Generate masked forms for sext8/sext16

In lib/Target/Nyuzi/NyuziInstrInfo.td, SEXT8VS, SEXT8VVM, SEXT8VSM, SEXT16VS, SEXT16VVM, an SEXT16VSM have their instruction matching patterns commented out, e.g.: ``` def SEXT16VSM : FormatRMaskedOneOpInst< (outs VR512:$dest), (ins GPR32:$mask, GPR32:$src2, VR512:$oldvalue), "sext_16_mask $dest,...

jbush001

cleanup

Cache control instructions with offset

Currently the assembler/compiler does not support generating cache control instructions with an offset. Should either add support/tests for these or eliminate the feature from the instruction set.

jbush001

minor

investigate

Port POCL

A compiler pass vectorizes appropriate functions: http://portablecl.org/docs/html/kernel_compiler.html https://github.com/pocl/pocl/tree/master/lib/llvmopencl

jbush001

feature

investigate

Support C++ exceptions

Explicitly disabled here: https://github.com/jbush001/NyuziToolchain/blob/2098fd3a8bd8fa511ad9d52cfcdc989d503baa95/tools/clang/lib/Driver/ToolChains/Clang.cpp#L439 Requires defining ABI and backend support. Also requires porting libunwind to link for target code

jbush001

feature

Optimize ADDE/ADDC/SUBE/SUBC

1

Currently these use branches, which are unnecessary. Grab carry from the comparison result register. Implement in NyuziISelLowering.

jbush001

minor

performance

Optimize constant pool/global memory accesses

With new implementation, a constant pool load takes 3 instructions: ``` movehi s0, hi(.LCPI1_0) or s0, s0, lo(.LCPI1_0) load_32 s0, (s0) ``` This can be done in two instructions by...

jbush001

performance

Remove redundant cmpne_i instructions

In some cases, the backend will emit: ``` cmpne_i s1, s2, 0 bnz s1, label ``` The cmpne is redundant and could be removed: ``` bnz s2, label ```

jbush001

minor

performance

Implement tail calls

Currently these are converted to normal calls. Probably minor performance improvement.

jbush001

minor

performance

Use LLVM masked load/store intrinsics

Currently, Nyuzi uses custom intrinsics for vector operations: __builtin_nyuzi_gather_loadi/f __builtin_nyuzi_gather_loadi/f_masked __builtin_nyuzi_scatter_storei/f __builtin_nyuzi_scatter_storei/f_masked Switch to using pre-defined LLVM intrinsics for these: http://llvm.org/docs/LangRef.html#masked-vector-load-and-store-intrinsics For unmasked variants, set the mask to all ones...

jbush001

cleanup

NyuziToolchain
NyuziToolchain copied to clipboard

Metadata

LLDB: allow evaluating function expressions

Generate masked forms for sext8/sext16

Cache control instructions with offset

Port POCL

Support C++ exceptions

Optimize ADDE/ADDC/SUBE/SUBC

Optimize constant pool/global memory accesses

Remove redundant cmpne_i instructions

Implement tail calls

Use LLVM masked load/store intrinsics

← Metadata

Owner

Metadata

NyuziToolchain NyuziToolchain copied to clipboard

Metadata

← Metadata

Owner

Metadata

NyuziToolchain
NyuziToolchain copied to clipboard