spatial-lang
spatial-lang copied to clipboard
Spatial: "Specify Parameterized Accelerators Through Inordinately Abstract Language"
Initially this can be round robin allocation, which is probably good enough for apps with many load/store channels In the future partitioning of data structures can also be implemented
If you parallelize an unaligned load, you could potentially get the wrong result in the app since the Linebuffer assumes a constant enqueue stride and therefore has one wren signal....
The bug where it takes two reads to actually get the data from a register over AXI.. I tried delaying the axi_rvalid signal inside of the AXI4LitetoRFBridge verilog but I...
Example: From Sort_Radix, this can be merged to be a single store node ``` if (valid_buffer == a) { // println("dumping buf a") data_dram store a_sram } else { //...
Rather than use separate DSP blocks for multiplies and then adds, the Xilinx MACC IP can be used. This would reduce DSP utilization.
URAMs are 72 bits wide, which means that 2 32-bit words can fit in each entry (or 4 16-bit words, etc.) This can be done in the case of sequential...
Because fringe backpressure should be good for cmdStream now
Using this issue to track all initiation interval calculation issues - [ ] #151 - Should depend on length between read and write, e.g. s(i) = f( s(i - 1)...
The F1 has UltraRAMs which can be used for larger SRAMs. However, SRAMs need to be explicitly assigned to URAMs using the following syntax: `(* ram_style = "ultra" *) reg...
Stefan found this in vgg_1d, pasted below. I think this is a bug, unless if there is something logical I should do when dispatched to multiple duplicates? Issue is on...