Deep-Learning-Accelerator-SW icon indicating copy to clipboard operation
Deep-Learning-Accelerator-SW copied to clipboard

NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.

Results 9 Deep-Learning-Accelerator-SW issues
Sort by recently updated
recently updated
newest added

TensorRT 10.0.1.6 GA is out for Jetson AGX Orin. Is it compatible?

Hello, I see the description of Mul operators in the Readme says that DLA native supports. But when I use it, only by adjusting the Mul operator to fp16 can...

platform: Jetson AGX Orin 64GB OS: 5.1.2 DLA: 3.12.1 Sigmoid layers are used as the output of the model and the input & output shape of sigmoid is (8, 3,...

I am trying to convert onnx model to tensorrt egnine. I am using trtexec utility for doing this. Engine file should run in int8 so i generated a calibration file...

I have a Jetson AGX Orin 64GB and I’m testing it concretely on the [Orin Dense Performance](https://github.com/NVIDIA/Deep-Learning-Accelerator-SW?tab=readme-ov-file#dla-dense-performance) section of the page. I have downloaded your models and used your commands...

Fix of #30 Removed Debulicates on lines 106 & 110

I noticed some duplicated entry in operators/README.md, Here are the details. https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/blob/ea61780e417c6e29380760cca651e4cb2b8b55cb/operators/README.md?plain=1#L60 https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/blob/ea61780e417c6e29380760cca651e4cb2b8b55cb/operators/README.md?plain=1#L106 https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/blob/ea61780e417c6e29380760cca651e4cb2b8b55cb/operators/README.md?plain=1#L74 https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/blob/ea61780e417c6e29380760cca651e4cb2b8b55cb/operators/README.md?plain=1#L110 Thanks!

Hi, I am trying to run `cuDLALayerwiseStatsStandalone`. I ran into the issues when compiling with `make`: ``` >>> WARNING - nvscibuf.h not found, Waiving the sample > WARNING - nvscisync.h...

My process is running, tegrastats or nsys not works. ```shell cat /sys/devices/platform/host1x/15880000.nvdla0/power/runtime_status #DLA0 cat /sys/devices/platform/host1x/158c0000.nvdla1/power/runtime_status #DLA1 ``` https://forums.developer.nvidia.com/t/nsight-systems-not-seeing-profile-ranges-when-dla-is-enabled/107382/6 https://docs.nvidia.com/nsight-systems/UserGuide/index.html https://github.com/lix19937/history/tree/main/orin/nsight LayerwiseStats https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/tree/main/samples/cuDLA/cuDLALayerwiseStatsStandalone