nntrainer issues

[Wait for #2568] [ Tensor ] add is_NaN check in Tensor @open sesame 05/10 14:17

9

## In this PR This PR adds the is_NaN function to check if the tensor has a NaN value. This is for the check of NaN during mixed precision training....

jijoongmoon

Need Review

[Wait for #2567] [ Test ] Mixed Precision Test Case

3

## In this PR This PR includes the mixed precision test case. . Input - FC - MSE : "batch_size=2", "model_tensor_type=FP16-FP16", "loss_scale=128" **Self evaluation:** 1. Build test: [X]Passed [ ]Failed...

jijoongmoon

Need Review

[Wait for #2566] [ Layers ] Update Layers to support FP16 @open sesame 05/07 15:58

2

## In this PR This PR enables the FP16 support for the layers below: . input layer . mse loss layer Resolves: **Self evaluation:** 1. Build test: [X]Passed [ ]Failed...

jijoongmoon

Need Review

Issues and Questions about Execution of LLaMA using NNTrainer

8

- We have executed the LLaMA model (downloaded from HuggingFace[https://huggingface.co/meta-llama/Llama-2-7b-chat-hf]) using the NNTrainer and obtained the following output by following these steps: 1. File changes made before running the LLaMA...

Deeksha-20-99

Add `-Denable-fp16=true` build to CI

6

Some code could break build with `-Denable-fp16=true` option, like https://github.com/nnstreamer/nntrainer/pull/2545. I think it would be helpful to add this build to CI to ease the burden on reviewers.

heka1024

[ Wait for #2500 ] [ BLAS ] Refactor blas/math related files into cpu backend considering arch-dep

9

While during the process of implementing additional features in NEON, I found myself making unnecessary code blocks. This is a suggestion-draft of refactorization for current blas/math related files. **DONE** -...

skykongkong8

Refactor🏭

Need Review

[Wait for #2658][Layer] Add missing activation type

2

Some activation types were missing from EnumList. Added missing types to EnumList. Changed the order of ActivationType and EnumList to be the same. **Self evaluation:** 1. Build test: [X]Passed [...

SeoHyungjun

rebase required

[Wait for #2615] Enable Mixed Precision Training in NNTrainer @open sesame 07/04 21:44

3

## In this PR This PR finalizes the mixed precision support in NNTrainer. It modifies the network grap and layer node, and layer implementations. However, it does not support mixed...

jijoongmoon

DO NOT MERGE

Need Review

[Application] Add Mixed precision Application

1

Add Mixed Precision example on Application - this example can guide developer to handle fp16 example - we can test & eval our model end-to-end - we can optimize base...

DonghakPark

DO NOT MERGE

PR/READY2MERGE

[Layer] add tanh-based approximate gelu activation function

1

- add tanh-based approximate gelu(tanh gelu) for vision transformer. - rename quick gelu to sigmoid gelu(it's a sigmoid-based approximate gelu) **Self evaluation:** 1. Build test: [X]Passed [ ]Failed [ ]Skipped...

baek2sm

PR/READY2MERGE

nntrainer
nntrainer copied to clipboard

Metadata

[Wait for #2568] [ Tensor ] add is_NaN check in Tensor @open sesame 05/10 14:17

[Wait for #2567] [ Test ] Mixed Precision Test Case

[Wait for #2566] [ Layers ] Update Layers to support FP16 @open sesame 05/07 15:58

Issues and Questions about Execution of LLaMA using NNTrainer

Add `-Denable-fp16=true` build to CI

[ Wait for #2500 ] [ BLAS ] Refactor blas/math related files into cpu backend considering arch-dep

[Wait for #2658][Layer] Add missing activation type

[Wait for #2615] Enable Mixed Precision Training in NNTrainer @open sesame 07/04 21:44

[Application] Add Mixed precision Application

[Layer] add tanh-based approximate gelu activation function

← Metadata

Owner

Metadata

nntrainer nntrainer copied to clipboard

Metadata

← Metadata

Owner

Metadata

nntrainer
nntrainer copied to clipboard