dave-rtzr comments

Results 4 comments of


                                            dave-rtzr

undefined symbol errors for libtransformer_server.so

Same here. Any updates?

학습 모델 로드시 질문

아마 모델을 분산학습 시키셔서 모든 weight들의 이름이 module.~ 이런식으로 되어 있을 겁니다. 따라서 그냥 model.load_state_dict(checkpoint['model_state_dict'])를 하면 key error가 발생합니다.

[optimizer.cpp::computeCosts::1981] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[Reshape_419 + Transpose_420...Gather_2423]}

I had a similar issue in tensorrt:22.02-py3 as the following, ``` [optimizer.cpp::computeCosts::2011] Error Code 10: Internal Error (Could not find any implementation for node ``` and solved by increasing the...

RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAILED when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)`

I faced with this problem multiple times and found out the version mismatch between CUDA and pytorch was the problem. Try to update your torch and CUDA version.