Paddle issues

paddlepaddle2.3.1训练wide&deep速度慢

3

### 请提出你的问题 Please ask your question paddlepaddle2.3.1训练wide&deep速度K40 比 V100慢了10倍具体的速度对比 K40 ![image](https://user-images.githubusercontent.com/6941942/182832513-4ebed7ec-52ad-48b2-9468-16da6f900f45.png) V100 ![image](https://user-images.githubusercontent.com/6941942/182832394-12c3e2fb-b573-4370-bc69-6ce7488c70d4.png) 可能是哪些问题导致的

veridone

status/following-up

type/question

While_loop OP 在训练中正常，在paddle inference中报错

6

### 请提出你的问题 Please ask your question 我们使用 paddle.static.nn.while_loop() 构造循环体时，如果在训练program取循环结果的OP，能正常输出；当我们通过paddle.static.save_inference_model() 将program 导出成推断模型时，取循环结果的 OP 会报错，而且都是GPU方面的错。后来我们参照 paddlenlp 中 GPT-3 [循环解码部分](https://github.com/PaddlePaddle/PaddleNLP/blob/develop/examples/language_model/gpt-3/static/modeling.py#L1230)，将paddle.static.nn.while_loop() 改为 paddle.fluid.layers.While() ，发现同样的现象，在训练中取循环结果都是正常的，但是导出成推断模型，取循环结果总是报GPU方面的错误。后面初步定位，在不开启paddle.inference.Config().collect_shape_range_info()的情况下不会报错，在开启的情况下会报错。如下是相关的信息： 1. 测试模型训练中正常取循环体结果，即类似GPT解码多步预测结果： ```python 2022-08-05 10:21:53,426 - INFO...

HoratioJSY

status/following-up

type/question

windows develop分支从源码编译 cmake报错

### bug描述 Describe the Bug 按照官网步骤，在windows develop分支下：执行到 cmake .. -GNinja -DWITH_GPU=ON 报错。 release/2.3 cmake 正常。 ![QQ图片20220805155829](https://user-images.githubusercontent.com/35183089/183116149-96085a19-0bd7-45a0-9d67-74a1161e047b.png) ### 其他补充信息 Additional Supplementary Information _No response_

xiaohemaikoo

status/following-up

type/bug-report

put_along_axis for int

2

### 请提出你的问题 Please ask your question 如果输入数组和value都是int，是否有put_along_axis 的替代函数，目前文档上写着put_along_axis只支持float32和float64

tangmingkai

status/following-up

type/question

paddle.cos 输出错误

2

### bug描述 Describe the Bug import paddle import math x = paddle.to_tensor([160.22123718, 7*math.pi]) out = paddle.cos(x) print(out) 输出out 第一项为 -1 ### 其他补充信息 Additional Supplementary Information _No response_

taylover2016

status/following-up

type/bug-report

Paddle Inference 将所有参数与 OP放在 GPU 上推理模型

14

### 请提出你的问题 Please ask your question 在 Paddle inference 中，如果配置 paddle.inference.Config.enable_use_gpu() 能够启动 GPU 推理，但是我们在启动 config.enable_profile() 的时候发现 CPU 和 GPU 之前存在不少通信，有一些变量也会放在 CPU 上，所以有没有一种方法强制在 paddle inference 将所有计算节点与参数都放在 GPU 上？部分性能 profile: ```python...

HoratioJSY

status/following-up

type/question

develop版本paddle.autograd.grad()报错

3

### bug描述 Describe the Bug develop版本下，paddle.autograd.grad()报错：SystemError: (Fatal) Null autograd_meta gotten from unsafe_autograd_meta() 2.3版本下可正常运行代码见：https://github.com/PaddlePaddle/PaddleScience/pull/142 [报错位置](https://github.com/PaddlePaddle/PaddleScience/blob/d98f626e22351f08fee8f411f860f176da627455/paddlescience/network/grad_norm.py#L84)： ```python for i in range(losses.shape[0]): grad = paddle.autograd.grad(losses[i], W, retain_graph=True) norms.append(paddle.norm(self.loss_weights[i] * grad[0], p=2)) ```...

Asthestarsfalll

status/following-up

type/bug-report

三机八卡频繁hang住

2

三机八卡的训练出现了频繁hang住的问题，基本上每训半天就会hang住，log中没有任何报错信息，每次hang在不同的step上，麻烦帮忙看一下。

Alittleegg

Fix reorder bug in Conv MKLDNN

6

### PR types Bug fixes ### PR changes OPs ### Describe Fix reorder bug in conv MKLDNN

yeliang2258

Fix mkldnn interpolate ops

3

### PR types Bug fixes ### PR changes Others ### Describe When the interpolate OP only specifies the output dim but does not specify the scale, the value of the...

yeliang2258

Paddle
Paddle copied to clipboard

Metadata

paddlepaddle2.3.1训练wide&deep速度慢

While_loop OP 在训练中正常，在paddle inference中报错

windows develop分支从源码编译 cmake报错

put_along_axis for int

paddle.cos 输出错误

Paddle Inference 将所有参数与 OP放在 GPU 上推理模型

develop版本paddle.autograd.grad()报错

三机八卡频繁hang住

Fix reorder bug in Conv MKLDNN

Fix mkldnn interpolate ops

← Metadata

Owner

Metadata

Paddle Paddle copied to clipboard

Metadata

← Metadata

Owner

Metadata

Paddle
Paddle copied to clipboard