Paddle issues

tensor.numpy()执行大量数据从GPU拷贝到CPU速度缓慢

2

### 需求描述 Feature Description tensor.numpy()执行大量数据从GPU拷贝到CPU速度缓慢，5M数据执行tensor.numpy()耗费了1.4s，完全不可接受！啥原因呢？ ### 替代实现 Alternatives _No response_

liukaiyueyuo

status/new-issue

type/feature-request

Optimize topk's performance when k is small and input_width is large

1

### PR types Performance optimization ### PR changes OPs ### Describe Optimize topk's performance when k is small and input_width is large dtype：FP32，循环测试10w次，取平均值。优化后在k值较小且input_width较大时速度提升3-4倍。

carryyu

Limit python version to >=3.7

1

### PR types Breaking changes ### PR changes Others ### Describe Fix https://github.com/PaddlePaddle/Paddle/issues/46314

gongweibao

it is a test ,test=ljd_test

1

### PR types Others ### PR changes Others ### Describe 这是个测试PR，验证在优化精准测map中的一些想法

risemeup1

Add bernoulli primitive op and support dropout op in new AD.

2

### PR types New features ### PR changes OPs ### Describe Add `bernoulli_p` autograd primitive op and support orig2prim for paddle original `dropout` op.

levi131

【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速

10

### PR types New features ### PR changes OPs ### Describe deformable_conv_v1 算子实现 float16 数据类型支持。通过benchmark中测试用例，float32与float16前向速度~~接近~~更快： | Case No. | x_shape|offset_shape|weight_shape|mask_shape | data_type | Paddle Perf(ms) | |---|---|---|---|---|---|---| | 1...

Rayman96

contributor

status: proposed

PaddlePaddle Hackathon 3 No.45 & 46】：为 Paddle cumsum和logcumsumexp 支持 float16 数据类型

5

thunder95

contributor

fix libpaddle.so to paddle.so which soname is libpaddle.so

1

### PR types Bug fixes ### PR changes Others ### Describe 上次core_avx.so 的名字变更成libpaddle.so了，现在遇到一个问题。soname是liblibpaddle.so，导致运行时找lib库的时候找不到。需要将 soname 修改成libpaddle.so就可以了。

zh794390558

[Paddle-TRT]Fix cast bug

1

### PR types Others ### PR changes Others ### Describe layer->getOutput()->setType() may fail, use layer->setOutputType() instead. in trt 8.4 setType fails， but ok in trt 8.2, so strange，so I use...

zhoutianzi666

contributor

status: not progressed

Unable to use Paddle Library

2

### 请提出你的问题 Please ask your question While importing Paddle library from paddle OCR getting an error since we do not have permission to create path in Home directory on analysis...

DeepanChakravarti

status/following-up

type/question

Paddle
Paddle copied to clipboard

Metadata

tensor.numpy()执行大量数据从GPU拷贝到CPU速度缓慢

Optimize topk's performance when k is small and input_width is large

Limit python version to >=3.7

it is a test ,test=ljd_test

Add bernoulli primitive op and support dropout op in new AD.

【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速

PaddlePaddle Hackathon 3 No.45 & 46】：为 Paddle cumsum和logcumsumexp 支持 float16 数据类型

fix libpaddle.so to paddle.so which soname is libpaddle.so

[Paddle-TRT]Fix cast bug

Unable to use Paddle Library

← Metadata

Owner

Metadata

Paddle Paddle copied to clipboard

Metadata

← Metadata

Owner

Metadata

Paddle
Paddle copied to clipboard