Paddle icon indicating copy to clipboard operation
Paddle copied to clipboard

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Results 1049 Paddle issues
Sort by recently updated
recently updated
newest added
trafficstars

### PR Category Execute Infrastructure ### PR Types Improvements ### Description 这里是继续 https://github.com/PaddlePaddle/Paddle/pull/73094 中 paddle.inner的修改 PaddleAPITest 测试通过 GPU ![image](https://github.com/user-attachments/assets/3aa9988f-0b88-44fb-a2b2-7a5cc38cad4a) CPU ![image](https://github.com/user-attachments/assets/ab733630-5b7a-4cca-9ad0-63cde4e0f6b0) 代码只增加一个reshape,没有其他增加,性能测试使用的是全为1的shape,实际中这样的情况不多,使用这个分支执行的也不多 修改了PaddleTest的测试用例,看是否可以修改 https://github.com/PaddlePaddle/PaddleTest/pull/3096

contributor

### PR Category Execute Infrastructure ### PR Types Improvements ### Description 17 paddle.geometric.segment_max 18 paddle.geometric.segment_mean 19 paddle.geometric.segment_min 20 paddle.geometric.segment_sum 31 paddle.incubate.segment_max 32 paddle.incubate.segment_mean 33 paddle.incubate.segment_min 34 paddle.incubate.segment_sum paddle.incubate.* 算子和paddle.geometric.*相同 修改前向和反向,CPU/GPU...

contributor

### PR Category Execute Infrastructure ### PR Types Improvements ### Description 原问题是输入小于等于0时 gammaln 的梯度会直接返回 0,参照 torch 的实现对 gammaln_grad kernel 进行了修改 回测结果: ![image](https://github.com/user-attachments/assets/f1b268e4-5385-4718-a082-ab30af3e2dcf)

contributor

### PR Category Execute Infrastructure ### PR Types Improvements ### Description 修改infermeta跳过0-size检查,同时修改符号推导 修改前向和反向, CPU/GPU/XPU,反向填充0,torch中没有对应接口,是使用silu PaddleAPITest 测试都通过 GPU ![image](https://github.com/user-attachments/assets/94c535f0-6b07-41a7-ace3-246342693e4f) CPU ![image](https://github.com/user-attachments/assets/627cae61-da9a-4be7-810e-d9d049dee2f7)

contributor

### PR Category User Experience ### PR Types Others ### Description profiler 不使用 fluid 头文件

contributor

### PR Category Environment Adaptation ### PR Types Others ### Description Add Auto-Parallel pcard-67164

skip-ci: sot
skip-ci: coverage
skip-ci: mac
skip-ci: xpu
skip-ci: inference
skip-ci: distribute
skip-ci: static-check

### PR Category Auto Parallel ### PR Types New features ### Description card-73263 自动并行动半流水并行基础组件 Schedules

### PR Category Auto Parallel ### PR Types New features ### Description add cp&sep strategy cp: ring attention sep: segment parallel strategy, similar as Deepspeed Ulyssess Pcard-91295

### PR Category Operator Mechanism ### PR Types Bug fixes ### Description Pcard-85711

### PR Category ### PR Types ### Description