PaddleNLP icon indicating copy to clipboard operation
PaddleNLP copied to clipboard

pipelines可以直接导入非txt的文件

Open lugangmail opened this issue 2 years ago • 1 comments

Feature request

对于pipelines,目前导入的数据似乎只能是目录下的txt文件。 而目前手头的资料,有一些是word和pdf文件。 因此希望能直接导入word和pdf文件。

Motivation

避免用户将精力放到处理非AI核心业务以外。

Your contribution

我觉得以我目前的水平,只能对产品提出改进意见作为我的贡献。 例如增加可视化部分。类似easydl,全程向导。

lugangmail avatar Sep 14 '22 09:09 lugangmail

please refer to this pr https://github.com/PaddlePaddle/PaddleNLP/pull/3439

w5688414 avatar Oct 11 '22 11:10 w5688414

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。

github-actions[bot] avatar Dec 11 '22 00:12 github-actions[bot]

This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天,即将关闭。

github-actions[bot] avatar Dec 26 '22 00:12 github-actions[bot]