towhee icon indicating copy to clipboard operation
towhee copied to clipboard

[Bug]: use pipeline, when the python program works for a while, the process hang on, the process exist, and has no exception. But client cannot access the interface anymore

Open angelapytao opened this issue 1 year ago • 8 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues

Current Behavior

I use pipeline found another issue, when the python program works a while, the process hang on, the process exist, and has no exception. But client cannot access the interface anymore. image There is no _end_op ends with status: RunnerStatus.FINISHED, graph_context.py-graph_context:136 - INFO: Graph 25872 end., so I think why the program hangs on...

Expected Behavior

No response

Steps To Reproduce

No response

Environment

- Towhee version(e.g. v0.1.3 or 8b23a93):0.6.1
- OS(Ubuntu or CentOS):CentOS
- CPU/Memory:
- GPU:GeForce RTX 3090
- Others:

Anything else?

No response

angelapytao avatar Aug 01 '22 07:08 angelapytao

if the problem occurs again, please use stack or pstrace to find where the process hangs. I have run the pipeline 1000000 times, but the problem has not occurred.

junjiejiangjjj avatar Aug 03 '22 02:08 junjiejiangjjj

I use strace found image

angelapytao avatar Aug 08 '22 01:08 angelapytao

cat /proc/142637/stack image @junjiejiangjjj

angelapytao avatar Aug 08 '22 01:08 angelapytao

Any other infos? the stack is very similar to this https://stackoverflow.com/questions/68125101/python-multiprocessing-getting-stuck maybe because of multiprocessing.

junjiejiangjjj avatar Aug 08 '22 08:08 junjiejiangjjj

I raise up the single python process separately, within single python process, there is no multiprocessing.

angelapytao avatar Aug 09 '22 09:08 angelapytao

@angelapytao were you ever able to resolve your issue?

fzliu avatar Aug 24 '22 05:08 fzliu

@fzliu still exist. occasionally happened.

angelapytao avatar Aug 27 '22 02:08 angelapytao

image

angelapytao avatar Aug 27 '22 02:08 angelapytao

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. Rotten issues close after 30d of inactivity. Close the stale issues and pull requests after 7 days of inactivity. Reopen the issue with /reopen.

stale[bot] avatar Sep 26 '22 11:09 stale[bot]