cvat icon indicating copy to clipboard operation
cvat copied to clipboard

How to monitor/view logs during slow data import

Open yf-yang opened this issue 1 year ago • 2 comments

Actions before raising this issue

  • [X] I searched the existing issues and did not find anything similar.
  • [X] I read/searched the docs

Is your feature request related to a problem? Please describe.

We are mounting s3 via s3fs into a host machine, then deploy cvat via docker-compose and mount the s3 folder into the container (Connected File Share storage).

When using high level API to create a task,

with make_client() as client:
 task = client.tasks.create_from_data()

The procedure can succeed, so I can make sure the call is OK.

However, sometimes when creating a task with a list of ~10000 image files in the s3fs path, each file is 100KB, with option use_cache=True and image_quality=70 and chunk_size=100, the upload call stucks there without returning for hours, before I manually kill the process.

The problem is, I am unable to observe any charts or logs to see what happens.

Describe the solution you'd like

Provide docs and guidance on how to check CVAT's status. I can only make sure that container CPU and mem and IO usage rate are fine. Not sure if that's something related to the CVAT's internal mechanism or maybe the s3fs access is slow.

Describe alternatives you've considered

No response

Additional context

No response

yf-yang avatar Nov 09 '23 04:11 yf-yang

Let me summarize the demands:

  • When a task with multiple files is creating, how do I know what stage it is and how many files are already processed?
  • Where can I find the log for the task? Is it for the whole task or lines of logs for each file in the task?.
  • If I want to boost the performance, how do I know if the CPU or memory or network IO (of the CVAT service) is the bottleneck?

yf-yang avatar Nov 09 '23 05:11 yf-yang

On the top right corner there is a user dropdown menu. In the dropdown select the admin page and then you can view the DJANGO_RQ in which there is a queue having some data about the ongoing tasks. Screenshot 2024-04-01 at 18 16 43

Bin-Saleem avatar Apr 01 '24 12:04 Bin-Saleem