hub icon indicating copy to clipboard operation
hub copied to clipboard

Project is disconnected

Open spartajet opened this issue 1 year ago โ€ข 16 comments

Search before asking

  • [X] I have searched the HUB issues and found no similar bug report.

HUB Component

Projects

Bug

My project is disconnected and cannot get cloud trainning result . Model - 5 June 2024 18:48

Environment

No response

Minimal Reproducible Example

No response

Additional

No response

spartajet avatar Jun 06 '24 03:06 spartajet

๐Ÿ‘‹ Hello @spartajet, thank you for raising an issue about Ultralytics HUB ๐Ÿš€! Please visit our HUB Docs to learn more:

  • Quickstart. Start training and deploying YOLO models with HUB in seconds.
  • Datasets: Preparing and Uploading. Learn how to prepare and upload your datasets to HUB in YOLO format.
  • Projects: Creating and Managing. Group your models into projects for improved organization.
  • Models: Training and Exporting. Train YOLOv5 and YOLOv8 models on your custom datasets and export them to various formats for deployment.
  • Integrations. Explore different integration options for your trained models, such as TensorFlow, ONNX, OpenVINO, CoreML, and PaddlePaddle.
  • Ultralytics HUB App. Learn about the Ultralytics App for iOS and Android, which allows you to run models directly on your mobile device.
    • iOS. Learn about YOLO CoreML models accelerated on Apple's Neural Engine on iPhones and iPads.
    • Android. Explore TFLite acceleration on mobile devices.
  • Inference API. Understand how to use the Inference API for running your trained models in the cloud to generate predictions.

If this is a ๐Ÿ› Bug Report, please provide screenshots and steps to reproduce your problem to help us get started working on a fix.

If this is a โ“ Question, please provide as much information as possible, including dataset, model, environment details etc. so that we might provide the most helpful response.

We try to respond to all issues as promptly as possible. Thank you for your patience!

github-actions[bot] avatar Jun 06 '24 03:06 github-actions[bot]

Hello,

Thank you for reporting this issue with your project disconnection. To assist you better, could you please provide a bit more detail about the steps leading up to the disconnection? Additionally, any error messages or logs that were generated during the disconnection would be very helpful.

We'll work to resolve this as quickly as possible once we have a bit more information.

pderrenger avatar Jun 06 '24 05:06 pderrenger

thanks๏ผŒhowever, I just config the classify task in your HUB website and pay, Then I cannot get the train result. as shown in pic Snipaste_2024-06-06_15-08-01

spartajet avatar Jun 06 '24 07:06 spartajet

@spartajet Can you share the model ID so I can investigate this further?

The issue might be GPU availability as I just tested and everything works fine. Read more about Cloud Training: https://docs.ultralytics.com/hub/cloud-training

sergiuwaxmann avatar Jun 06 '24 07:06 sergiuwaxmann

I think "JADqyjS8ocsOQsULppOS" is my model id from share dialog webpage. thank you!

spartajet avatar Jun 06 '24 08:06 spartajet

@spartajet Perfect, thank you! I will keep you updated.

sergiuwaxmann avatar Jun 06 '24 08:06 sergiuwaxmann

Hi @pderrenger ๐Ÿ‘‹ I have the same issue

When training a model. It trains normally. I reach to the point of "Optimizing Weights" but then it gets stuck in that state for around 5 minutes. After that I get the following screen saying the model is "Disconnected". image

It lets me retry training from the last checkpoint, but the same issue happens again. You can see here that I tried a couple of times. image

This is how it looks in the Models Panel image

I had a similar issue earlier today when uploading datasets. The dataset was not huge (2GB). I got an "Unknown Error". This happened to me twice with slightly different versions of my dataset. On the third try, I was able to upload it.

PS: Things were working fine for me 2 days ago. Maybe something changed yesterday or today.

emilio-balda avatar Jun 07 '24 13:06 emilio-balda

@emilio-balda my dataset is just 54M, and I cannot get error message ...

spartajet avatar Jun 07 '24 14:06 spartajet

@emilio-balda Hello! Can you please share your model ID so I can investigate this further?

sergiuwaxmann avatar Jun 08 '24 10:06 sergiuwaxmann

@sergiuwaxmann Hi! I saw above that the issue might be GPU availability so I tried resuming training in the middle of the night and that worked ๐Ÿš€

image

In case you want to take look, here is my model ID: aASmZT80nXYK3VTG9rIw

emilio-balda avatar Jun 10 '24 06:06 emilio-balda

@emilio-balda Thank you! Yes, I will look into this but I am glad it worked in the end.

sergiuwaxmann avatar Jun 10 '24 08:06 sergiuwaxmann

Hey @sergiuwaxmann I am having the same issue over and over again. Sometimes it works when I retry, the last couple of tries it didn't unfortunately.

Is there anything to keep in mind?

Here are two projects that failed before: Uqlbyy57FiLg394OlkXU nBWRPqNKwzLUPmDwfESX

jankalthoefer avatar Jul 09 '24 20:07 jankalthoefer

@jankalthoefer Hello! I believe the issue might be related to GPU availability. During the 15-minute timeout, we attempt to spin up the dedicated instance for Cloud Training, but it might fail due to GPU availability. This issue has been logged, and we are working on improving the error handling.

sergiuwaxmann avatar Jul 10 '24 09:07 sergiuwaxmann

Got it! Let me know if you have any updates. For now I am moving to Colab then.

jankalthoefer avatar Jul 10 '24 14:07 jankalthoefer

@jankalthoefer Sure thing! Apologies for the inconvenience.

sergiuwaxmann avatar Jul 10 '24 15:07 sergiuwaxmann

๐Ÿ‘‹ Hello there! We wanted to give you a friendly reminder that this issue has not had any recent activity and may be closed soon, but don't worry - you can always reopen it if needed. If you still have any questions or concerns, please feel free to let us know how we can help.

For additional resources and information, please see the links below:

  • Docs: https://docs.ultralytics.com
  • HUB: https://hub.ultralytics.com
  • Community: https://community.ultralytics.com

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLO ๐Ÿš€ and Vision AI โญ

github-actions[bot] avatar Aug 10 '24 00:08 github-actions[bot]