roboflow-python icon indicating copy to clipboard operation
roboflow-python copied to clipboard

Incorrect Data Path in YOLOv8 Dataset Configuration

Open tomershukhman opened this issue 11 months ago • 3 comments

Bug Report: Incorrect Data Path in YOLOv8 Dataset Configuration

Description

While using the Roboflow Python client to download a YOLOv8 dataset for a tennis ball tracking project, I encountered an issue with the generated data.yaml file. The file's paths were incorrectly set, leading to a RuntimeError when attempting to train the YOLO model.

Steps to Reproduce

Download the dataset using the Roboflow Python client:

from roboflow import Roboflow
rf = Roboflow(api_key="YOUR_API_KEY")
project = rf.workspace("denica-tran").project("tennisballtracker")
version = project.version(9)
dataset = version.download("yolov8")

from ultralytics import YOLO
model = YOLO('yolov8n.pt')

dataset_path ="TennisBallTracker-9/data.yaml"
model.train(data=dataset_path, epochs=5)

Observe the downloaded TennisBallTracker-9 folder with the standard YOLOv8 dataset structure. Check the data.yaml file, which incorrectly references paths for training, validation, and test images. Expected Behavior The data.yaml file should correctly point to the respective image directories within the dataset folder.

Actual Behavior

The data.yaml file contains incorrect paths, leading to a RuntimeError when the YOLO model attempts to access the image directories:

RuntimeError: Dataset 'TennisBallTracker-9/data.yaml' error ❌ 
Dataset 'TennisBallTracker-9/data.yaml' images not found ⚠️, missing path '/.../android_figurine/datasets/TennisBallTracker-9/TennisBallTracker-9/valid/images'
Note dataset download directory is '/..../android_figurine/datasets'. You can update this in '/..../Ultralytics/settings.yaml'

TL;DR

The issue might be related to residual settings from a previous project (android_figurine) that seems to interfere with the current dataset configuration. It's unclear why the old dataset path is being referenced.

Temporary Solution

I modified the data.yaml file manually to correct the paths, which allowed the training to proceed successfully:

names:

ankle
court
elbow
eye
hand
head
hip
knee
net
nose
person
player
shoulder
tennis ball
tennis racket
nc: 15
path: /..../TennisBallTracker-9/
test: test/images
train: train/images
val: valid/images

Environment

Python version: 3.12



tomershukhman avatar Mar 13 '24 19:03 tomershukhman

same problem

xy-23 avatar Apr 09 '24 03:04 xy-23

Hey @tomershukhman . Did some digging around the source code and struggled to see if this was an Ultralytics issue or a pathing issue in Roboflow. Did changing the ultralytics settings affect anything? If not, can you check your environment variables for a DATASET_DIRECTORY? If you don't provide a location in version.download(), it will check the environment for this variable and use it for pathing, rather than defaulting on just the filename.

Ricerob avatar Apr 18 '24 22:04 Ricerob

I posted this error in ultralytics repository and this was their response.

guiaugustoga987 avatar May 20 '24 18:05 guiaugustoga987