dstack icon indicating copy to clipboard operation
dstack copied to clipboard

[Feature]: Better logging when a run terminates due to max_duration

Open james-boydell opened this issue 1 year ago • 2 comments

Problem

In the server console logs, it's unclear that a run was terminated due to max_duration. Attaching image showing when a run was started and the messaging shown 6 hours later (default max_duration).

image

Solution

No response

Workaround

No response

Would you like to help us implement this feature by sending a PR?

No

james-boydell avatar Sep 17 '24 14:09 james-boydell

@james-boydell, I agree we should improve the run failure reason in that case. Still, it's recommended to check run diagnostic logs on fails. They are available to users that don't have access to server logs and may contain more information than server logs:

Do dstack logs --diagnose run_name and you'd see:

...
time=2024-09-19T04:32:24.589936-04:00 level=error msg=Max duration exceeded max_duration=180
time=2024-09-19T04:32:24.590001-04:00 level=info msg=Job state changed new=terminated

r4victor avatar Sep 19 '24 08:09 r4victor

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Oct 20 '24 02:10 github-actions[bot]

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Nov 20 '24 02:11 github-actions[bot]

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Dec 21 '24 01:12 github-actions[bot]

This issue was closed because it has been inactive for 14 days since being marked as stale. Please reopen the issue if it is still relevant.

github-actions[bot] avatar Jan 05 '25 02:01 github-actions[bot]