-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Labels
bugSomething isn't workingSomething isn't workingdiscussionThe issue is created to keep track a discussionThe issue is created to keep track a discussion
Milestone
Description
Description
From #2807 (comment)
- Autosubmit version: 4.1.16+ (any dev version)
Reproducible Example
Run any experiment that takes a while, and just kill the remote jobs. The _STAT file is not generated.
AS/API expect to have the end time of the job, but it will only be created when the user runs the workflow again, so that the log retrieval process will call the logic to set the end date as the time the log retrieval was launched (which can be several hours/days/weeks/months after the actual end of the job).
Expected Behaviour
This is open for discussion for now. cc @f-macchia, @dbeltrankyl , @LuiggiTenorioK
We have a few options, I think, e.g.,:
- Get the API or one of its workers to update the end time when it's missing
- Get AS to talk to scheduler and find the time the remote job stopped via Slurm, etc., or look at the mod time of the last log in platforms without slurm
- Add a command for that
- Add a cron job for that
- Get a warning in the API so the user is aware it's missing an end date (maybe show that in the metrics page?)
- etc.,
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingdiscussionThe issue is created to keep track a discussionThe issue is created to keep track a discussion