-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Global identifier for the job monitoring #12192
Comments
This is exactly why I was proposing usage of Data Tracers in recent USCMS monitoring talk. If we introduce traces into WM workflows we can easily correlate different sources. |
Hi all, Just to clarify: @nikodemas Concerning the string concat: I dont think it is too critical. We could also think of adding a scripted field that just concats the parts non persistent. For
I am not 100% sure, if the array accessing works like this, couldn't test it, as I do not have the permissions. So please see it more as an idea than a proper implementation😅 Cheers, |
@RHofsaess , I made a proposal to WM group about data traces, will it take into consideration is up to priority and stakeholds push. If you feel it is desired feature feel free to reach our WM L2's and express your desire to be included into WM plan for upcoming quarter(s). |
@nikodemas the actual order of monitoring information is:
Are you requesting to have these document ids (or A job id) common between these 2 monitoring systems? Can you please remind us again what is the global id defined in condor-based job information? |
yes, it would be nice to have it to be able to match data from the two different data sources. Currently we have a classad called |
Ok, then we would have to persist this And in ArchiveDataReporter, when we construct documents to upload to WMArchive, we need to also add this @nikodemas do you know when this |
@amaltaro I am not sure when, but it is available for all of the statuses such as Held and Idle, and spider script doesn't fill or modify this value at all and it is simply just taken as it is received from htcondor. |
Impact of the new feature
WMArchive
Is your feature request related to a problem? Please describe.
It is impossible to match Jobs between the htcondor job monitoring and WMArchive monitoring. @RHofsaess found a way to match the jobs on the
EOSLogURL
, but it only works for the failed jobs.Describe the solution you'd like
It would be nice to have a global identifier that would be available in the WMArchive and could also be propagated to htcondor monitoring.
Describe alternatives you've considered
EOSLogURL
could be created for all jobs (not only the failed ones), however, this would require some string concats on the htcondor monitoring side that seem to be prone to breaking. See picture from the presentation of @RHofsaess (link to the full presentation):FYI @leggerf
The text was updated successfully, but these errors were encountered: