-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Requesting February Release Cycle Priority Update from ARAs and KPs: Jaeger/OpenTel (telemetry) - Provide evidence that your tools can reliably support 20 sequential queries with a maximum of 10 seconds pause between each query (our current goal for performance in the short term) #7
Comments
Multiomics KPs are served by Service Provider. |
Confirmed. Exposure Provider's KPs are served by Automat. Note that the CQS should probably be added to your list of ARAs, with Jason R. assigned. |
Confirmed for Clinical Data Provider |
Confirmed for Unsecret Agent. We didn’t use Jaeger since we aren't making TRAPI calls to KPs, but we have verified that our ARA can handle 20 consecutive queries on both CI and TEST environments. Please refer to the attached files for detailed timings as evidence of this performance. |
As @kaiwenho says, we have confirmed that Unsecret can handle 20 consecutive queries, even though we aren't using Jaeger (since we aren't making external TRAPI calls). |
Our CHP API currently runs with an average runtime of 308ms per query. We've tested 20 sequential queries and complete in ~31 seconds. We've implemented auto instrumentation using Open Telemetry and uWSGI. However, we are facing issues with traces and metrics not exporting to the console, and collector support has not been tested yet. We're seeking assistance to understand why our implementation isn't aligning with the Open Telemetry documentation. |
Hi @GregHydeDartmouth , are you using other exporters besides the Console exporter? we have been using the Jaeger exporter, if it helps we can discuss on the slack jaeger channel. |
Here are updates for the TMKP services: The DocumentMetadata API has fully implemented OpenTelemetry and is sending traces to the SRI-run Jaeger collector in ITRB. The DocumentMetadata API does not accept TRAPI queries, but does meet the initial service level objective of 90% of requests within 150ms. This would also more than satisfy the requirement of 20 sequential requests if it applied. The Targeted Assertion API is hosted by the Service Provider. The Literature Cooccurrence API has OpenTelemetry implemented in it, but the format is not accepted by the current version of the ITRB collector. Per discussions, this will be addressed in a future version of the collector at which point it will be able to receive telemetry from the Literature Cooccurrence API. This API has been tested with 24 simultaneous queries and is able to complete them, more than satisfying the requirement of 20 sequential queries. |
Confirmed for Exploring Agent/Service Provider. We didn't test via Jaeger, instead, here is the output of this performance test script: Results (click to expand)
This is a test of 20 simultaneous one-hops to BTE CI, which should satisfy the 20 sequential queries requirement. Update: Below is test results of 20 simultaneous Creative Mode Treats queries, which all respond within the 5-minute time limit: Results (click to expand)
|
Confirmed that MolePro can handle support 20 sequential queries with a maximum of 10 seconds pause; however we were not able to make Jaeger work. |
I use locust for load testing, which showed that our systems (improving-agent) were able to handle 50 sequential queries with a max 10 second pause. By the way in the original message, improving agent is tagged with QuiPrimusAbOris, but I don't know who that is. |
Thank you for the update |
Hello TACT representatives for ARAs and KPs. This priority item for the current release cycle requires an update from each ARA and KP on the status of this work. The goal for Jaeger/OpenTel (telemetry) - Provide evidence that your tools can reliably support 20 sequential queries with a maximum of 10 seconds pause between each query (our current goal for performance in the short term) was this:
The goal is to support 20 queries. Jaeger is part of the process. KPs need to support 20 one-hops whenever a query is run. This is a minimum goal to show that your tool can handle a small amount of load. Future goals will need to be more component specific.
Individuals representing ARAs and KPs on TACT, please add a comment to this ticket with your status update on the goal of this task, check the box off here in the issue (if you can and remember). We would greatly appreciate your response by the end of Wednesday January 17, 2024
ARAs
KPs
Thank you.
The text was updated successfully, but these errors were encountered: