Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation #1034

mmuth · 2024-07-22T11:47:09Z

Feature Request
Hi there - I am currently migrating from graphql-kickstart to spring-graphql. The new observations in the GraphQlObservationInstrumentation are really great!

However I suffer from a huge amount of "noise spans" when using dataloaders, as the traces contain a Span for every resolved Child node (with an graphql.field.path in an array-like iteration /myfield/edges[x]). That means for a GQL request with a 5000 page size, 5000 spans are added even though only one single database request happens in the end and this is the interesting part to investigate on the resulting trace.

I'd propose that we can somehow recognize the "dataloaded fields" (e.g by attaching a graphql.datafetcher.completeablefuture=true KevValue as an extra highCardinalityKeyValue for each observation; or something similar), then one would be able to filter them later. To make this complete, an Observation for all called dataloaders would be very helpful.

The text was updated successfully, but these errors were encountered:

bclozel · 2024-08-20T14:04:00Z

Thanks for reaching out. I had a look at our instrumentation and wrote a test case for this. We can consider several changes here and some of them are quite involved, so I have scheduled this for our next feature release.

As far as I can see, it is completely transparent to a DataFetcher caller whether it is using a DataLoader internally or not. Also, there are various strategies that can trigger the DataLoader dispatch - by default this is called at each depth level of the query.

We can extend our SelfDescribingDataFetcher to expose whether this is a batching operation. I believe we could automatically expose this for @BatchMapping handlers, but of course not for custom data fetcher registrations.
With that extra information, we could then make it available to our DataFetcherObservationContext and turn this information into a low cardinality key-value for the data fetcher observation. We would need to think about a proper name, but graphql.datafetcher.batching=true comes to mind.

We can also consider instrumenting DataLoader operations, but this is more challenging. Instrumenting DataLoader as they are being registered in the DataLoaderRegistry is complex, because we would need to involve the Observation API there and get the actual ObservationRegsitry instance at that point. Right now, instrumentation is neatly separated in its GraphQlObservationInstrumentation class.

We could, when the request execution starts, get the DataLoaderRegistry and for each registered data loader:

get the DataLoader from the registry
unregister it
wrap it for instrumentation purposes
re-register it under the original key

We would then need to create a new context/convention/key-values set for this new graphql.dataloader observation.
This would certainly make the observability better but we should also consider instrumentation overhead.

rstoyanchev · 2024-09-05T08:59:40Z

For the route with SelfDescribingDataFetcher and @BatchMapping methods, we can also support cases where DataLoader is injected into a controller method. That would cover all of @Controller usage, which would be a good place to be.

spring-projects-issues added the status: waiting-for-triage An issue we've not yet triaged label Jul 22, 2024

bclozel added the in: core Issues related to config and core support label Jul 22, 2024

bclozel self-assigned this Aug 20, 2024

bclozel added type: enhancement A general enhancement and removed status: waiting-for-triage An issue we've not yet triaged labels Aug 20, 2024

bclozel added this to the 1.4.x milestone Aug 20, 2024

bclozel mentioned this issue Sep 4, 2024

Fields beind solved sequentially in the same VirtualThread #1051

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation #1034

Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation #1034

mmuth commented Jul 22, 2024

bclozel commented Aug 20, 2024

rstoyanchev commented Sep 5, 2024 •

edited

Loading

Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation #1034

Allow to distinguish between dataloader fetched fields in datafetcher observations and offer dataloader instrumentation #1034

Comments

mmuth commented Jul 22, 2024

bclozel commented Aug 20, 2024

rstoyanchev commented Sep 5, 2024 • edited Loading

rstoyanchev commented Sep 5, 2024 •

edited

Loading