(Large) Provenance MVP #630

SeanDuHare · 2026-01-07T17:37:43Z

Context

We want to provide users the ability to show context about how files relate to each other AKA "provenance" this includes things like how a file was generated (ex. script) or more conceptual things how a file relates to a publication.

(Shoutout @aswallace started this implementation :) )

Changes

Lots of changes! New components for generating the provenance graph were created, though much of the rendering is managed by xyflow (see components/NetworkGraph). Logical entities were also created to hold complexity (see entity/Graph). Those are the major changes you should check out. However, additional cleanup to some busy components was done to alleviate some duplicate work that needed to happen in the graph components & to make testing easier.

Testing

Lots of manual input of provenance & new unit tests. The lower level components like FileNode and MetadataNode can't be tested using our current testing system unfortunately - hopefully this can be remedied in the upcoming improvements to our infrastructure.

Try it here using a small, but high fidelity provenance dataset. Working to get a provenance table going for using with the AICS FMS source directly

Example using the above linked dataset:

This was created using this provenance source:

& this dataset:

…feature/network-graph

SeanDuHare · 2026-01-07T19:46:38Z

packages/web/src/components/ErrorPage/index.tsx

This came up in my testing quite a bit since some provenance actions were causing this (they shouldn't and I fixed each instance of it happening as it they came but they were and an unseemly default error page was appearing) so this is just a simple error page I added to replace the default from react-router that we can improve on later should we want to.

SeanDuHare · 2026-01-07T19:48:43Z

packages/core/hooks/useRemoteFileUpload.ts

This hook is very much changed. This previously was checking the remote server itself then returning a callback. Instead, all this does is return a callback and the check for the remote server now happens in interaction/logics to avoid this check happening more than once per application load. Previously it was happening several times since the component this hook lived in was tied to what created the context menu for file interactions which gets destroyed in certain scenarios. This got updated in this changeset because the sheer amount of logs this was generating was very distracting

kmitcham · 2026-01-09T17:25:03Z

How would this look if each zarr had 10 segmented cell images? Seems like it would be hard to fit/display that. That is more a UX question that a code one, though.

packages/core/components/NetworkGraph/Nodes/nodeMenuItems.ts

SeanDuHare · 2026-01-12T18:11:44Z

How would this look if each zarr had 10 segmented cell images? Seems like it would be hard to fit/display that. That is more a UX question that a code one, though.

This would depend on the organization strategy chosen. The default is tree which is I assume what you're asking since it is likely what most people will use. In this case it would become a rather large tree with lots of siblings like you see in the example with Plate Barcode having many children. Currently, the amount of nodes present at any given time is limited to ~75 unless the user requests more get loaded in. Definitely a concern though & UX is collecting feedback from users on various aspects of this - I'll make sure to pass along this concern as well !

packages/core/components/CoreContent/CoreContent.tsx

packages/core/components/FileDetailPanel/index.tsx

packages/core/components/NetworkGraph/Edges/DefaultEdge.tsx

packages/core/components/NetworkGraph/Nodes/FileNode.tsx

BrianWhitneyAI · 2026-01-12T22:57:59Z

packages/core/entity/Graph/index.ts

+     * on a previous run used up all the node search afforded
+     */
+    public get hasMoreToSearch(): boolean {
+        return this.numberOfNodesAfforded <= 0;


Is this right? we are saying there are more nodes to search if == 0 (or less but I dont think thats possible)

return this.numberOfNodesAfforded > 0;

Ah yea this is a bit confusing naming wise (open to suggestions) this is answering the question of whether or not there could be more nodes to search not if we can search more nodes during this run. The idea here is that if we exhaust our searching without running out of afforded nodes then we can assume if have searched everything we possibly can

I think theres two things that are confusing here. 1) the name of the function hasMoreToSearch doesnt refrence what its searching and 2) numberOfNodesAfforded seems like it should be a constant from the name. Maybe another value like this.remainingNodeAllocation combined with hasCompletedNodeSearch?

I changed hasMoreToSearch to hasMoreNodesToSearch - does that help for point 1? For point 2 I'm unsure of a better name - we need it to be a state variable as opposed to a constant because it tracks how many nodes we have checked that are novel

packages/core/entity/Graph/index.ts

packages/core/services/DatabaseService/index.ts

packages/web/src/components/ErrorPage/index.tsx

aswallace and others added 30 commits October 1, 2025 11:15

Create a network graph component with reactflow and dagre

541b5fe

Set up database service to be able to process provenance source

585678c

Install reactflow and dagre

cec133c

Create provenance state branch

ac942af

Add a full-screen modal for the network graph

7869372

Modify the data source modal to upload provenance data

e384897

Update description for provenance modal

8e1d536

Include provenance source in search params

25f2312

Fix broken searchparams unit tests

8201b51

Add comments to custom graph edge

964f165

Remove unnecessary undefined row check

45315a6

Make full screen modal take up more vertical space

79303c5

Use uid instead of optional id

65e3885

Move graph construction step to load later

3941d4f

Merge branch 'main' of github.com:AllenInstitute/biofile-finder into …

673320f

…feature/network-graph

Add custom file nodes

4d0de23

Allow markdown rendering in graph edges

2adb818

Merge branch 'main' of github.com:AllenInstitute/biofile-finder into …

7836f08

…feature/network-graph

Separate file and non-file node generation

9a1239f

Add explicit typing of parent/child

d695e4c

Convert graph logic into object oriented class

ae65ebf

AnnotationNode -> MetadataNode

228184f

Tidy up; add comments

5adabb9

Add unit tests

044e4f8

Get graph surveying entire network

a793f0d

Clean up with enums; file org; and test outlines

25c0027

Clean up with enum; add thumbnail view

7f24cb8

Try calculating rank

f075d00

Avoid unnecessary edges

ea9391e

Clean up unnecessary edge fix

a5d0ea6

SeanDuHare added 5 commits January 7, 2026 10:35

Add error handling to failed graph generation

570055e

Prevent duplicating remote server checks after init

2cbb1e2

Catch uncaught errors in react-router

7819f23

Avoid timeout for getting remote server status

281321a

Merge branch 'main' into feature/network-graph-2

1da31e8

SeanDuHare commented Jan 7, 2026

View reviewed changes

SeanDuHare added 2 commits January 7, 2026 11:56

Merge branch 'main' into feature/network-graph-2

a3b4e74

Prevent bad URL objects

f8a2f6c

SeanDuHare marked this pull request as ready for review January 7, 2026 20:31

SeanDuHare requested review from BrianWhitneyAI and aswallace as code owners January 7, 2026 20:31

SeanDuHare requested review from ascibisz, hughes036, kmitcham and tyler-foster January 7, 2026 20:32

SeanDuHare temporarily deployed to staging January 7, 2026 20:32 — with GitHub Actions Inactive

kmitcham reviewed Jan 12, 2026

View reviewed changes

packages/core/components/NetworkGraph/Nodes/nodeMenuItems.ts Show resolved Hide resolved

BrianWhitneyAI reviewed Jan 12, 2026

View reviewed changes

BrianWhitneyAI approved these changes Jan 12, 2026

View reviewed changes

aswallace temporarily deployed to staging January 13, 2026 00:36 — with GitHub Actions Inactive

aswallace deployed to staging January 13, 2026 20:04 — with GitHub Actions Active

SeanDuHare and others added 6 commits January 14, 2026 09:53

Minor PR feedback

1e337a8

Remove duplicate operation

f483f47

Add comments & fix charcode translation

421bd37

Fix first level of translation

304c611

Clean up error element ordering and text

91ddb93

Merge branch 'main' into feature/network-graph-2

61f30fe

(Large) Provenance MVP #630

Are you sure you want to change the base?

(Large) Provenance MVP #630

Conversation

SeanDuHare commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Changes

Testing

Uh oh!

SeanDuHare Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SeanDuHare Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kmitcham commented Jan 9, 2026

Uh oh!

Uh oh!

SeanDuHare commented Jan 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BrianWhitneyAI Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

BrianWhitneyAI Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

SeanDuHare Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

BrianWhitneyAI Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

SeanDuHare Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

SeanDuHare commented Jan 7, 2026 •

edited

Loading

SeanDuHare Jan 7, 2026 •

edited

Loading

SeanDuHare Jan 7, 2026 •

edited

Loading

SeanDuHare Jan 15, 2026 •

edited

Loading