Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorporate spatial organization into pipeline #46

Open
ngreenwald opened this issue May 7, 2024 · 0 comments
Open

Incorporate spatial organization into pipeline #46

ngreenwald opened this issue May 7, 2024 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@ngreenwald
Copy link
Member

ngreenwald commented May 7, 2024

  • Occupancy statistics
    • Currently, we don't have a spatial metric that describes how spread out cells are in an image. The goal of this metric would be to summarize whether a given cell population tends to be concentrated or dispersed.
    • The simplest way to calculate this is to divide each image up into a grid, and look within each tile of the grid to see how many cells are present. We then set a threshold for how many cells need to be in a tile for it to be marked as positive, then count the number of tiles that are positive.
      • For this approach, we should play around with the tile size and threshold for positivity in each tile to see how it impacts the results. Create summary plots showing the distribution of the score across the cohort, look at images on either end of the spectrum to make sure the score is capturing the visual impact in the image, and see how all of that changes as the two parameters above are varied.
      • After deciding on a single threshold for raw counts across specific cell types, next play around with normalized cell counts. This is important because this can allow for the selection of a specific threshold across all cell types, as opposed to a specific threshold for each cell type. Normalization can be done:
        • Per image
        • Across the cohort
    • The different normalization techniques to try are:
      • Raw z-score
      • Mean centering
    • A slightly more complex way would be to look at the actual distribution of cell counts across the tiles, rather than a binary yes/no. If the binary assignment seems to be doing a good job of capturing the relevant dynamics, we can skip this one, but if it doesn't do a great job, we should try this.
      • Some options would be looking at the entropy in cell counts across the tiles, either calculated in a spatially agnostic way, or taking into account neighboring tiles to produce a more holistic metric of the distribution of cells.
  • Look into RQI
  • Look into Moran's I spatial autocorrelation
  • Look into recent publication https://pubmed.ncbi.nlm.nih.gov/38417823/
@ngreenwald ngreenwald added the enhancement New feature or request label May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants