Lineage tracing of Shh+ floor plate cells and dynamics of dorsal-ventral gene expression in the regenerating axolotl spinal cord

This project involves the analysis, modeling and visualization of gene expression data from multiple genes, conditions and replicas using a piecewise constant fitting approach.

Prerequisites

The following Python packages are required:

pandas
numpy
scipy
matplotlib
seaborn

You can install these packages using pip:

pip install pandas numpy scipy matplotlib seaborn

Data preprocessing

Loading data

The data is loaded from an Excel file (quantifications_all-n.xlsx). Each sheet in the Excel file contains distance and expression data for various genes, conditions and replicas.

Normalizing and binning data

Distances are normalized to a range of 0 to 100 and binned into specified intervals. The average expression within each bin is calculated.

Categorizing data by gene and condition

Data is organized into dictionaries based on genes and conditions, and statistical metrics are computed.

Visualizing data

Expression profiles: plots of gene expression profiles along the distance axis for each gene and condition.
Piecewise constant fitting: fitting the data using two-step and three-step piecewise constant functions to identify changes in expression levels.

Piecewise constant fitting

Two-step piecewise constant fitting

The two-step piecewise constant function fits the data with one switchpoint for most of the genes. The best fit is determined by evaluating the SSE for all potential switchpoints.

Three-step piecewise constant fitting

The three-step piecewise constant function is used to fit the data with two switchpoints for Pax6. The function evaluates different pairs of switchpoints to minimize the sum of squared errors (SSE).

Visualizing fitting results

SSE plots: scatter plots showing the SSE values for different switchpoints.
Expression plots: scatter plots of the data with the fitted piecewise constant function overlaid.

Statistical comparison

Intensity and size comparison: box plots comparing the corrected HCR signal intensity and domain size between conditions for each gene.

Statistical annotations

Statistical tests (Mann-Whitney and t-test) are performed to compare the differences between conditions, and the results are annotated on the plots.

Summary statistics

Summary statistics for gene expression data, including mean, median and std of intensity and size, are calculated and saved to a CSV file (summary_stats.csv).

Running the analysis

To run the analysis, execute the script in a Python notebooks with the necessary packages installed. The script will generate visualizations and save the results to the specified output files.

Contact

For questions or further information, please contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
conditions_comparison		conditions_comparison
data		data
fitting_mean		fitting_mean
individual_fittings		individual_fittings
mean_fits_with_data		mean_fits_with_data
.gitignore		.gitignore
DV_expression_analysis.ipynb		DV_expression_analysis.ipynb
LICENSE		LICENSE
README.md		README.md
quantifications_all-n.xlsx		quantifications_all-n.xlsx
summary_stats.csv		summary_stats.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lineage tracing of Shh+ floor plate cells and dynamics of dorsal-ventral gene expression in the regenerating axolotl spinal cord

Prerequisites

Data preprocessing

Loading data

Normalizing and binning data

Categorizing data by gene and condition

Visualizing data

Piecewise constant fitting

Two-step piecewise constant fitting

Three-step piecewise constant fitting

Visualizing fitting results

Statistical comparison

Statistical annotations

Summary statistics

Running the analysis

Contact

About

Releases 1

Packages

Languages

License

ecuracosta/dorsal-ventral_gene_expression_in_the_regenerating_axolotl_spinal_cord

Folders and files

Latest commit

History

Repository files navigation

Lineage tracing of Shh+ floor plate cells and dynamics of dorsal-ventral gene expression in the regenerating axolotl spinal cord

Prerequisites

Data preprocessing

Loading data

Normalizing and binning data

Categorizing data by gene and condition

Visualizing data

Piecewise constant fitting

Two-step piecewise constant fitting

Three-step piecewise constant fitting

Visualizing fitting results

Statistical comparison

Statistical annotations

Summary statistics

Running the analysis

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages