Install & Guide
๐๐ถ๐ฑ๐ฑ๐ฐ๐ณ๐ต ๐ต๐ฉ๐ฆ ๐ฐ๐ฑ๐ฆ๐ฏ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ฎ๐ฐ๐ท๐ฆ๐ฎ๐ฆ๐ฏ๐ต ๐ต๐ฐ ๐ข๐ญ๐ญ๐ฆ๐ท๐ช๐ข๐ต๐ฆ ๐ธ๐ฐ๐ณ๐ฌ ๐ฑ๐ณ๐ฆ๐ด๐ด๐ถ๐ณ๐ฆ ๐ง๐ฐ๐ณ ๐ด๐ค๐ช๐ฆ๐ฏ๐ต๐ช๐ง๐ช๐ค ๐ฑ๐ฆ๐ฆ๐ณ๐ด, ๐ฆ๐ญ๐ช๐ฎ๐ช๐ฏ๐ข๐ต๐ฆ ๐ต๐ฆ๐ค๐ฉ๐ฏ๐ช๐ค๐ข๐ญ ๐ฃ๐ข๐ณ๐ณ๐ช๐ฆ๐ณ๐ด, ๐ข๐ฏ๐ฅ ๐ฑ๐ณ๐ฐ๐ฎ๐ฐ๐ต๐ฆ ๐ช๐ฏ๐ต๐ฆ๐ณ๐ฅ๐ช๐ด๐ค๐ช๐ฑ๐ญ๐ช๐ฏ๐ข๐ณ๐บ ๐ค๐ฐ๐ญ๐ญ๐ข๐ฃ๐ฐ๐ณ๐ข๐ต๐ช๐ฐ๐ฏ.
Note: To protect the originality of the manuscript, we will release the password for the key functional package during peer review, public person will have it after peer review has been completed.
1.Dependencies
Python = 3.9.18
(No compatibility testing performed)
Matlab = R2024a
(In theory, all versions are supported, because Matlab is backward compatible)
โโMatlab Tool Box = Parllel computing toolbox
Statics and machine learning toolbox
R = 4.3.2
Anaconda/Miniconda (Highly recommend)
2.PIP Install
pip install LittleSnowFox
3.Download the database
After installing LittleSnowFox using pip install LittleSnowFox
, the Matlab program will be successfully downloaded. However, the package does not include the accompanying data files. To avoid potential errors in downloading data files through the Python component, you need to manually download the data files. You can directly download the entire database
folder.
If the folder is placed correctly, you will be prompted with โDo you want to overwrite the database
folder?โ Select โYesโ to write the data files. Ensure that the data directory is correctly overwritten in the Anaconda environment during the process.
#Database folder Structure๏ผ
---[LittleSnowFox]
---[kl_sample]
---[workcatalogue]
---[database]
---[Clustering_sample]
---[e-cigarettes], [e-cigarettes_difference_expression], [fibroblasts]
---[data] #File processed by Jupyter Notebook
---[Rdata] #File processed by R
---[result] #File processed by Matlab
---main_v3_[Sample Name].m #Matlab Main Coding
---[Tracing_sample]
---[Hematopoiesis], [Nerveferroptosis], [Reprogramming]
---[data] #File processed by Jupyter Notebook
---[Rdata] #File processed by R
---[result] #File processed by Matlab
---[R_processing] #Preprocessing with R
---Hematopoiesis_all.R
---Hematopoiesis_prog.R
---R-Mature-Myocardial.R
---R-Mature-MI Tissue.R
---R-Mature-Ferroptosis.R
---Reprogramming_all.R
---Reprogramming_prog.R
---cigarette.ipynb #First step for e-cigarette analysis
---fibroblasts.ipynb #First step for fibroblasts analysis
---neuron.ipynb #First step for neuron analysis
---tracing.ipynb #First step for tracing analysis
3.1.Database & Matlab processing codes
#link
https://biocomputing.cowtransfer.com/s/090b3801dedd44
#Password
Please see the password attached to the manuscript. It will be made publicly accessible upon publication.
When installing TOGGLE (KALIN) using Anaconda, the path is typically:
./anaconda3/envs/[Your Environment Name]/Lib/site-packages/LittleSnowFox
Overwrite the files in
./LittleSnowFox/
3.2.R code
#link
https://biocomputing.cowtransfer.com/s/eee620f9e5124a
#password
Please see the password attached to the manuscript. It will be made publicly accessible upon publication.
3.2.R package
#link
https://drive.google.com/drive/folders/15NtoVXZbAn05JSMTRWEnWAn6eyNqXrng?usp=drive_link
We provide GEO dataset backup and R language library file backup
4.Cover Letter
๐ฉ๐๐๐๐๐๐ ๐ช๐๐๐๐๐๐๐๐๐:
Terminally differentiated cells undergo programmed cell death through distinct molecular pathways, including ferroptosis, apoptosis, and pyroptosis. However, identifying specific types remains challenging due to the complexity of signaling pathways, marker genes, and the limitations of current tools.
๐จ๐๐๐๐๐๐๐๐ ๐ช๐๐๐๐๐๐๐๐๐:
It is necessary to use unsupervised algorithms because the function and lineage changes of cells are a continuous process. If it is possible to obtain a broadly representative set of labeled samples that cover the entire fate map, then the status of cells on the fate map is known, eliminating the need for differentiation between function and lineage. The current cell tracking markers are not only expensive but also cannot be used on mature cells that no longer divide.
Even when addressing the same problem repeatedly, it is difficult to combine single-cell RNA transcription maps due to the inherent systematic bias characteristics of biological experiments, which often make supervised training infeasible.
which presents the following issues:
โข Endpoint loss (e.g., missing terminal samples like dead neurons in stroke).
โข Time loss (difficulty in sampling at multiple intervals).
โข Sample imbalance (uneven representation of cell stages), and the absence of clear fate boundaries (gradual transitions in processes like ferroptosis).
โข Lack of Boundaries: cell soft and same type and subtype lack type boundaries, fate prediction of neurons lacks clear boundaries between cell states
๐ฐ๐๐๐๐๐๐๐๐๐ ๐๐ ๐ด๐๐๐๐๐ ๐:
- Based on the ability to perform mature cell subtype/type classification and lineage tracing prediction, we extended functionality to enable classification within the same subtype/type of cells based on their specific functional states.
- While existing techniques can only conduct subtype classification and lineage tracing, we enhanced the resolution of these principles to enable programmed fate differentiation, distinguishing different stages of cell fate.
- We demonstrated the potential to detect RNA pathway alterations in the e-cigarette dataset and the ability to classify functional cell interactions in the rat myocardial infarction (MI) model dataset.
๐ป๐๐๐๐๐๐๐๐๐๐ ๐ฌ๐๐๐๐๐๐๐๐๐๐:
โข Subtype Classification: A method of categorizing cell types. Specific mature cells can differentiate into different subtypes under various conditions. For instance, CD4โบ T cells can transition into Th1 cells, Th2 cells, Treg cells, or Tfh cells. These all belong to the broader category of T cells but represent distinct subcategories. For clarity, we define that if a cell type does not generate other subtypes, it is considered to have only one subtype.
โข Lineage Tracing: A functional classification method for cells. It predicts the types of cells that a progenitor cell with differentiation potential will develop into. For example, hematopoietic cells may differentiate into monocytes or neutrophils.
โข Programmed Fate Differentiation: A classification of cell fate stages. Within the same cell subtype or type, specific programmed events are initiated for a particular purpose, such as healthy cells undergoing apoptosis or ferroptosis. This is a new topic we trying to talk for single cell.
Discover of Biology:
โข We mapped the trajectory of neuronal programmed cell death and identified neuron populations primarily undergoing ferroptosis.
โข Key ferroptosis-driving genes, including Ctsb, Mtdh, Ndrg1, Smad7, and others, were highlighted as potential therapeutic targets. By characterizing these pathways, we provide new insights into the regulation of ferroptosis, which is essential for rescuing neurons and mitigating damage in affected brain areas.
๐จ๐
๐๐๐๐๐๐
๐ช๐๐๐๐๐๐๐๐๐:
5.Raw data
We provide a format-converted dataset on the cloud drive.
If you wish to access the original dataset, you can do so through the following link.
๐๐๐๐ฃ๐ ๐จ ๐ฉ๐ค ๐ฉ๐๐๐จ๐ ๐๐ช๐ฉ๐๐ค๐ง๐จ ๐๐ค๐ง ๐ฉ๐๐๐๐ง ๐จ๐ฅ๐๐ง๐๐ฉ ๐ค๐ ๐จ๐๐๐ง๐๐ฃ๐. ๐๐๐๐ฎ ๐๐ง๐ ๐๐ง๐๐๐ฉ ๐ฅ๐๐ค๐ฅ๐ก๐.
Hematopoietic Development (GSE140802)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE140802
Fibroblast Reprogramming (GSE99915)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE99915
E-Cigarette and Nicotine Exposures (GSE183614)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE183614
Rat Myocardial Infarction (MI) model (GSE253578)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE253578
Programmed Cell Death Trajectories Prediction in Neurons (GSE232429)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE232429