๐˜š๐˜ถ๐˜ฑ๐˜ฑ๐˜ฐ๐˜ณ๐˜ต ๐˜ต๐˜ฉ๐˜ฆ ๐˜ฐ๐˜ฑ๐˜ฆ๐˜ฏ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ท๐˜ฆ๐˜ฎ๐˜ฆ๐˜ฏ๐˜ต ๐˜ต๐˜ฐ ๐˜ข๐˜ญ๐˜ญ๐˜ฆ๐˜ท๐˜ช๐˜ข๐˜ต๐˜ฆ ๐˜ธ๐˜ฐ๐˜ณ๐˜ฌ ๐˜ฑ๐˜ณ๐˜ฆ๐˜ด๐˜ด๐˜ถ๐˜ณ๐˜ฆ ๐˜ง๐˜ฐ๐˜ณ ๐˜ด๐˜ค๐˜ช๐˜ฆ๐˜ฏ๐˜ต๐˜ช๐˜ง๐˜ช๐˜ค ๐˜ฑ๐˜ฆ๐˜ฆ๐˜ณ๐˜ด, ๐˜ฆ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ฏ๐˜ข๐˜ต๐˜ฆ ๐˜ต๐˜ฆ๐˜ค๐˜ฉ๐˜ฏ๐˜ช๐˜ค๐˜ข๐˜ญ ๐˜ฃ๐˜ข๐˜ณ๐˜ณ๐˜ช๐˜ฆ๐˜ณ๐˜ด, ๐˜ข๐˜ฏ๐˜ฅ ๐˜ฑ๐˜ณ๐˜ฐ๐˜ฎ๐˜ฐ๐˜ต๐˜ฆ ๐˜ช๐˜ฏ๐˜ต๐˜ฆ๐˜ณ๐˜ฅ๐˜ช๐˜ด๐˜ค๐˜ช๐˜ฑ๐˜ญ๐˜ช๐˜ฏ๐˜ข๐˜ณ๐˜บ ๐˜ค๐˜ฐ๐˜ญ๐˜ญ๐˜ข๐˜ฃ๐˜ฐ๐˜ณ๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ.

Note: To protect the originality of the manuscript, we will release the password for the key functional package during peer review, public person will have it after peer review has been completed.

1.Dependencies

Python = 3.9.18(No compatibility testing performed)
Matlab = R2024a(In theory, all versions are supported, because Matlab is backward compatible)
โ€”โ€“Matlab Tool Box = Parllel computing toolbox
Statics and machine learning toolbox
R = 4.3.2
Anaconda/Miniconda (Highly recommend)

2.PIP Install

pip install LittleSnowFox

3.Download the database

After installing LittleSnowFox using pip install LittleSnowFox, the Matlab program will be successfully downloaded. However, the package does not include the accompanying data files. To avoid potential errors in downloading data files through the Python component, you need to manually download the data files. You can directly download the entire database folder.

If the folder is placed correctly, you will be prompted with โ€œDo you want to overwrite the database folder?โ€ Select โ€œYesโ€ to write the data files. Ensure that the data directory is correctly overwritten in the Anaconda environment during the process.

#Database folder Structure๏ผš
---[LittleSnowFox]
     ---[kl_sample]
     ---[workcatalogue]
     ---[database]
       ---[Clustering_sample]
              ---[e-cigarettes], [e-cigarettes_difference_expression], [fibroblasts]
                      ---[data] #File processed by Jupyter Notebook
                      ---[Rdata] #File processed by R
                      ---[result] #File processed by Matlab
                      ---main_v3_[Sample Name].m #Matlab Main Coding
       ---[Tracing_sample] 
              ---[Hematopoiesis], [Nerveferroptosis], [Reprogramming]
                      ---[data] #File processed by Jupyter Notebook
                      ---[Rdata] #File processed by R
                      ---[result] #File processed by Matlab
       ---[R_processing] #Preprocessing with R
              ---Hematopoiesis_all.R
              ---Hematopoiesis_prog.R
              ---R-Mature-Myocardial.R
              ---R-Mature-MI Tissue.R
              ---R-Mature-Ferroptosis.R
              ---Reprogramming_all.R
              ---Reprogramming_prog.R
       ---cigarette.ipynb #First step for e-cigarette analysis
       ---fibroblasts.ipynb #First step for fibroblasts analysis
       ---neuron.ipynb #First step for neuron analysis
       ---tracing.ipynb #First step for tracing analysis

3.1.Database & Matlab processing codes

#link
https://biocomputing.cowtransfer.com/s/090b3801dedd44

#Password  
Please see the password attached to the manuscript. It will be made publicly accessible upon publication.

Password.jpg

When installing TOGGLE (KALIN) using Anaconda, the path is typically:

./anaconda3/envs/[Your Environment Name]/Lib/site-packages/LittleSnowFox

Overwrite the files in

./LittleSnowFox/

3.2.R code

#link
https://biocomputing.cowtransfer.com/s/eee620f9e5124a

#password
Please see the password attached to the manuscript. It will be made publicly accessible upon publication.

Password.jpg

3.2.R package

#link
https://drive.google.com/drive/folders/15NtoVXZbAn05JSMTRWEnWAn6eyNqXrng?usp=drive_link

We provide GEO dataset backup and R language library file backup

4.Cover Letter

๐‘ฉ๐’Š๐’๐’๐’๐’ˆ๐’š ๐‘ช๐’‰๐’‚๐’๐’๐’†๐’๐’ˆ๐’†๐’”:
Terminally differentiated cells undergo programmed cell death through distinct molecular pathways, including ferroptosis, apoptosis, and pyroptosis. However, identifying specific types remains challenging due to the complexity of signaling pathways, marker genes, and the limitations of current tools.

๐‘จ๐’๐’ˆ๐’๐’“๐’Š๐’•๐’‰๐’Ž ๐‘ช๐’‰๐’‚๐’๐’๐’†๐’๐’ˆ๐’†๐’”:
It is necessary to use unsupervised algorithms because the function and lineage changes of cells are a continuous process. If it is possible to obtain a broadly representative set of labeled samples that cover the entire fate map, then the status of cells on the fate map is known, eliminating the need for differentiation between function and lineage. The current cell tracking markers are not only expensive but also cannot be used on mature cells that no longer divide.
Even when addressing the same problem repeatedly, it is difficult to combine single-cell RNA transcription maps due to the inherent systematic bias characteristics of biological experiments, which often make supervised training infeasible.
which presents the following issues:
โ€ข Endpoint loss (e.g., missing terminal samples like dead neurons in stroke).
โ€ข Time loss (difficulty in sampling at multiple intervals).
โ€ข Sample imbalance (uneven representation of cell stages), and the absence of clear fate boundaries (gradual transitions in processes like ferroptosis).
โ€ข Lack of Boundaries: cell soft and same type and subtype lack type boundaries, fate prediction of neurons lacks clear boundaries between cell states

๐‘ฐ๐’๐’๐’๐’—๐’‚๐’•๐’Š๐’๐’ ๐’๐’‡ ๐‘ด๐’†๐’•๐’‰๐’๐’…๐’”:

  1. Based on the ability to perform mature cell subtype/type classification and lineage tracing prediction, we extended functionality to enable classification within the same subtype/type of cells based on their specific functional states.
  2. While existing techniques can only conduct subtype classification and lineage tracing, we enhanced the resolution of these principles to enable programmed fate differentiation, distinguishing different stages of cell fate.
  3. We demonstrated the potential to detect RNA pathway alterations in the e-cigarette dataset and the ability to classify functional cell interactions in the rat myocardial infarction (MI) model dataset.

๐‘ป๐’†๐’“๐’Ž๐’Š๐’๐’๐’๐’๐’ˆ๐’š ๐‘ฌ๐’™๐’‘๐’๐’‚๐’๐’‚๐’•๐’Š๐’๐’:
โ€ข Subtype Classification: A method of categorizing cell types. Specific mature cells can differentiate into different subtypes under various conditions. For instance, CD4โบ T cells can transition into Th1 cells, Th2 cells, Treg cells, or Tfh cells. These all belong to the broader category of T cells but represent distinct subcategories. For clarity, we define that if a cell type does not generate other subtypes, it is considered to have only one subtype.
โ€ข Lineage Tracing: A functional classification method for cells. It predicts the types of cells that a progenitor cell with differentiation potential will develop into. For example, hematopoietic cells may differentiate into monocytes or neutrophils.
โ€ข Programmed Fate Differentiation: A classification of cell fate stages. Within the same cell subtype or type, specific programmed events are initiated for a particular purpose, such as healthy cells undergoing apoptosis or ferroptosis. This is a new topic we trying to talk for single cell. Discover of Biology:
โ€ข We mapped the trajectory of neuronal programmed cell death and identified neuron populations primarily undergoing ferroptosis.
โ€ข Key ferroptosis-driving genes, including Ctsb, Mtdh, Ndrg1, Smad7, and others, were highlighted as potential therapeutic targets. By characterizing these pathways, we provide new insights into the regulation of ferroptosis, which is essential for rescuing neurons and mitigating damage in affected brain areas.

๐‘จ๐’…๐’—๐’‚๐’๐’„๐’†๐’… ๐‘ช๐’๐’Ž๐’‘๐’‚๐’“๐’Š๐’”๐’๐’: Advanced.png

5.Raw data

We provide a format-converted dataset on the cloud drive.
If you wish to access the original dataset, you can do so through the following link.
๐™๐™๐™–๐™ฃ๐™ ๐™จ ๐™ฉ๐™ค ๐™ฉ๐™๐™š๐™จ๐™š ๐™–๐™ช๐™ฉ๐™๐™ค๐™ง๐™จ ๐™›๐™ค๐™ง ๐™ฉ๐™๐™š๐™ž๐™ง ๐™จ๐™ฅ๐™ž๐™ง๐™ž๐™ฉ ๐™ค๐™› ๐™จ๐™๐™–๐™ง๐™ž๐™ฃ๐™œ. ๐™๐™๐™š๐™ฎ ๐™–๐™ง๐™š ๐™œ๐™ง๐™š๐™–๐™ฉ ๐™ฅ๐™š๐™ค๐™ฅ๐™ก๐™š.

Hematopoietic Development (GSE140802)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE140802

Fibroblast Reprogramming (GSE99915)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE99915

E-Cigarette and Nicotine Exposures (GSE183614)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE183614

Rat Myocardial Infarction (MI) model (GSE253578)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE253578

Programmed Cell Death Trajectories Prediction in Neurons (GSE232429)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE232429