runDeconvolution
- -h, --help
Print help messages.
- -v, --version
Print version of SDePER.
General options
- -n, --n_cores
Number of CPU cores used for parallel computing.
- Type:
integer
- Default:
1, i.e. no parallel computing
- --threshold
Threshold for hard thresholding the estimated cell type proportions, i.e. for one spot, estimated cell type proportions smaller than this threshold value will be set to 0, then re-normalize all proportions of this spot to sum as 1.
- Type:
float
- Default:
0, which means no hard thresholding
- --use_cvae
Control whether to build Conditional Variational Autoencoder (CVAE) to remove the platform effect between spatial transcriptomics and reference scRNA-seq data (true/false).
Building CVAE requires raw nUMI counts and corresponding cell type annotation of scRNA-seq data specified.
- Type:
boolean
- Default:
true
Tip
It is recommended to enable CVAE when there is an anticipated presence of platform effect between the spatial transcriptomics and reference scRNA-seq data.
- --use_imputation
Control whether to perform imputation (true/false).
Imputation requires the spot diameter (µm) at higher resolution to be specified.
- Type:
boolean
- Default:
false
- --diagnosis
If true, provide more output files related to CVAE building and hyperparameter selection for diagnosis.
- Type:
boolean
- Default:
false
- --verbose
Control whether to print more info such as output of each ADMM iteration step during program running (true/false).
- Type:
boolean
- Default:
true
Cell type marker identification options
Changed in version 1.1.0: Cell-type specific markers are identified by Differential analysis (DE) across cell-types in reference scRNA-seq data. We also perform cell and/or gene filtering before DE. Each time we ONLY compare the normalized gene expression (raw nUMI counts divided by sequencing depth) one cell-type (1st) vs another one cell-type (2nd) using Wilcoxon Rank Sum Test, then take the UNION of all identified markers for downstream analysis.
Before version 1.1.0, for each comparison genes with a FDR adjusted p value < 0.05 will be selected first, then these marker genes will be sorted by a combined rank of log fold change and pct.1/pct.2, and finally pick up specified number of genes with TOP ranks.
In version 1.1.0, the ranking strategy has been revised. Now we filter the marker genes with pre-set thresholds of p value (or FDR), fold change, pct.1 (percentage of cells expressed this marker in 1st cell-type) and pct.2 (percentage of cells expressed this marker in 2nd cell-type). Next we sort the marker genes by p value (or FDR) or fold change, and select the TOP ones.
- --n_marker_per_cmp
Number of selected TOP marker genes for each comparison of ONE cell-type against another ONE cell-type using Wilcoxon Rank Sum Test. For each comparison, genes passing filtering will be selected first, then these marker genes will be sorted by fold change or p value (or FDR), and finally pick up specified number of genes with TOP ranks. If the number of available genes is less than the specified number, a WARNING will be shown in the program running log file.
- Type:
integer
- Default:
20
Changed in version 1.2.1: Default value changed from 30 to 20.
- --use_fdr
Whether to use FDR adjusted p value for filtering and sorting. If true use FDR adjusted p value; if false orginal p value will be used instead.
- Type:
boolean
- Default:
true
Added in version 1.1.0.
- --p_val_cutoff
Threshold of p value (or FDR if
--use_fdris true) in marker genes filtering. By default only genes with p value (or FDR if--use_fdris true) <= 0.05 will be kept.- Type:
float
- Default:
0.05
Added in version 1.1.0.
- --fc_cutoff
Threshold of fold change (without log transform!) in marker genes filtering. By default only genes with fold change >= 1.2 will be kept.
- Type:
float
- Default:
1.2
Added in version 1.1.0.
- --pct1_cutoff
Threshold of pct.1 (percentage of cells expressed this marker in 1st cell-type) in marker genes filtering. By default only genes with pct.1 >= 0.3 will be kept.
- Type:
float
- Default:
0.3
Added in version 1.1.0.
- --pct2_cutoff
Threshold of pct.2 (percentage of cells expressed this marker in 2nd cell-type) in marker genes filtering. By default only genes with pct.2 <= 0.1 will be kept.
- Type:
float
- Default:
0.1
Added in version 1.1.0.
- --sortby_fc
Whether to sort marker genes by fold change. If true sort marker genes by fold change then select TOP ones. If false, p value (or FDR if
--use_fdris true) will be used to sort marker genes instead.- Type:
boolean
- Default:
true
Added in version 1.1.0.
- --filter_cell
Whether to filter cells with <200 genes for reference scRNA-seq data before differential analysis. NOTE we only apply cell filtering on reference data.
- Type:
boolean
- Default:
true
Added in version 1.1.0.
- --filter_gene
Whether to filter genes presented in <10 cells for reference scRNA-seq data and <3 spots for spatial data before differential analysis.
- Type:
boolean
- Default:
true
Added in version 1.1.0.