runDeconvolution

-h, --help

Print help messages.

-v, --version

Print version of SDePER.

General options

-n, --n_cores

Number of CPU cores used for parallel computing.

Type:

integer

Default:

1, i.e. no parallel computing

--threshold

Threshold for hard thresholding the estimated cell type proportions, i.e. for one spot, estimated cell type proportions smaller than this threshold value will be set to 0, then re-normalize all proportions of this spot to sum as 1.

Type:

float

Default:

0, which means no hard thresholding

--use_cvae

Control whether to build Conditional Variational Autoencoder (CVAE) to remove the platform effect between spatial transcriptomics and reference scRNA-seq data (true/false).

Building CVAE requires raw nUMI counts and corresponding cell type annotation of scRNA-seq data specified.

Type:

boolean

Default:

true

Tip

It is recommended to enable CVAE when there is an anticipated presence of platform effect between the spatial transcriptomics and reference scRNA-seq data.

--use_imputation

Control whether to perform imputation (true/false).

Imputation requires the spot diameter (µm) at higher resolution to be specified.

Type:

boolean

Default:

false

--diagnosis

If true, provide more output files related to CVAE building and hyperparameter selection for diagnosis.

Type:

boolean

Default:

false

--verbose

Control whether to print more info such as output of each ADMM iteration step during program running (true/false).

Type:

boolean

Default:

true

Cell type marker identification options

Changed in version 1.1.0: Cell-type specific markers are identified by Differential analysis (DE) across cell-types in reference scRNA-seq data. We also perform cell and/or gene filtering before DE. Each time we ONLY compare the normalized gene expression (raw nUMI counts divided by sequencing depth) one cell-type (1st) vs another one cell-type (2nd) using Wilcoxon Rank Sum Test, then take the UNION of all identified markers for downstream analysis.

Before version 1.1.0, for each comparison genes with a FDR adjusted p value < 0.05 will be selected first, then these marker genes will be sorted by a combined rank of log fold change and pct.1/pct.2, and finally pick up specified number of genes with TOP ranks.

In version 1.1.0, the ranking strategy has been revised. Now we filter the marker genes with pre-set thresholds of p value (or FDR), fold change, pct.1 (percentage of cells expressed this marker in 1st cell-type) and pct.2 (percentage of cells expressed this marker in 2nd cell-type). Next we sort the marker genes by p value (or FDR) or fold change, and select the TOP ones.

--n_marker_per_cmp

Number of selected TOP marker genes for each comparison of ONE cell-type against another ONE cell-type using Wilcoxon Rank Sum Test. For each comparison, genes passing filtering will be selected first, then these marker genes will be sorted by fold change or p value (or FDR), and finally pick up specified number of genes with TOP ranks. If the number of available genes is less than the specified number, a WARNING will be shown in the program running log file.

Type:

integer

Default:

20

Changed in version 1.2.1: Default value changed from 30 to 20.

--use_fdr

Whether to use FDR adjusted p value for filtering and sorting. If true use FDR adjusted p value; if false orginal p value will be used instead.

Type:

boolean

Default:

true

Added in version 1.1.0.

--p_val_cutoff

Threshold of p value (or FDR if --use_fdr is true) in marker genes filtering. By default only genes with p value (or FDR if --use_fdr is true) <= 0.05 will be kept.

Type:

float

Default:

0.05

Added in version 1.1.0.

--fc_cutoff

Threshold of fold change (without log transform!) in marker genes filtering. By default only genes with fold change >= 1.2 will be kept.

Type:

float

Default:

1.2

Added in version 1.1.0.

--pct1_cutoff

Threshold of pct.1 (percentage of cells expressed this marker in 1st cell-type) in marker genes filtering. By default only genes with pct.1 >= 0.3 will be kept.

Type:

float

Default:

0.3

Added in version 1.1.0.

--pct2_cutoff

Threshold of pct.2 (percentage of cells expressed this marker in 2nd cell-type) in marker genes filtering. By default only genes with pct.2 <= 0.1 will be kept.

Type:

float

Default:

0.1

Added in version 1.1.0.

--sortby_fc

Whether to sort marker genes by fold change. If true sort marker genes by fold change then select TOP ones. If false, p value (or FDR if --use_fdr is true) will be used to sort marker genes instead.

Type:

boolean

Default:

true

Added in version 1.1.0.

--filter_cell

Whether to filter cells with <200 genes for reference scRNA-seq data before differential analysis. NOTE we only apply cell filtering on reference data.

Type:

boolean

Default:

true

Added in version 1.1.0.

--filter_gene

Whether to filter genes presented in <10 cells for reference scRNA-seq data and <3 spots for spatial data before differential analysis.

Type:

boolean

Default:

true

Added in version 1.1.0.