CLI Reference
mokume provides six main commands. In practice, most users start with features2proteins for quantification workflows or tissuemap for tissue atlas workflows.
Use mokume --help or mokume <command> --help for details.
features2proteins
The unified pipeline: features to protein quantification in one step.
mokume features2proteins [OPTIONS]
Required Options
| Option |
Description |
-p/--parquet |
Input parquet file (quantms.io/qpx format) |
-o/--output |
Output protein intensities CSV |
| Option |
Default |
Description |
-s/--sdrf |
none |
SDRF file for sample metadata |
--min-aa |
7 |
Minimum amino acid length |
--min-unique |
2 |
Minimum unique peptides per protein |
--remove-contaminants |
on |
Remove contaminants and decoys |
--keep-contaminants |
off |
Keep contaminants and decoys |
Quantification
| Option |
Default |
Description |
--quant-method |
maxlfq |
Method: maxlfq, directlfq, ibaq, topn, sum, median, ratio |
--fasta |
none |
FASTA file (required for iBAQ) |
--topn |
3 |
N for TopN quantification |
--ion-alignment |
none |
Ion alignment: none or hierarchical |
--directlfq-cores |
auto |
CPU cores for DirectLFQ |
--directlfq-min-nonan |
1 |
Min non-NaN values for DirectLFQ |
Normalization
| Option |
Default |
Description |
--run-normalization |
median |
Run-level: median, mean, max, global, max_min, iqr, none |
--sample-normalization |
globalMedian |
Sample-level: globalMedian, conditionMedian, hierarchical, tmm, none |
--normalization-proteins |
none |
File with protein IDs for normalization |
IRS (Multi-Plex TMT)
| Option |
Default |
Description |
--irs |
off |
Enable IRS normalization |
--irs-reference-samples |
auto |
Comma-separated reference sample names |
--irs-sdrf-column |
auto |
SDRF column for reference detection |
--irs-sdrf-values |
auto |
Values indicating reference samples |
--irs-reference-regex |
pool\|powder\|ref\|reference\|bridge |
Regex for reference auto-detection |
--irs-stat |
median |
Plex reference statistic: median or mean |
--irs-remove-reference |
off |
Remove reference samples from output |
Coverage Filter
| Option |
Default |
Description |
--coverage-threshold |
none |
Min fraction of non-missing values per condition |
Ratio Quantification
| Option |
Default |
Description |
--ratio-fraction-merge |
mean |
Fraction merge strategy: mean or max |
Batch Correction
| Option |
Default |
Description |
--batch-correction |
off |
Enable ComBat batch correction |
--batch-method |
sample_prefix |
Detection method: sample_prefix, run, column |
--batch-column |
none |
SDRF column used when --batch-method=column |
--batch-covariates |
none |
Comma-separated SDRF columns to preserve |
--batch-parametric / --batch-nonparametric |
parametric |
ComBat estimation mode |
--batch-mean-only |
off |
Only adjust batch means |
--batch-ref |
none |
Reference batch ID |
Differential Expression
| Option |
Default |
Description |
--de |
off |
Enable differential expression analysis |
--de-contrasts |
— |
Comma-separated contrasts (e.g., "A vs B,A vs C") |
--de-contrasts-file |
— |
TSV file with columns group1, group2 |
--de-method |
auto |
Method: auto, limrots, deqms, or proda |
--de-log2fc |
0.5 |
Minimum absolute log2 fold change |
--de-fdr |
0.05 |
Maximum FDR threshold |
--de-fdr-method |
bh |
FDR correction: bh or ihw |
--de-output |
auto |
Output file for DE results |
Contrasts must be explicitly provided via --de-contrasts and/or --de-contrasts-file. Both can be combined.
--de-method auto selects deqms for directlfq quantification and limrots for other quantification methods.
Plots & Reports
| Option |
Default |
Description |
--plot-dir |
none |
Output directory for plots |
--plot-volcano |
off |
Generate volcano plots |
--plot-heatmap |
off |
Generate per-contrast DE heatmaps (top 50 by |log2FC|) |
--plot-pca |
off |
Generate PCA plots |
--highlight-genes |
none |
Comma-separated gene names to highlight |
--interactive-report |
off |
Generate interactive HTML QC report |
--report-output |
auto |
Output path for HTML report |
Export
| Option |
Default |
Description |
--export-peptides |
none |
Export normalized peptides to file |
--export-ions |
none |
Export normalized ions (DirectLFQ only) |
features2peptides
Feature-level to peptide-level normalization.
mokume features2peptides [OPTIONS]
Core Options
| Option |
Default |
Description |
-p/--parquet |
required |
Input parquet file |
-s/--sdrf |
none |
SDRF file for metadata |
-o/--output |
required |
Output peptide intensity file |
--min_aa |
7 |
Minimum amino acid length |
--min_unique |
2 |
Minimum unique peptides per protein |
--remove_ids |
none |
File with protein IDs to exclude |
--remove_decoy_contaminants |
off |
Remove decoys and contaminants |
--remove_low_frequency_peptides |
off |
Remove peptides in <20% of samples |
Normalization
| Option |
Default |
Description |
--run-normalization |
median |
Feature normalization: median, mean, max, global, max_min, iqr, none |
--sample-normalization |
globalMedian |
Sample normalization: globalMedian, conditionMedian, hierarchical, tmm, none |
--skip_normalization |
off |
Skip all normalization |
--log2 |
off |
Log2 transform output |
--save_parquet |
off |
Save output as parquet |
TMT / ITRAQ
| Option |
Default |
Description |
--irs_channel |
none |
Explicit pooled/reference channel label |
--irs_autodetect_regex |
none |
Regex to detect pooled samples from SDRF |
--irs_stat |
median |
IRS per-run statistic |
--irs_scope |
global |
IRS scaling scope: global, by_mixture, or two_stage |
--aggregation_level |
sample |
Aggregate at sample or run level |
Filter Configuration
| Option |
Default |
Description |
--filter-config |
none |
YAML/JSON filter configuration file |
--generate-filter-config |
none |
Generate example config and exit |
--filter-min-intensity |
none |
Min intensity threshold (override) |
--filter-cv-threshold |
none |
Max CV across replicates (override) |
--filter-charge-states |
none |
Comma-separated charge states (override) |
--filter-max-missed-cleavages |
none |
Max missed cleavages (override) |
--filter-exclude-modifications |
none |
Comma-separated modifications (override) |
--filter-min-unique-peptides |
none |
Min unique peptides (override) |
--filter-min-features |
none |
Min features per run (override) |
--filter-max-missing-rate |
none |
Max missing rate (override) |
peptides2protein
Protein quantification from normalized peptide data.
mokume peptides2protein [OPTIONS]
| Option |
Default |
Description |
-p/--peptides |
required |
Input peptide intensity file |
-f/--fasta |
none |
FASTA file (required for iBAQ) |
--method |
ibaq |
Method: ibaq, top3, topn, maxlfq, sum, directlfq |
-e/--enzyme |
Trypsin |
Enzyme for in-silico digestion |
-n/--normalize |
off |
Normalize quantification values |
--min_aa |
7 |
Min amino acid length |
--max_aa |
30 |
Max amino acid length |
-t/--tpa |
off |
Calculate TPA (iBAQ only) |
-r/--ruler |
off |
ProteomicRuler (iBAQ only) |
-i/--ploidy |
2 |
Ploidy number |
-m/--organism |
human |
Organism for histone data |
-c/--cpc |
200 |
Cellular protein concentration (g/L) |
--topn_n |
3 |
N for TopN quantification |
--threads |
-1 |
Threads for MaxLFQ (-1 = all cores) |
--min_nonan |
1 |
Min non-NaN for DirectLFQ |
-o/--output |
none |
Output file path |
--verbose |
off |
Print distribution info |
--qc_report |
QCprofile.pdf |
QC report PDF path |
Use --method topn --topn_n 5 or --method topn --topn_n 10 for Top5 or Top10-style quantification. -o/--output is effectively required for ibaq; for the other methods, omitting it prints the result table to stdout.
correct-batches
Standalone batch correction for pre-quantified data.
mokume correct-batches [OPTIONS]
| Option |
Default |
Description |
-f/--folder |
required |
Folder with TSV files |
-p/--pattern |
*ibaq.tsv |
File matching pattern |
-o/--output |
required |
Output file path |
-sid/--sample_id_column |
SampleID |
Sample ID column |
-pid/--protein_id_column |
ProteinName |
Protein ID column |
--ibaq_raw_column |
IBAQ |
Raw intensity column |
--ibaq_corrected_column |
IBAQ_BEC |
Corrected intensity column |
--comment |
# |
Comment character |
--sep |
\t |
Field separator |
--export_anndata |
off |
Export to AnnData h5ad |
tissuemap
Per-dataset tissue proteome atlas analysis from QPX outputs.
mokume tissuemap [OPTIONS]
| Option |
Default |
Description |
--scan-dir |
required unless --generate-config |
Dataset directory or parent directory containing datasets |
--output-dir |
tissuemap_output |
Output directory for results |
--config |
none |
YAML configuration file |
--generate-config |
none |
Generate a default YAML template and exit |
--tmt-dataset |
auto |
Mark one or more dataset IDs as TMT |
--n-jobs |
8 |
Threads for dataset processing and embedding |
--dpi |
250 |
Plot resolution override |
Note
Install the optional dependencies first with pip install mokume[tissuemap].
tsne-visualization
t-SNE dimensionality reduction visualization.
mokume tsne-visualization [OPTIONS]
| Option |
Default |
Description |
-f/--folder |
required |
Folder with protein files |
-o/--pattern |
proteins.tsv |
File matching pattern |