SPACER
FinderSequencesJobsDocsContact

Search documentation

Search all SPACER documentation pages

GitHub
IntroductionQuick Start
OverviewEnzyme FamiliesPAM SequencesNomenclature
Cas12 FinderCas13 FinderMSA Guide DesignBADGERS OptimizerOptimizer Configuration
Assay ScoreHeuristic QualityML ActivitySpacer StructurePrimer ScoringCoverage & SpecificityTier SystemFlags Reference
spacer-webv0.1.0
Scoring & Quality›Heuristic Quality

Heuristic Quality

Sequence composition scoring combining GC content, homopolymer runs, and PFS preference into a single normalized component.

Overview

The heuristic quality component combines three sub-scores — GC content quality, homopolymer penalty, and PFS (Protospacer Flanking Sequence) preference — into a single [0, 1] value. It carries a default weight of 0.10 in the assay score and is always active (it requires no optional pipeline stages).

Enzyme-Specific Sub-Weights

The three sub-scores are combined with enzyme-specific weights:

EnzymeGC WeightHomopolymer WeightPFS WeightFormula
Cas120.600.40— (not used)0.60 × gc + 0.40 × homo
Cas130.500.300.200.50 × gc + 0.30 × homo + 0.20 × pfs

Cas12 does not use PFS because it relies on PAM recognition rather than protospacer flanking sequences. Cas13 variants may have PFS preferences that affect guide efficacy.

GC Content Quality

GC content is scored on a piecewise linear function. The optimal range is 40–60%, scoring 1.0. Outside this range, the score ramps linearly to 0.0 at the extremes:

GC RangeScoreFormula
40–60%1.0Optimal — no penalty
20–40%0.0 → 1.0 (linear ramp)(gc − 0.20) / 0.20
60–80%1.0 → 0.0 (linear ramp)(0.80 − gc) / 0.20
≤20% or ≥80%0.0Extreme GC — maximum penalty

Quality flags are raised at the boundaries: LowGc when GC < 40%, HighGc when GC > 60%.

Homopolymer Quality

Homopolymer runs (consecutive identical nucleotides) are penalized based on the longest run in the spacer sequence:

Max Run LengthScoreNotes
0–31.0No penalty — short runs are acceptable
40.75Mild penalty
50.50Moderate penalty; flag raised at ≥5
60.25Severe penalty
≥70.0Maximum penalty

For runs of 4–6, the formula is: 1.0 - (max_run - 3.0) / 4.0. This produces a linear ramp from 1.0 (at 3) to 0.0 (at 7). Runs of 7 or more always score 0.0.

Poly-T/U Stretches

Poly-T stretches (4+ consecutive T/U) are detected and flagged with Critical severity, but they do not contribute to any score component. Poly-T acts as a transcription terminator for RNA Polymerase III (Pol III), which is used to express guide RNAs in vivo. A poly-T stretch in the spacer would cause premature termination of the crRNA transcript, making the guide non-functional in Pol III expression systems.

Warning
Poly-T/U stretches do not affect the heuristic quality score or any other assay score component. They are flagged as a critical quality issue because they prevent crRNA expression via Pol III promoters, but the scoring system treats them as a binary pass/fail filter rather than a graded penalty.

PFS Preference (Cas13 Only)

Some Cas13 variants have preferences for the nucleotide immediately flanking the protospacer. The PFS sub-score uses three levels:

PFS StatusScoreMeaning
Favorable1.0Flanking nucleotide matches variant preference
Unknown0.5No flanking context available; neutral assumption
Unfavorable0.0Flanking nucleotide is disfavored by the variant

Variant-Specific Rules

VariantRuleFavorableUnfavorable
LwaCas13aAvoid G at 3′ of protospacerA, C, T/U at 3′G at 3′
LbuCas13aAvoid G at 3′ of protospacerA, C, T/U at 3′G at 3′
PsmCas13bAvoid C at 5′ of protospacerA, G, T/U at 5′C at 5′
GenericNo PFS requirementAlways favorable—

When flanking context is not available (e.g., the spacer is at the edge of the input sequence), the PFS check defaults to favorable (passes). For Cas12 enzymes, PFS is not evaluated and does not contribute to the heuristic quality score.

Info
The heuristic quality component provides a baseline sequence composition signal that is always available, even without ML models or structure predictors. For the full scoring breakdown, see the Assay Score overview.
Scoring & Quality
Assay Score
Scoring & Quality
ML Activity
ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT ATCG GCTA TACG CGAT ATCG TAGC GCTA ATCG TACG CGAT
SPACER

Open-source CRISPR guide RNA design and scoring for Cas12 and Cas13 diagnostic systems.

Resources
FinderDocumentationChangelogContactGitHub
Developed atFiocruz Parana — Instituto Carlos Chagas

Fundacao Oswaldo Cruz - Parana

Instituto Carlos Chagas

© 2026 SPACER·v0.1.0
hwalflorGitHub