VeriVote

VeriVote Indicator Protocol — Evaluator Guide v2.0

Scoring consistently, transparently,
and defensibly.

This guide ensures every indicator is scored using only verifiable public sources. Read it alongside the VeriVote Indicator Protocol v2.0. This guide covers how to apply those definitions in practice — not restate them.

Version 2.0 • March 2026 • 7 sections For VeriVote evaluators and data integrity team

Purpose: This guide ensures every indicator is scored consistently, transparently, and defensibly using only verifiable public sources. It is to be read alongside the VeriVote Indicator Protocol v2.0, which is the canonical reference for all 80 indicator definitions, measurement rules, and scoring band criteria.

1. General Scoring Rules

The following rules apply to every indicator across all five categories and must be applied consistently by all evaluators.

1.1 The Scoring Scale

Every indicator is scored on a 0–25 point scale. The five standardised performance bands are:

Band Performance Label General Criteria
19–25 Excellent performance Proactive compliance; exemplary, fully documented record with no significant gaps.
13–18 Strong performance Consistent evidence with only minor, resolved gaps.
7–12 Moderate performance Compliance mixed with neutral or weak signals; some gaps present.
1–6 Weak performance Documented lapses, insufficient evidence, or notable failures on record.
0 No evidence / serious violation No verifiable evidence provided, active conviction, or confirmed serious violation.

1.2 Evidence Requirements

Eligible evidence: Must be public, attributable, timestamped, and sourced from official or credible institutions (EACC, KRA, OAG, Hansard, official gazettes, verified media fact-checkers including Africa Check and PesaCheck, etc.).
Self-reported claims: Score 0 unless independently verified by a recognised institution or credible third party.
No controversies / no applicable events: Do not penalise absence. If there are no recorded events against which an indicator can be measured, award a neutral-to-high score unless evidence of avoidance is documented.
Refused or withheld data: If a candidate or institution actively refuses to provide data that is reasonably available and request-able, treat this as Score 0. This is distinct from structural inapplicability (see N/A Rule below). Refusal must be documented in the audit trail.
Composite indices: For indicators that aggregate multiple sub-signals, sub-signal weights are visible to evaluators in the Indicator Protocol. The final aggregated score counts as 25% of the subcategory like any other indicator.

1.3 Proxy Allowance — Private Sector Records

Where a candidate has no public office history for a given indicator, a private-sector equivalent may be used as a proxy, subject to all four of the following conditions:

# Condition
(a) The proxy record has been independently audited by a recognised professional body (ICPAK, LSK, CMA, or equivalent).
(b) The proxy measures the same underlying construct as the primary indicator (e.g., a corporate audit as a proxy for an OAG audit; ICPAK disciplinary clearance as a proxy for EACC clearance).
(c) The proxy is scored on the same 0–25 scale using the same band thresholds as the primary indicator.
(d) The proxy does NOT apply to Category 1 constitutional-specific indicators where the constitutional source is an official government body only (EACC, KRA, Judiciary). These require primary evidence.
Audit trail requirement: The evaluator must document the original indicator, the proxy document used, and the explicit basis for equivalence. Proxy use must be flagged in the score record.

1.4 N/A Rule — Structural Inapplicability

When N/A applies: An indicator is marked N/A only when it is structurally inapplicable to the candidate — i.e., the indicator presupposes a type of prior role or office the candidate has never held, and no proxy evidence exists.

Redistribution: An N/A score redistributes that indicator's 25-point weight equally across the other three indicators in the same subcategory. Each remaining indicator's maximum therefore increases by 8.33 points.

N/A vs. Refused: N/A is for structural impossibility (the candidate could not have this record). Refused/withheld is for deliberate non-disclosure of evidence that could exist. These must never be conflated.

1.5 Conditional Indicators

Five indicators are conditional — they only apply if the candidate meets the stated prior-role threshold. If the condition is not met, the indicator is automatically marked N/A and the redistribution rule applies.

ID Indicator Name Condition for Applicability
1.3.3 Legislative or Executive Decision Consistency Record Candidate must have held prior legislative or executive office.
2.3.1 Audit Clean Opinion Certification Candidate must have held a budget-responsible public office.
2.3.4 Budget Variance Management and Waste Prevention Index Candidate must have had direct budget oversight responsibility.
5.1.1 Tenure Completion Rate Candidate must have held any prior elected or appointed public office.
5.1.3 Crisis Management History and Success Rate Candidate must have held an executive, managerial, or command-level office.

1.6 Demagogue Penalty — Category 3 Negative Trigger

Category 3 (Public Engagement) indicators are subject to a deduction if verified negative triggers are present. These are applied at subcategory level before the final category score is computed.

Hate Speech / Incitement: A verified NCIC formal finding triggers a deduction of up to 10 points from the affected subcategory score. Each finding requires a source reference in the audit trail.

Disinformation: A verified Africa Check, PesaCheck, or VeriVote platform flagged-statement finding triggers a deduction of up to 5 points per verified pattern. Each deduction requires a source reference.

Purpose: This prevents high engagement volume from masking Chapter Six violations. A candidate cannot offset an NCIC finding with high attendance at public forums.


2. The Indicator Protocol Template

Every one of the 80 indicators in the VeriVote Indicator Protocol v2.0 is specified using the following nine-field structure. Evaluators must populate each field for every score they record.

Field Content and Evaluator Obligation
1. Indicator Name The canonical name as defined in Protocol v2. Do not paraphrase or abbreviate.
2. Constitutional Anchor The specific Article(s) of the Kenya Constitution that ground this indicator. Cited per indicator in Protocol v2.
3. Definition The precise scope of what is being measured. Determines what counts as evidence and what does not.
4. Primary Evidence Sources The named institutions, document types, and platforms from which evidence must be drawn. Evaluators must cite a source within this list or justify any departure.
5. Measurement Method The formula, rate, ratio, or procedure used to convert raw evidence into a score. For composite indices, sub-signal weights are stated here.
6. Scoring Guidance (bands) The five-band criteria (Excellent → No evidence/violation) specifying what evidence is required at each level. Evaluators must match the candidate's evidence to the correct band.
7. Bias / Consistency Controls Rules that prevent evaluator discretion from distorting scores — e.g., minimum source requirements, what counts as corroboration, excluded evidence types.
8. Evidence Type Classification of the evidence used, drawn from the six-type schema defined in Section 3 of this guide.
9. Verification Level The 1–5 verification tier assigned to the evidence used, as defined in Section 3 of this guide.

Where to find indicator definitions

The complete specification for all 80 indicators — including definition, evidence sources, measurement method, scoring bands, and bias controls — is contained in the VeriVote Indicator Protocol v2.0.

This Evaluator Guide does not replicate individual indicator definitions. Evaluators must keep a copy of Protocol v2.0 open when scoring. Any conflict between this guide and Protocol v2.0 is resolved in favour of Protocol v2.0.


3. Evidence Classification Schema

Every score submitted must include an Evidence Type and a Verification Level. These are stored in the database and displayed to voters on candidate scorecards.

3.1 Evidence Types

Assign the single most accurate Evidence Type from the six categories below:

Evidence Type Description Kenyan Examples
Official Institutional Records Direct records from a government body with legal authority to maintain them. Court judgments, EACC clearance certificates, KRA compliance certificates, official gazettes, ODPP filings.
Independent Oversight Reports Reports produced by independent bodies mandated to audit or oversee public institutions. OAG Auditor-General reports, PPRA procurement audits, CAJ decisions, Ombudsman reports.
Legislative / Government Records Parliamentary or executive records documenting decisions, debates, and proceedings. Hansard transcripts, committee minutes, executive orders, presidential/governor's assent records.
Verified Public Communications Official statements, manifestos, or disclosures published through verifiable institutional channels. Candidate's official website (archived), IEBC-registered manifesto, official press releases, government portal announcements.
Independent Data Sources Evidence produced by credible independent organisations with no stake in the candidate's outcome. Africa Check fact-check reports, PesaCheck analyses, KNBS statistical publications, academic peer-reviewed studies.
Structured Surveys Quantitative data from formally designed surveys with documented methodology. Use sparingly; only where no primary source exists. Afrobarometer, IPSOS Kenya polling with published methodology, CAJ perception surveys.

3.2 Verification Levels

Assign the single Verification Level that best reflects the independence and authority of the primary evidence used for a given score:

Level Name Standard and Examples
Level 1 Official Verified Evidence Highest tier. Direct records from a government body or court with legal authority. Examples: court judgments, EACC certificates, KRA compliance records, gazette notices.
Level 2 Independent Institutional Evidence Third-party institutional reports from bodies mandated to audit or oversee. Examples: OAG audit reports, EACC clearance reports, PPRA findings, CAJ decisions.
Level 3 Documented Public Evidence Officially published records in the public domain. Examples: Hansard transcripts, gazetted notices, registered manifesto documents, official press releases on institutional portals.
Level 4 Analytical or Survey Evidence Aggregated or modelled evidence. Used sparingly and only where Level 1–3 evidence is unavailable. Examples: fact-check aggregates, perception survey results with documented methodology.
Level 5 Indirect / Proxy Evidence Lowest tier. Inferred or equivalent evidence where no primary source exists. Must document the basis for equivalence. Examples: private-sector audit standing in for OAG audit; corporate disciplinary clearance as EACC proxy.

3.3 UI Translation for Voter Scorecards

The evidence classifications above are translated into plain-language confidence labels for voters on the VeriVote platform:

Confidence Label Verification Level(s) Meaning for Voter
Evidence Confidence: High Levels 1–2 Score is grounded in official government or independent oversight records. Highest reliability.
Evidence Confidence: Moderate Level 3 Score is grounded in documented public records. Reliable but not from a primary government source.
Evidence Confidence: Limited Levels 4–5 Score relies on analytical, survey, or proxy evidence. Confidence is Limited — treat with appropriate caution and review the audit trail.

3.4 Database Schema — Required Columns per Score Record

Every score record in the VeriVote database must include the following fields in addition to the raw score:

Column Data Type Description
evidence_type ENUM / string One of the six Evidence Types defined in Section 3.1.
verification_level INT (1–5) Verification tier assigned per Section 3.2.
source_institution String Named institution that produced the primary evidence (e.g., "OAG", "EACC", "Africa Check").
source_document_ref String / URL Document title, file reference, or stable URL linking to the evidence. Screenshots must be archived.
score_date DATE Date the score was assigned or last verified.
evaluator_notes Text Brief justification for the band assigned, including any proxy use, N/A declaration, or refusal flag.
safeguard_triggered BOOLEAN Set to TRUE if the Integrity Veto (Step 5) or Category Floor Rule (Step 6) was applied to produce the candidate's final score. The UI reads this flag to determine whether to display a safeguard banner adjacent to the score.

4. Audit Trail Requirements

Every score must be defensible. The audit trail is the mechanism that makes a score disputable, correctable, and transparent to voters. These requirements are mandatory, not advisory.

Source links or screenshots: Every score must include either a stable URL to the primary evidence document, a document reference number, or an archived screenshot. Ephemeral links (e.g., social media posts without archiving) are not acceptable as standalone sources.
Brief justification: The evaluator must record, in plain language, why the evidence warrants the band selected. One to three sentences is sufficient. The justification must address the specific band threshold, not just describe the evidence.
Composite index documentation: For indicators that use sub-signals (Calculated Fields), the evaluator must record the score awarded to each sub-signal, not just the final aggregated score. This enables independent verification of the composite.
Proxy documentation: If a private-sector proxy was used (Section 1.3), the evaluator must state: the original indicator, the proxy document, the institution that audited it, and the explicit basis for equivalence.
N/A declaration: If an indicator is marked N/A, the evaluator must state why the indicator is structurally inapplicable and confirm that no proxy evidence was available or eligible.
Refusal flag: If a candidate or institution refused to provide evidence, the evaluator must document the date and method of the request, the party from whom evidence was sought, and the nature of the refusal or non-response.

Audit Trail Retention

All audit trail records must be retained for a minimum of five years from the date of scoring, or for the duration of the elected term plus two years, whichever is longer.

Records must be stored in a format accessible to the VeriVote compliance team for independent review. Evaluators cannot unilaterally delete or amend a submitted audit trail entry. Corrections must be logged as a new entry with reference to the original.


5. Newcomer Scoring Rules

Candidates without prior public office history are not penalised by design. The following rules govern how evaluators handle indicators where no public record can exist.

5.1 Category 2 — Institutional Accountability (Pillar-Level Threshold)

Category 2 relies heavily on public office records. For newcomers, a four-stage process applies:

Stage Name Evaluator Action
Stage 1 Proxy Mapping Search for private-sector equivalents for each null indicator (e.g., ICPAK-audited corporate accounts as proxy for OAG audit). Score eligible proxies using the same band thresholds.
Stage 2 Indicator Nullification Mark remaining indicators NULL if no proxy exists after Stage 1.
Stage 3 Threshold Evaluation Count NULL indicators in Category 2. If NULL count ≤ 75%: score the pillar normally; remaining indicators fill the category proportionally.
Stage 4 Structural Redistribution If NULL count exceeds 75%: zero Category 2 and redistribute its 25% weight to the remaining four categories using the standard matrix (see How We Calculate Scores v2, Section 4).

5.2 Category 3 — Public Engagement (Newcomer Proxy)

Candidates without a constituency office are scored using leadership proxies on the same 0–25 scale:

  • Community service records (verified by a registered NGO or community organisation)
  • Pro-bono work (verified by a professional body such as LSK)
  • NGO board engagement (verified by NGO Coordination Board registration records)
  • Professional body participation (verified by the relevant professional body)

Evaluators must document which proxy type was used and confirm independent verification of the evidence.

5.3 Category 4 — Policy Development (Newcomer Proxy)

Candidates without a parliamentary record may be scored using the following policy proxy types, each on the same 0–25 scale:

  • White Papers / Publications: Peer-reviewed articles, policy reports, or academic publications authored by the candidate.
  • Organisational Strategy Records: Strategic plans, restructuring frameworks, or growth strategies the candidate authored or led in a corporate, NGO, or institutional role.
  • Advocacy Frameworks: Formal legislative petitions, policy memos, or submissions to government bodies authored by the candidate.
  • Professional Thought Leadership: Expert panel participation or published governance commentary, verified by the convening body or publication.

All proxies must be independently verifiable. Evaluators must identify the original indicator, the proxy document, and the basis for equivalence in the audit trail.


6. Worked Examples

The following five examples illustrate correct application of the evaluation template. One example is drawn from each category. Each shows the full nine-field structure as an evaluator would complete it. For the indicator's canonical definition and measurement rule, refer to the corresponding entry in Protocol v2.0.

Indicator ID 1.4.4
Constitutional Anchor Art. 73(1)(a), 10(2)(a)
Definition Evidence of publicly recognising and formally rectifying errors, misstatements, or ethical lapses. A Documented Retraction is a formal, publicly traceable correction of a specific previously identified false or erroneous statement. Informal social media posts without an official correction notice do not qualify.
Primary Evidence Sources Africa Check / PesaCheck retraction logs; VeriVote platform flagged-statement log; official press releases; candidate's institutional website (archived); EACC resolutions.
Measurement Method Rate = Documented Retractions / Identified Major Errors or Controversies. Normalise by public exposure level (high-profile candidates face more scrutiny; adjust threshold accordingly).
Scoring Guidance Excellent (19–25): Timely formal retraction for every identified major error; proactive correction with documented follow-up action. Strong (13–18): Formal retraction for most errors; minor delays or one unaddressed minor error. Moderate (7–12): Partial or delayed retractions; some errors publicly identified but not formally addressed. Weak (1–6): Few or no formal retractions despite multiple identified errors. No evidence / violation (0): No corrections despite documented controversies; or active misrepresentation.
Bias / Consistency Controls Require ≥1 verifiable public record per correction claimed. Neutral score if no controversies are on record — do not penalise absence of errors. Exclude informal social media acknowledgements that lack an official correction notice.
Evidence Type Independent Data Sources (Africa Check / PesaCheck) — or Official Institutional Records if sourced from EACC resolutions.
Verification Level Level 2 (Independent Institutional Evidence) if from Africa Check / PesaCheck. Level 1 if from EACC resolutions.

Indicator ID 2.4.3
Constitutional Anchor Art. 232(1)(e), 10(2)(c)
Definition Effectiveness of institutional systems for complaints, oversight, and enforcement. Scored as a Calculated Field using four equally weighted sub-signals.
Primary Evidence Sources OAG audit reports; oversight body recommendations and implementation status records; public complaint logs; CAJ decisions; disciplinary records.
Measurement Method Weighted average of four sub-signals: (30%) complaint tracking system existence and resolution rate; (30%) oversight body recommendation follow-up compliance rate; (20%) public reporting transparency score; (20%) documented enforcement evidence. Final score = weighted average normalised to 0–25.
Scoring Guidance Excellent (19–25): All four sub-signals fully operational with documented evidence; enforcement actions on record. Strong (13–18): Three sub-signals strong; one partial. Moderate (7–12): At least two sub-signals present; partial compliance. Weak (1–6): Minimal or non-functional systems. No evidence / violation (0): No accountability mechanisms in place or documented obstruction of oversight.
Bias / Consistency Controls Use only official reports and verified logs; exclude perception surveys. Document each sub-signal score separately. If this indicator is N/A (candidate has no public institutional history), apply proxy allowance rules from Section 1.3.
Evidence Type Independent Oversight Reports (OAG, CAJ) for primary; Official Institutional Records for complaint logs.
Verification Level Level 2 for OAG / CAJ reports. Level 1 if sourced from official institutional complaint registries.

Indicator ID 3.1.1
Constitutional Anchor Art. 10(2)(a), 38(3)
Definition Frequency and geographic reach of documented two-way public engagement sessions. Office Open Days are the primary session type. Campaign Rallies are explicitly excluded (one-way communication, not two-way dialogue).
Primary Evidence Sources Official constituency office logs; georeferenced event records with attendance registers; meeting minutes; verified media coverage with timestamps; ward/sub-county administration records.
Measurement Method Score = (Frequency Score × 0.5) + (Geographic Reach Score × 0.5). Frequency benchmarks are role-specific: MCAs = monthly; Senators, Women Reps, Governors = quarterly; President = national coverage. Geographic Reach Rule: >75% of applicable sub-units covered = 25 pts maximum; <30% covered = 12 pts maximum.
Scoring Guidance Excellent (19–25): Meets or exceeds frequency benchmark; Office Open Days documented in >75% of applicable sub-units. Strong (13–18): Meets benchmark in most periods; coverage 50–74% of sub-units. Moderate (7–12): Below benchmark frequency; coverage 30–49% of sub-units. Weak (1–6): Rare Office Open Days; coverage <30% of sub-units. No evidence / violation (0): No documented Office Open Days.
Bias / Consistency Controls Require attendance register or official minutes per session. Campaign rally records must be excluded. Newcomer proxy: community leadership forums, civic society facilitation, or professional body engagement — scored on same scale with geographic reach adjusted to the candidate's relevant professional domain.
Evidence Type Official Institutional Records (ward/sub-county administration logs); Verified Public Communications (official office schedules).
Verification Level Level 1 for ward/sub-county administration records. Level 3 for verified media coverage with timestamps.

Indicator ID 4.4.2
Constitutional Anchor Art. 232(1)(d), 35(3)
Definition The extent to which a candidate's policy proposals or manifesto cite verifiable evidence sources for major claims. Applies the Feasibility Multiplier: Vague (0.0) / Specific (0.8) / Costed (1.0).
Primary Evidence Sources Published manifesto (IEBC-registered copy); bills or legislative proposals (Hansard); executive orders; official policy documents with date and authorship verified.
Measurement Method Citation Rate = (Major claims with cited, verifiable sources) / (Total major claims). Apply Feasibility Multiplier per claim: 0.0 = vague/no source; 0.8 = specific mechanism named; 1.0 = full cost-benefit breakdown with verified revenue source. Final score normalised to 0–25.
Scoring Guidance Excellent (19–25): >80% of major claims cite verifiable sources; majority are Costed (1.0). Strong (13–18): 60–80% of claims cited; mix of Specific and Costed. Moderate (7–12): 40–59% cited; mostly Specific (0.8). Weak (1–6): <40% cited; mostly Vague (0.0). No evidence / violation (0): No citations; or document cannot be authenticated.
Bias / Consistency Controls Document-based only; evaluate each major claim independently. Binary per claim (cited or not). Newcomer proxy: white papers, published policy commentary, or advocacy framework submissions — scored on same scale.
Evidence Type Verified Public Communications (IEBC-registered manifesto); Legislative / Government Records (Hansard for bills).
Verification Level Level 3 for IEBC-registered manifesto. Level 1 for Hansard-recorded legislative proposals.

Indicator ID 5.1.4
Constitutional Anchor Art. 73(1), 232(1)(h)
Definition Degree of confidence demonstrated by credible stakeholders in the candidate's professional leadership. Measured as a Calculated Field using four equally weighted signals.
Primary Evidence Sources Signal 1: Senior staff retention rate (HR records / official org charts). Signal 2: Documented organisational endorsements (formal letters, joint statements from registered bodies). Signal 3: Verified staff turnover records (official HR or payroll records). Signal 4: Independent leadership evaluations (published 360-degree reviews, board evaluations, or professional body assessments).
Measurement Method Final Index = (S1 + S2 + S3 + S4) ÷ 4. Each signal scored 0–25 independently before averaging. Exclude partisan endorsements and anonymous testimonials. Only attributable, public sources count.
Scoring Guidance Excellent (19–25): Strong, documented signals across all four; no credible condemnations on record. Strong (13–18): Three signals strong; one moderate. Moderate (7–12): Mixed signals; some credible endorsements, some neutral. Weak (1–6): Few endorsements; documented credible criticism. No evidence / violation (0): Widespread credible condemnation or no verifiable signals.
Bias / Consistency Controls Exclude political party endorsements and endorsements from direct family members. Require each endorsement to be attributable to a named, registered organisation. Document each of the four signal scores separately in the audit trail.
Evidence Type Official Institutional Records (HR / org chart records for S1 and S3); Verified Public Communications (endorsement letters for S2); Independent Data Sources (published leadership evaluations for S4).
Verification Level Level 1 for HR records. Level 2 for published independent evaluations. Level 3 for formal endorsement letters from registered organisations.

7. VeriVote Document Hierarchy

This guide operates within a suite of VeriVote documents. Evaluators should understand how they relate:

Document Version Evaluator Relevance
VeriVote Indicator Protocol v2.0 v2.0 (current) PRIMARY REFERENCE. Contains all 80 indicator definitions, measurement rules, and scoring bands. Keep open when scoring.
How We Calculate Scores v2.0 (current) Explains how individual indicator scores combine into the final Suitability Index. Includes Trust Coefficients, redistribution logic, and score range interpretation.
Master Technical Dictionary & Governance Core v2.2 (current) Defines all technical terms used across the VeriVote framework. First point of reference for any term or concept requiring clarification.
VeriVote Indicator Protocol — Evaluator Guide v2.0 (current) This document. Operational instructions for evaluators. Does not restate indicator definitions.

Conflict Resolution

In the event of any conflict between this Evaluator Guide and the VeriVote Indicator Protocol v2.0, the Protocol takes precedence on all matters of indicator definition, measurement method, and scoring band criteria.

This guide takes precedence on matters of evaluator procedure: how to document evidence, how to declare N/A, how to apply the proxy allowance, and how to record the audit trail.

VeriVote: Empowering the Electorate through Data-Driven Accountability.

VeriVote Indicator Protocol — Evaluator Guide v2.0 • Updated March 2026

We value your privacy

VeriVote uses strictly necessary cookies (session, CSRF protection) to keep the platform running. We also use optional analytics cookies to understand how visitors use the site. Privacy Policy