← Back to country index
Synthestat · Latvia · population QA

LV 1:1 population synthesis QA cycle

Country-specific layer for synthetic people in households, dwellings, real building stock where available, hidden-population overlays, and work/school assignment evidence.

Board: synthestat-population-qa · Tenant: synthestat · Country: LV · Task workflow status: ready · Artifact completion: review_bundle_metrics_partial

Ideal-country quality criteria: impossible 1:1 benchmark

This is the common gold-standard benchmark for an ideal country. It is intentionally impossible to fully satisfy: complete success would mean a 1:1 replica of the real population where every person, household, dwelling, attribute, and assignment is exactly represented. The QA page uses it as an asymptote and gap taxonomy, not as a release promise.

Apply this same rubric to this country’s latest run, then report which needs are measured, constrained, modelled, unavailable, or blocked.

NeedUnachievable idealQA evidence we require insteadWhy perfection cannot be achieved
Complete de jure resident coverageEvery real resident represented exactly once in the right country, municipality, small area, household, and dwelling.Synthetic person count equals official population at all enforced geographies; no unexplained duplicate, missing, or out-of-universe people.A true 1:1 resident list is a confidential population register and changes continuously; Synthestat can only match official aggregates and declared source universes.
Complete attribute truthEach synthetic person has the same age, sex, household role, education, occupation, industry, origin, health proxy, income proxy, and lifecycle state as the corresponding real person.Published marginal and cross-tab constraints pass within HARD/FIRM/SOFT tolerances; modelled fields carry uncertainty and measured/constrained/modelled provenance.Official releases do not expose a complete individual joint distribution, and many attributes are survey-derived, lagged, suppressed, or unavailable at fine geography.
Perfect household and family structureEvery household contains the exact real members and relationships, including multi-generation, partnership, child, shared, institutional, and edge-case arrangements.Household totals, household-type distributions, age/sex/role consistency, fertility/child constraints, and structural invariants pass with explicit residuals.Household membership is sensitive microdata; public sources usually expose only aggregate household/family tables and partial cross-tabs.
Exact dwelling and building groundingEvery household is assigned to its real dwelling and building with exact occupancy, vacancy, dwelling type, floor area, tenure, and address-level geography.Dwelling/building capacity checks pass; vacancy/second-home/institutional dwellings are represented or explicitly unavailable; building links have source provenance.Many countries lack open address-level registers; dwelling occupancy is confidential and time-varying.
Complete de facto and hidden-population overlaysHomeless, undocumented, refugees, students away from home, seasonal, institutional, tourists, and daytime populations are all represented with exact location and timing.Overlay layers use interval estimates, source-specific quality flags, and never silently modify de jure HARD constraints.Hidden populations are partly unobserved by definition; ethical/privacy constraints forbid exact person-level labels.
Exact school, workplace, facility, and mobility assignmentEvery person is assigned to the real school, workplace, care provider, commute, and daily activity chain they use.Assignment layers use official registers/OD flows where available; modelled assignments are flagged and validated only against aggregate flows/capacities.Operational assignments are usually protected registers or dynamic behavioural data; Phase 1 must not imply they are known.
Full joint-distribution realismThe full multivariate joint distribution is identical to reality across all attributes, households, geography, and rare subgroups.High-priority marginals/cross-tabs pass; sparse zones and prior-dominated attributes are clearly marked with quality tiers and credible intervals.The joint distribution is non-identifiable from published marginals; IPF/BN/hierarchical pooling choose plausible distributions, not truth.
Zero uncertainty and zero lagAll values are current today and known without error.Every output records reference period, retrieval timestamp, lag, confidence, uncertainty bounds, and degradation decisions.Official statistics are lagged, revised, sampled, suppressed, and harmonized after collection.
Privacy-safe yet maximally detailed releaseThe system releases maximum useful detail while creating zero re-identification risk.Release mode, k-anonymity/cell safeguards, perturbation/aggregation policy, and sensitive-field treatment are explicit.Fine-area synthetic microdata can still create structurally unique records; synthetic does not mean anonymous.
Perfect reproducibility and auditabilityAny user can trace every output record to exact source snapshots, transformations, constraints, relaxations, seeds, and code versions.Run manifests, source provenance, checksums, frozen extracts, seeds, versioned crosswalks, validation reports, and relaxation logs are complete.This is approachable but never final: source portals, classifications, geography, and code keep changing, so audits must be continuously renewed.

Population artifact output status (separate from task status)

Artifact completionRow count sourcePeopleTarget populationNational coverageAbsolute shortfallHouseholdsDwellingsHouses/buildingsMax marginal deviationHARD statusRun
review_bundle_metrics_partialparquet_metadata_review_bundle843,907lv_population_review_national_candidate_2025_csb_freeze_44d91be2_seed420987

This table describes emitted population artifacts only. It is intentionally independent from the Kanban task workflow status below: a country can have all tasks done while its artifact is still only a seeded slice, or a passing review bundle can still lack national target completion. Deviation is the maximum absolute relative error across collected HARD/FIRM/SOFT marginal constraints in the latest review bundle. GUIDE/INFORMATIONAL priors are excluded. National target/coverage are read from build_manifest.json when available and override any visual impression of completion.

Kanban task workflow status (not artifact completion)

ready
1
done
8

These cards count board tasks only. They do not certify that the country-level population artifact is nationally complete or reviewer-approved.

Datasets and distributions

Lists come from the latest run bundle: source_provenance.json, distribution_diagnostics.json, and build_manifest.json.

Summary

Datasets used0
Distributions available0
Constraints/distributions used in synthesis4
Constraint types
Dataset variants
Finest-geography status

Source gaps

  • No source gaps listed.

Datasets used

Dataset/source ID
None listed yet.

Best source by distribution family

Distribution familyDataset/source ID
None listed yet.

Available distributions / priors in registry

SpecLabelTypeGeoStatusVariantConfidenceData URI
None listed yet.

Constraints/distributions used in synthesis manifest

Constraint or distribution ID
LV_CSP_IRD041 HARD all-resident national age-sex
LV_CSP_MVS041 HARD/FIRM household size/count and private-household population
LV_CSP_MVS031 FIRM household type composition
LV_CSP_MAS010 GUIDE dwellings/collective-living historical baseline

Current country tasks

IDTitleAssigneeStatusCreatedLatest summary
t_af44fbbcLV human acceptance decision for national-only PASS candidatesynth-managerready2026-05-19 23:03:47 CEST
t_14da0e6eLV final human/internal review note for national candidate PASSsynth-reviewerdone2026-05-19 22:03:38 CESTProduced LV final human/internal review note for national candidate PASS at /home/synthestat/output/reviews/LV_final_human_internal_review_note_national_candidate_PASS_2026-05-19.md. The note states the +26,783 / 1.4605% private-household-population relaxation, national-only scope, unavailable build
t_093ef8e8LV review: verify household-realism-fixed national candidate bundlesynth-reviewerdone2026-05-19 21:24:00 CESTReviewed LV fixed national candidate bundle at /home/synthestat/output/runs/LV/lv_population_review_national_candidate_2025_csb_freeze_44d91be2_seed420987 and returned PASS for the declared national/private-household candidate scope. Independent audits verified exact 1,860,565 person rows, 843,907 h
t_ed1b43deLV model fix: repair national candidate household realism before internal review PASSsynth-modelerdone2026-05-19 21:03:40 CESTFixed and rebuilt the LV national candidate household model at /home/synthestat/output/runs/LV/lv_population_review_national_candidate_2025_csb_freeze_44d91be2_seed420987. The reviewer blockers now audit to zero minor-only households, zero under-15/no-adult households, zero reference persons under 2
t_b7d00b12LV population QA reviewer: inspect national-scale non-fixture rerunsynth-reviewerdone2026-05-19 20:17:38 CESTReviewed LV national candidate bundle: it is national-scale, not the old 8-person fixture, and exact HARD age-sex/population/household row-count controls are verified. Verdict NEEDS_MODEL_FIX because household/family realism is structurally invalid at scale (343,555 minor-only households; 280,878 un
t_01713ba2LV population QA modeler: national-scale non-fixture synthesis rerunsynth-modelerdone2026-05-19 20:17:37 CESTBuilt LV non-fixture national-scale review candidate at /home/synthestat/output/runs/LV/lv_population_review_national_candidate_2025_csb_freeze_44d91be2_seed420987, replacing the prior 8-person seeded LV slice for review purposes. Candidate matches official 2025 all-resident population (1,860,565) a
t_0b003eeaLV population QA downloader: freeze exact official payloads for national-scale rerunsynth-downloaderdone2026-05-19 20:17:36 CESTFroze/catalogued 36/36 Latvia CSB PxWeb official payloads requested by marginals/distribution researchers for national population QA. Wrote downloader log, catalogue update, latest snapshot, manager update, and an effective raw-freeze catalogue; large-source HTTP 403s were resolved with bounded geog
t_773c68baLV population QA distribution closure: joint priors for non-fixture synthesissynth-distributions-researcherdone2026-05-19 20:17:35 CESTCompleted Latvia distribution-prior source closure for non-fixture population QA. Wrote LV findings and downloader extraction specs, refreshed the distribution evidence board, and appended manager updates; verdict is DISTRIBUTION_READY_FOR_MODEL_FIX with current dwelling occupancy and relationship-l
t_330752b6LV population QA source closure: exact national marginals for non-fixture synthesissynth-marginals-researcherdone2026-05-19 20:17:34 CESTCompleted LV national population marginal/source closure and wrote the required handoffs under /home/synthestat/workspace/manager_handoffs/marginals/. Identified CSB IRD041 2025 population target 1,860,565, CSB MVS031/MVS041 private-household target 843,907, explicit 26,783 non-private-household res

Process

Manager kickoff

synth-manager creates and controls the country loop.

Model build

synth-modeler generates the review bundle: people, households, dwellings/buildings or unavailable markers, overlays, assignments, manifests, residuals, diagnostics, uncertainty, provenance.

Reviewer gate

synth-reviewer audits constraints, marginals, household/family realism, hidden populations, dwelling/building grounding, work/school assignment, uncertainty, provenance, and privacy.

Branch

PASS finalizes; NEEDS_MODEL_FIX routes back to modeler; NEEDS_MORE_SOURCES routes to marginal/distribution researchers then downloader; exhausted evidence/model plateau stops for human decision.

Quality gates and stop conditions