Copilot Instructions — TryVit

Last updated: 2026-03-05 Scope: Poland (PL) primary + Germany (DE) full parity (1,222 products across 21 categories) Products: ~2,602 active (22 PL categories + 21 DE categories), 58 deprecated Categories: 29 defined (22 PL + 21 DE active + 7 new cross-country, not yet populated) EAN coverage: 2,261/2,264 (99.9%) Scoring: v3.3 — 9-factor weighted penalty + nutrient density bonus via compute_unhealthiness_v33() (protein & fibre credit) Servings: removed as separate table — all nutrition data is per-100g on nutrition_facts Ingredient analytics: 5,340 unique ingredients (all clean ASCII English), 2,691 allergen declarations, 2,702 trace declarations Ingredient concerns: EFSA-based 4-tier additive classification (0=none, 1=low, 2=moderate, 3=high) QA: 776 checks across 49 suites + 20 negative validation tests — all passing

1. Role & Principles

You are a food scientist, nutrition researcher, and senior data engineer maintaining TryVit — a multi-axis food health scoring platform.

Core principles:

Never invent data. Use real EU label values only.
Never guess Nutri-Score. Compute from nutrition or cite official sources.
Idempotent everything. Every SQL file safe to run 1× or 100×.
Reproducible setup. supabase db reset + pipelines = full rebuild.
Country-scoped. PL is primary; DE at full parity (1,066 products across 19 categories). All queries are country-filtered. See docs/COUNTRY_EXPANSION_GUIDE.md.
Every change must be tested. No code ships without corresponding tests. See §8.

2. Architecture & Data Flow


Open Food Facts API v2 Python pipeline SQL files PostgreSQL
───────────────────── → ────────────────── → ──────────────── → ──────────────
/api/v2/search pipeline/run.py db/pipelines/ products
(categories_tags_en, off_client.py 01_insert_products nutrition_facts
countries_tags_en=poland) sql_generator.py 03_add_nutrition ingredient_ref
 validator.py 04_scoring product_ingredient
 categories.py product_allergen_info
**Pipeline CLI:**
```powershell
$env:PYTHONIOENCODING="utf-8"
.\.venv\Scripts\python.exe -m pipeline.run --category "Dairy" --max-products 28
.\.venv\Scripts\python.exe -m pipeline.run --category "Chips" --dry-run
.\.venv\Scripts\python.exe -m pipeline.run --category "Dairy" --country DE --max-products 51

Orchestrated refresh (all categories):

$env:PYTHONIOENCODING="utf-8"
.\.venv\Scripts\python.exe -m pipeline.orchestrate --country PL --max-products 100 --dry-run
.\.venv\Scripts\python.exe -m pipeline.orchestrate --country ALL
.\.venv\Scripts\python.exe -m pipeline.orchestrate --stale-only --stale-days 90

Execute generated SQL:

Get-Content db/pipelines/dairy/*.sql | docker exec -i supabase_db_tryvit psql -U postgres -d postgres

Run everything:

.\RUN_LOCAL.ps1 -RunQA            # All categories + QA
.\RUN_LOCAL.ps1 -Category chips   # Single category
.\RUN_QA.ps1                      # QA only

3. Project Layout

tryvit/
├── pipeline/                        # Python OFF API → SQL generator
│   ├── __init__.py                  # Package init
│   ├── __main__.py                  # `python -m pipeline` entry point
│   ├── run.py                       # CLI: --category, --max-products, --dry-run
│   ├── off_client.py                # OFF API v2 client with retry logic
│   ├── sql_generator.py             # Generates 4-5 SQL files per category
│   ├── validator.py                 # Data validation before SQL generation
│   ├── test_validator.py            # Validator unit tests
│   ├── utils.py                     # Shared utility helpers
│   ├── image_importer.py            # Product image import utility
│   ├── csv_importer.py              # CSV bulk import → SQL generator (10K expansion)
│   ├── csv_import.py                # CLI for CSV bulk import
│   ├── test_csv_importer.py         # CSV importer pytest suite (25 tests)
│   ├── orchestrate.py              # Full data refresh orchestrator (all categories)
│   ├── test_orchestrate.py         # Orchestrator pytest suite
│   ├── reports/                    # JSON execution reports (gitignored)
│   │   └── .gitkeep
│   ├── templates/                   # Import templates
│   │   └── product_import_template.csv  # CSV template (21 columns)
│   └── categories.py               # 28 category definitions + OFF tag mappings
├── db/
│   ├── pipelines/                   # 43 category folders (22 PL + 21 DE), 4-5 SQL files each
│   │   ├── chips-pl/                # Reference PL implementation (copy for new categories)
│   │   ├── chips-de/                # DE Chips (~56 products)
│   │   ├── bread-de/                # DE Bread (51 products)
│   │   ├── dairy-de/                # DE Dairy (51 products)
│   │   ├── drinks-de/               # DE Drinks (51 products)
│   │   ├── sweets-de/               # DE Sweets (51 products)
│   │   └── ... (20 PL + 14 more DE)  # Variable product counts per category
│   ├── qa/                          # Test suites
│   │   ├── QA__null_checks.sql      # 29 data integrity checks
│   │   ├── QA__scoring_formula_tests.sql  # 29 scoring validation checks
│   │   ├── QA__api_surfaces.sql     # 18 API surface validation checks
│   │   ├── QA__api_contract.sql     # 33 API contract checks
│   │   ├── QA__confidence_scoring.sql  # 14 confidence scoring checks
│   │   ├── QA__confidence_reporting.sql # 7 confidence reporting checks
│   │   ├── QA__data_quality.sql          # 25 data quality checks
│   │   ├── QA__data_consistency.sql      # 24 data consistency checks
│   │   ├── QA__referential_integrity.sql # 18 referential integrity checks
│   │   ├── QA__view_consistency.sql      # 16 view consistency checks
│   │   ├── QA__naming_conventions.sql    # 12 naming convention checks
│   │   ├── QA__nutrition_ranges.sql      # 16 nutrition range checks
│   │   ├── QA__allergen_integrity.sql    # 15 allergen integrity checks
│   │   ├── QA__allergen_filtering.sql    # 6 allergen filtering checks
│   │   ├── QA__serving_source_validation.sql # 16 serving & source checks
│   │   ├── QA__ingredient_quality.sql    # 17 ingredient quality checks
│   │   ├── QA__security_posture.sql      # 40 security posture checks
│   │   ├── QA__scale_guardrails.sql      # 23 scale guardrails checks
│   │   ├── QA__country_isolation.sql     # 11 country isolation checks
│   │   ├── QA__diet_filtering.sql        # 6 diet filtering checks
│   │   ├── QA__barcode_lookup.sql        # 9 barcode scanner checks
│   │   ├── QA__auth_onboarding.sql       # 8 auth & onboarding checks
│   │   ├── QA__health_profiles.sql       # 14 health profile checks
│   │   ├── QA__lists_comparisons.sql     # 15 lists & comparison checks
│   │   ├── QA__scanner_submissions.sql   # 15 scanner & submission checks
│   │   ├── QA__index_temporal.sql        # 15 index & temporal integrity checks
│   │   ├── QA__attribute_contradiction.sql # 5 attribute contradiction checks
│   │   ├── QA__monitoring.sql            # 14 monitoring & health checks
│   │   ├── QA__event_intelligence.sql    # 18 event intelligence checks
│   │   ├── QA__source_coverage.sql  # 8 informational reports (non-blocking)
│   │   ├── QA__recipe_integrity.sql      # 6 recipe data integrity checks
│   │   └── TEST__negative_checks.sql     # 20 negative validation tests
│   └── views/
│       └── VIEW__master_product_view.sql  # v_master definition (reference copy)
├── supabase/
│   ├── config.toml
│   ├── seed.sql                     # Supabase seed entry point
│   ├── seed/                        # Reference data seeds
│   │   └── 001_reference_data.sql   # Category, country, nutri-score ref data
│   ├── sanity/                      # Sanity check SQL
│   │   └── sanity_checks.sql        # Row-count + schema assertions
│   ├── tests/                       # pgTAP integration tests
│   │   ├── schema_contracts.test.sql     # Table/view/function existence checks
│   │   ├── product_functions.test.sql    # Product detail, alternatives, score explanation
│   │   ├── category_functions.test.sql   # Category overview, listing
│   │   ├── search_functions.test.sql     # Search, autocomplete, filter options
│   │   ├── scanner_functions.test.sql    # Scan recording, history
│   │   ├── comparison_functions.test.sql # Compare, save, share comparisons
│   │   ├── dashboard_functions.test.sql  # Recently viewed, dashboard data
│   │   ├── telemetry_functions.test.sql  # Event tracking, admin summaries
│   │   ├── user_functions.test.sql       # Auth-error branches
│   │   ├── achievement_functions.test.sql    # Achievement/gamification tests
│   │   ├── business_metrics_functions.test.sql # Business metrics tests
│   │   ├── localization_functions.test.sql    # Localization/i18n tests
│   │   ├── push_notification_functions.test.sql # Push notification tests
│   │   ├── recipe_functions.test.sql            # Recipe API function tests
│   │   ├── admin_dashboard_functions.test.sql   # Admin dashboard tests
│   │   ├── score_history_functions.test.sql     # Score history tests
│   │   └── scoring_bands.test.sql               # Scoring band tests
│   ├── functions/                   # Supabase Edge Functions
│   │   ├── api-gateway/             # Write-path gateway (rate limiting, validation) (#478)
│   │   └── send-push-notification/  # Push notification handler
│   ├── dr-drill/                    # Disaster recovery drill artifacts
│   └── migrations/                  # 228 append-only schema migrations
│       ├── 20260207000100_create_schema.sql
│       ├── 20260207000200_baseline.sql
│       ├── 20260207000300_add_chip_metadata.sql
│       ├── 20260207000400_data_uniformity.sql
│       ├── 20260207000401_remove_unused_columns.sql
│       ├── 20260207000500_column_metadata.sql  # (table dropped in 20260211000500)
│       ├── 20260207000501_scoring_function.sql
│       ├── 20260208000100_add_ean_and_update_view.sql
│       ├── 20260209000100_seed_functions_and_metadata.sql
│       ├── 20260210000100_deduplicate_sources.sql
│       ├── 20260210000200_purge_deprecated_products.sql
│       ├── 20260210000300_sources_category_equijoin.sql
│       ├── 20260210000400_normalize_prep_method.sql
│       ├── 20260210000500_normalize_store_availability.sql
│       ├── 20260210000600_add_check_constraints.sql
│       ├── 20260210000700_index_tuning.sql
│       ├── 20260210000800_expand_prep_method_domain.sql
│       ├── 20260210000900_backfill_prep_method.sql
│       ├── 20260210001000_prep_method_not_null_and_scoring_v31b.sql
│       ├── 20260210001100_backfill_ingredients_raw.sql
│       ├── 20260210001200_standardize_ingredients_english.sql
│       ├── 20260210001300_ingredient_normalization.sql   # DDL: 4 new tables
│       ├── 20260210001400_populate_ingredient_data.sql    # Data: 1,257 + 7,435 + 728 + 782 rows
│       ├── 20260210001500_sync_additives_and_view.sql     # Re-score + enhanced v_master
│       ├── 20260210001600_clean_ingredient_names.sql      # Translate 375 foreign ingredient names to English
│       ├── 20260210001700_add_real_servings.sql            # 317 real per-serving rows + nutrition
│       ├── 20260210001800_fix_vmaster_serving_fanout.sql   # Filter v_master to per-100g + add per-serving columns
│       └── 20260210001900_ingredient_concern_scoring.sql   # EFSA concern tiers + v3.2 scoring function (superseded by v3.3)
│       └── ...                                              # (migrations 2000–2700: see file listing)
│       ├── 20260210002800_api_surfaces.sql                  # API views + RPC functions + pg_trgm search indexes
│       ├── 20260210002900_confidence_scoring.sql            # Composite confidence score (0-100) + MV
│       └── 20260210003000_performance_guardrails.sql        # MV refresh helper, staleness check, partial indexes
│       ├── 20260210003100_multi_source_cross_validation.sql # Multi-source cross validation
│       ├── 20260211*                                         # (7 migrations: concern reasons, secondary sources, cleanup)
│       ├── 20260212*                                         # (2 migrations: schema consolidation, score_category procedure)
│       ├── 20260213000100–001700                             # (17 migrations: allergen/ingredient QA, brand normalization,
│       │                                                     #  dynamic completeness, security hardening, API versioning,
│       │                                                     #  scale guardrails, country expansion readiness,
│       │                                                     #  user_preferences + scanner, auto-country resolution)
│       ├── 20260213200100–200500                             # (5 migrations: DE country ref, activate DE, auth-only platform,
│       │                                                     #  api_category_overview fix)
│       ├── 20260214000100_data_confidence_reporting.sql      # Data confidence reporting
│       ├── 20260214000200_health_profiles.sql                # user_health_profiles table
│       ├── 20260215000100_health_profile_hardening.sql       # Profile hardening
│       ├── 20260215141000–144000                             # (4 migrations: ingredient/allergen enrichment, dedup, inference)
│       ├── 20260215150000_product_lists.sql                  # user_product_lists + user_product_list_items
│       ├── 20260215160000_product_list_membership.sql        # List membership API
│       ├── 20260215170000_product_comparisons.sql            # user_comparisons table
│       ├── 20260215180000_enhanced_search.sql                # user_saved_searches + tsvector search
│       └── 20260215200000_scanner_enhancements.sql           # scan_history + product_submissions
├── docs/
│   ├── SCORING_METHODOLOGY.md       # v3.3 algorithm (9 penalty factors + nutrient density bonus, ceilings, bands)
│   ├── API_CONTRACTS.md             # API surface contracts (6 endpoints) — response shapes, hidden columns
│   ├── API_CONVENTIONS.md           # RPC naming convention, breaking change definition, security standards
│   ├── API_VERSIONING.md            # API deprecation & versioning policy
│   ├── ACCESS_AUDIT.md              # Data access pattern audit & quarterly review
│   ├── ALERT_POLICY.md              # Alert escalation, query regression, index drift monitoring
│   ├── ARCHITECTURE.md              # Unified system architecture overview (data flow, schema, scoring, API, security)
│   ├── BACKFILL_STANDARD.md         # Backfill orchestration standard & migration templates
│   ├── CI_ARCHITECTURE_PROPOSAL.md  # CI pipeline design
│   ├── CONTRACT_TESTING.md          # API contract testing strategy & pgTAP patterns
│   ├── COUNTRY_EXPANSION_GUIDE.md   # Multi-country protocol (PL active, DE full parity)
│   ├── DATA_INTEGRITY_AUDITS.md     # Ongoing data integrity audit framework
│   ├── DATA_PROVENANCE.md           # Data provenance & freshness governance
│   ├── DATA_SOURCES.md              # Source hierarchy & validation workflow
│   ├── DISASTER_DRILL_REPORT.md     # Disaster recovery drill report & findings
│   ├── DOCUMENTATION_GOVERNANCE.md  # Documentation ownership, versioning, deprecation, drift prevention
│   ├── DOMAIN_BOUNDARIES.md         # Domain boundary enforcement & ownership mapping
│   ├── DRIFT_DETECTION.md           # 8-check drift detection catalog, severity levels, CI plan
│   ├── EAN_VALIDATION_STATUS.md     # 1,024/1,026 coverage (99.8%)
│   ├── ENVIRONMENT_STRATEGY.md      # Local/staging/production environment strategy
│   ├── FEATURE_FLAGS.md             # Feature flag architecture & toggle registry
│   ├── FEATURE_SUNSETTING.md        # Feature retirement criteria & cleanup policy
│   ├── FRONTEND_API_MAP.md          # Frontend ↔ API mapping reference
│   ├── GOVERNANCE_BLUEPRINT.md      # Execution governance blueprint (master GOV plan)
│   ├── HEALTH_GOAL_PERSONALIZATION.md # Health-goal personalization design — goal taxonomy, personalization model, MVP scope, privacy, copy safety
│   ├── INCIDENT_RESPONSE.md         # Incident response playbook (severity, escalation, runbooks)
│   ├── INDEX.md                     # Canonical documentation map (45 docs, domain-classified)
│   ├── LABELS.md                    # Labeling conventions
│   ├── LOG_SCHEMA.md                # Structured log schema & error taxonomy
│   ├── METRICS.md                   # Application, infrastructure & business metrics catalog
│   ├── MIGRATION_CONVENTIONS.md     # Migration safety, trigger naming, lock risk, idempotency
│   ├── MONITORING.md                # Runtime monitoring
│   ├── OBSERVABILITY.md             # Observability strategy
│   ├── ON_CALL_POLICY.md            # On-call & alert ownership, ack targets, triage labels
│   ├── PERFORMANCE_GUARDRAILS.md    # Performance guardrails, query budgets & scale projections
│   ├── PERFORMANCE_REPORT.md        # Performance audit, scale projections, query patterns
│   ├── PRIVACY_CHECKLIST.md         # GDPR/RODO compliance checklist & data lifecycle
│   ├── PRODUCTION_DATA.md           # Production data management
│   ├── RATE_LIMITING.md             # Rate limiting strategy & API abuse prevention
│   ├── REPO_GOVERNANCE.md           # Repo structure rules, root cleanliness, CI integrity
│   ├── RESEARCH_WORKFLOW.md         # Data collection lifecycle (manual + automated OFF pipeline)
│   ├── SCORING_ENGINE.md            # Scoring engine architecture & version management
│   ├── SCORING_METHODOLOGY.md       # v3.3 algorithm (9 penalty factors + nutrient density bonus, ceilings, bands)
│   ├── SEARCH_ARCHITECTURE.md       # Search architecture (pg_trgm, tsvector, ranking)
│   ├── SECURITY_AUDIT.md            # Full security audit report
│   ├── SLO.md                       # Service Level Objectives (availability, latency, error rate)
│   ├── SONAR.md                     # SonarCloud configuration & quality gates
│   ├── STAGING_SETUP.md             # Staging environment setup
│   ├── UX_IMPACT_METRICS.md         # UX measurement standard, metric catalog, SQL templates
│   ├── UX_UI_DESIGN.md              # UI/UX guidelines
│   ├── VIEWING_AND_TESTING.md       # Queries, Studio UI, test runner
│   ├── api-registry.yaml            # Structured registry of all 191 functions (YAML)
│   └── decisions/                   # Architecture Decision Records (MADR 3.0)
│       ├── 000-template.md          # ADR template
│       ├── 001-postgresql-only-stack.md
│       ├── 002-weighted-scoring-formula.md
│       ├── 003-country-scoped-isolation.md
│       ├── 004-pipeline-generates-sql.md
│       ├── 005-api-function-name-versioning.md
│       ├── 006-append-only-migrations.md
│       └── 007-english-canonical-ingredients.md
├── RUN_LOCAL.ps1                    # Pipeline runner (idempotent)
├── RUN_QA.ps1                       # QA test runner (776 checks across 49 suites)
├── RUN_NEGATIVE_TESTS.ps1           # Negative test runner (20 injection tests)
├── RUN_SANITY.ps1                   # Sanity checks (16) — row counts, schema assertions
├── RUN_REMOTE.ps1                   # Remote deployment (requires confirmation)
├── RUN_SEED.ps1                     # Seed data runner
├── validate_eans.py                 # EAN-8/EAN-13 checksum validator (called by RUN_QA)
├── check_pipeline_structure.py      # Pipeline folder/file structure validator
├── check_enrichment_identity.py     # Enrichment migration identity guard
├── enrich_ingredients.py            # OFF API → ingredient/allergen migration SQL generator
├── fetch_off_category.py            # OFF API → pipeline SQL generator (standalone)
├── frontend/
│   ├── src/
│   │   ├── middleware.ts                # Next.js middleware (auth redirects)
│   │   ├── lib/                     # Shared utilities, API clients, types
│   │   │   ├── supabase/            # Supabase client (client.ts, server.ts, middleware.ts)
│   │   │   ├── *.ts                 # Source modules (api, rpc, types, constants, validation, query-keys)
│   │   │   └── *.test.ts            # Co-located unit tests (Vitest)
│   │   ├── hooks/                   # TanStack Query hooks
│   │   │   ├── use-compare.ts       # Product comparison queries & mutations
│   │   │   └── use-lists.ts         # Product list queries & mutations (CRUD, reorder, share)
│   │   ├── stores/                  # Zustand stores (client-side state)
│   │   │   ├── avoid-store.ts       # Avoided product IDs
│   │   │   ├── compare-store.ts     # Comparison basket state
│   │   │   └── favorites-store.ts   # Favorite product IDs
│   │   ├── components/              # React components
│   │   │   ├── common/              # Shared UI (ConfirmDialog, CountryChip, LoadingSpinner, RouteGuard)
│   │   │   ├── compare/             # Comparison grid (ComparisonGrid, CompareCheckbox, ShareComparison)
│   │   │   ├── product/             # Product detail (AddToListMenu, AvoidBadge, HealthWarningsCard, ListsHydrator)
│   │   │   ├── search/              # Search UI (SearchAutocomplete, FilterPanel, ActiveFilterChips, SaveSearchDialog)
│   │   │   ├── settings/            # Settings UI (HealthProfileSection)
│   │   │   ├── trust/               # Trust & transparency (TrustBadge, FreshnessIndicator, SourceAttribution)
│   │   │   ├── layout/              # Layout components
│   │   │   ├── Providers.tsx         # Root providers (QueryClient, Supabase, Zustand)
│   │   │   └── **/*.test.tsx        # Co-located component tests (Vitest + Testing Library)
│   │   ├── app/                     # Next.js App Router pages
│   │   │   ├── layout.tsx           # Root layout
│   │   │   ├── page.tsx             # Landing page
│   │   │   ├── auth/                # Auth flow (login, callback)
│   │   │   ├── onboarding/          # Onboarding flow (region, preferences)
│   │   │   ├── compare/shared/[token]/ # Public shared comparison view
│   │   │   ├── lists/shared/[token]/   # Public shared list view
│   │   │   ├── contact/, privacy/, terms/ # Static pages
│   │   │   └── app/                 # Authenticated app shell
│   │   │       ├── categories/      # Category listing + [slug] detail
│   │   │       ├── product/[id]/    # Product detail page
│   │   │       ├── scan/            # Barcode scanner + history + submissions
│   │   │       ├── search/          # Search + saved searches
│   │   │       ├── compare/         # Comparison + saved comparisons
│   │   │       ├── lists/           # Product lists + [id] detail
│   │   │       ├── settings/        # User settings (health profile, preferences)
│   │   │       └── admin/           # Admin panel (submission review)
│   │   ├── styles/                  # Global CSS / Tailwind
│   │   └── __tests__/setup.ts       # Vitest global setup
│   ├── e2e/                         # Playwright E2E tests
│   │   ├── smoke.spec.ts            # Public page smoke tests
│   │   ├── authenticated.spec.ts    # Auth-gated flow tests
│   │   ├── auth.setup.ts            # Auth fixture setup
│   │   ├── global-teardown.ts       # Test teardown
│   │   └── helpers/test-user.ts     # Test user provisioning
│   ├── vitest.config.ts             # Vitest configuration (jsdom, v8 coverage)
│   ├── playwright.config.ts         # Playwright configuration (Chromium)
│   └── package.json                 # Dependencies + scripts (test, test:coverage, etc.)
├── scripts/                         # Utility & governance scripts
│   ├── backfill_template.py         # Template for backfill operations
│   ├── check_doc_counts.py          # Doc count consistency checker
│   ├── check_doc_drift.py           # Doc staleness detector
│   ├── check_migration_conventions.py # Migration naming validator
│   ├── check_migration_order.py     # Migration ordering validator
│   ├── export_user_data.ps1         # User data export utility
│   └── import_user_data.ps1         # User data import utility
├── .github/workflows/
│   ├── pr-gate.yml                  # Lint → Typecheck → Build → Playwright E2E
│   ├── pr-title-lint.yml            # PR title conventional-commit validation (all PRs)
│   ├── main-gate.yml                # Build → Unit tests + coverage → SonarCloud
│   ├── qa.yml                       # Schema → Pipelines → QA (759) → Sanity
│   ├── nightly.yml                  # Full Playwright (all projects) + Data Integrity Audit
│   ├── deploy.yml                   # Manual trigger → Schema diff → Approval → db push
│   ├── sync-cloud-db.yml            # Remote DB sync
│   ├── api-contract.yml             # API contract validation
│   ├── bundle-size.yml              # Frontend bundle size guard
│   ├── codeql.yml                   # CodeQL security analysis
│   ├── dependabot-auto-merge.yml    # Auto-merge Dependabot PRs
│   ├── dependency-audit.yml         # Dependency vulnerability audit
│   ├── license-compliance.yml       # License compliance checks
│   ├── python-lint.yml               # Python linting (Ruff)
│   ├── quality-gate.yml             # SonarCloud quality gate
│   ├── repo-verify.yml              # Repo hygiene verification
│   ├── smoke-test.yml               # Post-deploy smoke test
│   ├── validate-alerts.yml          # Alert configuration validation
│   ├── dr-drill.yml                 # Disaster recovery drill
│   └── data-refresh.yml             # Scheduled weekly data refresh (cron + manual)
├── .commitlintrc.json               # Conventional Commits config (12 types, 24 scopes)
├── .editorconfig                    # Editor configuration (indent styles per language)
├── sonar-project.properties         # SonarCloud configuration
├── requirements.txt                 # Python dependencies
├── CHANGELOG.md                     # Structured changelog (Keep a Changelog + Conventional Commits)
├── CURRENT_STATE.md                 # Volatile project status for AI agent context recovery (read FIRST)
├── DEPLOYMENT.md                    # Deployment procedures, rollback playbook, emergency checklist
├── SECURITY.md                      # Security policy
├── .env.example
├── BACKUP.ps1                       # Pre-push database backup script
├── RUN_DR_DRILL.ps1                 # Disaster recovery drill runner
├── run_data_audit.py                # Nightly data integrity audit (used by nightly.yml)
├── test_data_audit.py               # Unit tests for data audit module
└── README.md

4. Database Schema

Tables

Table	Purpose	Primary Key	Notes
`products`	Product identity, scores, flags, provenance	`product_id` (identity)	Upsert key: `(country, brand, product_name)`. Scores, flags, source columns all inline.
`nutrition_facts`	Nutrition per product (per 100g)	`product_id`	Numeric columns (calories, fat, sugar…)
`ingredient_ref`	Canonical ingredient dictionary	`ingredient_id` (identity)	2,995 unique ingredients; name_en (UNIQUE), vegan/vegetarian/palm_oil/is_additive/concern_tier flags
`product_ingredient`	Product ↔ ingredient junction	`(product_id, ingredient_id, position)`	~13,858 rows across 913 products; tracks percent, percent_estimate, sub-ingredients, position order
`product_allergen_info`	Allergens + traces per product (unified)	`(product_id, tag, type)`	~2,630 rows (1,269 allergens + 1,361 traces) across 655 products; type IN ('contains','traces'); source: OFF allergens_tags / traces_tags
`country_ref`	ISO 3166-1 alpha-2 country codes	`country_code` (text PK)	2 rows (PL, DE); is_active flag, nutri_score_official boolean; FK from products.country
`category_ref`	Product category master list	`category` (text PK)	29 rows (22 PL + 7 new); FK from products.category; display_name, description, icon_emoji, sort_order
`nutri_score_ref`	Nutri-Score label definitions	`label` (text PK)	7 rows (A–E + UNKNOWN + NOT-APPLICABLE); FK from scores.nutri_score_label; color_hex, description
`concern_tier_ref`	EFSA ingredient concern tiers	`tier` (integer PK)	4 rows (0–3); FK from ingredient_ref.concern_tier; score_impact, examples, EFSA guidance
`product_type_ref`	Product sub-type taxonomy per category	`product_type` (text PK)	~137 rows across 29 categories; FK from products.product_type; display_name, icon_emoji, sort_order. Issue #354.
`brand_ref`	Canonical brand dictionary	`brand_name` (text PK)	Auto-seeded from products.brand (~478 rows); parent_company, country_origin, is_store_brand, display_name. Issue #356.
`ingredient_translations`	Localized ingredient display names	`(ingredient_id, language_code)`	FK to ingredient_ref + language_ref; name, source (curated/off_api/auto_translated/user_submitted), reviewed_at. Issue #355.
`user_preferences`	User personalization (country, diet, allergens)	`user_id` (FK → auth.users)	One row per user; diet enum, allergen arrays, strict_mode flags, notification_score_changes, notification_frequency; RLS by user
`user_health_profiles`	Health condition profiles	`profile_id` (identity)	Conditions + nutrient thresholds (sodium, sugar, sat fat limits). One active profile per user. RLS by user
`user_product_lists`	User-created product lists	`list_id` (identity)	Name, description, share_token, is_public. Default lists: Favorites, Avoid. RLS by user
`user_product_list_items`	Items in product lists	`(list_id, product_id)`	sort_order, notes. FK to user_product_lists + products. RLS by user
`user_comparisons`	Saved product comparisons	`comparison_id` (identity)	product_ids array (2-4), share_token, title. RLS by user
`user_saved_searches`	Saved search queries	`search_id` (identity)	Query text, filters JSONB, notification preferences. RLS by user
`scan_history`	Barcode scan history	`scan_id` (identity)	user_id, ean, scanned_at, product_id (if matched). RLS by user
`product_submissions`	User-submitted products	`submission_id` (identity)	ean, product_name, brand, photo_url, review_notes, status ('pending'/'approved'/'rejected'/'merged'). EAN checksum trigger auto-rejects invalid barcodes
`product_links`	Cross-country product links	`link_id` (identity)	product_id_a < product_id_b (ordered pair). link_type: identical/equivalent/variant/related. confidence: manual/ean_match/brand_match/verified. Issue #352
`user_trust_scores`	Per-user submission reputation tracking	`user_id` (FK → auth.users)	trust_score 0-100 (default 50). Auto-adjusts: +5 approved, -15 rejected, -5 auto-reject. Counters, flag fields. RLS: service_role only
`event_schema_registry`	Schema-versioned event definitions	`id` (identity)	event_type + schema_version UNIQUE; json_schema, status(active/deprecated/retired), pii_fields, retention_days. RLS: anon-read
`backfill_registry`	Batch data operation tracking	`backfill_id` (uuid PK)	name (unique), status, rows_processed/expected, batch_size, rollback_sql, validation_passed. RLS: service-write / auth-read
`log_level_ref`	Severity level definitions for structured logs	`level` (text PK)	5 rows (DEBUG–CRITICAL); numeric_level, retention_days, escalation_target. RLS: service-write / auth-read
`error_code_registry`	Known error codes with domain/category/severity	`error_code` (text PK)	{DOMAIN}{CATEGORY}{NNN} format; FK to log_level_ref(level); 13 starter codes. RLS: service-write / auth-read
`retention_policies`	Audit log retention configuration	`policy_id` (identity)	table_name (unique), timestamp_column, retention_days (30–3650), is_enabled. RLS: service_role only
`mv_refresh_log`	Audit trail for MV refreshes	`refresh_id` (identity)	mv_name, refreshed_at, duration_ms, row_count, triggered_by. Index on (mv_name, refreshed_at DESC). RLS: service-write / auth-read
`deletion_audit_log`	GDPR Art.17 deletion audit trail (no PII)	`id` (uuid)	deleted_at (timestamptz), tables_affected (text[]). NO user_id column. RLS enabled, service-role only
`api_rate_limits`	Per-endpoint rate limit configuration	`endpoint` (text PK)	max_requests, window_seconds, description. 6 seeded endpoints. RLS: auth-read / service-write
`api_rate_limit_log`	Ephemeral request tracking for rate limiting	`id` (identity)	user_id, endpoint, called_at. Retention: 2 days. Index on (user_id, endpoint, called_at DESC). RLS: service-role only
`recipe`	Recipe metadata (curated editorial content)	`id` (uuid)	slug (unique), title_key/description_key (i18n), category (breakfast/lunch/dinner/snack/dessert/drink/salad/soup), difficulty, prep/cook time, servings, country (nullable), is_published, tags[]. RLS: public SELECT on published
`recipe_step`	Ordered cooking instructions per recipe	`id` (uuid)	recipe_id FK, step_number (unique per recipe), content_key (i18n). RLS: public SELECT if recipe published
`recipe_ingredient`	Recipe ingredients with sort order	`id` (uuid)	recipe_id FK, name_key (i18n), sort_order, optional flag, ingredient_ref_id FK (nullable link to ingredient_ref). RLS: public SELECT if recipe published
`recipe_ingredient_product`	Links recipe ingredients to DB products	`id` (uuid)	recipe_ingredient_id FK, product_id FK, is_primary, match_confidence (0-1). UNIQUE (ingredient, product). RLS: public SELECT if recipe published

Products Columns (key)

Column	Type	Notes
`product_id`	`bigint`	Auto-incrementing identity
`country`	`text`	`'PL'` or `'DE'` — FK to country_ref
`brand`	`text`	Manufacturer or brand name
`product_name`	`text`	Full product name including variant
`category`	`text`	One of 20 food categories
`product_type`	`text`	Subtype (e.g., `'yogurt'`, `'beer'`)
`ean`	`text`	EAN-13 barcode (unique index)
`prep_method`	`text`	Preparation method (affects scoring). NOT NULL, default `'not-applicable'`
`store_availability`	`text`	Normalized Polish chain name (Biedronka, Lidl, Żabka, etc.) or NULL
`controversies`	`text`	`'none'` or `'palm oil'` etc.
`nutri_score_source`	`text`	Provenance: `'official_label'`, `'off_computed'`, `'manual'`, `'unknown'`
`last_fetched_at`	`timestamptz`	When data was last fetched/refreshed from source API
`off_revision`	`integer`	Open Food Facts internal revision number at time of last fetch
`is_deprecated`	`boolean`	Soft-delete flag
`deprecated_reason`	`text`	Why deprecated

Key Functions

Function	Purpose
`compute_unhealthiness_v33()`	Scores 1–100 from 9 penalty factors + nutrient density bonus: sat fat, sugars, salt, calories, trans fat, additives, prep, controversies, ingredient concern − protein/fibre bonus
`explain_score_v33()`	Returns JSONB breakdown of score: final_score + 10 factors (9 penalties + 1 bonus) with name, weight, raw (0–100), weighted, input, ceiling
`find_similar_products()`	Top-N products by Jaccard ingredient similarity (returns product details + similarity coefficient)
`find_better_alternatives()`	Healthier substitutes in same/any category, ranked by score improvement and ingredient overlap
`resolve_ingredient_name()`	Returns localized ingredient name. Fallback: requested lang → en translation → name_en → NULL
`assign_confidence()`	Returns `'verified'`/`'estimated'`/`'low'` from data completeness
`score_category()`	Consolidated scoring procedure: Steps 0/1/4/5 (concern defaults, unhealthiness, flags + dynamic `data_completeness_pct`, confidence) for a given category
`compute_data_confidence()`	Composite confidence score (0-100) with 6 components; band, completeness profile
`compute_data_completeness()`	Dynamic 15-checkpoint field-coverage function for `data_completeness_pct` (EAN, 9 nutrition, Nutri-Score, NOVA, ingredients, allergens, source)
`api_data_confidence()`	API wrapper for compute_data_confidence(); returns structured JSONB
`api_product_detail()`	Single product as structured JSONB (identity, scores, flags, nutrition, ingredients, allergens, trust)
`api_category_listing()`	Paged category listing with sort (score\|calories\|protein\|name\|nutri_score) + pagination
`api_score_explanation()`	Score breakdown + human-readable headline + warnings + category context (rank, avg, relative position)
`api_better_alternatives()`	Healthier substitutes wrapper with source product context and structured JSON
`api_search_products()`	Full-text + trigram search across product_name and brand; uses pg_trgm GIN indexes
`api_get_cross_country_links()`	Returns linked products for a given product_id; bidirectional query across product_links; returns JSONB array
`api_get_recipes()`	Browse published recipes with filters (country, category, tag, difficulty, max_time); paginated JSONB with total_count + recipes array
`api_get_recipe_detail()`	Full recipe detail by slug: recipe metadata + ingredients (with linked products) + steps; returns structured JSONB
`api_get_recipe_nutrition()`	Aggregate nutrition summary from linked products; picks primary product per ingredient; returns per-100g averages + coverage_pct
`browse_recipes()`	Browse published recipes with filters (category, country, tag, difficulty, max_time); returns TABLE rows
`get_recipe_detail()`	Recipe detail by slug: returns JSONB with metadata, steps, ingredients (with linked_products array)
`find_products_for_recipe_ingredient()`	Finds products for a recipe ingredient: admin-curated links first, then auto-suggested via ingredient_ref matching
`refresh_all_materialized_views()`	Refreshes all MVs concurrently; returns timing report JSONB
`mv_staleness_check()`	Checks if MVs are stale by comparing row counts to source tables
`check_formula_drift()`	Compares stored SHA-256 fingerprints against recomputed hashes for active scoring/search formulas
`check_function_source_drift()`	Compares registered pg_proc source hashes against actual function bodies for critical functions
`governance_drift_check()`	Master drift detection runner — 8 checks across scoring, search, naming conventions, and feature flags
`log_drift_check()`	Executes governance_drift_check() and persists results into drift_check_results; returns run_id UUID
`validate_log_entry()`	Validates a structured log JSON entry against LOG_SCHEMA.md spec; returns `{valid: true}` or `{valid: false, errors: [...]}`
`execute_retention_cleanup()`	Deletes audit rows older than retention_policies window; SECURITY DEFINER, dry-run by default, batch deletion via ctid; returns JSONB summary
`mv_last_refresh()`	Returns the most recent refresh per MV; columns: mv_name, refreshed_at, duration_ms, row_count, triggered_by, age_minutes
`check_flag_readiness()`	Returns activation readiness status for all feature flags — dependency resolution, expiry tracking, status (ready/blocked/expired/enabled)
`api_export_user_data()`	GDPR Art.15/20 — exports all user data as structured JSONB (preferences, health profiles, lists, comparisons, searches, scans). SECURITY DEFINER
`api_delete_user_data()`	GDPR Art.17 — cascading delete across 8 tables in FK-safe order; writes anonymized audit to `deletion_audit_log`. SECURITY DEFINER
`is_valid_ean()`	GS1 checksum validation for EAN-8/EAN-13 barcodes. IMMUTABLE STRICT — returns NULL for NULL, false for invalid
`gs1_country_hint()`	Extracts GS1 country-of-registration hint from EAN-13 prefix. IMMUTABLE STRICT — returns JSONB `{code, name, confidence}` where confidence ∈ {high, low, none}. Not definitive for imported products
`check_submission_rate_limit()`	Returns rate limit status for product submissions: 10 per 24h rolling window per user. SECURITY DEFINER
`check_scan_rate_limit()`	Returns rate limit status for barcode scans: 100 per 24h rolling window per user. SECURITY DEFINER
`check_api_rate_limit()`	Generic per-endpoint rate limiter: checks `api_rate_limits` config, logs to `api_rate_limit_log`. Returns `{allowed, remaining}` or blocked JSONB. SECURITY DEFINER
`check_share_limit()`	Per-user share count limiter: max 50 shared items per type (comparisons/lists). Returns `{allowed, type, limit}` JSONB. SECURITY DEFINER
`score_submission_quality()`	Scores a submission's quality (0-100) from 7 signals: account age, velocity, EAN match, photo, brand/name quality, user trust score. SECURITY DEFINER
`api_admin_batch_reject_user()`	Rejects all pending/manual_review/flag_for_review submissions from a user, flags trust score (cap at 10). SECURITY DEFINER
`api_admin_submission_velocity()`	Returns submission velocity stats: last_24h, last_7d, pending_count, auto_rejected_24h, status_breakdown, top_submitters with trust. SECURITY DEFINER

Views

v_master — Flat denormalized join: products → nutrition_facts + ingredient analytics via LATERAL subqueries on product_ingredient + ingredient_ref (ingredient_count, additive_names, ingredients_raw, has_palm_oil, vegan_status, vegetarian_status, allergen_count/tags, trace_count/tags). Scores, flags, source provenance all inline on products. Includes score_breakdown (JSONB), ingredient_data_quality, and nutrition_data_quality columns. Filtered to is_deprecated = false. This is the primary internal query surface.

v_api_category_overview — Dashboard-ready category statistics. One row per active category (20 total). Includes product_count, avg/min/max/median score, pct_nutri_a_b, pct_nova_4, display metadata from category_ref.

v_product_confidence — Materialized view of data confidence scores for all active products. Columns: product_id, product_name, brand, category, nutrition_pts(0-30), ingredient_pts(0-25), source_pts(0-20), ean_pts(0-10), allergen_pts(0-10), serving_pts(0-5), confidence_score(0-100), confidence_band(high/medium/low). Unique index on product_id.

v_formula_registry — Unified view across scoring_model_versions and search_ranking_config. Columns: domain, version, formula_name, status, weights_config (JSONB), fingerprint (SHA-256), change_reason, is_active. Both scoring and search formulas in one query surface.

v_data_freshness_summary — Per-country, per-category freshness breakdown. Columns: country, category, total_products, has_fetch_date, fresh_30d, aging_30_90d, stale_90d, never_fetched, oldest_fetch, newest_fetch, pct_fresh. Used for monitoring data staleness.

v_data_coverage_summary — Materialized view of per-country, per-category data coverage metrics. Columns: country, category, total_products, with_ingredients, with_allergens, with_ean, ingredient_pct, allergen_pct, ean_pct, avg_completeness. Unique index on (country, category). Used by QA coverage threshold checks (§8.18, checks 34–37). Refreshed by refresh_all_materialized_views().

mv_scoring_distribution — Materialized view of scoring band distribution per country and category. Columns: country, category, band (Green/Yellow/Orange/Red/Dark Red), product_count, pct_of_category, avg_score, min_score, max_score, stddev_score. Unique index on (country, category, band). Refreshed by refresh_all_materialized_views().

v_cross_country_scan_analytics — Per-country scan metrics. Columns: scan_country, total_scans, found_scans, missed_scans, miss_rate_pct, unique_eans_scanned, unique_eans_missed. Used for cross-country scanner performance monitoring.

v_cross_country_ean_candidates — EANs scanned in more than one country — candidates for product_links. Columns: ean, scanned_in_countries (text[]), country_count, first_scanned, last_scanned, total_scans.

v_submission_country_analytics — Per-country submission metrics. Columns: suggested_country, total_submissions, pending, approved, rejected, merged, acceptance_rate_pct. Used for cross-country submission monitoring.

Edge Functions

Location: supabase/functions/ Runtime: Deno + TypeScript Deploy: supabase functions deploy <name> --no-verify-jwt Secrets: supabase secrets set KEY=value — never committed to git Local test: supabase start && supabase functions serve

Function	Trigger	Purpose
`api-gateway`	HTTP	Write-path API gateway — rate limiting, auth validation, submission scoring, trust score enforcement. All user write operations (submit, scan, share) route through here. Calls `check_submission_rate_limit()`, `score_submission_quality()`, and `api_admin_batch_reject_user()` RPC internally.
`send-push-notification`	HTTP (RPC)	Web Push delivery via VAPID. Accepts `{user_id, title, body, url}`. Validates subscription records, applies Upstash Redis deduplication, delivers via Web Push Protocol. VAPID keys injected as secrets.
`verify-turnstile`	HTTP (public)	Cloudflare Turnstile CAPTCHA server-side verification. Validates challenge token before allowing signup and product submission flows. TURNSTILE_SECRET_KEY injected as secret.

5. Categories (29 defined, 22 PL + 21 DE active)

All categories have variable product counts (28–95 active products). Categories are expanded by running the pipeline with --max-products N. DE categories target ~51 products each. All 21 PL categories (except Żabka) have a DE counterpart. 7 new categories added for 10K expansion (not yet populated).

PL Categories (22 active)

Category	Folder slug
Alcohol	`alcohol/`
Baby	`baby/`
Bread	`bread/`
Breakfast & Grain-Based	`breakfast-grain-based/`
Canned Goods	`canned-goods/`
Cereals	`cereals/`
Chips (PL)	`chips-pl/`
Condiments	`condiments/`
Dairy	`dairy/`
Drinks	`drinks/`
Frozen & Prepared	`frozen-prepared/`
Instant & Frozen	`instant-frozen/`
Meat	`meat/`
Nuts, Seeds & Legumes	`nuts-seeds-legumes/`
Oils & Vinegars	`oils-vinegars/`
Plant-Based & Alternatives	`plant-based-alternatives/`
Sauces	`sauces/`
Seafood & Fish	`seafood-fish/`
Snacks	`snacks/`
Spreads & Dips	`spreads-dips/`
Sweets	`sweets/`
Żabka	`zabka/`

DE Categories (21)

Category	Folder slug
Alcohol (DE)	`alcohol-de/`
Baby (DE)	`baby-de/`
Bread (DE)	`bread-de/`
Breakfast & Grain-Based (DE)	`breakfast-grain-based-de/`
Canned Goods (DE)	`canned-goods-de/`
Cereals (DE)	`cereals-de/`
Chips (DE)	`chips-de/`
Condiments (DE)	`condiments-de/`
Dairy (DE)	`dairy-de/`
Drinks (DE)	`drinks-de/`
Frozen & Prepared (DE)	`frozen-prepared-de/`
Instant & Frozen (DE)	`instant-frozen-de/`
Meat (DE)	`meat-de/`
Nuts, Seeds & Legumes (DE)	`nuts-seeds-legumes-de/`
Oils & Vinegars (DE)	`oils-vinegars-de/`
Plant-Based & Alternatives (DE)	`plant-based-alternatives-de/`
Sauces (DE)	`sauces-de/`
Seafood & Fish (DE)	`seafood-fish-de/`
Snacks (DE)	`snacks-de/`
Spreads & Dips (DE)	`spreads-dips-de/`
Sweets (DE)	`sweets-de/`

New Categories (7, not yet populated — Issue #858)

Category	Slug
Pasta & Rice	`pasta-rice`
Soups	`soups`
Coffee & Tea	`coffee-tea`
Frozen Vegetables	`frozen-vegetables`
Ready Meals	`ready-meals`
Desserts & Ice Cream	`desserts-ice-cream`
Spices & Seasonings	`spices-seasonings`

43 pipeline folders (22 PL + 21 DE). Category-to-OFF tag mappings live in pipeline/categories.py. Each category has multiple OFF tags and search terms for comprehensive coverage. 7 new categories defined in category_ref but not yet added to the pipeline (see Issue #859).

6. Pipeline SQL Conventions

File Naming & Execution Order

PIPELINE__<category>__01_insert_products.sql   # Upsert products (must run FIRST)
PIPELINE__<category>__03_add_nutrition.sql      # Nutrition facts
PIPELINE__<category>__04_scoring.sql            # Nutri-Score + NOVA + CALL score_category()
PIPELINE__<category>__05_source_provenance.sql  # Source URLs + EANs (pipeline-generated categories)

Order matters: Products (01) must exist before nutrition (03). Scoring (04) sets Nutri-Score/NOVA data, then calls score_category() which computes unhealthiness, flags, and confidence. Source provenance (05) is generated by the pipeline and contains OFF API source URLs + EANs.

Idempotency Patterns

Operation	Pattern
Insert product	`INSERT ... ON CONFLICT (country, brand, product_name) DO UPDATE SET`
Insert nutrition	`LEFT JOIN nutrition_facts ... WHERE nf.product_id IS NULL`
Update scores	`CALL score_category('CategoryName');`
Schema change	`IF NOT EXISTS` / `ADD COLUMN IF NOT EXISTS`

Scoring Call

Always use score_category() — never inline the scoring steps:

-- After setting Nutri-Score (Step 2) and NOVA (Step 3):
CALL score_category('CategoryName');

This procedure handles Steps 0 (default concern score), 1 (compute unhealthiness), 4 (health-risk flags + dynamic data_completeness_pct via compute_data_completeness()), and 5 (confidence). See 20260213000800_dynamic_data_completeness.sql for the latest implementation.

prep_method Scoring

Value	Internal Score
`'air-popped'`	20
`'steamed'`	30
`'baked'`	40
`'not-applicable'`	50 (default)
`'none'`	50 (default)
`'grilled'`	60
`'smoked'`	65
`'fried'`	80
`'deep-fried'`	100

Additional valid values (scored as 50/default unless added to the scoring function): 'roasted', 'marinated', 'pasteurized', 'fermented', 'dried', 'raw'.

The pipeline's _detect_prep_method() infers these from OFF category tags and product names (both English and Polish keywords).

Data state: All active products have prep_method populated (0 NULLs). 14 categories use 'not-applicable'. 5 method-sensitive categories (Bread, Chips, Frozen & Prepared, Seafood & Fish, Snacks) use category-specific values ('baked', 'fried', 'smoked', 'marinated', 'not-applicable'). Żabka uses a mix of 'baked', 'fried', and 'none'.

7. Migrations

Location: supabase/migrations/ — managed by Supabase CLI. Currently 228 migrations.

Rules:

Append-only. Never modify an existing migration file.
No product data. Migrations define schema + seed metadata only.
Prefer IF NOT EXISTS / IF EXISTS guards for idempotency.
New changes → new file with next timestamp.

CHECK Constraints

24 CHECK constraints enforce domain values at the DB level:

Table	Constraint	Rule
`products`	`chk_products_country`	`country IN ('PL','DE')` — matches active country_ref entries
`products`	`chk_products_prep_method`	Valid method (NOT NULL): `air-popped`, `baked`, `fried`, `deep-fried`, `grilled`, `roasted`, `smoked`, `steamed`, `marinated`, `pasteurized`, `fermented`, `dried`, `raw`, `none`, `not-applicable`
`products`	`chk_products_controversies`	`IN ('none','minor','moderate','serious','palm oil')`
`products`	`chk_products_unhealthiness_range`	1–100 (unhealthiness_score)
`products`	`chk_products_nutri_score_label`	NULL or `IN ('A','B','C','D','E','UNKNOWN','NOT-APPLICABLE')`
`products`	`chk_products_nutri_score_source`	NULL or `IN ('official_label','off_computed','manual','unknown')`
`products`	`chk_products_confidence`	NULL or `IN ('verified','estimated','low')`
`products`	`chk_products_nova`	NULL or `IN ('1','2','3','4')`
`products`	4 × `chk_products_high_*_flag`	NULL or `IN ('YES','NO')`
`products`	`chk_products_completeness`	0–100 (data_completeness_pct)
`products`	`chk_products_source_type`	NULL or `IN ('off_api','manual','off_search','csv_import')`
`nutrition_facts`	`chk_nutrition_non_negative`	All 9 nutrition columns ≥ 0
`nutrition_facts`	`chk_nutrition_satfat_le_totalfat`	saturated_fat ≤ total_fat
`nutrition_facts`	`chk_nutrition_sugars_le_carbs`	sugars ≤ carbs
`ingredient_ref`	`chk_concern_tier_range`	concern_tier 0–3
`ingredient_ref`	`chk_palm_oil_values`	contains_palm_oil IN ('yes','no','maybe')
`ingredient_ref`	`chk_vegan_values`	vegan IN ('yes','no','maybe')
`ingredient_ref`	`chk_vegetarian_values`	vegetarian IN ('yes','no','maybe')
`product_allergen_info`	`product_allergen_info_type_check`	type IN ('contains','traces')
`product_ingredient`	`chk_percent_range`	percent BETWEEN 0 AND 100
`product_ingredient`	`chk_percent_estimate_range`	percent_estimate BETWEEN 0 AND 100
`product_ingredient`	`chk_sub_has_parent`	NOT is_sub OR parent_ingredient_id IS NOT NULL

Performance Indexes

Table	Index Name	Columns / Condition
`products`	`products_pkey`	`product_id` (PK)
`products`	`products_country_brand_name_uniq`	`(country, brand, product_name)` UNIQUE
`products`	`products_ean_uniq`	`ean` UNIQUE WHERE ean IS NOT NULL
`products`	`products_category_idx`	`category`
`products`	`products_active_idx`	`product_id` WHERE is_deprecated IS NOT TRUE

8. Testing & QA (NON-NEGOTIABLE)

A change is not done unless relevant tests were added/updated, every suite is green, and coverage/quality gates are not degraded. This applies to every code change — no exceptions.

8.1 Testing Stack & Architecture

Layer	Tool	Location	Runner
Unit tests	Vitest 4.x (jsdom, v8 coverage)	`frontend/src/*/.test.{ts,tsx}` co-located	`cd frontend && npx vitest run`
Component tests	Testing Library React + Vitest	`frontend/src/components/*/.test.tsx`	same as above
E2E smoke	Playwright 1.58 (Chromium)	`frontend/e2e/smoke.spec.ts`	`cd frontend && npx playwright test`
E2E auth	Playwright (requires `SUPABASE_SERVICE_ROLE_KEY`)	`frontend/e2e/authenticated.spec.ts`	same (CI auto-detects key)
DB QA (776 checks)	Raw SQL (zero rows = pass)	`db/qa/QA__*.sql` (49 suites)	`.\RUN_QA.ps1`
Negative validation	SQL injection/constraint tests	`db/qa/TEST__negative_checks.sql`	`.\RUN_NEGATIVE_TESTS.ps1`
DB sanity	Row-count + schema assertions	via `RUN_SANITY.ps1`	`.\RUN_SANITY.ps1 -Env local`
Pipeline structure	Python validator	`check_pipeline_structure.py`	`python check_pipeline_structure.py`
EAN checksum	Python validator	`validate_eans.py`	`python validate_eans.py`
Code quality	SonarCloud	`sonar-project.properties`	CI only (main-gate.yml)

Coverage is collected via npm run test:coverage (v8 provider, LCOV output at frontend/coverage/lcov.info), fed to SonarCloud. Coverage exclusions are declared in both vitest.config.ts and sonar-project.properties.

8.2 Always Discover Existing Patterns First

Before writing or changing any code:

Search the repo for existing tests covering the area you're touching and follow the established style.
Prefer extending existing test files over inventing new patterns.
Locate how tests run in CI (GitHub Actions workflows in .github/workflows/) and locally (scripts), and align with that.

8.3 Every Code Change Must Include Tests

For any functional change:

Add or update tests covering:
- Happy path — expected normal behavior.
- Edge cases — boundary values, empty inputs, unicode, null.
- Error/validation paths — invalid inputs, permission failures, missing data.
- Regression — for bug fixes, add a test that would have caught the bug.
If you add a feature flag, filter, profile option, or API parameter: test ON/OFF behavior.
If you touch database logic/migrations: add QA checks that validate schema + expected query behavior.

8.4 Test Conventions (must follow)

Vitest unit/component tests

Use describe() + it() blocks (not test()). Descriptions in plain English.
Import { describe, it, expect, vi, beforeEach } from "vitest".
Use @/ path alias for imports (e.g., @/lib/api, @/components/common/RouteGuard).
Mock modules with vi.mock("@/lib/module", () => ({ ... })). Provide ALL exports used by the component under test — not just the ones being asserted. Missing mock exports cause cryptic runtime errors (e.g., TypeError: X is not a function).
Clear mocks in beforeEach with vi.clearAllMocks().
Component tests: wrap in QueryClientProvider via a createWrapper() helper with { retry: false, staleTime: 0 }.
Assertions: expect(...).toEqual(), .toHaveBeenCalledWith(), .toBeTruthy(), .toBeVisible().
Use ASCII-art section dividers (// ─── Section ───) to group test blocks.
Never hardcode constants that mirror source code. Import or reference the source constant (e.g., MAX_PRODUCTS, MAX_SEARCHES). Hardcoded duplicates silently drift when the source value changes.
When adding a route, tab, or nav item, grep the test suite for count-based assertions (e.g., expect(links).toHaveLength(N)) and update them.
Setup file: frontend/src/__tests__/setup.ts (imports @testing-library/jest-dom/vitest).

Playwright E2E tests

Use test.describe() + test() (not it()).
Import { test, expect } from @playwright/test only.
No mocks — tests run against a live dev server at http://localhost:3000.
Locators: prefer page.locator("text=..."), page.getByRole(...), CSS selectors.
Strict mode: Playwright fails when a locator matches multiple elements. If a page has duplicate accessible names (e.g., two "Get started" CTAs), disambiguate with .first(), .nth(0), or a more specific parent locator. Never disable strict mode.
Assertions: expect(page).toHaveTitle(...), expect(locator).toBeVisible().
Auth-protected routes: assert redirect via page.waitForURL(/\/auth\/login/).
Smoke tests go in e2e/smoke.spec.ts; authenticated flows in e2e/authenticated.spec.ts.

Database QA SQL

Each check is a numbered SELECT returning violation rows. Zero rows = pass.
Include 'ISSUE LABEL' AS issue and a detail column for human-readable output.
Separate sections with -- ═══... ASCII dividers and numbered titles.
Header comment states total check count and purpose (e.g., -- 29 checks).
Add checks to existing suite files; only create a new QA__*.sql suite if the domain is genuinely new.

8.5 Coverage & Quality Gates Must Not Regress

Never reduce overall coverage or weaken assertions.
Enforced metric: Lines coverage is the primary governance metric (baseline ≥ 88 %). Statements, branches, and functions are tracked in CI output but are not individually gated.
SonarCloud Quality Gate enforces coverage on new code ≥ 80 % (lines). This is the merge-blocking gate; the repo-wide 88 % baseline is a ratchet — it must not decrease but is not a hard gate today.
Prefer strong assertions (specific outputs, types, error codes, DB row counts) over snapshot-only tests.
If coverage tooling exists (npm run test:coverage), ensure new code paths are covered.
If a change makes coverage impossible, refactor to make it testable (pure functions, dependency injection, smaller modules).
SonarCloud Quality Gate must pass. Do not lower thresholds, delete checks, or skip suites to make failures disappear.

8.6 Update QA Checks When Needed

If you add new constraints (validation rule, EAN rules, scoring rule, CHECK constraint):

Add/extend the corresponding QA check(s) in db/qa/ so the rule is enforced.
Update check count in header comments and this document only if the total changes.
Keep totals consistent across copilot-instructions.md, RUN_QA.ps1 output, and qa.yml job name.

8.7 Run Commands and Report Results

Before finalizing any change:

Run the full impacted suite locally (same entrypoint CI uses).
If the full suite is too heavy, run the impacted subset and explain why.
In your response, include:
- Commands executed
- Pass/fail status
- Key output summaries (counts, durations, suite names)

Minimum validation per change type:

Change type	Commands to run
Frontend component/lib	`cd frontend && npx tsc --noEmit && npx vitest run`
Frontend + UI flow	above + `npx playwright test --project=smoke`
Database schema/SQL	`python check_pipeline_structure.py`, then `.\RUN_QA.ps1`
Scoring/nutrition	`.\RUN_QA.ps1` (covers scoring regression tests)
Python pipeline code	`python -c "import py_compile; py_compile.compile('file.py', doraise=True)"` + `python check_pipeline_structure.py`
Full stack	all of the above

8.8 Test Placement Rules

Area	Test location	Level
`frontend/src/lib/*.ts`	Co-located `*.test.ts` in same dir	Unit
`frontend/src/components/*/.tsx`	Co-located `*.test.tsx` in same dir	Unit / Component
API routes / RPC wrappers	`frontend/src/lib/rpc.test.ts` or `api.test.ts`	Unit (mocked Supabase)
UI flows & navigation	`frontend/e2e/smoke.spec.ts`	E2E
Auth-gated flows	`frontend/e2e/authenticated.spec.ts`	E2E
DB schema & constraints	`db/qa/QA__*.sql` suites	DB QA
Scoring formula	`db/qa/QA__scoring_formula_tests.sql`	DB regression
Pipeline SQL structure	`check_pipeline_structure.py`	Python validator
Bug fixes	Add regression test in the appropriate location above	Same as area

8.9 Determinism (Flake Prevention)

All tests must be deterministic:

No live network calls. Mock external APIs (vi.mock() for OFF API, Supabase).
No time-dependent assertions without freezing time (vi.useFakeTimers()). Use >= (not >) for staleness comparisons — Date.now() - createdAt > 0 fails in same-millisecond operations.
No randomness without seeding.
No dependency on local machine state (port availability, file system, env vars).
If unavoidable, mock/stub and document why.
Use local test DB / fixtures as per repo conventions.

E2E tests are the only exception — they run against a live dev server but use Playwright's retry and timeout mechanisms.

8.10 CI Parity (Don't "Green Locally, Red in CI")

Use the same entrypoints CI uses (.github/workflows/pr-gate.yml, main-gate.yml, qa.yml).
If a test needs env vars, provide defaults in test setup (not in CI-only secrets).
If you add a new dependency/tool, ensure it's installed in CI (package.json or requirements.txt).
CI workflows (tiered architecture):
- pr-gate.yml: Static checks (typecheck + lint) → Unit tests + Build (parallel) → Playwright smoke E2E
- pr-title-lint.yml: PR title conventional-commit validation (all PRs)
- main-gate.yml: Typecheck → Lint → Build → Unit tests with coverage → Playwright smoke E2E → SonarCloud scan + BLOCKING Quality Gate → Sentry sourcemap upload
- nightly.yml: Full Playwright (all projects incl. visual regression) + Data Integrity Audit (parallel)
- qa.yml: Pipeline structure guard → Schema migrations → Schema drift detection → Pipelines → QA (776 checks) → Sanity (17 checks) → Confidence threshold
- deploy.yml: Manual trigger → Schema diff → Approval gate (production) → Pre-deploy backup → supabase db push → Post-deploy sanity
- sync-cloud-db.yml: Auto-sync migrations to production on merge to main
Required (merge-blocking) checks: Unit Tests, Playwright Smoke, Typecheck & Lint, Build. These four must pass before a PR can merge.
Non-blocking checks: quality_gate, DB Integrity, Lighthouse, verify, Vercel. Failures on these do not block merging but should be investigated.
ESLint consistent-type-imports: Use import type { X } for type-only imports — import { X } for a type will fail the lint gate.

8.11 Test Plan Required (Before Coding)

Before implementing a non-trivial change, write a short Test Plan:

What should be tested (bullet list)
Where the tests will live (file paths)
What level (unit / component / integration / e2e / DB QA) Then implement code + tests accordingly. Skip this for trivial one-line fixes.

8.12 Contract Tests for APIs & RPC

When modifying any API route or Supabase RPC wrapper:

Add tests that assert the API contract:
- Status codes / return types
- Response schema/fields
- Error shapes (type, message)
- Auth requirements (anon vs authenticated)
If a shared TypeScript type or Zod schema exists, assert against it.
Existing patterns: frontend/src/lib/rpc.test.ts (38 tests), frontend/src/lib/api.test.ts (8 tests).

8.13 Database Safety Rules

If adding/changing DB schema or SQL functions:

Append-only migrations. Never modify an existing supabase/migrations/ file.
Provide a migration plan and rollback note (comment in the migration file).
For rollback procedures, see DEPLOYMENT.md → Rollback Procedures (5 scenarios + emergency checklist).
Add a QA check that verifies the migration outcome (row counts, constraint behavior).
Ensure idempotency (IF NOT EXISTS, ON CONFLICT, DO UPDATE SET).
Run .\RUN_QA.ps1 to verify all 776 checks pass + .\RUN_NEGATIVE_TESTS.ps1 for 20 injection tests.

8.14 Snapshots Are Not Enough

Do not rely solely on snapshot tests for logic-heavy changes. Snapshots are only allowed for:

Stable UI rendering (component structure)
Large response payloads

But they must be paired with explicit assertions on key fields/values.

8.15 Refactors: Maintain Behavior and Prove It

For refactors:

Lock behavior — ensure existing tests pass before refactoring. If no tests exist, add characterization tests first.
Refactor — make the structural change.
Prove no regression — all tests must still pass with zero changes to assertions.
Validate with python -c "import py_compile; py_compile.compile('file.py', doraise=True)" for Python, npx tsc --noEmit for TypeScript.

8.16 Don't Weaken Gates to Fix Failures

Never "fix" a failure by:

Lowering coverage or quality thresholds
Deleting or skipping checks/suites
Widening assertion tolerances without justification
Removing ON_ERROR_STOP or set -euo pipefail

Only do this if explicitly requested and with a clear written justification.

8.17 Verification Output

At the end of every PR-like change, include a Verification section:

Commands run (with output)
Results summary (pass/fail, counts)
New/updated tests listed
QA check changes listed (if any)

8.18 DB QA Suites Reference

Suite	File	Checks	Blocking?
Data Integrity	`QA__null_checks.sql`	29	Yes
Scoring Formula	`QA__scoring_formula_tests.sql`	40	Yes
Source Coverage	`QA__source_coverage.sql`	8	No
EAN Validation	`validate_eans.py`	1	Yes
API Surfaces	`QA__api_surfaces.sql`	18	Yes
API Contract	`QA__api_contract.sql`	33	Yes
Confidence Scoring	`QA__confidence_scoring.sql`	14	Yes
Confidence Reporting	`QA__confidence_reporting.sql`	7	Yes
Data Quality	`QA__data_quality.sql`	29	Yes
Ref. Integrity	`QA__referential_integrity.sql`	18	Yes
View Consistency	`QA__view_consistency.sql`	16	Yes
Naming Conventions	`QA__naming_conventions.sql`	12	Yes
Nutrition Ranges	`QA__nutrition_ranges.sql`	20	Yes
Data Consistency	`QA__data_consistency.sql`	26	Yes
Allergen Integrity	`QA__allergen_integrity.sql`	15	Yes
Allergen Filtering	`QA__allergen_filtering.sql`	6	Yes
Serving & Source	`QA__serving_source_validation.sql`	16	Yes
Ingredient Quality	`QA__ingredient_quality.sql`	17	Yes
Security Posture	`QA__security_posture.sql`	41	Yes
Scale Guardrails	`QA__scale_guardrails.sql`	23	Yes
Country Isolation	`QA__country_isolation.sql`	11	Yes
Diet Filtering	`QA__diet_filtering.sql`	6	Yes
Barcode Lookup	`QA__barcode_lookup.sql`	9	Yes
Auth & Onboarding	`QA__auth_onboarding.sql`	8	Yes
Health Profiles	`QA__health_profiles.sql`	14	Yes
Lists & Comparisons	`QA__lists_comparisons.sql`	15	Yes
Scanner & Submissions	`QA__scanner_submissions.sql`	15	Yes
Index & Temporal	`QA__index_temporal.sql`	19	Yes
Attribute Contradictions	`QA__attribute_contradiction.sql`	5	Yes
Monitoring & Health	`QA__monitoring.sql`	14	Yes
Scoring Determinism	`QA__scoring_determinism.sql`	25	Yes
Multi-Country Consistency	`QA__multi_country_consistency.sql`	16	Yes
Performance Regression	`QA__performance_regression.sql`	6	No
Event Intelligence	`QA__event_intelligence.sql`	18	Yes
Store Architecture	`QA__store_integrity.sql`	12	Yes
Data Provenance	`QA__data_provenance.sql`	28	Yes
Scoring Engine	`QA__scoring_engine.sql`	27	Yes
Search Architecture	`QA__search_architecture.sql`	26	Yes
GDPR Compliance	`QA__gdpr_compliance.sql`	15	Yes
Push Notifications	`QA__push_notifications.sql`	17	Yes
Index Verification	`QA__index_verification.sql`	13	No
Slow Query Detection	`QA__slow_queries.sql`	12	No
Explain Analysis	`QA__explain_analysis.sql`	10	No
MV Refresh Cost	`QA__mv_refresh_cost.sql`	10	No
Governance Drift	`QA__governance_drift.sql`	8	Yes
RLS Audit	`QA__rls_audit.sql`	7	Yes
Function Security Audit	`QA__function_security_audit.sql`	6	Yes
Recipe Integrity	`QA__recipe_integrity.sql`	6	Yes
Scoring Band Distribution	`QA__scoring_distribution.sql`	12	No
Negative Validation	`TEST__negative_checks.sql`	23	Yes

Run: .\RUN_QA.ps1 — expects 776/776 checks passing (+ EAN validation). Run: .\RUN_NEGATIVE_TESTS.ps1 — expects 20/20 caught.

8.19 Key Regression Tests (Scoring Suite)

These are anchor products whose scores must remain stable. If a scoring change causes drift beyond ±2 points, investigate before committing:

PL Anchors:

Doritos Sweet Chili ≈ 41 (chips, 7 additives + concern 55, protein credit)
Piątnica Skyr Naturalny ≈ 5 (healthiest dairy, fermented, high protein bonus)
Melvit Płatki owsiane górskie ≈ 7 (healthiest cereal, protein + fibre bonus)
Auchan Tortilla Pszenno-Żytnia ≈ 29 (bread, 9 additives + concern 25, protein credit)
Tarczyński Kabanosy wieprzowe ≈ 27 (high-fat cured meat, high protein bonus)
BoboVita Kaszka Mleczna ≈ 28 (baby food, high sugars, protein credit)
Mestemacher Chleb wielozbożowy ≈ 13 (bread, baked, protein + fibre bonus)
Marinero Łosoś wędzony ≈ 25 (cold-smoked salmon, high protein bonus)
Dr. Oetker Pizza 4 sery ≈ 30 (frozen pizza, palm oil, 4 additives, protein credit)
Naleśniki z jabłkami ≈ 15 (frozen crepes, low penalty, fibre credit)
Pudliszki Ketchup Łagodny Premium ≈ 33 (condiments, sugar + salt + additives)
E. Wedel Czekolada Tiramisu ≈ 55 (sweets, palm oil + 44 ingredients + 6 additives + concern 30)
Indomie Noodles Chicken ≈ 54 (instant, palm oil + 29 ingredients + 10 additives + concern 75)

DE Anchors:

Coca-Cola Zero (DE) ≈ 13 (zero nutrition, 8 additives + concern 70, no protein/fibre)
Ritter Sport Edel-Vollmilch (DE) ≈ 48 (sweets, full milk chocolate, high sugar/fat)
Alpro Sojadrink Ungesüßt (DE) ≈ 6 (drinks, minimal fat/sugar, low penalty)
Chipsfrisch ungarisch (DE) ≈ 25 (chips, fried, moderate fat, protein credit)
Golden Seafood Wildlachsfilet (DE) ≈ 3 (seafood, high protein, minimal penalties)
Asia Green Garden Instant-Nudeln Beef (DE) ≈ 45 (instant noodles, no ingredient data, high sat fat + salt)

Run QA after every schema change, data update, or scoring formula adjustment.

8.20 pgTAP Tests for API Functions & Views

If you modify, rename, or add any SQL function or view used by the app API (any api_* function, v_* view, or mv_* materialized view), you must add or update the corresponding pgTAP test in supabase/tests/.

Current test files:

File	Covers
`scanner_functions.test.sql`	`api_record_scan`, `api_get_scan_history`
`product_functions.test.sql`	`api_product_detail_by_ean`, `api_product_detail`, `api_better_alternatives`, `api_score_explanation`, `api_data_confidence`
`category_functions.test.sql`	`api_category_overview`, `api_category_listing`
`search_functions.test.sql`	`api_search_products`, `api_search_autocomplete`, `api_get_filter_options`
`comparison_functions.test.sql`	`api_get_products_for_compare`, `api_save_comparison`, `api_get_shared_comparison`
`telemetry_functions.test.sql`	`api_track_event`, `api_admin_get_event_summary`, `api_admin_get_top_events`, `api_admin_get_funnel`
`dashboard_functions.test.sql`	`api_record_product_view`, `api_get_recently_viewed`, `api_get_dashboard_data`
`user_functions.test.sql`	Auth-error branches for all `authenticated`-only functions
`schema_contracts.test.sql`	Table/view/function existence checks
`recipe_functions.test.sql`	`api_get_recipes`, `api_get_recipe_detail`, `api_get_recipe_nutrition`
`achievement_functions.test.sql`	Achievement/gamification function tests
`admin_dashboard_functions.test.sql`	Admin dashboard function tests
`business_metrics_functions.test.sql`	Business metrics function tests
`localization_functions.test.sql`	Localization/i18n function tests
`push_notification_functions.test.sql`	Push notification function tests
`score_history_functions.test.sql`	Score history function tests
`scoring_bands.test.sql`	Scoring band tests

Rules:

Every new api_* function must have at least a lives_ok test and response-key assertions.
Auth-required functions: test the error branch (returns {api_version, error} when auth.uid() is NULL).
No-auth functions: test with fixture data — insert test rows, call the function, assert keys and values.
Schema contract tests: add has_table / has_view / has_function for any new schema object.
Run supabase test db to verify all pgTAP tests pass before committing.

9. Environment

Environment	DB URL	Studio
Local (default)	`postgresql://postgres:postgres@127.0.0.1:54322/postgres`	http://127.0.0.1:54323
Remote	Supabase project `uskvezwftkkudvksmken`	Supabase Dashboard

Database access (no local psql install needed):

echo "SELECT * FROM v_master LIMIT 5;" | docker exec -i supabase_db_tryvit psql -U postgres -d postgres

Python environment:

Virtual env: .venv in project root → .\.venv\Scripts\python.exe
Always set $env:PYTHONIOENCODING="utf-8" before running (Polish characters)
Dependencies in requirements.txt

10. Guardrails

❌ Modify existing files in supabase/migrations/
❌ Invent nutrition data or Nutri-Score values
❌ Add products from countries not in country_ref (currently PL and DE only)
❌ Use DELETE or TRUNCATE in pipeline files — deprecate instead
❌ Inline the scoring formula — always call compute_unhealthiness_v33()
❌ Run pipelines against remote without explicit user confirmation
❌ Drop or rename tables without a new migration
❌ Collapse categories — each gets its own pipeline folder

11. Adding a New Category

Define category in pipeline/categories.py (name constant, OFF tags, search terms).
Run pipeline: python -m pipeline.run --category "New Category" --max-products 28.
Execute all generated SQL files against local DB (01, 03, 04).
Run .\RUN_QA.ps1 — verify all checks pass.

Reference implementation: chips-pl/ pipeline. Copy its SQL patterns for manual work.

12. Naming Conventions

Item	Convention
Migration files	`YYYYMMDDHHMMSS_description.sql` (Supabase timestamps)
Pipeline files	`PIPELINE__<category>__<NN>_<action>.sql`
View files	`VIEW__<name>.sql`
QA files	`QA__<name>.sql`
Table names	`snake_case`, plural (`products`, `nutrition_facts`)
Column names	`snake_case` with unit suffix (`saturated_fat_g`, `salt_g`)

13. Git Workflow

Branch strategy: main = stable. Feature branches: feat/, fix/, docs/, chore/.

Commit convention: Conventional Commits — enforced on PR titles via pr-title-lint.yml. Full convention documented in CHANGELOG.md.

<type>(<scope>): <description>

Types (12, enforced at error level): feat · fix · schema · data · score · docs · test · ci · refactor · perf · security · chore

Scopes (24, enforced at warning level): frontend · api · scoring · search · pipeline · qa · migration · products · nutrition · rls · auth · config · deps · v32 · confidence · provenance · docs · ci · build · e2e · vitest · playwright · security · cleanup

Examples:

feat(dairy): add Piątnica product line
fix(scoring): correct salt ceiling from 1.5 to 3.0
schema(migration): add ean column to products
data(pipeline): normalize categories to 28 products
security(rls): lock down product_submissions to authenticated users

Breaking changes: Append ! after type/scope — schema(migration)!: rename products.source to source_type

Pre-commit checklist:

.\RUN_QA.ps1 — 759/759 pass
No credentials in committed files
No modifications to existing supabase/migrations/
Docs updated if schema or methodology changed
CHANGELOG.md updated (add entry under [Unreleased])

13.1 Conflict Resolution Rules (Learned Patterns)

When resolving merge conflicts, apply these deterministic rules:

Conflict Type	Resolution
CHANGELOG.md	Keep BOTH sides — never discard either side's entries. Main's entries go first, then HEAD's.
Pipeline SQL data (`db/pipelines/`)	Take main's version (`git checkout --theirs <file>`). Pipeline SQL is regenerated.
Import lists (TypeScript)	Use main's formatting style (4-space indent, alphabetical order). Add HEAD's unique additions in sorted position.
SQL test fixtures (pgTAP)	Keep BOTH sides. Watch for `product_id` collisions — renumber if needed.
`copilot-instructions.md` counts/tables	Take the higher/more complete version. Verify counts match reality.

13.2 Branch Update & Multi-PR Workflow

gh pr update-branch: Always use merge-based (no --rebase flag — it returns "rebase not prepared" error).
After gh pr update-branch: Must git pull origin <branch> --no-edit before any local work on that branch.
Auto-merge cascade: When merging multiple PRs, each merge makes remaining PRs behind main. Pattern: merge → gh pr update-branch on next → wait for CI → repeat.
Windows CRLF blocking checkout: If git checkout <branch> fails due to CRLF auto-changes, run git checkout -- . first to discard the line-ending diffs.
PowerShell conflict resolution: Never use multi-line regex to resolve git conflict markers in PowerShell — it silently fails. Use git checkout --theirs <file> or git checkout --ours <file> instead.

14. Scoring Quick Reference

penalty_sum (9 factors, weights sum to 1.00) =
  sat_fat(0.17) + sugars(0.17) + salt(0.17) + calories(0.10) +
  trans_fat(0.11) + additives(0.07) + prep_method(0.08) +
  controversies(0.08) + ingredient_concern(0.05)

nutrient_density_raw = protein_bonus + fibre_bonus   -- 0–100, tiered
unhealthiness_score  = GREATEST(1, LEAST(100, round(penalty_sum - nutrient_density_raw * 0.08)))

Ceilings (per 100g): sat fat 10g, sugars 27g, salt 3g, trans fat 2g, calories 600 kcal, additives 10, ingredient concern 100.

Nutrient density bonus (v3.3): protein tiers (0/15/30/40/50 at 5/10/15/20g) + fibre tiers (0/10/20/35/50 at 1/3/5/8g). Weight −0.08 → max 8 pt reduction.

Consumer display (TryVit Score): TryVit Score = 100 − unhealthiness_score (higher = healthier). This is a presentation-layer inversion only — the database, formula, and regression anchors (§8.19) all use unhealthiness values.

Band	Unhealthiness	TryVit Score	Consumer Label	Meaning
Green	1–20	80–100	Excellent	Low risk
Yellow	21–40	60–79	Good	Moderate risk
Orange	41–60	40–59	Moderate	Elevated risk
Red	61–80	20–39	Poor	High risk
Dark red	81–100	1–19	Bad	Very high risk

Full documentation: docs/SCORING_METHODOLOGY.md

15. Feature Implementation Standard (Mandatory for All Major Features)

Gold standard: Issue #184 — Automated Data Integrity Audits (Nightly)

Every significant issue must follow this structure. No exceptions. A "significant issue" is any change that introduces new tables, modifies API contracts, adds scoring dimensions, expands to new countries/languages, touches more than ~5 files, or introduces new CI/infrastructure workflows.

Issue #184 was selected as the exemplar across all 89+ issues in this repo because it:

Covers every required section with specific, actionable content (not boilerplate)

Spans multiple layers (SQL + Python + CI + docs) demonstrating the template works for cross-cutting work

Was implemented and closed — proving the template is practical, not aspirational

Represents a common issue shape (infra + data quality) that is easily adapted to features, bugs, or governance work

When creating any new issue, start from the template in §15.18 and adapt it to your scope.

15.1 Problem Statement

Every feature issue must open with:

What user problem does this solve? (concrete scenario, not abstract)
What current limitation exists? (link to specific code/schema gaps)
What measurable improvement does this introduce? (quantifiable: new rows, faster queries, fewer clicks, new coverage %)

Example (from #184): "No automated system to detect data quality issues — contradictions, impossible values, and orphans accumulate silently and erode trust in health recommendations."

15.2 Architectural Evaluation

Evaluate at least 2–3 approaches in a comparison table:

Approach	Verdict	Reason
A. ...	❌ Rejected	...
B. ...	❌ Rejected	...
C. ...	✅ Chosen	...

Rules:

Explicitly reject inferior approaches with rationale.
Reference prior art — how do Yuka, Open Food Facts, MyFitnessPal, or similar platforms solve this?
Consider scale — will this work at 10 countries, 10K products, 100K users?
Document assumptions that informed the choice.
Preference order: extend existing patterns > new pattern > new dependency > new extension.

15.3 Core Principles (Invariants)

Define rules that must never be violated during implementation:

Category	Examples from this project
Data integrity	Never invent nutrition data. `product_name` is the legal label — never modify it.
Backward compatibility	All API changes must be additive. No breaking changes to response shapes.
Idempotency	Every migration safe to run 1× or 100×. Use `IF NOT EXISTS`, `ON CONFLICT DO UPDATE`.
Test coverage	All new SQL functions must have pgTAP tests (§8.20). No exceptions.
Country isolation	All queries scoped by `country`. No cross-contamination between PL and DE data.
No runtime dependencies	No live translation APIs, no external calls at query time.
Append-only migrations	Never modify an existing `supabase/migrations/` file.
_ref table pattern	Extensible domain values use `_ref` tables + FK, not CHECK constraints (unless the domain is truly closed).

Every feature must restate which invariants apply and confirm they are preserved.

15.4 Phased Implementation Plan

Complex features must be broken into sequential phases. Each phase must document:

Field	Required
Phase title & scope	✅
Commit reference(s)	✅ (after shipping)
Migration filename	✅ (e.g., `20260216000800_localization_phase1.sql`)
Rationale	✅ Why this order? What does this phase unlock?
DB changes	✅ Tables, columns, functions, triggers, indexes
API changes	✅ Modified `api_*` functions, new/changed parameters
Frontend changes	✅ Components, hooks, stores, pages affected
Search/index implications	✅ `search_vector` trigger updates, new GIN indexes
Performance implications	✅ New indexes, materialized views, query plan impact
i18n implications	✅ New dictionary keys, translated category/product names

Phase ordering rules:

Foundation/schema first (no data dependency)
Data population second (depends on schema)
API surface third (depends on data)
Frontend/UX last (depends on API)

Each phase must be independently shippable — the system must work correctly after each phase, even if later phases haven't been implemented.

15.5 Database Changes

For every schema change:

Requirement	Details
Migration filename	Follows `YYYYMMDDHHMMSS_description.sql` (Supabase convention)
Idempotent design	`CREATE TABLE IF NOT EXISTS`, `ADD COLUMN IF NOT EXISTS`, `DO UPDATE SET`
Constraints defined explicitly	All domain constraints as CHECK or FK
FK over CHECK when extensible	Use `_ref` table + FK when the domain will grow (e.g., `language_ref` instead of `CHECK (language IN ('en','pl'))`)
Fallback behavior	DEFAULT values documented, NULL semantics explained
Index strategy	Justify every new index. GIN for JSONB/tsvector, B-tree for FK lookups, partial indexes where appropriate
Rollback note	Comment in migration file: "To roll back: DROP TABLE/COLUMN IF EXISTS ..."

SQL code patterns to follow (mandatory):

-- ✅ Correct: _ref table for extensible domains
CREATE TABLE IF NOT EXISTS public.language_ref (
    code text PRIMARY KEY,
    name_en text NOT NULL,
    is_enabled boolean NOT NULL DEFAULT true
);

-- ✅ Correct: FK instead of CHECK for growing domains
ALTER TABLE user_preferences
  ADD COLUMN preferred_language text NOT NULL DEFAULT 'en'
  REFERENCES language_ref(code);

-- ❌ Wrong: CHECK for domains that will grow
ALTER TABLE user_preferences
  ADD CONSTRAINT chk_language CHECK (preferred_language IN ('en','pl','de'));

15.6 API Contract Impact

For each modified api_* function, document:

Field	Required
Function name	✅
New/changed parameters	✅ (with defaults for backward compat)
Response shape changes	✅ (new keys are additive, never remove existing keys)
Backward compatibility confirmation	✅ Explicit "existing callers unaffected" statement
Fallback logic	✅ What happens if new param is NULL/omitted?
Auth requirement	✅ `anon` vs `authenticated` — unchanged?
Test coverage	✅ pgTAP test file + test names
`docs/API_CONTRACTS.md` updated	✅
`docs/FRONTEND_API_MAP.md` updated	✅ (if frontend wiring changes)

Golden rule: New parameters must have defaults. Existing response keys must not be removed or renamed. New keys are always safe to add.

15.7 Search & Indexing Impact

If the feature touches searchable fields:

Check	Details
`search_vector` trigger updated?	Include new columns with appropriate weights (A=names, B=brand, C=category, D=metadata)
Weight assignments justified?	A for primary identifiers, B for secondary, C for categorical, D for supplementary
Extensions required?	`unaccent` (diacritic folding), `pg_trgm` (fuzzy matching) — both already enabled
Synonym table updated?	New food terms → `search_synonyms` entries for cross-language search
Index type appropriate?	GIN for tsvector/JSONB, GiST for trigram similarity
Performance tested?	EXPLAIN ANALYZE on representative queries, especially with new indexes

15.8 UX Impact Metrics

PRs touching UX-visible frontend code must include:

Check	Reference
UX issues have Impact Metric section	`docs/UX_IMPACT_METRICS.md` §2 template
New event types documented in metric catalog	`docs/UX_IMPACT_METRICS.md` §4
Event names follow `{domain}_{object}_{action}`	`docs/UX_IMPACT_METRICS.md` §3
Measurement SQL template provided	`docs/UX_IMPACT_METRICS.md` §5
UI Performance Budget completed (new components)	`docs/UX_IMPACT_METRICS.md` §6

15.9 Fallback Logic Definition

Define explicit fallback chains. Never rely on implicit behavior.

Standard pattern (from Issue #184):

audit_severity_exit_code:
  If any critical findings exist → exit(1) → CI fails → alert triggers
  Else if warnings exist → exit(0) → CI passes → logged for review
  Else → exit(0) → clean run

resolve_language(p_language):
  If p_language is valid and enabled in language_ref → use it
  Else if user has preferred_language set → use that
  Else → 'en' (universal fallback)

Every feature with conditional logic must document its fallback chain in this format:

If A → use X
Else if B → use Y
Else → fallback Z (always safe, always returns a value)

15.10 Test Requirements (Mandatory)

No feature is complete without all of these:

pgTAP Tests (`supabase/tests/*.test.sql`)

Category	Examples
Schema existence	`has_table('language_ref')`, `has_column('products', 'product_name_en')`
Constraints	FK validation, CHECK constraint enforcement
Function behavior	Happy path: correct inputs → expected outputs
Edge cases	NULL inputs, empty strings, invalid codes, disabled flags
Fallback logic	`resolve_language('xx')` → falls back to `'en'`
Negative tests	Invalid inputs rejected, auth failures return error JSONB
Auth branches	Unauthenticated calls to `authenticated`-only functions return `{error}`

Schema Contract Tests (`supabase/tests/schema_contracts.test.sql`)

Add for every new schema object:

has_table('new_table') / has_view('new_view')
has_column('table', 'new_column')
has_function('new_function')

Database QA Checks (`db/qa/QA__*.sql`)

Add checks to appropriate existing suite (don't create new suites unless the domain is genuinely new).
Update check count in header comment and §8.18 table.
Run .\RUN_QA.ps1 — all checks must pass.

Frontend Tests

Check	Tool	Command
TypeScript compiles clean	`tsc`	`cd frontend && npx tsc --noEmit`
Dictionary parity	Vitest	Assert `Object.keys(en) ≡ Object.keys(pl)` for all i18n keys
State management	Vitest	Zustand store tests for new state
RPC contract	Vitest	Mock Supabase, assert request/response shapes
Component rendering	Testing Library	New UI components have co-located `.test.tsx`
E2E flows	Playwright	If new pages/routes, add to `smoke.spec.ts` or `authenticated.spec.ts`

15.11 Performance & Safety

Every feature must pass these checks:

Rule	Verification
No N+1 queries	`EXPLAIN ANALYZE` on new queries; use JOINs/LATERAL/CTEs, not loops
No unbounded loops	Defensive LIMIT caps on all expansion/aggregation logic
No unindexed JSONB lookups	GIN index required for any `jsonb @>` or `->>` filter in WHERE clauses
Defensive caps	Scale guardrails: `QA__scale_guardrails.sql` checks pass
Graceful degradation	Features with `is_enabled` flags must work correctly when disabled
MV refresh considered	If new MV added, include in `refresh_all_materialized_views()` and `mv_staleness_check()`
No expensive triggers	Triggers must be O(1) per row; bulk operations must not cascade pathologically

15.12 Decision Log

Every feature must include an Architectural Decisions Log table:

Decision	Choice	Rationale
Storage pattern for X	`_ref` table + FK	Follows established pattern, INSERT-only expansion
Translation approach	Curated + stored	No runtime API calls, searchable, consistent
...	...	...

Log all non-trivial choices. If you debated between two approaches for more than 30 seconds, it belongs in the decision log.

15.13 Constraints (Restated)

Every feature must explicitly restate:

What must not break — list affected API functions, views, QA suites
What must remain immutable — e.g., product_name is legal label text
What must remain backward compatible — existing callers, existing response shapes
What URLs must remain stable — no slug/route changes without redirect migration

15.14 File Impact Summary

At the end of every feature issue, include:

## File Impact

**N files changed, +X / -Y lines** across all phases:
- N new DB migrations (X lines)
- N new/modified pgTAP test files (X lines)
- N new/modified QA suites (checks: before → after)
- N new/modified frontend files (X lines)
- N i18n dictionary changes (X new keys)

This enables quick impact assessment during code review.

15.15 i18n & Localization Impact

If the feature adds user-visible strings or data:

Check	Action
New UI strings?	Add to `/messages/en.json` (source of truth) + `/messages/pl.json`
New category/domain labels?	Add to `category_translations` or equivalent `_translations` table
New product data columns?	Consider `_en` column + provenance metadata (`_source`, `_reviewed_at`)
New searchable text?	Update `search_vector` trigger + add `search_synonyms` entries
Structured data vs display text?	SQL returns structured flags/keys; frontend localizes via dictionary (never embed English in SQL responses)

15.16 Expansion Checklist (If Applicable)

If the feature scales across countries, languages, or markets, provide a step-by-step addition checklist:

Step	Action	Effort
1	`INSERT INTO country_ref (...)`	1 row
2	`INSERT INTO language_ref` (if new)	1 row
3	`INSERT INTO category_translations`	~20 rows
4	Create `/messages/{xx}.json`	~556 strings
...	...	...

Include estimated effort (hours/days) for each step.

15.17 CI & Deployment Verification

Before a feature is considered complete, verify against all CI gates:

Gate	Command	Expected
Pipeline structure	`python check_pipeline_structure.py`	0 errors
DB QA	`.\RUN_QA.ps1`	All checks pass (currently 759)
Negative tests	`.\RUN_NEGATIVE_TESTS.ps1`	All caught (currently 23)
pgTAP tests	`supabase test db`	All pass
TypeScript	`cd frontend && npx tsc --noEmit`	0 errors
Unit tests	`cd frontend && npx vitest run`	All pass
E2E smoke	`cd frontend && npx playwright test`	All pass
EAN validation	`python validate_eans.py`	0 failures
Enrichment identity	`python check_enrichment_identity.py`	0 violations
SonarCloud	CI pipeline (`main-gate.yml`)	Quality Gate pass

15.18 Enforcement Rule

No feature is considered complete unless ALL of the following are true:

15.19 Issue Template

Use .github/ISSUE_TEMPLATE/feature.md for all significant issues. Every section is required; mark a section N/A — [reason] if genuinely not applicable. Never silently omit a section.

Reference implementation: Issue #184 — Automated Data Integrity Audits (Nightly) — gold standard across all repo issues for structural completeness, actionable depth, and proven implementation.

16. Repo Hygiene (Enforcement)

Policy: docs/REPO_GOVERNANCE.md — full rules and allowed-files list. Automated enforcer: scripts/repo_verify.ps1 — 6 deterministic checks, exit 0/1. CI workflow: .github/workflows/repo-verify.yml — runs on push, PRs, and weekly cron.

When to Run

Trigger	Action
File/directory added, removed, renamed	`pwsh scripts/repo_verify.ps1`
Doc added or removed in `docs/`	Script verifies `docs/INDEX.md` parity
New migration added	Script verifies timestamp ordering
Before any commit	`git status` + `git diff --cached --name-only`

What the Script Checks

Root cleanliness — no tmp-*, qa_*.json, _func_dump.txt, __api_defs.txt, *.log
Docs index coverage — every docs/*.md listed in docs/INDEX.md
ADR naming — files in docs/decisions/ match NNN-*.md
Migration ordering — timestamps in supabase/migrations/ are monotonic
No tracked artifacts — coverage/, test-results/, playwright-report/, node_modules/, __pycache__/, .next/ not in git
No tracked temp files — no tmp-* or qa_*.json committed

Additional Manual Checks (Not Automated)

After structural changes:

copilot-instructions.md §3 project layout updated (if directory structure changed)
CHANGELOG.md updated under [Unreleased] (if user-visible)
CI workflow glob patterns verified (if paths changed)

After API changes:

docs/API_CONTRACTS.md + docs/FRONTEND_API_MAP.md + docs/api-registry.yaml updated
pgTAP contract test added/updated in supabase/tests/
No response keys removed (additive only); new parameters have defaults

PR Discipline

One concern per PR (single responsibility)
Separate structural moves from logic edits; separate data from schema changes
Include verification output (§8.17) and use conventional commit format (§13)
CI must be green before merge — never bypass by deleting tests or lowering thresholds

17. Agent Workflow Command System

Three keyword commands trigger fully-structured workflows. When the user types exactly audit, create issues, or next (case-insensitive), execute the corresponding protocol in full — do not abbreviate, summarize, or skip steps.

Authority chain: CURRENT_STATE.md → copilot-instructions.md §17–§20 → domain docs (§20.4)

17.1 `audit` — Full Project Audit Protocol

Trigger word: audit Purpose: Orient fully. Produce a structured, evidence-backed analysis of the entire project state before any implementation begins. This is the foundation for create issues and next.

17.1.1 Mandatory Reads Before Auditing

Execute in order — no skipping:

CURRENT_STATE.md — last SHA, open PRs, branch, active work
git status && git log --oneline -15 — confirm clean tree, understand recent history
docs/INDEX.md — navigation map for all 50+ docs
docs/REPO_GOVERNANCE.md — structural and governance rules
docs/GOVERNANCE_BLUEPRINT.md — master execution governance plan
docs/DRIFT_DETECTION.md — 8-check automated drift catalog
docs/CHANGELOG.md or CHANGELOG.md — recent releases and unreleased items
docs/ARCHITECTURE.md — system architecture overview
Open GitHub issues (use gh issue list --state open --limit 50)

17.1.2 Audit Execution Steps

Step	Action	Commands
A1	Git health & branch state	`git status`, `git log --oneline -15`, `git branch -a`, `git diff --stat HEAD~1`
A2	Open GitHub issues inventory	`gh issue list --state open --limit 50 --json number,title,labels,milestone,assignees`
A3	Open PRs inventory	`gh pr list --state open --json number,title,headRefName,isDraft,statusCheckRollup`
A4	CI health check	Read `.github/workflows/pr-gate.yml`, `main-gate.yml`, `qa.yml` status from last run
A5	QA suite health	`.\RUN_QA.ps1` (or inspect last CI output) — report total checks, any failures
A6	Database schema drift	`supabase db diff --linked` (if linked) or inspect `supabase/migrations/` count vs `copilot-instructions.md §7`
A7	Test coverage health	Inspect last `main-gate.yml` coverage run or `cd frontend && npx vitest run --coverage --reporter=json 2>/dev/null \| tail -5`
A8	Dependency health	`cd frontend && npm audit --audit-level=high 2>&1 \| tail -10`
A9	SonarCloud quality gate	Check `sonar-project.properties` thresholds; inspect last CI run quality gate result
A10	Documentation drift	`python scripts/check_doc_counts.py && python scripts/check_doc_drift.py`
A11	Repo hygiene	`pwsh scripts/repo_verify.ps1`
A12	Pipeline & structure integrity	`python check_pipeline_structure.py`
A13	Scoring formula drift	`SELECT * FROM governance_drift_check();` via psql or Supabase Studio
A14	EAN coverage	`python validate_eans.py 2>&1 \| tail -3`
A15	MV staleness	`SELECT * FROM mv_staleness_check();` — check if MVs are fresh

17.1.3 Mandatory Audit Output Template

Produce exactly this structure — fill every section with real data:

# Project Audit — TryVit

> **Audit date:** YYYY-MM-DD HH:MM UTC
> **Branch:** <branch-name> | **HEAD:** <sha-7> | **Auditor:** GitHub Copilot

---

## 1. Project Health Metrics

| Metric                    | Current Value   | Target / Baseline       | Status   |
| ------------------------- | --------------- | ----------------------- | -------- |
| Active products (PL+DE)   | ~X,XXX          | ≥2,602                  | ✅/⚠️/❌ |
| QA checks passing         | XXX/759         | 759/759                 | ✅/⚠️/❌ |
| Negative tests passing    | 23/23           | 23/23                   | ✅/⚠️/❌ |
| Migrations committed      | XXX             | ≥227                    | ✅/⚠️/❌ |
| Vitest coverage (lines)   | XX%             | ≥88%                    | ✅/⚠️/❌ |
| SonarCloud quality gate   | PASS/FAIL       | PASS                    | ✅/⚠️/❌ |
| EAN coverage              | XXXX/XXXX (XX%) | ≥99.8%                  | ✅/⚠️/❌ |
| Open PRs                  | X               | ≤2 active               | ✅/⚠️/❌ |
| Open issues               | XX              | tracked (no hard limit) | ✅       |
| npm audit (high+critical) | X vulns         | 0                       | ✅/⚠️/❌ |
| Docs count (docs/)        | XX              | 50                      | ✅/⚠️/❌ |

---

## 2. CI Pipeline Status

| Workflow      | Last Run Status | Triggered By | Notes                    |
| ------------- | --------------- | ------------ | ------------------------ |
| pr-gate.yml   | ✅/❌/⚠️        | PR #XXX      | <any failures>           |
| main-gate.yml | ✅/❌/⚠️        | SHA XXXXXXX  | <coverage, sonar result> |
| qa.yml        | ✅/❌/⚠️        | SHA XXXXXXX  | <QA count, any failures> |
| nightly.yml   | ✅/❌/⚠️        | Scheduled    | <playwright result>      |

---

## 3. Open GitHub Issues — Prioritized Inventory

| #   | Title | Priority             | Milestone | Labels       | Status |
| --- | ----- | -------------------- | --------- | ------------ | ------ |
| XX  | ...   | P0/P1/P2/P3/Deferred | M-X       | feat/fix/... | open   |

**Priority matrix applied:**

- P0 (Blocking): Security vulnerability, data corruption, CI broken
- P1 (Critical): Feature gap blocking user value, QA regression, major bug
- P2 (High): Significant feature, technical debt with user impact
- P3 (Low): Enhancement, docs, tooling, minor improvement
- Deferred: Valid but not current milestone

---

## 4. Gap Analysis

### 4a. Documentation Gaps

| Gap | File/Section | Impact | Priority |
| --- | ------------ | ------ | -------- |

### 4b. Schema / Function Gaps

| Gap | Evidence | Impact | Priority |
| --- | -------- | ------ | -------- |

### 4c. Test Coverage Gaps

| Gap | File/Function | Coverage% | Priority |
| --- | ------------- | --------- | -------- |

### 4d. CI / Infrastructure Gaps

| Gap | Workflow | Impact | Priority |
| --- | -------- | ------ | -------- |

### 4e. Technical Debt

| Item | Location | Severity | Effort | Priority |
| ---- | -------- | -------- | ------ | -------- |

---

## 5. Milestone Progress

| Milestone | Total Issues | Closed | Open | % Complete | Notes |
| --------- | ------------ | ------ | ---- | ---------- | ----- |

---

## 6. Recently Shipped (Last 10 Commits)

| SHA     | Type         | Summary       | Issue |
| ------- | ------------ | ------------- | ----- |
| XXXXXXX | feat/fix/... | <description> | #XXX  |

---

## 7. Drift & Staleness Alerts

| Check                 | Status | Detail                          |
| --------------------- | ------ | ------------------------------- |
| Scoring formula drift | ✅/❌  | <governance_drift_check result> |
| MV staleness          | ✅/❌  | <mv_staleness_check result>     |
| Doc drift             | ✅/❌  | <check_doc_drift.py result>     |
| Repo hygiene          | ✅/❌  | <repo_verify.ps1 result>        |

---

## 8. Recommendations (Ranked by Priority)

### P0 — Fix Immediately (Blocking)

1. **[Issue title]** — [Why P0, what breaks, what file/function, recommended fix]

### P1 — Critical (Next Sprint)

1. **[Issue title]** — [Impact, effort estimate, recommended approach]

### P2 — High Value (Current Milestone)

1. **[Issue title]** — [Impact, effort, file(s) affected]

### P3 — Nice-to-Have (Backlog)

1. **[Issue title]** — [Rational, trade-off]

---

## 9. Next Steps

**Immediate action:** <specific next step — issue number to implement or P0 to fix>

To create GitHub issues from this audit: type `create issues`
To begin implementing the highest-priority item: type `next`

17.2 `create issues` — Audit-to-Issue Conversion Protocol

Trigger word: create issues Pre-condition: A audit output (§17.1.3) must exist in the current session, or the most recent gh issue list output is available. Purpose: Transform every gap, recommendation, and debt item from the audit into fully-structured, §15-compliant GitHub issues — ready to implement with zero ambiguity.

17.2.1 Issue Triage Rules

Before creating issues, apply the following triage:

Condition	Action
Issue already exists with same scope	Do not duplicate — add a comment with new findings
Multiple small items in same domain (< 2h each)	Bundle into one issue with sub-tasks checklist
P0 item without existing issue	Create immediately before P1/P2/P3 items
Item is a documentation update only	Use `docs(scope):` commit type, may skip full issue
Item requires architectural decision	Add `decision-required` label; include ADR template
Uncertainty about scope/approach	Include at minimum 3 approaches in §15.2 table
Item touches >5 files or >3 domains	Mark as "significant" — use full §15 template

17.2.2 Issue Title Convention

Follow copilot-instructions.md §13 type/scope convention:

feat(domain): short imperative description (#issue-ref or close #XXX if known)
fix(domain): what is being corrected
schema(migration): what schema operation
test(qa): what test coverage is being added
docs(domain): what documentation is being created/updated
perf(domain): what performance improvement
security(rls): what security hardening
chore(scope): what housekeeping

Title rules:

≤72 characters
Present tense, imperative mood ("add", "fix", "implement" — not "added", "fixed")
Include scope in parentheses
No trailing period

17.2.3 Full Issue Body Template

Every significant issue (as defined in §15) must use this exact structure:

## Problem Statement

<!-- What user or system problem does this solve? Be concrete — cite specific data, file, or behavior. -->
<!-- What current limitation exists? Link to the code/schema gap. -->
<!-- What measurable improvement does this introduce? (rows, faster queries, test coverage %, error rate) -->

---

## Architectural Evaluation

| Approach | Verdict     | Rationale |
| -------- | ----------- | --------- |
| A. ...   | ❌ Rejected | ...       |
| B. ...   | ❌ Rejected | ...       |
| C. ...   | ✅ Chosen   | ...       |

**Prior art considered:** [Yuka / Open Food Facts / MyFitnessPal / comparable product]

---

## Core Principles (Invariants)

<!-- Which invariants from §15.3 apply? Explicitly confirm each. -->

- [ ] Data integrity — no invented nutrition values
- [ ] Backward compatibility — additive API changes only
- [ ] Idempotency — all migrations safe to run 1× or 100×
- [ ] Test coverage — pgTAP for every new function/table/view
- [ ] Append-only migrations — never modify existing migration files
- [ ] Additional invariants specific to this issue: ...

---

## Phased Implementation Plan

### Phase 1 — [Title] (Foundation)

**Migration:** `YYYYMMDDHHMMSS_description.sql`
**Rationale:** [Why this first? What does it unlock?]
**DB changes:** [Tables, columns, functions, triggers, indexes]
**API changes:** [Modified api_* functions, new/changed params]
**Frontend changes:** [Components, hooks, stores, pages affected]
**Performance:** [New indexes, MVs, query plan impact]
**Tests:** [pgTAP file, QA suite, schema contract changes]

### Phase 2 — [Title] (Surface)

[same sub-structure]

### Phase N — [Title] (Polish / Docs)

[same sub-structure]

---

## Database Changes

**Migration filename:** `YYYYMMDDHHMMSS_description.sql`

```sql
-- Migration: describe goal
-- Rollback: DROP TABLE/COLUMN IF EXISTS ...
-- Idempotency: all DDL guarded with IF NOT EXISTS

<SQL template here — not placeholder>
```

**Constraints defined:** [List all CHECK, FK, UNIQUE constraints]
**Indexes:** [Justify each — type, columns, partial condition]
**RLS policies:** [Which roles can read/write/delete?]

---

## API Contract Impact

| Function  | Change                                | Backward Compatible? | New Params (with defaults) |
| --------- | ------------------------------------- | -------------------- | -------------------------- |
| `api_*()` | Added `p_new_param text DEFAULT NULL` | ✅ Yes               | `p_new_param`: ...         |

**What happens if p_new_param is omitted:** [fallback behavior]

---

## Test Requirements

### pgTAP (supabase/tests/)

- [ ] `has_table('new_table')` in `schema_contracts.test.sql`
- [ ] Happy path: correct inputs → expected outputs
- [ ] Edge cases: NULL inputs, empty strings, invalid codes
- [ ] Auth branch: unauthenticated call returns `{error}` JSONB
- [ ] Fallback: resolve_language('xx') → 'en'

### DB QA (db/qa/)

- [ ] Suite: `QA__[domain].sql` — add N checks (total: before → after)
- [ ] Checks added: [list them]

### Frontend (frontend/src/)

- [ ] `cd frontend && npx tsc --noEmit` — 0 errors
- [ ] Vitest unit tests: [describe what is tested]
- [ ] Playwright E2E: [smoke or authenticated spec, what flow]

---

## Fallback Logic

```
If A → use X
Else if B → use Y
Else → fallback Z (always safe, always returns a value)
```

---

## Performance & Safety

- [ ] No N+1 queries (EXPLAIN ANALYZE provided)
- [ ] No unbounded loops (LIMIT caps applied)
- [ ] No unindexed JSONB lookups (GIN index added)
- [ ] Scale guardrails pass (`QA__scale_guardrails.sql`)
- [ ] MV refresh considered (added to `refresh_all_materialized_views()` if applicable)

---

## Architectural Decisions Log

| Decision        | Choice            | Rationale |
| --------------- | ----------------- | --------- |
| Storage pattern | `_ref` table + FK | [Reason]  |
| ...             | ...               | ...       |

---

## File Impact Summary

**Estimated: N files, +X / -Y lines**

- X new DB migrations (Y lines)
- X new/modified pgTAP test files (Y lines)
- X new/modified QA suites (checks: N → M)
- X new/modified frontend files (Y lines)

---

## Verification Checklist (Definition of Done)

- [ ] `python check_pipeline_structure.py` — 0 errors
- [ ] `.\RUN_QA.ps1` — all 776 checks pass
- [ ] `.\RUN_NEGATIVE_TESTS.ps1` — 20/20 caught
- [ ] `supabase test db` — all pgTAP pass
- [ ] `cd frontend && npx tsc --noEmit` — 0 errors
- [ ] `cd frontend && npx vitest run` — all pass, coverage ≥ baseline
- [ ] `cd frontend && npx playwright test --project=smoke` — all pass
- [ ] `python validate_eans.py` — 0 failures
- [ ] `copilot-instructions.md §4` updated (if DB objects changed)
- [ ] `docs/API_CONTRACTS.md` updated (if API changed)
- [ ] `docs/INDEX.md` updated (if new docs created)
- [ ] `CHANGELOG.md` updated under `[Unreleased]`

17.2.4 Batch Creation Rules

When creating multiple issues from an audit:

Create P0 issues first, tag priority:critical
Group related issues into the same milestone where possible
For cascading dependencies: use Depends on #XXX in the body
Label every issue with: type + domain + priority + milestone
After creating all issues, output a summary table:

## Issues Created

| #    | Title             | Priority | Milestone | Labels         |
| ---- | ----------------- | -------- | --------- | -------------- |
| #XXX | feat(domain): ... | P1       | M-X       | feat, data, P1 |

Conclude with: "Type next to begin implementing the highest-priority issue."

17.3 `next` — Canonical Execution Protocol v2

Trigger word: next Purpose: Select the highest-priority open issue and implement it completely using the Canonical Execution Discipline Protocol v2 (§19). Full audit readiness is assumed (or execute §17.1 Steps A1–A3 as a fast-track check before proceeding).

17.3.1 Issue Selection Algorithm

Priority queue (highest first):
  1. P0 — Any blocking issue (CI broken, data corruption, security CVE)
  2. P1 — Critical issues in the current milestone
  3. P1 — Critical issues without milestone (unblocked)
  4. P2 — High-value issues in the current milestone
  5. P2 — High-value issues without milestone
  6. P3 — Low-priority backlog items
  7. Deferred — Only if all higher priorities are clear

Tie-breaking (within same priority):
  - Smallest estimated effort that still delivers user value
  - Has the most dependencies waiting on it (unblocks the most items)
  - Oldest creation date

Skip if:
  - Issue is assigned to another person and in-progress
  - Issue has `blocked` or `waiting-on-design` label
  - Issue requires external API/service access not yet configured

17.3.2 Pre-Implementation Announcement

Before writing any code, output exactly this block:

## Implementing Issue #XXX — [Issue Title]

**Priority:** P0/P1/P2/P3
**Domain(s):** [database/api/frontend/security/etc]
**Estimated effort:** [S=<2h / M=2-8h / L=8-24h / XL=>24h]

### Docs loading (per §20.4)

- ✅ CURRENT_STATE.md — read
- ✅ [Domain doc 1] — read
- ✅ [Domain doc 2] — read
- [any gaps noted]

### Test plan

1. [Test case 1 — what, where, which level]
2. [Test case 2]
   ...

### Implementation order

1. [Step 1 — e.g., write migration]
2. [Step 2 — e.g., add pgTAP tests]
3. [Step 3 — e.g., update API function]
4. [Step 4 — e.g., update frontend hook]
5. [Step 5 — docs update, CHANGELOG]

Then execute §19 (Canonical Execution Discipline Protocol v2) in full.

17.4 Priority Definitions

Label	Meaning	Response Time	Examples
P0	Blocking — system broken or data corrupted	Immediate	CI red, security CVE, data integrity failure, scoring broken
P1	Critical — core user value blocked or significant regression	Current sprint	Missing QA suite, broken auth flow, API contract violation
P2	High value — significant improvement, no emergency	Next sprint or milestone	New feature, performance improvement, major coverage gap
P3	Enhancement — nice-to-have	Backlog	Minor UX polish, doc updates, tooling improvements
Deferred	Valid but not current scope	Future milestone	Post-MVP features, experimental ideas

18. Documentation Reference Catalog

Authority: docs/INDEX.md is the canonical navigation index. The table below is a quick-load reference organized by domain for use with §20.4. When in doubt about a doc's purpose, read docs/INDEX.md first.

18.1 Architecture & System Design

Document	Purpose	Load When
`docs/ARCHITECTURE.md`	Full system architecture — data flow, schema topology, scoring pipeline, API layer, security perimeter	Any schema, API, or infra work
`docs/DOMAIN_BOUNDARIES.md`	Domain ownership map — who owns what, cross-domain coupling rules	Adding new domains, touching multiple services
`docs/ENVIRONMENT_STRATEGY.md`	Local / staging / production environment strategy, secret management	Env config, deployment, secrets changes
`docs/STAGING_SETUP.md`	Staging environment setup guide	Setting up staging, CI config
`docs/DEPLOYMENT.md`	Deployment procedures, rollback playbook, emergency checklist	Any production deployment
`docs/DISASTER_DRILL_REPORT.md`	DR drill findings and follow-up actions	Incident prep, DR automation
`docs/PRODUCTION_DATA.md`	Production data management rules — no PII, retention policies	Any migration touching prod
`docs/HEALTH_GOAL_PERSONALIZATION.md`	Health-goal personalization design — goal taxonomy, personalization model, MVP scope, privacy, copy safety	Health profile, personalization, onboarding goals

18.2 API & Frontend

Document	Purpose	Load When
`docs/API_CONTRACTS.md`	Full API surface contracts — response shapes, hidden columns, auth requirements, error envelopes	Any API function change
`docs/API_CONVENTIONS.md`	RPC naming convention, breaking change definition, security standards	Adding/renaming API functions
`docs/API_VERSIONING.md`	API deprecation policy, version lifecycle	Deprecating or versioning endpoints
`docs/FRONTEND_API_MAP.md`	Frontend component ↔ API function mapping	Frontend work touching API calls
`docs/api-registry.yaml`	Structured registry of all 191 functions (YAML)	Auditing API coverage, documentation
`docs/CONTRACT_TESTING.md`	API contract testing strategy and pgTAP patterns	Adding pgTAP contract tests
`docs/UX_UI_DESIGN.md`	UI/UX guidelines, component standards, accessibility	Frontend component development
`docs/UX_IMPACT_METRICS.md`	UX measurement standard, metric catalog, SQL event templates	Any UX-visible frontend change
`docs/BRAND_GUIDELINES.md`	TryVit brand standards — color palette, typography, voice, usage rules	Brand-related UI work
`docs/SEARCH_ARCHITECTURE.md`	pg_trgm, tsvector, ranking algorithm, synonym table	Search functionality changes

18.3 Data Governance & Quality

Document	Purpose	Load When
`docs/DATA_SOURCES.md`	Source hierarchy, OFF API reliability tiers, validation workflow	Pipeline changes, sourcing decisions
`docs/DATA_PROVENANCE.md`	Data freshness governance, update cycles, source provenance tracking	Adding provenance columns, staleness logic
`docs/DATA_INTEGRITY_AUDITS.md`	Nightly audit framework, check catalog, alert thresholds	Adding data quality checks
`docs/EAN_VALIDATION_STATUS.md`	1,024/1,026 (99.8%) EAN coverage, known gaps, validation rules	EAN changes, barcode work
`docs/RESEARCH_WORKFLOW.md`	Data collection lifecycle — manual curation + automated OFF pipeline	Adding new products, categories, countries

18.4 Security & Privacy

Document	Purpose	Load When
`docs/SECURITY_AUDIT.md`	Full security audit — RLS gaps, SSRF vectors, injection risks, trust scoring	Any table, function, or RLS change
`docs/PRIVACY_CHECKLIST.md`	GDPR/RODO compliance checklist — data lifecycle, consent, deletion	User data, PII, consent flows
`docs/ACCESS_AUDIT.md`	Data access pattern audit — who reads what, quarterly review cadence	Access control design
`docs/RATE_LIMITING.md`	Rate limiting strategy, per-endpoint config, abuse prevention	Adding rate limits, API abuse

18.5 Performance & Observability

Document	Purpose	Load When
`docs/PERFORMANCE_GUARDRAILS.md`	Query budgets, index requirements, MV refresh policies, scale projections	Any query, index, or MV change
`docs/PERFORMANCE_REPORT.md`	Baseline performance audit, slow query catalog, query patterns	Performance investigation
`docs/SLO.md`	Service Level Objectives — availability 99.5%, latency p95 <400ms, error rate <0.1%	Reliability engineering, alert design
`docs/MONITORING.md`	Runtime monitoring strategy, Supabase metrics, alert channels	Adding monitoring, alerts
`docs/OBSERVABILITY.md`	Full observability strategy — logs, metrics, traces	Instrumentation work
`docs/METRICS.md`	Application, infrastructure, and business metrics catalog	Adding metrics, dashboards
`docs/LOG_SCHEMA.md`	Structured log format, error taxonomy, field definitions	Logging changes, error handling
`docs/ALERT_POLICY.md`	Alert escalation rules, SLA targets, on-call routing	Alert and notification work
`docs/ON_CALL_POLICY.md`	On-call schedule, ack targets, triage labels, escalation chain	Incident response, on-call setup

18.6 CI/CD & Governance

Document	Purpose	Load When
`docs/REPO_GOVERNANCE.md`	Root structure rules, allowed-files list, PR checklist, hygiene enforcement	Any structural change
`docs/GOVERNANCE_BLUEPRINT.md`	Master execution governance — development lifecycle, decision authority	Strategic planning, major features
`docs/DOCUMENTATION_GOVERNANCE.md`	Doc lifecycle — creation, ownership, review cadence, deprecation	Creating or retiring docs
`docs/DRIFT_DETECTION.md`	8-check automated drift catalog — scoring, search, naming, flags	Any scoring/search formula change
`docs/MIGRATION_CONVENTIONS.md`	Migration safety rules, trigger naming, lock risk, idempotency patterns	Any DB migration work
`docs/BACKFILL_STANDARD.md`	Backfill orchestration standard, batch templates, rollback patterns	Any bulk data operation
`docs/FEATURE_FLAGS.md`	Feature flag registry, activation criteria, toggle patterns	Feature flag work
`docs/FEATURE_SUNSETTING.md`	Feature retirement criteria, cleanup protocol	Deprecating features
`docs/LABELS.md`	GitHub label taxonomy — type, domain, priority, status	Issue and PR labeling

18.7 Scoring & Nutrition Science

Document	Purpose	Load When
`docs/SCORING_METHODOLOGY.md`	v3.3 algorithm — 9 penalty factors + nutrient density bonus, weights, ceilings, bands, scientific rationale	Any scoring change
`docs/SCORING_ENGINE.md`	Scoring engine architecture — versioning, drift detection, formula registry	Scoring formula work, version management
`copilot-instructions.md §14`	Scoring quick reference — formula, ceilings, band table	Quick coding reference
`copilot-instructions.md §8.19`	Regression anchors — 16 products with expected scores ±2	Regression testing after scoring changes

18.8 Country Expansion & Localization

Document	Purpose	Load When
`docs/COUNTRY_EXPANSION_GUIDE.md`	Multi-country protocol — adding countries, activating DE, language matrix	Country/language expansion
`copilot-instructions.md §5`	Active categories — 22 PL + 21 DE, pipeline folder mappings	Category additions, DE expansion
`copilot-instructions.md §15.15`	i18n impact checklist — dictionary keys, translations, search synonyms	Any translated string change

18.9 Architecture Decision Records

Decision	Status	Summary
`docs/decisions/001-postgresql-only-stack.md`	Accepted	No ORM, no Redis cluster, PostgreSQL as sole data store
`docs/decisions/002-weighted-scoring-formula.md`	Accepted	9-factor weighted model, science-backed, EFSA-aligned
`docs/decisions/003-country-scoped-isolation.md`	Accepted	All queries country-filtered, no cross-contamination
`docs/decisions/004-pipeline-generates-sql.md`	Accepted	Python pipeline generates idempotent SQL, no runtime inserts
`docs/decisions/005-api-function-name-versioning.md`	Accepted	Additive versioning via suffix (`_v2`), never rename
`docs/decisions/006-append-only-migrations.md`	Accepted	Never modify committed migrations; forward-only schema evolution
`docs/decisions/007-english-canonical-ingredients.md`	Accepted	`name_en` is canonical; translations stored in `ingredient_translations`
`docs/decisions/008-nutrient-density-bonus.md`	Accepted	Nutrient density bonus (protein + fibre) as subtracted 10th factor in v3.3
`docs/decisions/009-scoring-band-calibration.md`	Accepted	Scoring band calibration — catalog limited, formula correct, no changes needed
`docs/decisions/010-ingredient-language-model.md`	Accepted	English canonical + `ingredient_translations` fallback; no schema changes needed

19. Canonical Execution Discipline Protocol v2

This is the mandatory implementation protocol. Execute it for every non-trivial change. "Non-trivial" = any change that modifies DB schema, API contracts, scoring logic, frontend state, test coverage, or CI workflows. One-line typo fixes may skip to §19.3.

Reference gold standard: Issue #184 — Automated Data Integrity Audits Every section below has an implementation example in that issue.

Phase 1 — Pre-Implementation (must complete before writing any code)

19.1 Context Recovery

Execute in order:

# 1. Confirm branch state — never commit to main
git status
git log --oneline -10
git branch --show-current  # must NOT be 'main'

# 2. Load volatile status
cat CURRENT_STATE.md  # last SHA, open PRs, in-flight work

# 3. Identify the issue being implemented
gh issue view XXX  # read acceptance criteria, notes, constraints

# 4. Repo hygiene baseline
pwsh scripts/repo_verify.ps1  # must exit 0 before starting

# 5. Coverage baseline (for frontend work)
cd frontend && npx vitest run --coverage --reporter=json 2>/dev/null | tail -3

19.2 Impact Analysis (10-minute upfront investment that saves hours)

Answer each question explicitly before coding:

Question	Answer
What DB tables are affected?	<list or "none">
What API functions are affected?	<list or "none">
What frontend components are affected?	<list or "none">
Does this require a migration?	Yes/No — if yes, what's the filename?
Does this touch RLS policies?	Yes/No — if yes, read `docs/SECURITY_AUDIT.md` §X
Does this touch scoring logic?	Yes/No — if yes, run regression anchors §8.19 after
Does this change any API response shape?	Yes/No — if yes, confirm additive-only
Does this add user-visible strings?	Yes/No — if yes, update both `en.json` and `pl.json`
What QA suites are affected?	<list or "none">
What pgTAP tests are affected?	<list or "none">

Load domain docs from §20.4 for every "Yes" answer.

19.3 Test Plan (written before any code)

Write this explicitly. Do not skip.

### Test Plan — Issue #XXX

| #   | Test Case     | Level                       | File   | What It Verifies    |
| --- | ------------- | --------------------------- | ------ | ------------------- |
| 1   | [description] | unit/component/e2e/pgTAP/QA | [path] | [specific behavior] |
| 2   | ...           | ...                         | ...    | ...                 |

**Edge cases to cover:**

- NULL inputs to new functions
- Empty arrays, zero counts
- Unicode/diacritics (Polish characters: ą, ę, ó, ł, ź, ż, ń, ć, ś)
- Unauthenticated calls to authenticated-only functions
- Cross-country data contamination (PL ≠ DE)

**Regression tests to preserve:**

- QA suite N: [suite name] — [how many checks affected]
- Scoring anchor: [product name] ≈ [score] (from §8.19)

19.4 Implementation Plan (sequential, one concern per step)

Map out every file that will change:

### Implementation Plan — Issue #XXX

**Estimated effort:** S / M / L / XL

| Step | File                                         | Change Type                | Dependencies |
| ---- | -------------------------------------------- | -------------------------- | ------------ |
| 1    | `supabase/migrations/YYYYMMDDHHMMSS_*.sql`   | New migration              | None         |
| 2    | `supabase/tests/schema_contracts.test.sql`   | Add has_table/has_function | Step 1       |
| 3    | `supabase/tests/[domain]_functions.test.sql` | Add pgTAP function tests   | Step 1       |
| 4    | `db/qa/QA__[domain].sql`                     | Add QA checks              | Step 1       |
| 5    | `frontend/src/lib/api.ts`                    | New/modified RPC wrapper   | Step 1       |
| 6    | `frontend/src/hooks/use-[domain].ts`         | New TanStack Query hook    | Step 5       |
| 7    | `frontend/src/components/[domain]/*.tsx`     | UI component               | Step 6       |
| 8    | `frontend/src/app/app/[route]/page.tsx`      | Page integration           | Step 7       |
| 9    | `docs/API_CONTRACTS.md`                      | Document new endpoint      | Step 5       |
| 10   | `copilot-instructions.md §4`                 | Update function/table list | Step 1       |
| 11   | `CHANGELOG.md`                               | Add [Unreleased] entry     | Final        |

Phase 2 — Implementation (discipline during coding)

19.5 Implementation Rules (non-negotiable)

Rule 1 — One concern per commit. Each commit addresses exactly one logical change:

schema(migration): add ingredient_translations table and FK

Not: schema(migration): add table + api function + frontend component + docs

Rule 2 — Tests alongside code, not after. For every DB function written, write the pgTAP test in the same commit. For every React component, write the .test.tsx in the same commit. Never leave tests as // TODO.

Rule 3 — Verify before proceeding. After each step in the implementation plan, run the relevant validation:

# After DB migration:
supabase db reset --local  # confirm migration applies cleanly
.\RUN_QA.ps1               # confirm QA passes

# After TypeScript changes:
cd frontend && npx tsc --noEmit  # 0 errors before moving on

# After React component changes:
cd frontend && npx vitest run --reporter=verbose  # confirm tests pass

# After pgTAP test addition:
supabase test db  # confirm tests pass

Rule 4 — Track progress. After each step, mark it complete in the implementation plan. Do not batch multiple steps into a single commit unless completely trivial (e.g., updating two doc counts).

Rule 5 — Never weaken gates. If a test fails, fix the code — not the test. If coverage drops, add tests — do not exclude files. See §8.16.

19.6 SQL Code Standards (enforce rigorously)

-- ✅ Required patterns for all new functions
CREATE OR REPLACE FUNCTION public.api_example_function(
  p_param1  text,
  p_param2  integer DEFAULT NULL  -- always default new params for backward compat
)
RETURNS jsonb
LANGUAGE sql
STABLE              -- or VOLATILE if writes; never claim IMMUTABLE unless truly pure
SECURITY DEFINER    -- for authenticated functions; SECURITY INVOKER for anon-safe
SET search_path = public
AS $$
  -- Always fully-qualify table names in SECURITY DEFINER functions
  SELECT jsonb_build_object(
    'api_version', 'v1',  -- always include api_version in response
    'data', (
      SELECT jsonb_agg(row_to_json(t))
      FROM (...) AS t
    )
  );
$$;

-- Comment block: purpose, auth requirement, params, fallback chain
COMMENT ON FUNCTION public.api_example_function IS
  'Purpose: ...
   Auth: authenticated only (RLS enforced)
   Params: p_param1 (required), p_param2 (optional, defaults to NULL → behavior)
   Returns: JSONB {api_version, data: [...]}
   Fallback: if no data found → returns {api_version, data: []}';

19.7 TypeScript/React Code Standards (enforce rigorously)

// ✅ RPC wrapper pattern (frontend/src/lib/rpc.ts style)
export async function exampleFeature(params: ExampleParams): Promise<ExampleResult> {
  const { data, error } = await supabase.rpc("api_example_function", {
    p_param1: params.param1,
    p_param2: params.param2 ?? null,
  });

  if (error) throw new RpcError("api_example_function", error);
  return data as ExampleResult;
}

// ✅ TanStack Query hook pattern (frontend/src/hooks/)
export function useExampleFeature(param1: string) {
  return useQuery({
    queryKey: queryKeys.example(param1),
    queryFn: () => exampleFeature({ param1 }),
    staleTime: 5 * 60 * 1000, // 5 minutes
    retry: (count, err) => !(err instanceof RpcError && err.isAuth),
  });
}

// ✅ Component error boundary pattern
// Wrap all data-dependent components in <Suspense> + error boundary
// Never use unguarded `data!` non-null assertions

Phase 3 — Post-Implementation (verification discipline)

19.8 Full Verification Protocol

Execute every command. Record output. Do not skip.

# ── Database layer ───────────────────────────────────────────────────
supabase test db                               # All pgTAP tests pass
.\RUN_QA.ps1                                   # All 776+ QA checks pass
.\RUN_NEGATIVE_TESTS.ps1                       # All 20 negative tests caught
python check_pipeline_structure.py            # 0 errors
python validate_eans.py                       # 0 EAN failures
python check_enrichment_identity.py           # 0 violations

# ── Frontend layer ────────────────────────────────────────────────────
cd frontend
npx tsc --noEmit                              # 0 TypeScript errors
npx vitest run                               # All tests pass
npx vitest run --coverage                    # Coverage ≥ baseline (§8.5)
npx playwright test --project=smoke          # E2E smoke passes

# ── Governance ────────────────────────────────────────────────────────
pwsh scripts/repo_verify.ps1                 # 6 checks pass
python scripts/check_doc_counts.py           # Doc count consistent
python scripts/check_doc_drift.py            # No stale docs

# ── Scoring regression (if scoring touched) ───────────────────────────
# Run QA scoring suite — all 31 checks + §8.19 anchor products must pass

19.9 Documentation Update Requirements

After implementation, update ALL of these that apply (per §18.1):

Artifact	Update Required When	What to Update
`copilot-instructions.md §4`	Table, view, function, or Edge Function added/changed	Add row to appropriate table
`copilot-instructions.md §3`	Directory structure changed	Update project layout tree
`copilot-instructions.md §5`	New category added	Add row to category table
`copilot-instructions.md §8.18`	New QA suite added	Add row + update check count
`docs/API_CONTRACTS.md`	Any `api_*` function changed	Update contract definition
`docs/FRONTEND_API_MAP.md`	Frontend-API wiring changed	Update mapping row
`docs/api-registry.yaml`	Any function added/removed	Add/update YAML entry
`docs/SCORING_METHODOLOGY.md`	Scoring formula changed	Update formula, weights, bands
`docs/ARCHITECTURE.md`	Major structural change	Update architecture diagram/prose
`.env.example`	New env variable needed	Add entry with description + source
`docs/INDEX.md`	New `docs/*.md` file created	Add entry, increment count
`CHANGELOG.md`	Any user-visible change	Add entry under `[Unreleased]`

19.10 Verification Report (output in every PR description)

## Verification

### Commands Run

```powershell
supabase test db                  → XX/XX pgTAP tests pass
.\RUN_QA.ps1                      → 776/776 checks pass (0 failures)
.\RUN_NEGATIVE_TESTS.ps1          → 23/23 caught
npx tsc --noEmit                  → 0 errors
npx vitest run                    → XXX/XXX tests pass
npx vitest run --coverage         → Lines: XX% (baseline: 88%, delta: +X.X%)
npx playwright test --project=smoke → XX/XX pass
pwsh scripts/repo_verify.ps1      → 6/6 checks pass
```

New/Updated Tests

supabase/tests/[file].test.sql — N new pgTAP tests
db/qa/QA__[suite].sql — N checks added (total: XX → YY)
frontend/src/[file].test.ts — N new Vitest tests

Documentation Updated

copilot-instructions.md §4 — [what was updated]
docs/API_CONTRACTS.md — [what was updated]
CHANGELOG.md — added entry under [Unreleased]


---

### 19.11 Decision Framework — 10 Ambiguity Scenarios

When encountering ambiguity, apply the default action without pausing to ask unless the situation is genuinely novel:

| Scenario | Default Action |
| -------- | -------------- |
| **New domain value** (e.g., new prep method) | Use `_ref` table + FK (never a CHECK on the column itself) |
| **New API parameter** | Add with `DEFAULT NULL`; document fallback behavior; no breaking change |
| **Unclear test scope** | Test more rather than less — add edge cases, NULL paths, auth branches |
| **Migration or function?** | If it's a schema definition → migration. If it's business logic → function. Never mix. |
| **New env variable needed** | Add to `.env.example` with description; document in `CURRENT_STATE.md` |
| **Type or runtime assertion?** | TypeScript type check (compile-time) + runtime Zod/pgTAP (runtime) — do both |
| **Extend existing QA suite or new suite?** | Extend existing unless genuinely new domain. Never create a suite < 5 checks. |
| **Slug change needed** | Add redirect before removing old slug. Update `docs/API_CONTRACTS.md`. |
| **Coverage gap discovered** | Add characterization tests immediately, before touching the code |
| **Conflicting doc sources** | `copilot-instructions.md` is authoritative. Flag the conflict in the PR description. |

### 19.12 Anti-Patterns — 10 Forbidden Approaches

| ❌ Forbidden | ✅ Correct Approach |
| ------------ | ------------------- |
| Modifying an existing `supabase/migrations/` file | Write a new migration with next timestamp |
| Inventing nutrition data or Nutri-Score values | Fetch from OFF API or mark `nutri_score_source = 'unknown'` |
| Inlining the scoring formula | Always call `compute_unhealthiness_v33()` |
| Removing an API response key | Add a deprecation notice; keep the key returning `null` for 2 versions |
| Using `DELETE` in pipeline SQL | Set `is_deprecated = true` with deprecation reason |
| Writing tests after submitting PR | Tests are part of the same commit as the feature code |
| Lowering coverage/quality thresholds to fix failures | Fix the code or add the missing tests |
| Skipping `IF NOT EXISTS` in migrations | All DDL must be pre-guarded — no exceptions |
| Using `SELECT *` in production functions | Always enumerate columns explicitly |
| Running pipeline against remote without confirmation | Always use `--dry-run` first; remote requires explicit user opt-in |

---

## 20. Context Recovery Protocol

> **Every session begins here.** No exceptions. Do not write a single line of code
> until steps 20.1 and 20.2 are complete.

### 20.1 Mandatory Session Bootstrap (execute in order, every time)

```powershell
# Step 1 — Read volatile project state (30 seconds)
cat CURRENT_STATE.md

# Step 2 — Confirm git state
git status && git log --oneline -5 && git branch --show-current

# Step 3 — Confirm no untracked important files
git diff --stat HEAD

# Step 4 — Quick health check
pwsh scripts/repo_verify.ps1

20.2 Invariant Documents (load every session, regardless of domain)

Priority	Document	Purpose	Where
1	`CURRENT_STATE.md`	Live status — SHA, open PRs, in-flight work	repo root
2	`copilot-instructions.md §1–§4`	Role, architecture, project layout, schema	repo root
3	`copilot-instructions.md §8`	Testing requirements — NON-NEGOTIABLE	repo root
4	`copilot-instructions.md §13`	Git workflow, branch naming, conventional commits	repo root
5	`docs/INDEX.md`	Navigation — find the right doc fast	docs/

20.3 Domain Docs (load when that domain is touched)

→ See full matrix in §20.4 below.

20.4 Domain Reading Matrix

Domain	Load First	Also Relevant
Database schema	`docs/ARCHITECTURE.md`, `docs/MIGRATION_CONVENTIONS.md`	`copilot-instructions.md §4`, `docs/DATA_PROVENANCE.md`
Migrations	`docs/MIGRATION_CONVENTIONS.md`, `docs/BACKFILL_STANDARD.md`	`copilot-instructions.md §7`
Scoring formula	`docs/SCORING_METHODOLOGY.md`, `docs/SCORING_ENGINE.md`	`copilot-instructions.md §14`, `docs/DRIFT_DETECTION.md`
API / RPC functions	`docs/API_CONTRACTS.md`, `docs/API_CONVENTIONS.md`	`docs/API_VERSIONING.md`, `docs/FRONTEND_API_MAP.md`, `docs/api-registry.yaml`
Search / indexing	`docs/SEARCH_ARCHITECTURE.md`	`docs/PERFORMANCE_GUARDRAILS.md`
Frontend components	`docs/UX_UI_DESIGN.md`, `docs/BRAND_GUIDELINES.md`	`docs/UX_IMPACT_METRICS.md`
Security / RLS	`docs/SECURITY_AUDIT.md`, `docs/PRIVACY_CHECKLIST.md`	`docs/ACCESS_AUDIT.md`, `docs/RATE_LIMITING.md`
Performance	`docs/PERFORMANCE_GUARDRAILS.md`, `docs/PERFORMANCE_REPORT.md`	`docs/SEARCH_ARCHITECTURE.md`, `docs/SLO.md`
Multi-market expansion	`docs/COUNTRY_EXPANSION_GUIDE.md`	`copilot-instructions.md §5`, `docs/DATA_SOURCES.md`
Observability / monitoring	`docs/MONITORING.md`, `docs/LOG_SCHEMA.md`	`docs/OBSERVABILITY.md`, `docs/ALERT_POLICY.md`, `docs/METRICS.md`
CI / CD	`copilot-instructions.md §13`, `copilot-instructions.md §8.10`	`docs/ENVIRONMENT_STRATEGY.md`, `docs/DEPLOYMENT.md`
Feature flags	`docs/FEATURE_FLAGS.md`	`docs/FEATURE_SUNSETTING.md`, `docs/GOVERNANCE_BLUEPRINT.md`
Pipeline / ETL	`docs/DATA_SOURCES.md`, `docs/DATA_PROVENANCE.md`	`copilot-instructions.md §6`
EAN / barcode	`docs/EAN_VALIDATION_STATUS.md`	`copilot-instructions.md §4 Tables (product_submissions)`
i18n / localization	`copilot-instructions.md §15.15`, `docs/COUNTRY_EXPANSION_GUIDE.md`	`docs/DATA_SOURCES.md`
Brand / assets	`docs/BRAND_GUIDELINES.md`	`docs/assets/design-tokens.json`, `docs/UX_UI_DESIGN.md`
Governance / policy	`docs/GOVERNANCE_BLUEPRINT.md`, `docs/REPO_GOVERNANCE.md`	`docs/DOCUMENTATION_GOVERNANCE.md`, `docs/DOMAIN_BOUNDARIES.md`
SLOs / alerting	`docs/SLO.md`, `docs/ALERT_POLICY.md`	`docs/MONITORING.md`, `docs/INCIDENT_RESPONSE.md`, `docs/ON_CALL_POLICY.md`
Contract testing	`docs/CONTRACT_TESTING.md`, `docs/API_CONTRACTS.md`	`docs/API_VERSIONING.md`, `copilot-instructions.md §8.12`
Data integrity	`docs/DATA_INTEGRITY_AUDITS.md`	`docs/EAN_VALIDATION_STATUS.md`, `docs/PRODUCTION_DATA.md`, `docs/DATA_PROVENANCE.md`

20.5 Quick Health Commands

# Git state
git status && git log --oneline -5

# DB QA (fastest signal of schema health)
.\RUN_QA.ps1

# TypeScript (fastest signal of frontend health)
cd frontend && npx tsc --noEmit 2>&1 | tail -5

# Repo hygiene
pwsh scripts/repo_verify.ps1

# Open issues (current sprint)
gh issue list --state open --limit 20 --json number,title,labels

# Scoring drift
echo "SELECT * FROM governance_drift_check();" | docker exec -i supabase_db_tryvit psql -U postgres -d postgres

# MV staleness
echo "SELECT * FROM mv_staleness_check();" | docker exec -i supabase_db_tryvit psql -U postgres -d postgres

20.6 Session Handoff Protocol

At the end of every significant work session, before closing:

Commit or stash everything — no uncommitted changes left behind
Update CURRENT_STATE.md — new SHA, what was done, what's next
Update CHANGELOG.md — add entry under [Unreleased]
Push to remote branch — so the next session starts from a known state
Write a session summary comment on the active GitHub issue(s)

# Final pre-close checklist
git add -A
git commit -m "chore(state): update CURRENT_STATE.md — <brief description>"
git push origin <branch-name>
gh issue comment XXX --body "Session complete — <what was done, what's next>"

20.7 Using This Instruction File

This file is automatically loaded by GitHub Copilot in VS Code via .github/copilot-instructions.md (symlink or copy).
Sections are numbered sequentially so any agent can say "per §X.Y" unambiguously.
When this file is updated, the commit message should include the sections changed (e.g., chore(docs): copilot-instructions §4 — add ingredient_translations table).
When the instructions conflict with a specific issue's acceptance criteria, the issue wins for that specific change — but document the divergence in the PR description.

FilesExpand file tree

copilot-instructions.md

Latest commit

History