RecordCollector analysis spec — 479 lines on the first microfrontend's shape, data contract, and integration points

Why Care?

record-collector is the entry point of the workshop — everything downstream inherits the shape it produces. That makes it the most expensive place to get the data contract wrong, and the cheapest place to spend a few hours writing things down before coding.
The analysis spec is exactly that: 479 lines walking through what kinds of inputs the collector has to accept (files, APIs, CSVs, scraped pages, manual entry), what canonical-record shape it commits to producing for the rest of the pipeline, and what the integration surface looks like for the other five microfrontends that consume its output.
Two issues filed the day before (#2 Field Populator, #3 Highlight Collector specs; see the 2025-08-10 issue triage entry) carved space for parallel specs on the other microfrontends. RecordCollector landed first because it's the upstream constraint.

What's New?

  • specs/RecordCollector-Analysis.md — 479-line analysis covering:
    • Source types: files, REST APIs, CSVs, scrape targets, manual.
    • Canonical record shape: minimum required fields, provenance back to the upstream source.
    • Staging behavior: how records sit between "ingested" and "promoted to downstream review" so a human can intervene before any record reaches the prompt-template stage.
    • Federation contract: what record-collector exposes to the host shell and what shared types it consumes from packages/ui / future packages/shared-services.
This spec later migrated into content/projects/Augment-It/Specs/ as part of the documentation consolidation across the Lossless content tree.
ℹ️
Backfilled from git history on 2026-05-12. Commit: 9c67cb9.