5 Step 3: Record Selection
5.1 Objective
Select the best usable record for each analysis key after basic normalization and before final processing.
5.2 Important distinction
A “selected record” is not always a raw source row.
Sometimes the selected record is:
- a cleaned source row,
- a recoded value,
- a merged record assembled from multiple inputs, or
- a record retained specifically for a pop or historical-context layer.
That is fine, as long as the rule is explicit.
5.3 Do this exactly
- Normalize obvious naming, timing, and estimate-class issues.
- Define a deterministic selection hierarchy for the layer you are building.
- Apply the hierarchy consistently.
- Capture all overrides in the exception register and decision log.
5.4 Recommended hierarchy template
- preferred estimate class or estimate type,
- preferred survey method or platform,
- preferred verified source,
- approved manual override,
- escalate if a true tie still remains.
5.5 Mandatory checks
- no duplicate selected rows by analysis key,
- biological zero vs missing is handled consistently,
- year assignment rule is explicit,
- output-layer intent is preserved.
5.7 Typical analysis keys
Use the key that matches the layer you are building. For example:
CU_ID + Yearfor CU time series,Pop_ID + YearorCU_ID + Pop_Name + Yearfor pop outputs,- a species-specific stream/timing key for matching or pre-processing steps.