Module 18 Activity Worksheet
Module
Module 18: Data Cleaning and Preprocessing
Capability Target
Produce a reproducible preprocessing release that transforms raw or intermediate connectomics outputs into analysis-ready data, with explicit quality gates and full provenance.
Studio Activity Instructions
Evidence and Reasoning Notes
- Claim:
- Evidence:
- Limitation:
Rubric Check
- Minimum pass
- Cleaning decisions are explicit and reproducible.
- QC metrics include thresholds tied to actions.
- Release package includes provenance metadata.
- Strong performance
- Distinguishes low-risk cleanup from biologically sensitive transforms.
- Quantifies and explains pre/post changes clearly.
- Documents limitations and unresolved risks transparently.
- Common failure modes
- Silent ad-hoc edits with no transform log.
- Aggressive filtering that removes biologically meaningful variation.
- Metrics reported without operational thresholds.
Exit Prompt
Take one connectomics table (real or mock) and write:
- Three cleaning rules with rationale.
- Two QC thresholds and associated actions.
- One limitation that remains after preprocessing.