Module 13: Data Science for Connectomics
Apply core data science techniques to connectomics problems including feature extraction and unsupervised analysis.
📊 Feature Engineering
Neurons and synapses contain measurable features such as volume, length, branching, and connectivity. Feature selection affects analysis outcomes.
- Quantitative features from segmentation
- Normalizing and encoding categorical data
- Dealing with missing values
📊 Dimensionality Reduction and Clustering
Use PCA, UMAP, and t-SNE to compress high-dimensional data. Cluster to identify putative neuron classes or structural types.
- PCA and variance explained
- Clustering with K-means and DBSCAN
- Visualizing 2D projections
🌟 COMPASS Integration
- Knowledge: Feature representation, unsupervised learning
- Skills: Analysis, visualization, pattern detection
- Character: Comfort with ambiguity, exploration
- Meta-Learning: Iterative tuning and validation
📚 References & Resources
- Petersen et al., 2021. Cell types in mouse cortex revealed by unsupervised analysis of connectomics. Nature.
- McInnes et al., 2018. UMAP: Uniform Manifold Approximation and Projection. JOSS.
- Colab: "Neuron Embedding and Clustering Pipeline"
✅ Assessment
- Extract a feature matrix from a sample connectome
- Use UMAP or t-SNE to project into 2D space
- Label clusters and infer possible biological types