Scattergram | strataquest

View

Definition

A scattergram plots two measurements against each other for every cell — one on the X axis, one on the Y axis — with each dot representing a single cell. It reveals relationships that are invisible in histograms: two markers might each look normally distributed alone, but the scattergram shows that they're correlated, anticorrelated, or form distinct clusters. Scattergrams are the standard tool for setting gates, identifying cell populations, and understanding multi-marker expression patterns — the same approach used in flow cytometry for decades.

Bivariate Visualization

Two measurements, one plot

Gate Placement

Visual threshold setting

Population Identification

Find distinct cell groups

Correlation Assessment

How markers relate to each other

How It Works

The Scattergram engine generates interactive bivariate plots:

Axis selection — Choose any two measurements as X and Y axes (marker intensities, area, derived values).
Population display — Each cell appears as a point at its (X, Y) measurement coordinates. Color coding by phenotype, region, or other classification highlights population structure.
Density overlay — For large cell counts (50,000+), density contours replace individual points to prevent overplotting and reveal population density structure.
Gate drawing — Interactive rectangle, polygon, or quadrant gates can be drawn directly on the scattergram. Cells within the gate are classified accordingly.
Linked visualization — Selecting cells on the scattergram highlights their positions in the tissue image, and vice versa. This links statistical patterns to spatial locations.

Simplified

A scattergram plots every cell as a dot using two measurements as coordinates. Clusters of dots are cell populations; the gaps between clusters are natural classification boundaries. Draw gates around clusters to define phenotypes, and click on dots to see where those cells are in the tissue.

Science Behind It

Bivariate analysis (Dilbilir): A scattergram visualizes the joint distribution of two variables. While each variable's marginal distribution (histogram) shows univariate structure, the joint distribution reveals relationships: correlation, clustering, and non-linear associations. Two markers might both appear normally distributed in their individual histograms, but their scattergram reveals two distinct clusters — a bimodal joint distribution that is invisible in either marginal alone.

Colocalization metrics (Pawley): In confocal microscopy, scattergrams of two fluorescence channels are used to assess colocalization. Pearson's correlation coefficient R measures intensity correlation (-1 to +1). Manders' coefficients M1 and M2 measure the proportion of each channel that overlaps with the other. But Pawley warns: colocalization quantifies codistribution, not molecular interaction — the microscope's resolution (~200 nm) means molecules in the same voxel could be hundreds of nanometers apart. The same caution applies to tissue cytometry scattergrams: co-expression of two markers in the same cell doesn't necessarily mean the proteins interact.

The compensation analogy: In flow cytometry, spectral overlap between fluorophores produces false correlations in raw scattergrams — cells that are positive for only one marker appear to express both. Compensation (spectral unmixing) removes this artifact. In tissue cytometry, spectral unmixing serves the same purpose: the scattergram of unmixed signals shows true co-expression, while the raw scattergram may show artifactual correlation from spectral bleed-through.

Simplified

Scattergrams reveal what histograms hide — the relationship between two measurements. Two markers that each look like smooth distributions alone might form distinct clusters when plotted together. These clusters are cell populations, and the gaps between them are natural gates. Just remember: markers co-expressed in the same cell doesn't necessarily mean the proteins interact — it means both are present, not that they're in contact.

Practical Example

Identifying immune cell subtypes in a CD3-vs-CD8 scattergram:

Upper right cluster (CD3+CD8+): Cytotoxic T cells — high in both markers
Left cluster (CD3+CD8−): Helper T cells — high CD3, low CD8
Lower left cloud (CD3−CD8−): Non-T cells — negative for both
Lower right (CD3−CD8+): Rare/artifact — NK cells or spectral contamination

Drawing quadrant gates at the valleys between clusters classifies all cells simultaneously. The linked tissue view shows where each population resides spatially — helper T cells in the stroma, cytotoxic T cells at the tumor border.

Simplified

Plot CD3 vs. CD8: three clusters appear — cytotoxic T cells (CD3+CD8+), helper T cells (CD3+CD8−), and non-T cells (CD3−CD8−). Draw quadrant gates between the clusters to classify all cells. Click on any cluster to see where those cells are in the tissue.

Connected Terms

Gates Category Related Learning Path
Histogram Category Related Learning Path
Cutoffs Category Related Learning Path
Phenotypes Category Related Learning Path
Derived Measurements Related
Raw Data Related
Standard Measurements Related
Tissue Cytometry Related