Last updated: 2021-08-17

Example dataset: human brain DLPFC region

For the examples in this workshop, we use a human brain dataset from the 10x Genomics Visium platform, which was previously described in our publication Maynard and Collado-Torres et al. (2021).

The example dataset consists of a single sample from the human brain dorsolateral prefrontal cortex (DLPFC) region. This region contains six cortical layers and white matter, which can be distinguished by their molecular expression profiles, and have a layered spatial structure.

The full dataset contains 12 samples in total, from 3 individuals, with 2 pairs of “spatially adjacent” replicates (serial sections) per individual (4 samples per individual). The individuals and spatially adjacent replicates can be considered as blocking factors. Each sample spans the six layers of the cortex plus white matter in a perpendicular tissue section.

For the examples in this workshop, we use a single sample from this dataset (sample 151673).

For more details, see Maynard and Collado-Torres et al. (2021). The full dataset is publicly available through the spatialLIBD Bioconductor package, and the analysis code from our paper is provided in the HumanPilot GitHub repository. The dataset can also be explored interactively through the spatialLIBD Shiny web app.

Preprocessing steps

After performing the Visium experiment, there are several steps needed to process the data and load it into R for downstream analysis.

These steps are described in detail in the Preprocessing steps part in OSTA. Here, we provide a short overview and links.

Image segmentation and cell counting

The first step is to process the raw images, which includes splitting multiple capture areas into individual samples, segmenting cell nuclei, and identifying the number of cells per spot.

For more details, see the chapter Image segmentation (Visium) in OSTA, as well as the paper Tippani et al. (2021) describing the VistoSeg tool.

Note that part of this step also requires the Space Ranger outputs described next.

Loupe Browser and Space Ranger

The Loupe Browser and Space Ranger software provided by 10x Genomics are used to align sequencing reads, match the sequencing data to the spatial coordinates on the image, as well as an initial inspection of data quality and exploration of the data.

For more details, see chapters Loupe Browser (Visium) and Space Ranger (Visium) in OSTA.

SpatialExperiment

After running Loupe Browser and Space Ranger, we are ready to load the data into R.

For this, we use the SpatialExperiment data structure. For more details, see the SpatialExperiment chapter in OSTA, or our additional SpatialExperiment workshop.

The schematic below illustrates the SpatialExperiment structure, which extends SingleCellExperiment to include additional spatial and image information. The structure is flexible, and has been designed to work with data from 10x Genomics Visium as well as other spot-based and molecule-based ST platforms.

Schematic illustrating [SpatialExperiment](https://bioconductor.org/packages/SpatialExperiment) class structure.

Schematic illustrating SpatialExperiment class structure.