Light-Seq: light-directed in situ barcoding of biomolecules in fixed cells and tissues for spatially indexed sequencing

Kishi, Jocelyn Y.; Liu, Ninning; West, Emma R.; Sheng, Kuanwei; Jordanides, Jack J.; Serrata, Matthew; Cepko, Constance L.; Saka, Sinem K.; Yin, Peng

doi:10.1038/s41592-022-01604-1

Download PDF

Article
Open access
Published: 10 October 2022

Light-Seq: light-directed in situ barcoding of biomolecules in fixed cells and tissues for spatially indexed sequencing

Nature Methods volume 19, pages 1393–1402 (2022)Cite this article

60k Accesses
25 Citations
594 Altmetric
Metrics details

Subjects

Abstract

We present Light-Seq, an approach for multiplexed spatial indexing of intact biological samples using light-directed DNA barcoding in fixed cells and tissues followed by ex situ sequencing. Light-Seq combines spatially targeted, rapid photocrosslinking of DNA barcodes onto complementary DNAs in situ with a one-step DNA stitching reaction to create pooled, spatially indexed sequencing libraries. This light-directed barcoding enables in situ selection of multiple cell populations in intact fixed tissue samples for full-transcriptome sequencing based on location, morphology or protein stains, without cellular dissociation. Applying Light-Seq to mouse retinal sections, we recovered thousands of differentially enriched transcripts from three cellular layers and discovered biomarkers for a very rare neuronal subtype, dopaminergic amacrine cells, from only four to eight individual cells per section. Light-Seq provides an accessible workflow to combine in situ imaging and protein staining with next generation sequencing of the same cells, leaving the sample intact for further analysis post-sequencing.

Bridging structural and cell biology with cryo-electron microscopy

Article 03 April 2024

Eva Nogales & Julia Mahamid

Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis

Article Open access 25 March 2024

Wenpin Hou & Zhicheng Ji

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Main

To comprehensively understand the cellular states that drive biological function and pathology, it would be ideal to combine optical characterization and imaging-based screening of fixed cells and tissues with omics-level profiling by next generation sequencing (NGS).

Recent applications that focus on integrating imaging with transcriptomic measurements of the same individual cells rely largely on physical dissociation and sorting of cells using opto- or microfluidics before, after or during imaging^1,2,3,4. Alternative methods rely on photoconversion of fluorescent proteins in selected live cells for later sorting^5,6,7, or photoactivation of messenger RNA capture moieties followed by physical aspiration of target live cells for transcriptomic analysis⁸. These approaches typically require live imaging of cells and extraction of a limited number of visual features before pooling or single-cell sorting of dissociated cells for sequencing. In these cases, the spatial information or the original location of the cell is typically lost or is not relevant.

In an effort to preserve spatial information and morphology, particularly in tissues, recent spatial transcriptomic methods aim to profile cells in situ^9,10. Employing various modes of DNA barcoding, spatial omics approaches aim to spatially index either a two-dimensional capture surface (Slide-Seq^11,12, HDST¹³, Seq-Scope¹⁴, Stereo-Seq¹⁵, Sci-Space¹⁶) or the biomolecules directly. In the latter case, barcode sequences can be incorporated onto biomolecules through in situ hybridization (ISH) of combinatorially barcoded probes, or enzymatically by reverse transcription (RT) or ligation (for example, after microfluidic delivery of the barcodes into defined sample positions as in DBIT-Seq¹⁷). These barcodes can then be read out in situ by iterative imaging (for example, SeqFISH¹⁸ or MERFISH¹⁹, HybISS²⁰), by in situ sequencing (FISSEQ²¹, CARTANA²²) or by ex situ sequencing after they are retrieved from the sample (DBIT-Seq¹⁷, IGS²³). Alternative approaches have used spatially restricted (1) iterative photo-cleavage and collection of ISH probe barcodes from target regions of interest (ROIs) (DSP²⁴) or (2) collection of target cells themselves by selective immobilization²⁵, suction²⁶ or laser capture microdissection²⁷ for subsequent barcoding and sequencing.

Existing spatial profiling methods currently rely on one or several of expensive instrumentation (closed box systems costing $100,000–1,000,000, high-end custom microscopes or fluorescence sorters), complex multi-round optical deconvolution of barcoded arrays or barcode sequences in situ and custom microfluidics systems or arrays with rigid sample format restrictions. Many of these methods also use targeted ISH probes rather than whole transcriptome sequencing or are partially or completely destructive to the sample. This creates a high need for accessible and scalable visual selection methods that can directly link multi-dimensional and high-resolution cellular phenotypes (including morphology, protein markers, spatial organization) to transcriptomic profiles for diverse sample types. Recent methods in this direction have adopted an ultraviolet (UV)-uncaging approach to allow spatial barcoding of RNAs (PIC²⁸) or whole cells (via DNA-barcoded antibodies or lipids as in ZipSeq²⁹), but the connection to the sequencing output requires destruction of the samples.

Here, we present a different paradigm, named Light-Seq, for light-directed in situ spatial barcoding of target molecules in desired ROIs for ex situ NGS without sample destruction (Fig. 1). We achieve this by two innovations: (1) Building on our previous work of light-controlled rapid crosslinking of nucleotides³⁰, we utilize an ultrafast crosslinking chemistry³¹ and parallelized photolithography³² for light-controlled enzyme-free covalent attachment of pre-designed barcode sequences onto biomolecules in situ. (2) Developing a cross-junction synthesis reaction to integrate DNA barcodes onto biomolecules for ex situ sequencing. We combine this barcoding strategy with a nondestructive workflow to enable imaging and whole transcriptome sequencing of selected cells in fixed samples with the possibility to revisit the sample for further assays such as protein stainings. We benchmark and demonstrate the applicability of Light-Seq on mixed cell cultures and mouse retina tissue sections, and utilize the approach for rare cell transcriptomics, where we identify biomarkers for the very rare dopaminergic amacrine cells (DACs) in the mouse retina.

Results

Light-Seq overview and barcoding chemistry

Light-Seq employs a light-controlled DNA barcode attachment strategy to enable custom indexing of ROIs in imaged samples. To achieve this capability, we use barcode strands that contain the ultrafast photocrosslinker 3-cyanovinylcarbazole nucleoside (CNVK)³¹. Hybridized CNVK can form an interstrand crosslink upon short UV illumination (Fig. 2a). Our general strategy is to hybridize CNVK-containing barcodes to complementary docking sequences and then direct UV light to an ROI to photocrosslink the barcode strands only in that area, and then wash away noncrosslinked barcode strands. This process can be iteratively performed to label multiple ROIs with orthogonal barcode strands.

**Fig. 2: Light-controlled DNA photocrosslinking.**

To set up the Light-Seq platform, we optimized the crosslinking exposure time and light intensity and found that 1–10 s produced efficient crosslinking, similar to previous in vitro results³³. To create custom photomasks in a parallelized manner, we use a digital micromirror device (DMD)³⁴ attached to a standard widefield imaging setup. Using a ×10 objective, a single mirror in our DMD setup can yield a practical resolution <2 µm based on estimating the full-width at half-maximum on a dot array (Extended Data Fig. 1a,b). We first validated the barcoding chemistry in vitro on a glass surface coated with immobilized DNA strands. By adding fluorescently labeled barcode strands and using custom photomasks (for example, of a cat) to crosslink them to the surface, we were able to create patterns with a single barcode strand (Fig. 2b) or use sequential rounds of barcoding with unique strands to pattern multiple regions on the same slide, such as the three-color Penrose triangle (Fig. 2c). Although we primarily use 365-nm light-emitting diode (LED) epi-illumination with a DMD for the UV crosslinking step, we also demonstrated subcellular spatial labeling with a 405-nm laser³⁰ on a confocal point-scanning microscope (Extended Data Fig. 1c–g). The laser-scanning system offers higher resolution and contrast but is slower than the DMD and LED illumination.

Next, to read out the sequence of target DNAs with their corresponding crosslinked barcode sequences by NGS, the crosslinked bases must be addressed without loss of either the barcode identity or the barcoded sequence. To this end, we developed a cross-junction synthesis reaction to copy both the barcoded DNA sequence and barcode into a new single strand of DNA without a crosslink (Fig. 3a). For this we use a strategy similar to our previously developed Primer Exchange Reaction (PER)³⁵. We use a primer with a strand displacing polymerase that copies a new strand until it is halted at the crosslink point. We designed the sequences around the crosslink to have an identical domain so that the extended primer can reach across the junction and be templated on the opposing strand through a branch migration^36,37,38 competition between two identical domains. The single-stranded DNA product of this cross-junction reaction can then be amplified and read out with standard NGS pipelines.

**Fig. 3: Cross-junction synthesis and full in situ protocol with validation on cell mixtures.**

Utilizing Light-Seq for whole transcriptome sequencing

To use Light-Seq for whole transcriptome sequencing, we first perform in situ RT³⁹ on fixed and permeabilized cells or tissue sections to synthesize complementary DNA (cDNA) (Fig. 3b, step 1). To label RNAs regardless of polyadenylation, we use a degenerate primer with five N and three G bases on the 3′ end⁴⁰ and a barcode docking site on the 5′ end. After RT, the 3′ ends of the generated cDNAs are A-tailed to create a 3′ handle for ex situ primer binding. A CNVK- and unique molecular identifier (UMI)-containing barcode strand is then hybridized to the 5′ docking site on all cDNAs. We then direct UV light to the ROI to photocrosslink the barcode strands in that area and then wash away noncrosslinked barcode strands. This process can be iteratively performed to label multiple ROIs with orthogonal barcode strands (Fig. 3b, step 2).

After all ROIs have been barcoded, barcoded cDNAs are collected from the sample with a mild RNase H treatment and prepared for sequencing (Fig. 3b, step 3). We then apply our cross-junction synthesis reaction to stitch the cDNAs and barcode sequences together into a single readout strand with one enzymatic reaction step (Fig. 3b, step 4). This direct attachment of spatial barcodes onto transcriptomic sequences allows a straightforward transition to ex situ sequencing of the readout sequences after PCR amplification and NGS library preparation (Fig. 3b, steps 5–6). While developing the protocol, we made several critical design choices to minimize the potential artifacts of the in situ RT reaction that have been previously observed^41,42,43, as we discuss in detail in Supplementary Note 1.

To validate the capability to select and barcode cells based on phenotypic profiles, we performed a cell mixing experiment where mouse 3T3 cells were mixed with human HEK cells that stably express eGFP (Fig. 3c). We targeted each cell type with distinct barcode sequences (coupled to different fluorophores for visualization purposes) by manually selecting them based on eGFP expression and cellular morphology (Supplementary Figs. 1 and 2). Out of a total of ~4,500 cells in the same well, we barcoded ~25 human and ~25 mouse cells and confirmed the targeted cell type was barcoded correctly by a fluorescent scan (Fig. 3d–f). Sequencing reads were mapped to a merged human and mouse genome, and unique maps were further analyzed. We were able to validate the barcode integration by cross-junction synthesis by matching the reads to our expected sequence output (Supplementary Fig. 3a–c). We observed a good discrimination ratio of mouse and human maps to their respective barcode sequences (89.1 ± 0.7% of mouse reads, 87.3 ± 0.7% of human reads, n = 3 technical replicates, mean ± s.d.), and notably 93 ± 0.5% of eGFP reads were correctly attributed to the human-specific barcodes (Fig. 3g). After sequence extraction, we performed multiplexed immunofluorescence (IF) to validate the integrity of the sample for secondary assays (Fig. 3h).

Next, we estimated the abundance of transcripts that Light-Seq can capture. Because the barcoding area of Light-Seq can be arbitrarily set by the user, we chose to normalize the number of transcripts that can be captured with Light-Seq as the number of UMI sequences per ‘unit area’ that was roughly the size of a bead in Slide-Seq¹¹ or a barcoded square in DBIT-Seq¹⁷, which we define as 10 × 10 µm². We obtained the number of UMIs per unit area by estimating the total cell or tissue area that was subject to UV illumination for each barcode based on microscopy images of ROIs via segmentation (Extended Data Fig. 2a) and calculating the average UMI count (for correctly barcoded and uniquely mapped reads) for a 100-µm² barcoded area. At a sequencing depth of ~30 million reads per replicate and with only ~1/2 of the sample amplified for library preparation and sequencing, we observed an average of 1,959 ± 453 and 1,170 ± 207 UMIs per unit area for HEK and 3T3 cells, respectively (Supplementary Table 1, mean ± s.d.). We note that the read depth of ~30 million was subsaturating (Extended Data Fig. 2b), and a single ~200 million read dataset from half of the sample yielded 3,328 and 2,029 UMIs per unit area for HEK and 3T3 cells, respectively (Supplementary Table 1).

Normalized gene expression levels (expressed as log₂-transformed transcripts per kilobase per million reads (TPM)) displayed good correlation across technical replicates (Pearson correlation coefficient > 0.8). The top 200 expressed genes correlated highly across technical replicates (Pearson correlation coefficient > 0.9; Extended Data Fig. 2c,d) and comprised various protein coding genes, as well as short transfer RNAs and long noncoding RNAs (Supplementary Table 2 and source data), illustrating the range of transcripts that can be targeted with the barcoding strategy.

With Light-Seq, we were able to successfully recover species-specific transcriptomes of targeted cells, despite only selecting 1–2% of cells within the whole well. We hypothesized that the Light-Seq background, reflected here by the small portion of species-specific transcripts harboring the barcode corresponding to the other species (Fig. 3g), could arise from three potential sources: (1) diffusion of RNAs or cDNAs before photocrosslinking, (2) incomplete removal of uncrosslinked barcode strands during stringent washing and (3) light-scattering inducing out-of-ROI barcoding. To mitigate (1) and (2), we added blocking and crowding agents to our barcode hybridization step for all of the following experiments, which substantially improved signal-to-noise (Methods and Supplementary Note 2). To mitigate light-scattering effects, we seeded the cells below confluence. For higher density labeling (for example, in tissues), we suggest optimization of the optical setup and sample preparation and slight erosion of ROI boundaries (~1–3 µm) to account for scatter (Methods and Extended Data Fig. 1).

Spatial sequencing with Light-Seq in tissue sections

RNA sequencing of specific cell populations within tissue samples remains challenging, especially when target cells are rare or difficult to isolate. We therefore tested Light-Seq on fixed sections from the mouse retina to capture biomarkers from cell populations of interest based on in situ identification. First, in situ RT was performed in fixed 18 µm retinal cryosections to synthesize cDNAs for spatial barcoding. We then manually selected three cellular layers of the retina, each known to have unique function and cellular composition: the outer nuclear layer (ONL), containing rod and cone photoreceptors; the bipolar cell layer (BCL), containing bipolar cells, horizontal cells and Müller glia; and the ganglion cell layer (GCL), containing retinal ganglion cells and displaced amacrine cells (ACs) (Fig. 4a). After photocrosslinking fluorescently labeled barcodes to all cDNAs within each layer, the targeted ROIs were imaged using a confocal microscope to verify layer-specific labeling (Fig. 4b and Extended Data Fig. 3a–c). Barcoded cDNAs were then extracted, pooled and prepared for NGS as described above, leaving the sample intact.

**Fig. 4: Application of Light-Seq for spatial barcoding of three main retinal layers in fixed frozen mouse retina sections.**

Since the cellular composition of each retinal layer is well-established⁴⁴, we verified the specificity of spatial barcoding based on the levels of cell-type-specific markers associated with each barcode. We used DESeq2 (ref. ⁴⁵) to perform differential expression analysis on exon-mapped barcoded sequencing reads to look for layer-specific markers. Based on the size of barcoded areas, we labeled an estimated 1,112 ± 199, 298 ± 29 and 91 ± 14 (mean ± s.d., n = 4 technical replicates) cells for the ONL, BCL and GCL layers, respectively. UMI yields varied for different retinal layers depending on cell and RNA content of each layer, ranging from ~1,200 to 5,000 UMI’s per 10 × 10 µm² unit area (Extended Data Fig. 3d and Supplementary Table 3). As expected, we observe that cells with larger cytoplasmic volumes correlate with higher UMIs per unit area (Extended Data Fig. 3d).

Technical replicates showed consistent read filtering throughout the sequence-processing pipeline (Extended Data Fig. 4a,b) and were well correlated based on principal component analysis for each layer (Extended Data Fig. 4c), and we discovered >3,400 genes with significant differential expression between pairs of barcoded populations (3,430 genes for ONL versus BCL; 3,434 for BCL versus GCL; 6,165 for ONL versus GCL, P_adj < 0.05; Fig. 4c,d and accompanying source data), including many known markers of rod and cone photoreceptors in the ONL, of bipolar cells and Müller glia in the BCL and of retinal ganglion cells in the GCL.

To further benchmark Light-Seq data, we simulated pseudo-bulk RNA sequencing of the retinal layers using published single-cell Drop-Seq data of ONL and BCL cells⁴⁶ (Methods). We saw strong correlation for the genes enriched between the ONL and BCL (98.6% of genes significantly enriched in both assays were enriched in the same layer; Extended Data Fig. 4d). For all BCL-enriched genes, the ratios of Light-Seq to Drop-Seq reads show higher correlation and comparable sensitivity for longer genes and lower correlation and sensitivity for shorter genes (Extended Data Fig. 4e–g), consistent with Light-Seq’s internal priming strategy versus Drop-Seq’s polyA-targeted transcript capture.

Interestingly, more genes were significantly differential among the layers in Light-Seq data than Drop-Seq (3,430 compared with 1,524) despite similar total numbers of genes detected (24,460 compared with 24,904) across technical replicates, in line with previous studies that suggest targeted bulk transcriptome measurements can provide better statistics for discovery of moderate to lowly expressed biomarkers than single-cell sequencing⁴⁷.

To assess the sensitivity of Light-Seq for detecting mRNAs, we compared both Light-Seq and Drop-Seq directly with single molecule FISH (smFISH⁴⁸) data. Our previous work used quantitative multiplexed SABER-FISH⁴⁹ to co-detect 16 mRNA markers of bipolar interneuron subtypes in age-matched mouse retinas⁵⁰. From these data, we estimated the average number of transcripts per cell within the pooled bipolar cell population. Multiplying this by the number of cells within the barcoded Light-Seq BCL (ROI2) and pseudo-bulked Drop-Seq data, we estimated the expected number of detectable transcripts for each gene in the sequenced populations. Relative to smFISH, we find that the sensitivity of Light-Seq was 4.29 ± 3.39% (mean ± s.d., n = 16 genes, 4 replicates) and sensitivity of Drop-Seq was 3.97 ± 4.38% (mean ± s.d., n = 16 genes, 6 replicates) (Fig. 4e and Extended Data Fig. 4h–j).

After Light-Seq, the same sections were then stained with DAPI, wheat germ agglutinin (WGA) and antibodies targeting PAX6 and VSX2 proteins, demonstrating that cellular DNA, oligosaccharides on cell membranes and proteins remained detectable after extraction of barcoded cDNAs for sequencing (Fig. 4f and Extended Data Fig. 3c).

Due to internal priming during the RT step, Light-Seq shows read coverage spanning gene bodies (Extended Data Fig. 5a) and consistent reads per kilobase per million mapped reads (RPKM) across transcripts of different lengths (Extended Data Fig. 5b,c). This strategy, combined with membrane permeabilization in situ, enables Light-Seq to capture a wide variety of RNA species including nonpolyadenylated and mitochondrial transcripts (see source data for Fig. 4). When intronic sequences are included for transcriptome mapping, ~21–26% of reads map to introns, indicating capture of nuclear RNAs at ratios consistent with similar methods using internal RNA priming (Supplementary Note 3)⁵¹, suggesting that our workflow can detect both nuclear and cytoplasmic RNAs.

Rare cell transcriptomics with Light-Seq

For applications where only a small number of cells of interest are present and/or where spatial context is critical for their identification, capturing transcriptomes remains a major challenge. To test Light-Seq’s utility for imaging, barcoding and sequencing of rare cells, we targeted the extremely rare and difficult to isolate DACs in the mouse retina. DACs comprise ≤0.01% of retinal cells⁵² and are interspersed among diverse types of neurons. Previous works aimed at profiling DACs with single-cell RNA-seq⁵³ or microarray capture after dissociation⁵⁴ have seen limited success. To find subtype-specific biomarkers of DACs, we fixed and sectioned mouse retinas and first performed in situ RT (Fig. 5a, step 1). Then, IF was done to detect tyrosine hydroxylase (TH), a known marker of DACs, to locate them for barcoding (step 2). Next, two rounds of barcoding were guided by the TH IF signal (Fig. 5b and Extended Data Fig. 6a), to barcode cDNAs within TH⁻ DACs with FITC-labeled barcode strands (roughly 500 cells per section) and cell bodies of all TH⁺ DACs with Cy3-labeled barcode strands (4–8 cells per section, n = 5 section replicates) (step 3). Finally, barcoded cDNAs were displaced from the tissue and prepared for sequencing, with the sample remaining intact for post-sequencing stains (step 4).

**Fig. 5: Rare cell transcriptomics by Light-Seq.**

Light-directed barcoding permitted precise labeling of individual TH⁺ DACs within the dense tissue environment (for notes on optical system, see Methods and Supplementary Note 4). Although some light-scattering can induce out-of-ROI crosslinking, this effect was mitigated by drawing photomasks slightly smaller than the intended ROIs: for barcoding TH⁺ DACs, photomasks were drawn 1–3 µm inside the cellular boundary (Extended Data Fig. 6a–c). Use of a laser-based point-scanning microscope is slower but offers higher barcoding resolution and may be used in place of a DMD (Extended Data Fig. 1e).

Sequencing at a subsaturating sequencing depth of 3.5–5 million reads per replicate (one pooled MiSeq run for five successfully amplified section replicates; Extended Data Fig. 6d) yielded 6,000–10,000 UMIs per 10 × 10-µm² unit area for TH⁺ DACs, with an average of 7,800 UMIs per cell (Supplementary Table 4). We again observed good gene body coverage and consistent RPKM across replicates (Extended Data Fig. 7a–c). Differential expression analysis revealed 36 significantly enriched genes in the TH⁺ population (P_adj < 0.05, log₂(fold enrichment) > 1), including the known markers TH (P_adj = 1.32 × 10⁻²⁴) and CARTPT (P_adj = 4.98 × 10⁻⁶⁵)⁵⁴ (Fig. 5c and accompanying source data). To validate the top biomarker (Cartpt) after sequencing, we revisited the stored samples and performed IF for CARTPT on the same cells, revealing specific labeling (Fig. 5d,e).

For further validation, RNA-FISH was performed in new retinal sections to detect the top differentially expressed genes (P_adj < 0.05 and log₂(fold enrichment) > 3). In all cases, the RNA-FISH confirmed the Light-Seq results, showing marker enrichment in TH⁺ DACs relative to neighboring TH⁻ ACs (Fig. 5f). As controls, we detected Gad1 mRNA, with known expression in both TH⁺ DACs and TH⁻ ACs, and Vsx2 mRNA, which is not expressed in either population. Importantly, FISH data suggest that some markers had high expression levels (Cartpt, Slc18a2), while others were expressed more moderately (Gabre, Gabrq, Stra6), demonstrating that Light-Seq can accurately detect moderately expressed RNAs.

Several of the Light-Seq enriched markers have been previously reported in murine TH⁺ DACs: Cartpt⁵⁴, Slc18a2 (VMAT2)^55,56 and gamma-aminobutyric acid (GABA) A receptor subunits Gabre and Gabrq⁵⁷. Other transcripts, including Stra6, Gpx3, Arpp21, Scn5a and most biomarkers identified here, have, to our knowledge, not yet been reported for this subtype.

Discussion

Here, we present Light-Seq, a method to attach sequenceable spatial indices onto biomolecules in intact samples using light. By directly integrating two powerful domains, microscopy and NGS, Light-Seq enables linking of morphological and spatial parameters of target cells to their transcriptomic profiles. Capture of morphology, tissue context, transcriptome and protein expression in the same cells provides a more comprehensive measurement of the state of cells and their interactions.

We demonstrated that Light-Seq can be used for full-transcriptome profiling of populations of 4–1,000+ cells within fixed tissue sections, with sensitivity similar to existing methods. Light-Seq produced UMI yields of 1,000–10,000 per 10 × 10-µm² unit area depending on the target cell type, comparable to DBIT-Seq (~5,000 UMIs)¹⁷ and Slide-SeqV2 (500–1,000 UMIs)¹² for tissue areas of the same size (Supplementary Tables 3 and 4). Light-Seq sensitivity for transcript detection ranged between 1% and 10% for individual genes when compared with smFISH measurements, with a mean of 4.29 ± 3.39%, similar to single-cell RNA sequencing sensitivity for the same set of genes (Fig. 4e and Extended Data Fig. 4h–j). Even with our conservative pipeline which includes only reads that mapped uniquely to the genome and exonic features, the sensitivity of Light-Seq is in line with existing spatial sequencing methods (0.005–15.5% (refs. ^17,21,58,59), as reviewed before⁹). We note that both sensitivity and UMIs per unit area measurements are generally highly impacted by the particular cell and tissue type, genes assayed and sequence-processing pipelines, which makes direct comparisons across technologies and applications imperfect.

We expect that Light-Seq’s sensitivity can be further improved with optimization of the in situ RT and barcoding, such as by protease treatment, antigen retrieval or changes to fixation/permeabilization conditions^21,28,39,60, use of targeted ISH probes⁶¹ and targeted ribosomal RNA depletion. These improvements, combined with the flexibility of custom photomasks, could ultimately enable profiling of single cells or subcellular compartments with higher efficiency. We will continue to update detailed protocols for applying Light-Seq at lightseq.io, including suggestions for optimization in different types of tissues.

Despite using different methods for RNA detection, we observed strong agreement between differentially enriched genes in single-cell Drop-Seq and Light-Seq data from the same cell types, such that 98.6% of significantly differential genes were enriched in the same retinal cell population in both datasets (Extended Data Fig. 4d). While the Drop-Seq data originated from dissociated cells among many retinas, Light-Seq requires far less cellular input (selected cells from only four 18 µm sections) and is not subject to loss of cells upon dissociation and the selection biases that this can produce.

Many existing spatial transcriptomics methods are complex and expensive to implement, regardless of the biological question of interest. Using standard NGS, Light-Seq circumvents many challenges associated with in situ sequencing and FISH approaches, which are limited by tissue autofluorescence and image deconvolution. The sequencing output of Light-Seq enables detecting not only RNA presence, but precise sequence information such as single nucleotide polymorphisms and splice isoforms, offering a major advantage over most ISH approaches (Supplementary Note 3). This approach could be particularly useful for tracking mutations and clonality in cancers or resolving microbial species in tissue samples. The cost for RT, A-tailing, three rounds of barcoding, displacement, cross-junction synthesis and PCR, on each section, is only ~$34.50 (Supplementary Table 5) per section. Photocrosslinking is feasible to perform with any standard optical imaging setup that can focus UV light on specified areas (for example, a microscope with a 365-nm UV LED and a DMD attachment, or a 405-nm laser on a confocal scanning or even light-sheet microscope for three-dimensional applications).

The DSP platform offers similar flexibility for spatial targeting by using iterative UV-cleavage and microcapillary collection of released barcodes²⁴, but this platform requires expensive equipment and targeted hybridization-based barcoding of mRNAs, and currently offers lower sensitivity, requiring a minimum of 20–300 cells for sequencing. Another approach, PIC, similarly takes advantage of light-directed ROI targeting and shows higher sensitivity with single-cell barcoding, but is single-plex and destructive to the sample. These methods include differences in sample preparation and UMI recovery protocols which may be useful to consider for future variations of Light-Seq.

By spatially restricting the barcoding to targeted cDNAs and selectively amplifying them for library generation, Light-Seq allows the sequencing reads to be focused on cells of interest. Thus, unlike surface capture or microfluidic channel-based methods, sequencing depth and cost can be flexibly optimized, particularly for experiments targeting very few cells. This advantage is evident with our rare cell experiment (Fig. 5) where we used a single pooled MiSeq run (20 million reads) for all replicates. Experimentally decoupling the imaging and sequencing also makes the workflow highly flexible, and the light-directed barcoding allows addressing ROIs of different scales (from subcellular structures to large super-cellular regions) with the same reagents and experimental strategy.

We demonstrated how Light-Seq offers a simple and customizable workflow for studying very rare cell populations. We discovered and validated previously unknown biomarkers of TH⁺ DACs, which have not been captured by previous attempts to transcriptionally profile this population^53,54. Our data confirm previously known markers, such as Th, Cartpt and Slc18a2, but also provide new leads for understanding the biology of dopaminergic retinal neurons. Several of these, such as Arpp21 (ref. ⁶²), Vgff⁶³ and Gpx3 (ref. ⁶⁴), are suggested to play a role in dopaminergic neurons or related diseases elsewhere in the nervous system, but many remain unstudied. Among the novel markers is Stra6, encoding a transmembrane vitamin A transporter that bidirectionally traffics retinol. Interestingly, this gene is additionally expressed in the retinal pigment epithelium and has been implicated in retinal disease, but the role of TH⁺ DACs in disease phenotypes remains unexplored^65,66. The discovery of the tetrodotoxin-resistant voltage-gated sodium channel, Scn5a, is surprising and interesting. Scn5a encodes the main cardiac sodium channel, Nav1.5, and has been detected very rarely in neuronal populations^67,68. Here, we report that Scn5a is highly expressed in TH⁺ DACs, but its function in these cells remains to be shown.

The nondestructive nature of Light-Seq leaves the sample intact after sequencing, potentiating multi-omic measurements from the same cells. We also envision that Light-Seq could be adopted for landmark-based transcriptomics (such as APEX-Seq⁶⁹), without genetic intervention and sample destruction. While the combination of sensitivity, spatial resolution and ease of adoption of Light-Seq is collectively advantageous over existing technologies, the number of addressable regions is currently limited compared with other spatial methods. We expect that increasing the multiplexing would be highly feasible, as other published methods (for example, SABER^49,70, CycIF⁷¹ or CODEX⁷²) perform much longer (up to 60) serial cycles with repeated labeling, imaging and dehybridization/bleaching, and provide strong preceding evidence for good tissue preservation across more barcoding rounds. Scaling to combinatorial barcode construction and to labeling of other biomolecules (for example, proteome, epigenome) in future applications could support high-throughput barcoding and screening of hundreds to thousands of cells or regions with minimal increase in cost and is of great interest for further development.

Methods

Oligo design and preparation

CNVK-containing barcode sequences were screened with NUPACK⁷³ to have minimal secondary structure and were also checked against the mouse and human genomes using the BLAST⁷⁴ and BLAT⁷⁵ online tools. For in situ transcriptomic barcoding, an RT primer ending in five Ns and three Gs⁴⁰ was designed to contain a docking sequence complementary to a shared region in the barcode sequences (Supplementary Fig. 3). Barcode oligos were ordered from Gene Link, and all remaining oligos were ordered from Integrated DNA Technologies. Oligo stocks of 100 μM were stored at −20 °C, and working stocks of 10 μM in IDTE (Integrated DNA Technologies cat. no. 11-01-02-02) were prepared for most frequently used oligos. All sequence, purification and vendor information is listed in Supplementary Tables 5 and 6.

Cell culture

A stable HEK293-GFP cell line (SC001) that constitutively expresses eGFP under a CMV promoter was purchased from GenTarget and cultured in high-glucose D-MEM with GlutaMax supplemented with 10% fetal bovine serum, 0.1 mM MEM nonessential amino acids, 1% Pen-Strep (Thermo cat. no. 15140122) and 10 µg ml⁻¹ blasticidin (Thermo cat. no. J67216). A mouse 3T3 cell line was purchased from ATCC (CRL-1658) and cultured in D-MEM with GlutaMax (Thermo cat. no.10569-010) supplemented with 10% calf bovine serum (ATCC 30-2030) and 1% Pen-Strep. For the cell mixing experiment, cells were seeded overnight in D-MEM with GlutaMax supplemented with 10% calf bovine serum, 1% Pen-Strep and 0.1 mM MEM nonessential amino acids (Thermo cat. no 11140-050).

Tissues

All animal experiments were conducted in compliance with protocol IS00001679, approved by the Institutional Animal Care and Use Committee at Harvard University. Experiments were performed on tissue collected from postnatal day 18 wild-type CD1 IGS mice (Charles River, Strain Code 022).

Barcoding setup

Barcoding was performed on an inverted Nikon Eclipse Ti-E microscope with an attached Mightex Polygon 400 DP DMD. A Mightex BLS-series high-power liquid light guide-coupled LED source, 365 nm, 50-W emitter, was applied at 10% power for 10 s through a CFI plan fluor 10× objective per selected ROI per barcoding round (with the exception of Fig. 2b where at 10% power was applied for 5 s). For the glass surface or cell culture crosslinking experiments, focus was set to z-position corresponding to the top surface of the glass coverslip. For the retina layers experiment and barcoding of TH⁻ ACs, focus was set at 10 µm above the glass surface. For the TH⁺ DAC population, the focal plane of the TH-antibody stain was found for each cell, and barcoding was performed on the plane 5 µm above, to target the middle of the cells. Photocrosslinking was done on a single z-plane. Photomasks for the cell mixing and retina experiments were hand-drawn using the Bezier ROI tool and set as stimulation regions using the Nikon Elements (v.4.51) Polygon 400 module user interface. Refer to our protocols at lightseq.io and Supplementary Note 4 for detailed barcoding details, including calibrating and optimizing the optical setup.

Retina tissue barcoding and immunofluorescence

Neural retinas were dissected from postnatal day 18 mice in 1× PBS (Invitrogen AM9625, diluted in ultrapure water, Invitrogen cat. no. 10977) and immediately fixed for 25 min at room temperature in 1× PBS with 4% paraformaldehyde (diluted from 16% solution, Thermo Scientific cat. no. 28908) and 0.25% Triton X-100 (Sigma Aldrich T8787). Retinas were then washed 3 × 5 min in 1× PBS and once for 10 min in 7% sucrose in 1 × PBS (40-µm filter-sterilized), before getting embedded in a 1:1 solution (v:v) of OCT (Tissue-Tek 4583) and 30% sucrose in 1× PBS for freezing in cryomolds and subsequent storage at −80 °C. Cryosectioning was performed to cut 18 µm sections onto poly-l-lysine-coated 18-well ibidi chamber slides (ibidi custom order cat. no. 81814, #1.5 polymer). For comparing retinal layers (Fig. 4), four technical replicates were prepared by cutting four distinct retinal sections from the same animal into different wells. To promote tissue adhesion, chamber slides were coated before cryosectioning with an additional layer of poly-d-lysine (PDL) (Sigma P6407) dissolved at 0.3 mg ml⁻¹ in 2× Borate Buffer (diluted in water from Thermo Scientific PI28341, aliquoted and stored −20 °C). PDL coating was performed by covering the chambers with sterile PDL solution for 2 hours at 4 °C, removing the solution and drying completely and washing once with UltraPure water. After cryosectioning, retinas were dried briefly (~10 min) and washed three times in 1× PBS with 0.1% Tween-20 (vol/vol, Sigma Aldrich cat. no. P9416-50ML) (PBST).

The buffer was then replaced with an RT mix composed of 300 µM dNTPs (NEB N0447S), 0.5% Triton X-100 (vol/vol), 6 mM RNaseOUT (Invitrogen cat. no. 10777019), 1 µM RT primer and 8 U µl⁻¹ Maxima RT H Minus enzyme (Thermo Scientific cat. no. FEREP0753) in 1× RT buffer and incubated on a flat-top thermocycler (Mastercycler Nexus Flat, Eppendorf cat. no. 6335000020) with the following program: 30 min at 22 °C, followed by a 12-cycle ramp program of 8 °C for 30 s, 15 °C for 30 s, 25 °C for 30 s, 30 °C for 1 min, 37 °C for 1 min and 42 °C for 2 min. After a final 42 °C incubation for 30 min, samples were held at 4 °C temporarily. Following RT, samples were washed in PBST with 60% deionized formamide (vol/vol, Thermo Scientific cat. no. AM9342) for 3 × 5 min, in PBST with 1 M NaCl (Invitrogen AM9760G) for 2 × 2 min and in PBST for 2 × 2 min. Samples were kept in PBST until the buffer was exchanged with an A-tailing master mix consisting of 1× ThermoPol Buffer (NEB cat. no. B9004S), 1 mM dATP (NEB N0446S), 25 µM ddATP (Sigma GE27-2051-01) and 1,000 U ml⁻¹ terminal transferase enzyme (NEB M0315L) for a 45-min incubation at 37 °C. After A-tailing, samples were washed in PBST for 3 × 1 min and stored overnight at 4 °C in PBST. Next day, before barcoding, the PBST was removed and fresh PBST with 1 M NaCl was added. For barcode hybridization, a barcoding solution consisting of PBST with 2 mg ml⁻¹ sheared salmon sperm DNA (Invitrogen AM9680), 10% dextran sulfate (wt/vol, Sigma Aldrich cat. no. S4030), 250 nM Barcode 1 strand (GATE.D12.B1) and 500 mM NaCl was applied for 30 min. Samples were then washed with PBST with 1 M NaCl (3 × 1 min). Last wash buffer was replaced with fresh PBST with 1 M NaCl before proceeding to the barcoding. Slide was then transferred to the microscope for light-directed barcoding. Desired regions were visually identified in the brightfield images and hand-drawn masks were set as photostimulation regions (see the Barcoding setup section). After photostimulation in the regions of interest, the chamber was removed from the microscope for washing. Samples were then washed with PBST with 60% deionized formamide (vol/vol) 8 times (four cycles of two buffer exchanges with 5-min incubation in between) and with PBST with 1 M NaCl (2 × 2 min). Last wash buffer was replaced with fresh PBST with 1 M NaCl for the next barcoding round. The same tissue area was then manually found on the microscope, and two additional barcoding rounds were performed with Barcodes 2 (GATE.D12.B2) and 3 (GATE.D12.B3). After the last barcoding round and washes, buffer was replaced with PBST.

Barcoded retinas were stained with DAPI for 30 min (0.5 µg ml⁻¹ in PBST) and imaged (Fig. 4 and Extended Data Fig. 3). After imaging, samples were treated with 67.5 µl of displacement mix consisting of 1× ThermoPol buffer and 250 U ml⁻¹ RNase H (NEB cat. no. M0297L) and incubated at 37 °C for 45 min. To maximize yield of recovery, a low-retention pipette tip was first coated with primer by pipetting up and down in a 20 nM solution, and then the same empty tip was used to pipette the displacement mix up and down several times within the well before collection. Each eluate was then transferred to a tube containing 1.6 µl of 1 µM cross-junction synthesis primer (GATC.20T), mixed and heat inactivated at 75 °C for 20 min in a PCR machine (Eppendorf Mastercycler Nexus Gradient). Then, 10.9 µl of cross-junction synthesis mix containing 1.15 × ThermoPol buffer, 734 µM dNTPs and 5,872 U ml⁻¹ BST LF polymerase (NEB cat. no. M0275L) was added to each eluate. Samples were incubated in a PCR machine at 37 °C for 30 min followed by heat inactivation at 80 °C for 20 min.

After extraction, cryosections were washed twice in PBST and kept in fresh PBST at 4 °C until further analysis. Retinas were incubated with Tissue Blocking Solution (1× PBS, 0.3% Triton X-100 and 5% Normal Donkey Serum (Jackson ImmunoResearch cat. no. 017-000-121, RRID:AB_2337258) and filter-sterilized using a 40-µm syringe filter) for 1 hour at room temperature before antibody staining. The following primary antibodies were prepared in the Tissue Blocking Solution: sheep anti-CHX10 (Exalpha X1180P, RRID:AB_2314191, diluted 1:500) and rabbit anti-PAX6 (Abcam cat. no. ab195045, RRID:AB_2750924, diluted 1:300). Primary antibodies were incubated overnight at 4 °C and then washed 5 × 5 min with PBST. Secondary antibodies (donkey anti-sheep-Alexa647, Jackson ImmunoResearch cat. no. 713-605-147, RRID:AB_2340751; donkey anti-rabbit-Cy3, Jackson ImmunoResearch cat. no. 711-165-152, RRID:AB_2307443) were incubated overnight at 4 °C in Blocking Solution (both antibodies diluted 1:500 from 50% glycerol stock) and washed 5 × 5 min with PBST. Wheat germ agglutinin (WGA, Biotium cat. no. 29022-1) diluted 1:100 in PBST from a 1 mg ml⁻¹ stock solution was applied for 1 hour at room temperature, followed by 30 min of staining with DAPI in PBST (0.5 µg ml⁻¹).

After cross-junction synthesis, the extracted sequences were quantified and then bulk amplified as follows. First, 5-µl volumes of reactions were combined in a 1:1 ratio with a PCR mix consisting of Sybr Green I (1×, Invitrogen S7563), Kapa HiFi Buffer (2×, from Roche KK2502), forward and reverse primers (600 nM each, GATE and GATC sequences; see Supplementary Table 6), dNTPs (600 µM, from Roche KK2502) and Kapa HiFi Hot Start Polymerase (0.04 U µl⁻¹, from Roche KK2502). Samples were amplified on a quantitative PCR machine (Biorad CFX Connect Real-Time System) with the following program: 98 °C for 3 min, followed by 30 cycles of 98 °C for 20 s, 60 °C for 30 s and 72 °C for 2 min. Samples were then incubated for a final 5 min at 72 °C, and a melt curve measurement was performed. Based on the quantification, bulk amplification was then performed with the same conditions for 20 cycles using all of the remaining reaction material.

The extracted material for each of the four technical replicates was prepared separately, and samples were later pooled for sequencing as below (Library preparation and sequencing). Of the total tissue area of the sections, 7–11% (Supplementary Table 3) was estimated to be labeled with barcodes based on the size of the barcoded area over the total retina tissue area. Barcoded area was measured using the area of the photomask, and total retina tissue area was measured by the free-hand area tool in FIJI⁷⁶.

Barcoding TH⁺ DACs

Retinas were prepared and sectioned as described above for six section replicates. Light-Seq was performed as described above for the retinal layers in Fig. 4, with few changes: After in situ RT and A-tailing and before barcoding, sections were stained with an anti-TH antibody (Millipore cat. no. AB152, RRID:AB_390204) for 1 hour at room temperature. The antibody was diluted 1:500 in Blocking Solution, which was made of 1× PBS, 0.1% Tween-20 and 1% molecular grade BSA (GeminiBio, 700-106 P) and was filter-sterilized using a 20-µm syringe filter. Samples were washed 5 × 5 min with PBST. Donkey anti-rabbit-Alexa647 secondary antibody (Jackson ImmunoResearch cat. no. 711-605-152, RRID:AB_2492288) was added at 1:250 dilution in the same Blocking Solution for 30 min at room temperature and then washed 5 × 3 min with PBST.

Barcoding was then performed as described above with minor changes, listed here. The first round targeted the FITC-barcode strand (Barcode 3, GATE.D12.B3 sequence, annotated as Barcode 1 in Fig. 5a) to TH⁻ ACs, and the second round targeted the Cy3-barcode strand (Barcode 2, GATE.D12.B2) to TH⁺ DACs (see Supplementary Table 6 for barcode sequences). The z focal plane for barcoding TH⁺ DACs was guided by the anti-TH antibody signal with an additional 5-µm adjustment away from the well surface. Stringent washes after light-directed barcoding in PBST with 60% formamide were doubled in number (four cycles of 5-min incubations, where each cycle consisted of four buffer exchanges, rather than two buffer exchanges as done for the retinal layers). All TH⁺ DACs in the same section were addressed individually across multiple fields of view and were crosslinked with Barcode 2 after manual selection. Of the six section replicates prepared, five yielded enough material for sequencing during the PCR amplification of the extracted sequences. For all replicates, the number of TH⁺ DACs was between four and eight, and therefore the TH⁺ DAC transcriptomes came from a pool of four to eight cells per technical replicate.

Immunostaining after sequencing TH^+/− ACs

After sequencing, we revisited the same barcoded cells from two of the original sections to validate the enriched expression of the top hit (lowest P_adj), Cartpt, at the protein level. The samples were stored in PBST at 4 °C for >10 days while the sequencing was run and analyzed. For staining, samples were incubated with a goat anti-CARTPT antibody (Thermo Fisher Scientific cat. no. PA5-47170, RRID:AB_2607700) at 1:20 in Blocking Solution for 1.5 hours at room temperature and then washed 5 × 5 min with PBST. Then, a donkey anti-goat-Alexa488 secondary antibody (Jackson ImmunoResearch cat. no. 705-545-003, RRID:AB_2340428) was added at 1:250 in the same Blocking Solution for 30 min at room temperature and then washed 5 × 3 min with PBST. WGA-Rhodamine (Vector Labs cat. no. RL-1022-5, 1:100 from 1 mg ml⁻¹ stock) and DAPI were added to the secondary antibody mixture to label cell membranes and nuclei.

RNA-FISH validation of TH⁺ DAC markers

For the top ten markers (with P_adj < 0.05, log₂(fold change) > 3), we validated using RNA-FISH. We used serial SABER-FISH to detect the markers in new 25 µm retinal sections as previously described⁴⁹. FISH validations were done on four new sections from a mouse littermate of the source animal used for Light-Seq experiments. SABER-FISH probes were designed using the ‘RNA Probe Design’ feature of the PaintSHOP tool (https://oligo.shinyapps.io/paintshop/)⁷⁷. For each gene, all probes were appended with a common gene-specific SABER-FISH primer sequence for orthogonal detection of multiple genes in the same cells. See Supplementary Table 7 for probe sequences (with their attached primer) targeting each gene. Previously described probe sets were used for Gad1.26 and Vsx2.25 (ref. ⁴⁹). SABER-FISH probe preparation and RNA detection were performed as described before (by Kishi et al., 2019, ‘User-friendly protocol: Retina Tissue Sections RNA-FISH’ section of the Supplemental Protocols⁴⁹). The Sstr2.25 SABER-FISH probe set failed to extend during the probe synthesis reaction (before being applied to the tissue samples) and therefore was excluded from the validation experiments.

Probes were split into two groups for multiplexed detection, with three rounds of fluorescent detection done to capture six RNAs total in each sample (representative images shown as rows in Fig. 5f). Each round detected two different genes using ATTO565 and Alexa647 fluorescent oligos, as described before⁴⁹ (sequences included in Supplementary Table 7). The antibody stain for TH was performed after the first round of fluorescent detection, as described in the section above, and therefore was present during all rounds of sequential imaging. WGA-405S (Biotium cat. no. 29022-1) was added during the secondary antibody incubation at 1:100 for membrane staining.

Barcoding and immunofluorescence on cultured cells

Eighteen-well poly-l-lysine-coated ibidi chambers (ibidi custom order cat. no. 81814) were coated with sterile PDL (Sigma Aldrich cat. no. P6407) at 0.3 mg ml⁻¹ overnight at 4 °C. Afterwards, the chamber was dried for 1 hour, washed with UltraPure water (Invitrogen cat. no. 10977) and dried again before cell seeding. Chambers were then seeded with ~4,000 HEK293 and ~5,000 NIH/3T3 cells per well and placed in an incubator (37 °C with 5% CO₂) overnight. Samples were gently washed with DPBS (pre-warmed to 37 °C, Gibco cat. no. 14190-144) and fixed in 4% formaldehyde (wt/vol, Thermo cat. no. 28908) in 1× PBS (Invitrogen cat. no. AM9625) for 10 min at room temperature. Then, samples were washed twice with 1× PBS and permeabilized with 0.25% Triton X-100 (vol/vol) for 10 min. Samples were then washed twice with 1× PBS, and the in situ RT step was done following the same protocol described for retina samples above. Following RT, samples were washed in PBST and 60% deionized formamide for 3 × 2 min, in PBST with 1 M NaCl for 2 × 2 min and in PBST for 2 × 2 min. Samples were kept in PBST at 4 °C (as a pausing point in the protocol). To proceed further, the PBST buffer was exchanged with the A-tailing master mix (as described above for tissues) and incubated at 37 °C for 30 min.

After A-tailing, samples were washed in PBST for 3 × 1 min and left in PBST with 1 M NaCl until barcoding. For barcode hybridization, a barcoding solution consisting of PBST with 250 nM Barcode 1 strand (GATE.D12.B1) and 500 mM NaCl was applied onto the samples for 15 min and excess strands were washed with PBST with 1 M NaCl for 3 × 1 min. Last wash buffer was replaced with fresh PBST with 1 M NaCl, and slide was then transferred to the microscope for light-directed barcoding. Selected HEK cells were then photocrosslinked using hand-drawn photomasks (see Barcoding setup for more details). Samples were then washed with PBST with 60% deionized formamide (vol/vol) eight times (four cycles of two buffer exchanges with 2-min incubations in between) and followed by 2 × 2 min washes with PBST with 1 M NaCl. The last wash buffer was replaced with fresh PBST with 1 M NaCl before the next barcoding round with Barcode 2 (GATE.D12.B2, for 3T3 cells), which was performed with identical barcode incubation and washing protocol, except after all washes, the liquid in the wells was replaced with PBST. See Supplementary Note 2 for suggested protocol updates to reduce the nonspecific background, and see lightseq.io for the latest suggested protocols.

After barcoding, samples were treated with 67.5 µl of displacement mix consisting of 1× ThermoPol buffer and 250 U ml⁻¹ RNase H (NEB M0297L) and incubated at 37 °C for 30 min. cDNA collection, RNase H heat inactivation and cross-junction synthesis were done as described above for retina samples.

After extraction of cDNAs, cells were washed twice in PBST and then kept in fresh PBST at 4 °C until further analysis. For the cell mixing experiment, multiplexed IF was performed on one well. All antibodies were spun down at 10,000 g for 10 min at 4 °C before use. Cells were incubated with a primary antibody mix containing goat anti-Lamin-B (sc-6216, RRID:AB_648156), mouse anti-TFAM (MA5-16148, RRID:AB_11157422) and rat anti-alpha Tubulin (MA1-80017, RRID:AB_2210201) diluted 1:75 in 1× PBS, 0.3% Triton X-100 and 5% BSA (Jackson ImmunoResearch cat. no. 001-000-162) for 1 hour. After 2 × 1 min washes in 1 × PBS, cells were incubated with secondary antibodies anti-mouse-Alexa647 (Jackson ImmunoResearch 715-605-150, RRID:AB_2340862), anti-goat-Cy3 (VWR 102649-368) and anti-rat- Alexa488 (Jackson ImmunoResearch 712-545-150, RRID:AB_2340683) diluted 1:150 (from 50% glycerol stocks) in 1× PBS with 0.3% Triton X-100 and 5% BSA for 1 hour. Cells were washed in 1× PBS 2 × 1 min and then incubated with 4 µg ml⁻¹ DAPI (Invitrogen cat. no. D1306) in 1 × PBS for 5 min, and washed 2 × 1 min with 1× PBS, followed by imaging in fresh 1 × PBS.

After cross-junction synthesis, extracted sequences were quantified as in the retina tissue barcoding and immunofluorescence section above. Bulk amplification was then performed with the same conditions for 24 cycles but starting with about half (40 µl) of the starting material. The other half of the sample material was kept to test alternative PCR kits (all the results shown in figures in this work were obtained with the Kapa HiFi kit, which we chose as our standard amplification method).

For cell mixing experiments, on average, ~25 cells of each type were pooled from a population of ~4,500 cells per well for each of the three technical replicates. Extracts were separately prepared from each replicate and pooled for sequencing as below. The total cell number was estimated from a manual count of ~130 cells in a 1-mm² area of a single well and extrapolating to the total surface area.

Library preparation and sequencing

Library preparation and sequencing were performed the same for both the cell mixing and retina tissue experiments. After PCR, samples were stored at −20 °C then purified with a 1.2× ratio of Ampure XP Beads (Beckman A63881) and eluted in water. Next, tagmentation was performed with a Nextera XT Library Preparation Kit (Illumina cat. no. FC-131-1096), but using custom primers for the i5 end. Tagmentation was performed using the standard manufacturer protocols and reagents (TD buffer, ATM, NT buffer, NPM PCR master mix) on 2 ng of sample in 20 µl reactions containing 10 µl of TD buffer and 5 µl of ATM for 5 min and at 55 °C, and reactions were stopped with 5 µl of NT buffer and held on ice. To each tube, 6.5 µl of water, 1.75 µl of Nextera i7 primer, 1.75 µl of custom i5 primer (GATE*.P5* in Supplementary Table 6) and 15 µl of NPM PCR master mix were added. Reactions were incubated at 72 °C for 3 min, 95 °C for 15 s, then 12 cycles of: 95 °C for 15 s, 55 °C for 15 s and 72 °C for 40 s. After a final incubation at 72 °C for 1 min, samples were held at 10 °C. Reactions were then purified with 0.9× Ampure XP Beads and eluted in water. Samples were stored at −20 °C until sequencing.

Sequencing was performed either by GeneWiz on an Illumina HiSeq machine or by the Biopolymers Facility at Harvard Medical School on an Illumina NovaSeq machine using custom Read 1 and i5 primers and 30% Phi-X spike-in. For the cell mixing data, all replicate sequences were pooled on a single lane of a HiSeq 4000 flowcell, although one replicate was also sequenced on its own lane. For the retina layers experiment (Fig. 4), all replicate sequences were pooled together and sequenced in both lanes of a NovaSeq 6000 flowcell. For the retina amacrine experiment (Fig. 5), all replicates were pooled together and sequenced with a single Illumina MiSeq run.

In vitro surface barcoding

An eight-well ibidi ibiTreat chamber (ibidi cat. no. 80826) was functionalized with BSA-biotin and streptavidin for in vitro surface barcoding tests. BSA-biotin (Sigma A8549) and streptavidin (Invitrogen S-888) solutions were diluted to 1 mg ml⁻¹ and 0.5 mg ml⁻¹ in 1 × PBS, respectively. Then, 200 µl of the BSA-biotin solution was pipetted onto an empty ibidi well and incubated for 5 min at room temperature. Afterwards, the BSA-biotin solution was aspirated out of the well and washed twice with 1 × PBS. The streptavidin solution was then added to the well and also incubated for 5 min at room temperature, followed by three 1 × PBS washes.

The functionalized surface was then incubated with a biotinylated strand (Supplementary Table 6) at 1 μM in 1 × PBS for 5 min at room temperature, followed by three 1 × PBS washes. Hybridization solutions were made up of a 200 nM CNVK barcoding strand in 1 × PBS with 1 M NaCl.

Photomasks for the cat photo and Penrose triangle in Fig. 2b,c were generated as binary .tif files which were then uploaded and mapped onto the DMD chip. The binary image of the cat was generated using Adobe Photoshop (v.2021) diffusion dither function on a picture taken with a personal camera. Photomasks for the individual portions of the Penrose triangle were hand-drawn in Adobe illustrator (v.2021) and saved as an 8-bit grayscale .tif file.

Hybridization of the barcoding strands proceeded sequentially with each photomask corresponding to a unique barcoding strand and fluorophore (the cat photomask crosslinking was done as a single round (using the Barcode 2 strand) and with crosslinking). Each barcoding round introduced a barcoding strand in hybridization solution followed by a 5-min incubation at room temperature and UV photocrosslinking as described in the Barcoding setup section above. Afterwards, noncrosslinked strands were removed with a stringent wash with 40% formamide in 1 × PBS for 2 × 2 min. The formamide was then removed with 2 × 1-min washes in 1 × PBS with 1 M NaCl to prepare the chamber for the next round of barcoding.

Imaging setups

In control experiments to monitor the signal after RT, after barcoding and after cDNA extraction, we scanned the samples with an ImageXpress Micro-4 system (Molecular Devices) equipped with a custom 5-mm liquid light guide Gen III Spectra LED-based light engine (solid-state 377/54, 438/29, 475/28, 511/16, 555/28, 576/23, 635/22, 730/40), Semrock filters (Zero Pixel Shift Filter Cubes: for DAPI and 750/Cy7 LED-Da/Fi/Tr/Cy5.Cy/5x-A Penta Band; for 488/GFP LED-FITC-A Single Band; for 565/Cy3 LED-TRITC-A Single Band; for 647/Cy5 LED-Cy5-A Single Band) and an Andor Zyla 4.2 camera controlled with the MetaXpress software (v.6.5.3.427). Tiling was performed with 10% overlap.

A fully motorized Nikon Ti-2 inverted microscope was used to image fluorescent retina samples. This confocal microscope was equipped with a Yokogawa CSU-W1 single spinning disk (50-µm pinhole size) and a Nikon linear-encoded motorized stage with Mad City Labs 500-µm range Nano-Drive Z piezo insert, and an Andor Zyla 4.2 plus (6.5-mm photodiode size) sCMOS camera using a Nikon Apo λS LWD 40×/1.1 DICN2 water immersion objective lens with Zeiss Immersol W 2010. Fluorescence was acquired from 405 nm, 488 nm, 550 nm and 640 nm by sample illumination with directly modulated solid-state lasers 405-nm diode 100-mW (at the fiber tip) laser line, 488-nm diode 100-mW laser line, 561-nm 100-mW diode-pumped solid-state laser line and 640-nm diode 70-mW laser line, in a Toptica iChrome MLE laser combiner, respectively. For all channels, a hard-coated Semrock Di01-T405/488/568/647 multi-bandpass dichroic mirror was used. Images were captured with 16-bit Dual Gain (high dynamic range camera mode). Nikon Elements AR 5.02 software was used during acquisition. Z-stacks were acquired using a Piezo Z-device (shutter closed during axial movement). Data were saved and exported as ND2 files.

For Extended Data Fig. 1c–g, a Leica SP5 X MP inverted laser-scanning confocal microscope was used to create custom scan regions using the ‘FRAP’ module. Point-scanning was performed through an HCX PL APO CS ×63.0 1.20 water objective with a 405-nm diode laser set to the highest power. Scan speed was set to the slowest rate of 10 Hz and scanned twice across the ROIs.

Image processing and analysis

For images from the cell mixing experiment, multi-channel images from each round were registered and stitched using ASHLAR⁷⁸ (v.1.12.0). Brightfield images for the cell mixing and retina images were manually contrasted for best visibility. Hand-drawn ROIs for the cell mixing and retina experiments were saved as binary .tif files and converted to a vector image with Adobe Illustrator’s image trace function and then overlaid onto the brightfield images. Multi-channel fluorescent images for the cell mixing were scaled and overlaid based on the following: Each image channel’s pixel values were separated as foreground or background pixels with an Otsu threshold. A linear normalization from 0 to 1 was then applied to each image, with the maximum pixel value of 1 set to the 95th percentile of foreground pixel values. Images were then false-colored and blended into an overlay image with a custom Python script that is equivalent to Adobe Photoshop’s screen blend function. The cat and Penrose triangle fluorescent images were manually contrasted for best visibility. The stitched overlay image in Fig. 3d was prepared in OMERO⁷⁹ (v.5.4.6.21).

Retinal images in Fig. 4b,f are from a single Z-plane, extracted from composite multi-channel images in FIJI⁷⁶ (v.2.0.0-rc-69/1.52n). The minimum and maximum intensity levels were manually chosen to linearly scale the pixel intensities for optimal display and adjusted using the Image → Adjust → Brightness/Contrast window in ImageJ. Single-channel images in Extended Data Fig. 3b,c were adjusted using the ‘Auto’ setting under Image → Adjust → Brightness/Contrast, and the minimum intensity value was then set to zero.

Figures were assembled in Adobe Illustrator (v.2021 and 2022).

Sequencing data processing and differential gene expression analysis

Parsing and mapping of sequencing data were performed on the Harvard Medical School O2 cluster (Kernel 3.10.0) with Python (3.7.5), PyTables (3.6.1), samtools (1.9 and 1.12), pysam (0.17.0), numpy (1.21.4), pandas (1.3.4), Biopython (1.79) and scikit-bio (0.5.6). See the Code availability statement for the GitHub repository containing all code and virtual environment parameters. The pipeline for sequencing analysis is outlined in Extended Data Fig. 4a, with a breakdown of read distributions for each replicate depicted in Extended Data Fig. 4b. Barcode, UMI and cDNA mapping sequences (up to 40 nt) were extracted from Read 1 (R1) reads using the UMI-tools (v.1.1.1) package⁸⁰. Sequences were then mapped to the appropriate genome (Human v38 or Mouse vM27) or a merged genome using the STAR aligner⁸¹ (v.2.7.9a), with multimapped alignments discarded in all cases.

After mapping, the featureCounts⁸² tool was used to assign genomic mappings to genes, and then reads were deduplicated (per gene) with the UMI-tools dedup command. For mouse transcript mapping, reads corresponding to two genes (ENSMUSG00000119584.1 and ENSMUSG00000064337.1, a ribosomal RNA and a mitochondrial rRNA, respectively) were discarded as the number of mapped sequences came close to or exceeded the number of possible UMIs and caused the deduping process to stall. For the human and cell mixing experiment, reads that did not map were then compared with the eGFP transcript sequence with the Striped Smith Waterman algorithm⁸³ and were considered eGFP reads if they had a score of at least 40 and the UMI was unique. For the mouse and human cell mixing experiment, discrimination values were calculated based on mapping to a merged human and mouse genome, and UMI counts were estimated based on mapping separately the human-barcoded reads to the human genome and mouse-barcoded reads to the mouse genome.

After gene assignment and UMI deduplication, reads were parsed out by their DNA barcode sequence by exact matches only with custom Python scripts. Normalized expression levels for the cell mixing experiment were calculated as log₂-transformed TPM (log₂(TPM + 1)). Gene enrichment analysis for the retina tissue experiment was performed in R with the DESeq2 package^45,84. Genes were considered enriched if their adjusted P values (P_adj, with the Benjamini–Hochberg method) were under 0.05. For Fig. 4d, markers for each layer were the set of genes that were enriched relative to both other layers, plotted in R 3.6.1 with pheatmap function (v.1.0.12). For Fig. 4c, genes with adjusted P > 0.05 are plotted in black (at the bottom of the graphs), while genes that were significantly enriched in the pairwise comparisons are colored based on their layer enrichment (e.g., magenta, enriched in ONL, Barcode 1).

To compare with existing Drop-Seq data⁴⁶ for Extended Data Fig. 4e–g, pseudo-bulk RNA sequencing data were modeled based on single-cell counts by pooling the counts from all cells of each cell type that are known to localize within the targeted layers. Single cells from the Drop-Seq dataset were classified as cell types as outlined in the GitHub markdown (at https://github.com/broadinstitute/BipolarCell2016). The pseudo-ONL was constructed by pooling together the rod and cone photoreceptor cells, and the pseudo-BCL was constructed by pooling all bipolar subtype clusters and the Müller glia. Number of transcripts per cell (Supplementary Table 3) was estimated by dividing by the number of cells pooled for each layer.

For Light-Seq, the number of barcoded cells per region was estimated based on the area of the barcoded region (Extended Data Fig. 3) and manual counting of the number of DAPI-stained nuclei within three-dimensional confocal images of the barcoded regions. For each layer, the number of DAPI-stained nuclei was counted within a subset of the barcoded area (~5,000 μm²), and the total number was estimated by linearly scaling based on the precise size of the full area. Since the BCL contains both bipolar cells and Müller glia, the number of bipolar cells within the BCL was estimated as 72% of the total DAPI-stained nuclei within the layer⁸⁵. In Fig. 4c, markers that were in enriched in one of the ONL or BCL layers in either assay were compared, and in Fig. 4d, markers that were enriched relative to both other layers were plotted based on their DESeq2 z score⁴⁵. In Extended Data Fig. 4f, markers that were enriched in the BCL in both assays were plotted.

Scripts used for sequencing data processing, mapping and analysis have been posted to GitHub, along with cell count estimates for the pseudo-bulk comparisons and step-by-step instructions for running the code.

Sensitivity estimation

To estimate the sensitivity of Light-Seq, we compared sequencing read counts with published smFISH data for a set of 16 bipolar cell markers, which were captured in the BCL Light-Seq data⁵⁰. From this previous publication, we used the smFISH puncta-per-cell counts to estimate the expected number of total transcripts detected, based on the number of bipolar cells captured in the Light-Seq BCL area and the known average transcript per cell counts within the BCL. To estimate the number of bipolar cells captured within the Light-Seq BCL area, we overlaid the ROI outline with the DAPI-stained barcode image. First, the full arclength of the region, arc_total, was measured in FIJI using the Segmented Line tool followed by Analyze → Measure. Then, a small portion of the arclength, arc_small, was measured (~200 μm) in the same way, and the number of DAPI-stained nuclei within the ROI were counted for the arc_small area (cells_small). The total number of nuclei in the ROI was predicted by scaling: (no. of total cells) = (arc_total × cells_small)/arc_small (Supplementary Table 3).

With an estimate of the number of bipolar cells within the BCL, we then used the smFISH data for the 16 marker genes with single-cell expression counts published in West et al. 2022 (ref. ⁵⁰) to predict how many transcripts should be present within a bipolar cell population of the measured size. Since the published data were single-cell transcript (smFISH puncta) counts for each of the 16 marker genes for all bipolar subtypes (at their measured ratios), we chose to exclude all cells of type BC1B from the smFISH data and the Drop-Seq data because their cell bodies are located within the AC layer and would not be within the Light-Seq BCL (as shown in previous work, particularly in Fig. 3 of Shekhar et al. 2016 (ref. ⁴⁶)). To exclude BC1B from the smFISH single-cell gene expression matrix from West et al. 2022 (ref. ⁵⁰), we removed all rows from ‘Retina1.csv’ (available on https://github.com/ewest11/Bipolar-Serial-SABER-FISH-Analysis) with ‘Subtype’ = 2. For Drop-Seq data, the cells belonging to the BC1B cluster were similarly removed.

With the remaining gene expression counts across all other bipolar subtypes, we averaged across all cells to obtain an ‘average transcript per cell count’ for the BCL. This average was then scaled by our estimated cell numbers (listed in Supplementary Table 3 for each replicate) to obtain the expected number of transcripts per Light-Seq BCL replicate and per Drop-Seq replicate. Sensitivity was plotted with MATLAB 2018a’s boxplot function, with default settings.

We note that the mice in West et al. 2022 (ref. ⁵⁰) were injected with EdU and BrdU for cellular birth dating and the Drop-Seq cells were dissociated for sequencing, which should be considered as potential sources of variance.

Chimeric read analysis

R1 and R2 reads for each replicate from the TH⁺ DAC experiment were mapped and deduplicated separately. The R1 and R2 deduplicated read files were then merged, sorted by read name and iterated through to identify pairs where both R1 and R2 mapped to transcripts. The numbers of pairs that mapped to the same transcript versus different transcripts are reported in Supplementary Note 1. For this analysis, Python v.3.10.4 was used on a MacBook Pro (2021) with macOS Monterey (v.12.2.1).

Intron analysis

To analyze intronic reads, the -t gene flag of featureCounts⁸² was used to map to genes rather than just exons. Then, the RSeQC⁸⁶ (v.4.0.0) read_distribution.py program was used to profile the numbers of reads from UTR exonic, CDS exonic and 10-kb regions upstream and downstream for each replicate. Further details and counts can be found in Supplementary Note 3. For this analysis, Python v.3.10.4 was used on a MacBook Pro (2021) with macOS Monterey (v.12.2.1).

Gene length bias analysis

Gene lengths as reported by featureCounts⁸² (v.2.0.1) were used to profile read counts from transcripts of different lengths for each barcode for each replicate. Histograms were generated to show the distribution of transcript counts across different transcript lengths. Box plots were then generated to show transcript counts for transcripts within different length ranges, with bins chosen based on mouse embryonic stem cells full-length comparisons⁸⁷ in R (v.4.1.3). RPKM values were calculated for the same bins by dividing the read counts by the length (in kilobases) of the transcript and dividing again by a scaling factor calculated as the number of reads from the condition divided by 1 million.

Gene body coverage analysis

Files containing aligned, deduplicated reads in BAM file format were input to the RSeQC⁸⁶ geneBody_coverage.py program to generate the gene body coverage plots. The reference BED file input to RseQC was generated by converting the comprehensive gene annotation GFF3 file (vM27) to a 12-column BED file using conversion utilities hosted by the UCSC Genome Browser⁸⁸ (specifically, gff3ToGenePred and genePredToBed (v.1.04.00)). These data were visualized in IGV (v.2.12.3)⁸⁹.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Detailed protocols for barcoding experiments are accessible online on the protocols.io platform, and up-to-date protocols and resources can be found at lightseq.io. These protocols cover the following: (1) in situ reverse transcription and A-tailing, (2) in situ spatial barcoding, (3) displacement and extraction of barcoded cDNA sequences, (4) cross-junction synthesis, (5) PCR amplification and (6) library preparation via tagmentation. Raw sequencing data are available online in NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE208650. The gene mappings and counts for human–mouse cell mixing experiment replicates are provided in the source data for Extended Data Fig. 2. The full lists of differentially enriched genes enriched between layers in the retina tissue experiment are provided in the source data for Fig. 4. The full list of differentially enriched genes enriched between TH⁺ and TH⁻ cells in the rare retinal AC tissue experiment is provided in the source data of Fig. 5. All SABER-FISH probe sequences are provided in Supplementary Table 7. Correspondence and requests for materials should be addressed to J.Y.K., S.K.S., P.Y. or C.L.C.

Code availability

Code is available on GitHub at https://github.com/Harvard-MolSys-Lab/Light-Seq-Nature-Methods-2022. This includes the code for image analysis, sequence analysis and differential gene expression analysis and plotting.

References

Altemose, N. et al. μDamID: a microfluidic approach for joint imaging and sequencing of protein-DNA interactions in single cells. Cell Syst. 11, 354–366.e9 (2020).
Article PubMed PubMed Central CAS Google Scholar
Zhang, J. Q. et al. Linked optical and gene expression profiling of single cells at high-throughput. Genome Biol. 21, 49 (2020).
Article PubMed PubMed Central Google Scholar
Nitta, N. et al. Intelligent image-activated cell sorting. Cell 175, 266–276.e13 (2018).
Article PubMed CAS Google Scholar
Schraivogel, D. et al. High-speed fluorescence image-enabled cell sorting. Science 375, 315–320 (2022).
Article PubMed PubMed Central CAS Google Scholar
Hasle, N. et al. High-throughput, microscope-based sorting to dissect cellular heterogeneity. Mol. Syst. Biol. 16, e9442 (2020).
Article PubMed PubMed Central CAS Google Scholar
You, L. et al. Functional single cell selection and annotated profiling of dynamically changing cancer cells. Preprint at bioRxiv https://doi.org/10.1101/2021.10.12.464054 (2021).
Medaglia, C. et al. Spatial reconstruction of immune niches by combining photoactivatable reporters and scRNA-seq. Science 358, 1622–1626 (2017).
Article PubMed PubMed Central CAS Google Scholar
Lovatt, D. et al. Transcriptome in vivo analysis (TIVA) of spatially defined single cells in live tissue. Nat. Methods 11, 190–196 (2014).
Article PubMed PubMed Central CAS Google Scholar
Asp, M., Bergenstråhle, J. & Lundeberg, J. Spatially resolved transcriptomes—next generation tools for tissue exploration. Bioessays 42, e1900221 (2020).
Article PubMed Google Scholar
Moses, L. & Pachter, L. Museum of spatial transcriptomics. Nat. Methods 19, 534–546 (2022).
Article PubMed CAS Google Scholar
Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
Article PubMed PubMed Central CAS Google Scholar
Stickels, R. R. et al. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 39, 313–319 (2021).
Article PubMed CAS Google Scholar
Vickovic, S. et al. High-definition spatial transcriptomics for in situ tissue profiling. Nat. Methods 16, 987–990 (2019).
Article PubMed PubMed Central CAS Google Scholar
Cho, C.-S. et al. Microscopic examination of spatial transcriptome using Seq-Scope. Cell 184, 3559–3572.e22 (2021).
Article PubMed PubMed Central CAS Google Scholar
Chen, A. et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 185, 1777–1792.e21 (2022).
Article PubMed CAS Google Scholar
Srivatsan, S. R. et al. Embryo-scale, single-cell spatial transcriptomics. Science 373, 111–117 (2021).
Article PubMed PubMed Central CAS Google Scholar
Liu, Y. et al. High-spatial-resolution multi-omics sequencing via deterministic barcoding in tissue. Cell 183, 1665–1681.e18 (2020).
Article PubMed PubMed Central CAS Google Scholar
Eng, C.-H. L. et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH. Nature 568, 235–239 (2019).
Article PubMed PubMed Central CAS Google Scholar
Moffitt, J. R. et al. High-throughput single-cell gene-expression profiling with multiplexed error-robust fluorescence in situ hybridization. Proc. Natl Acad. Sci. USA 113, 11046–11051 (2016).
Article PubMed PubMed Central CAS Google Scholar
Gyllborg, D. et al. Hybridization-based in situ sequencing (HybISS) for spatially resolved transcriptomics in human and mouse brain tissue. Nucleic Acids Res. 48, e112 (2020).
Article PubMed PubMed Central CAS Google Scholar
Lee, J. H. et al. Highly multiplexed subcellular RNA sequencing in situ. Science 343, 1360–1363 (2014).
Article PubMed PubMed Central CAS Google Scholar
Ke, R. et al. In situ sequencing for RNA analysis in preserved tissue and cells. Nat. Methods 10, 857–860 (2013).
Article PubMed CAS Google Scholar
Payne, A. C. et al. In situ genome sequencing resolves DNA sequence and structure in intact biological samples. Science 371, eaay3446 (2021).
Article PubMed CAS Google Scholar
Merritt, C. R. et al. Multiplex digital spatial profiling of proteins and RNA in fixed tissue. Nat. Biotechnol. 38, 586–599 (2020).
Article PubMed CAS Google Scholar
Chien, M.-P., Werley, C. A., Farhi, S. L. & Cohen, A. E. Photostick: a method for selective isolation of target cells from culture. Chem. Sci. 6, 1701–1705 (2015).
Article PubMed PubMed Central CAS Google Scholar
Nirmal, A. J. et al. The spatial landscape of progression and immunoediting in primary melanoma at single-cell resolution. Cancer Discov. 12, 1518–1541 (2022).
Article PubMed PubMed Central CAS Google Scholar
Civita, P. et al. Laser capture microdissection and RNA-seq analysis: high sensitivity approaches to explain histopathological heterogeneity in human glioblastoma FFPE archived tissues. Front. Oncol. 9, 482 (2019).
Article PubMed PubMed Central Google Scholar
Honda, M. et al. High-depth spatial transcriptome analysis by photo-isolation chemistry. Nat. Commun. 12, 4416 (2021).
Article PubMed PubMed Central CAS Google Scholar
Hu, K. H. et al. ZipSeq: barcoding for real-time mapping of single cell transcriptomes. Nat. Methods 17, 833–843 (2020).
Article PubMed PubMed Central CAS Google Scholar
Liu, N., Dai, M., Saka, S. K. & Yin, P. Super-resolution labelling with Action-PAINT. Nat. Chem. 11, 1001–1008 (2019).
Article PubMed PubMed Central CAS Google Scholar
Nakamura, S., Kawabata, H. & Fujimoto, K. Double duplex invasion of DNA induced by ultrafast photo-cross-linking using 3-cyanovinylcarbazole for antigene methods. Chem. Commun. 53, 7616–7619 (2017).
Article CAS Google Scholar
Kim, S. H., Liu, Y., Hoelzel, C., Zhang, X. & Lee, T.-H. Super-resolution optical lithography with DNA. Nano Lett. 19, 6035–6042 (2019).
Article PubMed CAS Google Scholar
Vieregg, J. R., Nelson, H. M., Stoltz, B. M. & Pierce, N. A. Selective nucleic acid capture with shielded covalent probes. J. Am. Chem. Soc. 135, 9691–9699 (2013).
Article PubMed PubMed Central CAS Google Scholar
Singh-Gasson, S. et al. Maskless fabrication of light-directed oligonucleotide microarrays using a digital micromirror array. Nat. Biotechnol. 17, 974–978 (1999).
Article PubMed CAS Google Scholar
Kishi, J. Y., Schaus, T. E., Gopalkrishnan, N., Xuan, F. & Yin, P. Programmable autonomous synthesis of single-stranded DNA. Nat. Chem. 10, 155–164 (2018).
Article PubMed CAS Google Scholar
Lee, C. S., Davis, R. W. & Davidson, N. A physical study by electron microscopy of the terminally repetitious, circularly permuted DNA from the coliphage particles of Escherichia coli 15. J. Mol. Biol. 48, 1–22 (1970).
Article PubMed CAS Google Scholar
Yurke, B., Turberfield, A. J., Mills, A. P. Jr, Simmel, F. C. & Neumann, J. L. A DNA-fuelled molecular machine made of DNA. Nature 406, 605–608 (2000).
Article PubMed CAS Google Scholar
Panyutin, I. G. & Hsieh, P. The kinetics of spontaneous DNA branch migration. Proc. Natl Acad. Sci. USA 91, 2021–2025 (1994).
Article PubMed PubMed Central CAS Google Scholar
Tecott, L. H., Barchas, J. D. & Eberwine, J. H. In situ transcription: specific synthesis of complementary DNA in fixed tissue sections. Science 240, 1661–1664 (1988).
Article PubMed CAS Google Scholar
Sheng, K., Cao, W., Niu, Y., Deng, Q. & Zong, C. Effective detection of variation in single-cell transcriptomes using MATQ-seq. Nat. Methods 14, 267–270 (2017).
Article PubMed CAS Google Scholar
Cocquet, J., Chong, A., Zhang, G. & Veitia, R. A. Reverse transcriptase template switching and false alternative transcripts. Genomics 88, 127–131 (2006).
Article PubMed CAS Google Scholar
Zhu, Y. Y., Machleder, E. M., Chenchik, A., Li, R. & Siebert, P. D. Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. Biotechniques 30, 892–897 (2001).
Article PubMed CAS Google Scholar
Wulf, M. G. et al. Non-templated addition and template switching by Moloney murine leukemia virus (MMLV)-based reverse transcriptases co-occur and compete with each other. J. Biol. Chem. 294, 18220–18231 (2019).
Article PubMed PubMed Central CAS Google Scholar
Dowling, J. E. The Retina: An Approachable Part of the Brain, Revised Edition (Harvard Univ. Press, 2012).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Shekhar, K. et al. Comprehensive classification of retinal bipolar neurons by single-cell transcriptomics. Cell 166, 1308–1323.e30 (2016).
Article PubMed PubMed Central CAS Google Scholar
Denninger, J. K. et al. Robust transcriptional profiling and identification of differentially expressed genes with low input RNA sequencing of adult hippocampal neural stem and progenitor populations. Front. Mol. Neurosci. 15, 810722 (2022).
Article PubMed PubMed Central CAS Google Scholar
Raj, A., van den Bogaard, P., Rifkin, S. A., van Oudenaarden, A. & Tyagi, S. Imaging individual mRNA molecules using multiple singly labeled probes. Nat. Methods 5, 877–879 (2008).
Article PubMed PubMed Central CAS Google Scholar
Kishi, J. Y. et al. SABER amplifies FISH: enhanced multiplexed imaging of RNA and DNA in cells and tissues. Nat. Methods 16, 533–544 (2019).
Article PubMed PubMed Central CAS Google Scholar
West, E. R. et al. Spatiotemporal patterns of neuronal subtype genesis suggest hierarchical development of retinal diversity. Cell Rep. 38, 110191 (2022).
Article PubMed CAS Google Scholar
Lee, S. et al. Covering all your bases: incorporating intron signal from RNA-seq data. NAR Genom. Bioinform. 2, lqaa073 (2020).
Article PubMed PubMed Central Google Scholar
Gustincich, S., Feigenspan, A., Wu, D. K., Koopman, L. J. & Raviola, E. Control of dopamine release in the retina: a transgenic approach to neural networks. Neuron 18, 723–736 (1997).
Article PubMed CAS Google Scholar
Yan, W. et al. Mouse retinal cell atlas: molecular identification of over sixty amacrine cell types. J. Neurosci. 40, 5177–5195 (2020).
Article PubMed PubMed Central CAS Google Scholar
Gustincich, S. et al. Gene discovery in genetically labeled single dopaminergic neurons of the retina. Proc. Natl Acad. Sci. USA 101, 5069–5074 (2004).
Article PubMed PubMed Central CAS Google Scholar
Puopolo, M., Hochstetler, S. E., Gustincich, S., Wightman, R. M. & Raviola, E. Extrasynaptic release of dopamine in a retinal neuron: activity dependence and transmitter modulation. Neuron 30, 211–225 (2001).
Article PubMed CAS Google Scholar
Witkovsky, P. et al. Activity-dependent phosphorylation of tyrosine hydroxylase in dopaminergic neurons of the rat retina. J. Neurosci. 24, 4242–4249 (2004).
Article PubMed PubMed Central CAS Google Scholar
Greferath, U. et al. GABAA receptor subunits have differential distributions in the rat retina: in situ hybridization and immunohistochemistry. J. Comp. Neurol. 353, 553–571 (1995).
Article PubMed CAS Google Scholar
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Article PubMed Google Scholar
Borm, L. E. et al. Scalable in situ single-cell profiling by electrophoretic capture of mRNA. Preprint at bioRxiv https://doi.org/10.1101/2022.01.12.476082 (2022).
Nuovo, G. J. The foundations of successful RT in situ PCR. Front. Biosci. 1, c4–c15 (1996).
Article PubMed CAS Google Scholar
Marshall, J. L. et al. HyPR-seq: single-cell quantification of chosen RNAs via hybridization and sequencing of DNA probes. Proc. Natl Acad. Sci. USA 117, 33404–33413 (2020).
Article PubMed PubMed Central CAS Google Scholar
Ouimet, C. C., Hemmings, H. C. Jr & Greengard, P. ARPP-21, a cyclic AMP-regulated phosphoprotein enriched in dopamine-innervated brain regions. II. Immunocytochemical localization in rat brain. J. Neurosci. 9, 865–875 (1989).
Article PubMed PubMed Central CAS Google Scholar
Quinn, J. P., Kandigian, S. E., Trombetta, B. A., Arnold, S. E. & Carlyle, B. C. VGF as a biomarker and therapeutic target in neurodegenerative and psychiatric diseases. Brain Commun. 3, fcab261 (2021).
Article PubMed PubMed Central Google Scholar
Jiang, F., Wu, Q., Sun, S., Bi, G. & Guo, L. Identification of potential diagnostic biomarkers for Parkinson’s disease. FEBS Open Bio 9, 1460–1468 (2019).
Article PubMed PubMed Central CAS Google Scholar
Golzio, C. et al. Matthew-Wood syndrome is caused by truncating mutations in the retinol-binding protein receptor gene STRA6. Am. J. Hum. Genet. 80, 1179–1187 (2007).
Article PubMed PubMed Central CAS Google Scholar
Pasutto, F. et al. Mutations in STRA6 cause a broad spectrum of malformations including anophthalmia, congenital heart defects, diaphragmatic hernia, alveolar capillary dysplasia, lung hypoplasia, and mental retardation. Am. J. Hum. Genet. 80, 550–560 (2007).
Article PubMed PubMed Central CAS Google Scholar
Gellens, M. E. et al. Primary structure and functional expression of the human cardiac tetrodotoxin-insensitive voltage-dependent sodium channel. Proc. Natl Acad. Sci. USA 89, 554–558 (1992).
Article PubMed PubMed Central CAS Google Scholar
Veerman, C. C., Wilde, A. A. M. & Lodder, E. M. The cardiac sodium channel gene SCN5A and its gene product NaV1.5: role in physiology and pathophysiology. Gene 573, 177–187 (2015).
Article PubMed PubMed Central CAS Google Scholar
Fazal, F. M. et al. Atlas of subcellular RNA localization revealed by APEX-Seq. Cell 178, 473–490.e26 (2019).
Article PubMed PubMed Central CAS Google Scholar
Saka, S. K. et al. Immuno-SABER enables highly multiplexed and amplified protein imaging in tissues. Nat. Biotechnol. 37, 1080–1090 (2019).
Article PubMed PubMed Central CAS Google Scholar
Lin, J.-R. et al. Highly multiplexed immunofluorescence imaging of human tissues and tumors using t-CyCIF and conventional optical microscopes. eLife 7, e31657 (2018).
Article PubMed PubMed Central Google Scholar
Schürch, C. M. et al. Coordinated cellular neighborhoods orchestrate antitumoral immunity at the colorectal cancer invasive front. Cell 182, 1341–1359.e19 (2020).
Article PubMed PubMed Central Google Scholar
Zadeh, J. N. et al. NUPACK: analysis and design of nucleic acid systems. J. Comput. Chem. 32, 170–173 (2011).
Article PubMed CAS Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article PubMed CAS Google Scholar
Kent, W. J. BLAT—the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002).
PubMed PubMed Central CAS Google Scholar
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Article PubMed CAS Google Scholar
Hershberg, E. A. et al. PaintSHOP enables the interactive design of transcriptome- and genome-scale oligonucleotide FISH experiments. Nat. Methods 18, 937–944 (2021).
Article PubMed PubMed Central CAS Google Scholar
Muhlich, J. L. et al. Stitching and registering highly multiplexed whole slide images of tissues and tumors using ASHLAR. Bioinformatics (2022) https://doi.org/10.1093/bioinformatics/btac544
Burel, J.-M. et al. Publishing and sharing multi-dimensional image data with OMERO. Mamm. Genome 26, 441–447 (2015).
Article PubMed PubMed Central Google Scholar
Smith, T., Heger, A. & Sudbery, I. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res. 27, 491–499 (2017).
Article PubMed PubMed Central CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article PubMed CAS Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article PubMed CAS Google Scholar
Zhao, M., Lee, W.-P., Garrison, E. P. & Marth, G. T. SSW library: an SIMD Smith-Waterman C/C++ library for use in genomic applications. PLoS ONE 8, e82138 (2013).
Article PubMed PubMed Central Google Scholar
Amamoto, R. et al. Probe-Seq enables transcriptional profiling of specific cell types from heterogeneous tissue by RNA-based isolation. eLife 8, e51452 (2019).
Article PubMed PubMed Central CAS Google Scholar
Jeon, C. J., Strettoi, E. & Masland, R. H. The major cell populations of the mouse retina. J. Neurosci. 18, 8936–8946 (1998).
Article PubMed PubMed Central CAS Google Scholar
Wang, L., Wang, S. & Li, W. RSeQC: quality control of RNA-seq experiments. Bioinformatics 28, 2184–2185 (2012).
Article PubMed CAS Google Scholar
Phipson, B., Zappia, L. & Oshlack, A. Gene length and detection bias in single cell RNA sequencing protocols. F1000Res. 6, 595 (2017).
Article PubMed PubMed Central Google Scholar
Karolchik, D., Hinrichs, A. S. & Kent, W. J. The UCSC Genome Browser. Curr. Protoc. Hum. Genet. Chapter 18, Unit18.6 (2011).
PubMed Google Scholar
Thorvaldsdóttir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 14, 178–192 (2013).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors acknowledge funding from the Wyss Institute (through their Validation Project Program for J.Y.K., N.L., P.Y. and S.K.S.; the Molecular Robotics Initiative; and a Technology Development Fellowship to J.Y.K.), the National Institutes of Health (under grant nos. UG3HL145600, UH3CA255133, DP1GM133052, R01GM124401, RF1MH124606 and RF1MH128861 to P.Y.), the Office of Naval Research (under grant no. N00014–18–1–2549 to P.Y.), the Chan Zuckerberg Initiative (under grant no. 2019-02433 to P.Y.), the European Molecular Biology Laboratory (S.K.S.) and Howard Hughes Medical Institute (E.R.W., C.L.C.). The authors acknowledge the MicRoN Core at Harvard and P. Montero Llopis for imaging equipment and expertise. We thank Kylie the cat for being our feline model for photomask generation. We thank I. Goldaracena for assistance with cell experiments, J. Immen for help with graphic design and T. Ferrante, J. Rosenberg, J. McDonough and B. Bedell for discussion and feedback. Several figure panels (Figs. 1, 2a and 3b–c) were partially created with BioRender.com.

Author information

Sinem K. Saka
Present address: Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
These authors contributed equally: Jocelyn Y. Kishi, Ninning Liu, Emma R. West.

Authors and Affiliations

Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA
Jocelyn Y. Kishi, Ninning Liu, Kuanwei Sheng, Jack J. Jordanides, Matthew Serrata, Sinem K. Saka & Peng Yin
Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Jocelyn Y. Kishi, Sinem K. Saka & Peng Yin
Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
Emma R. West & Constance L. Cepko
Howard Hughes Medical Institute, Chevy Chase, MD, USA
Emma R. West & Constance L. Cepko
Department of Ophthalmology, Harvard Medical School, Boston, MA, USA
Constance L. Cepko

Authors

Jocelyn Y. Kishi
View author publications
You can also search for this author in PubMed Google Scholar
Ninning Liu
View author publications
You can also search for this author in PubMed Google Scholar
Emma R. West
View author publications
You can also search for this author in PubMed Google Scholar
Kuanwei Sheng
View author publications
You can also search for this author in PubMed Google Scholar
Jack J. Jordanides
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Serrata
View author publications
You can also search for this author in PubMed Google Scholar
Constance L. Cepko
View author publications
You can also search for this author in PubMed Google Scholar
Sinem K. Saka
View author publications
You can also search for this author in PubMed Google Scholar
Peng Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Y.K. conceived and led the study, designed and performed experiments, wrote software, analyzed data and wrote the manuscript. N.L. conceived the study, designed and performed experiments, wrote software, analyzed data and wrote the manuscript. E.R.W. designed and performed experiments, wrote software, analyzed data and wrote the manuscript. K.S. contributed to experimental design, method optimization and data analysis. J.J.J. performed experiments and contributed to optimization of the method and writing of the manuscript. M.S. provided experimental assistance. C.L.C. contributed expertise and supervision, experimental design and data analysis. S.K.S. conceived the study, provided scientific and technical guidance, designed and performed experiments, analyzed the results and wrote the manuscript. P.Y. conceived and supervised the study, designed the experiments and wrote the manuscript. All authors edited and approved the manuscript.

Corresponding authors

Correspondence to Jocelyn Y. Kishi, Constance L. Cepko, Sinem K. Saka or Peng Yin.

Ethics declarations

Competing interests

J.Y.K., N.L., P.Y. and S.K.S. are inventors on patent applications covering the method. Multiple authors are involved in commercialization of the technique and engage with Digital Biology, Inc. (J.Y.K. and E.R.W. are co-founders and employees; P.Y. is co-founder, equity holder, director and consultant; S.K.S. is anticipated to be a consulting scientific co-founder; N.L. is a consulting founding scientist; J.J.J. is an employee.) P.Y. is also a co-founder, equity holder, director and consultant of Ultivue, Inc. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Yang Liu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Rita Strack, in collaboration with the Nature Methods team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 High resolution light-directed DNA barcoding.

(a) Fluorescent image of a dot array printed onto a glass slide functionalized with DNA docking sequences. Dots were printed through targeted photocrosslinking of fluorescent DNA barcode strands to complementary docking sites (see also Fig. 2a). Five dots were chosen for a profile scan of gray values (magenta dashes), pixel contrast set to 450–800. (b) Linescans from panel a (dotted colored lines) were averaged into a single linescan (black dots with dashes). Averaged linescan was fit to a Gaussian curve (blue). A single dot corresponds to a single activated DMD mirror, estimated to illuminate a 0.76 µm diameter area. FWHM from the fit was ~1.56 µm. (c) Subcellular labeling of 3T3 cells with a 405 nm laser on a point-scanning confocal microscope (n = 4 cells from a single field of view). The photomask used for crosslinking was scaled to the size of the fluorescent image and manually overlaid (magenta) to aid in visualization. A profile scan was performed on the rectangular area between the magenta dashed lines. (d) Intensity profile of the dotted box from panel c, data was fit to a Gaussian curve (green) with a measured FWHM of 4.4 µm. A second exponential decay was fitted to one-half of the profile scan (blue) to calculate a 84–16% criterion, the distance across which the signal drops from 84 to 16% of the maximum value. Distance of the 84–16% drop was calculated from the exponential fit to be 2.67 µm. Vertical dashed lines indicate the estimated ROI boundary from the photomask. Width of the photomask was estimated to be 4 µm. (e-g) The single field of view in panel c imaged on a confocal scanning microscope. Nuclear signal (cyan) with the ROI selection (white lines) overlaid (e), fluorescent Cy3 barcode after stringent washes of non-crosslinked strands (f), overlay with lower contrast display of nuclear signal to enable visualization of the overlapping Cy3 signal (g).

Extended Data Fig. 2 Cell segmentation and read counts of 3T3 and HEK cells.

(a) Representative segmentation results from the fluorescent signal of the mouse (top row) and human (bottom row) barcoded cells (representative from n = 3 technical replicates). Masks were used to calculate barcoded area (also see Methods and Supplementary Table 1). (b) A single ~200 million read dataset from the cell mixing experiment (magenta) was mapped to a merged genome and subsampled by fraction of reads without replacement and processed with the UMI deduplication pipeline. Average number of UMIs from 5 simulated datasets are shown (cyan). (c-d) Scatterplots and histograms of normalized expression level (log₂(TPM + 1)) between the three technical replicates for cells. We only considered the genes detected across all replicates (log₂(TPM + 1) cutoff of ≥1). Highlighted data points (orange) indicate top 200 genes, remaining genes are colored blue. Pearson correlation for all genes (black) and top 200 genes (orange) reported for (c) human cells and (d) mouse cells. Histograms of log₂(TPM + 1) distributions (excluding top 200 genes) for each replicate are plotted on the diagonals. Full list of gene mappings and counts is provided as Source Data Table 1.

Source data

Extended Data Fig. 3 Barcoding of retinal layers.

(a) Brightfield images of a mouse retina cryosection with the barcoded area overlaid. From left to right, Outer Nuclear Layer (ONL), Bipolar Cell Layer (BCL), Ganglion Cell Layer (GCL). Scale bar is 100 µm. Binary images show the selected ROIs that were used as barcoding photomasks. Pixel size is 1.6 µm/pixel. (b) Single Z-plane spinning disc confocal images taken after barcode crosslinking for DAPI and the fluorescent barcodes 1–3 (labeled with Cy5, Cy3 and Fluorescein, respectively). (c) Single Z-plane images of DAPI, WGA, and immunofluorescence for PAX6 and VSX2 proteins in the same barcoded cells after recovery of barcoded cDNAs for sequencing. Images in (b) and (c) are displayed with auto scaling (with minimum set to zero). Scale bars are 50 µm. (d) Single Z-plane spinning disk confocal images of barcode fluorescence (white) overlaid with DAPI (blue) within each barcoded layer, displaying the different cellular morphologies with differences in cell size, cytoplasmic area, RNA density for each cellular layer that is comprised of different cell types. Scale bars are 10 µm. UMI counts per unit area (10 µm x 10 µm) are listed for each barcoded layer. Panels a-d are representative images from n = 4 technical replicates.

Extended Data Fig. 4 Sequencing metrics and sensitivity of Light-Seq for retinal layer experiment.

(a) Sequence processing pipeline. (b) Sunburst plots depicting fractions of reads filtered at each processing step. (c) PCA plot of Light-Seq replicates (n = 4 technical replicates per layer). (d) Correlation matrix of genes enriched in ONL versus BCL in Light-Seq and Drop-Seq data⁴⁶ (p_adj < 0.05, two-sided Wald test with Benjamini-Hochberg adjustment for multiple hypothesis testing). (e) Subtraction-based enrichment of approximate difference in Light-Seq transcripts per cell between ONL and BCL to simulated difference Drop-Seq transcripts per cell between the ONL and BCL. Genes significantly enriched in either the ONL or BCL in both assays are plotted (p_adj < 0.05, two-sided Wald test with Benjamini-Hochberg adjustment for multiple hypothesis testing). Zoom of the plot shown on right. (f) Estimated reads per cell in Drop-Seq (simulated BCL) versus Light-Seq BCL data (barcode 2) for genes enriched in the BCL in both assays (p_adj < 0.05). (g) Boxplots of mean Drop-Seq (n = 6 sample replicates) vs Light-Seq (n = 4 section replicates) counts per gene per for different transcript lengths. Pearson R and median ratio shown. Median line and quartiles bound the box, with whiskers marking 1.5× the interquartile range. (h) Sensitivity of Light-Seq relative to smFISH for 16 BCL marker genes with published single-cell smFISH data⁵⁰. Based on the number of cells within the BCL, sensitivity calculated as [# expected transcripts by smFISH]/[# observed Light-Seq reads]. Dots represent the sensitivity of a single replicate (n = 4 replicates). Error bars show standard deviation, centered at the mean. (i) Sensitivity of Drop-Seq relative to smFISH. Based on the number of cells in the pooled bipolar clusters in Shekhar et al., 2016⁴⁶, sensitivity calculated as [# of expected transcripts by smFISH]/[number of observed Drop-Seq reads] (see Methods). Dots reflect sensitivity of a single replicate/gene (n = 6 sample replicates). Error bars reflect standard deviation centered around the mean. (j) Difference between mean Light-Seq and Drop-Seq sensitivity per gene from (h) and (i). Error bars show standard error for the difference of means. For panels h-j, genes are arranged and colored by ascending gene length.

Extended Data Fig. 5 Gene body coverage and gene length read distributions for retinal layers.

(a) Gene body coverage for transcripts 100 nt and up (top), 1000 nt and up (middle), 10000 nt and up (bottom). (b-c) Reads Per Kilobase of transcript, per Million mapped reads (RPKM) (panel b) and read counts (panel c) for all barcode-replicate conditions across different transcript length bins (bins based on Phipson et al., 2017⁸⁷). All box plots show median line and quartiles bounding the box, with whiskers marking 1.5× the interquartile range. n = 4 technical replicates.

Extended Data Fig. 6 TH⁺ AC ROI selection and signal.

(a) TH IF of a TH⁺ AC imaged on a confocal microscope with a 40× objective with barcoding ROI overlaid (magenta, left). Fluorescent image of the barcoded region (right). Pixel value set to 0–400. (b) Profile scan performed on the 50 × 5 µm rectangle across the fluorescent barcode signal (dotted box in panel a). Dashed vertical lines indicate ROI boundary. (c) Selected images of single TH⁺ amacrine cells stained with IF (top), with the ROIs overlaid (magenta, bottom). ROIs were drawn slightly inside the cell bodies to account for light-scattering at the ROI boundary. Scale bars are 10 µm. Panels (a) and (c) are representative images from n = 5 technical replicates. (d) A single sequencing run depth at 3.7 million read depth from a representative experimental condition was selected for subsampling without replacement illustrating UMI scaling by fraction of reads for the genes Th, Cartpt, and total UMIs. Mean + /- standard deviation of 5 simulations (cyan) are plotted, with full dataset represented as a single point (magenta).

Extended Data Fig. 7 Gene body coverage and gene length read distributions for amacrine cell experiment.

(a) Gene body coverage for transcripts 100 nt and up (left), 1000 nt and up (middle), 10000 nt and up (right). (b-c) Reads Per Kilobase of transcript, per Million mapped reads (RPKM) (in b) and read counts (in c) for all barcode-replicate conditions across different transcript length bins. All box plots show median line and quartiles bounding the box, with whiskers marking 1.5× the interquartile range. n = 5 technical replicates.

Supplementary information

Supplementary Information

Supplementary Information contains Supplementary Figs. 1–3, Tables 1–6, Notes 1–4 and references.

Reporting Summary

Peer Review File

Supplementary Table 7

List of SABER-FISH probe sets used for RNA-FISH validation experiments.

Source data

Source Data Fig. 4

The full lists of differentially enriched genes enriched between layers in the retina tissue experiment.

Source Data Fig. 5

The full list of differentially enriched genes enriched between TH⁺ and TH⁻ cells in the rare retinal amacrine cell tissue experiment.

Source Data Extended Data Fig. 2

The gene mappings and counts for human–mouse cell mixing experiment replicates.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kishi, J.Y., Liu, N., West, E.R. et al. Light-Seq: light-directed in situ barcoding of biomolecules in fixed cells and tissues for spatially indexed sequencing. Nat Methods 19, 1393–1402 (2022). https://doi.org/10.1038/s41592-022-01604-1

Download citation

Received: 21 December 2021
Accepted: 10 August 2022
Published: 10 October 2022
Issue Date: November 2022
DOI: https://doi.org/10.1038/s41592-022-01604-1

This article is cited by

Spatial multi-omics: novel tools to study the complexity of cardiovascular diseases
- Paul Kiessling
- Christoph Kuppe
Genome Medicine (2024)
Multiplex protein imaging in tumour biology
- Natalie de Souza
- Shan Zhao
- Bernd Bodenmiller
Nature Reviews Cancer (2024)
Profiling joint tissues at single-cell resolution: advances and insights
- Akshay Pandey
- Nidhi Bhutani
Nature Reviews Rheumatology (2024)
Spatial transcriptomics in development and disease
- Ran Zhou
- Gaoxia Yang
- Yuan Wang
Molecular Biomedicine (2023)
Barcoded multiple displacement amplification for high coverage sequencing in spatial genomics
- Jinhyun Kim
- Sungsik Kim
- Sunghoon Kwon
Nature Communications (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Light-Seq overview and barcoding chemistry

Utilizing Light-Seq for whole transcriptome sequencing

Spatial sequencing with Light-Seq in tissue sections

Rare cell transcriptomics with Light-Seq

Discussion

Methods

Oligo design and preparation

Cell culture

Tissues

Barcoding setup

Retina tissue barcoding and immunofluorescence

Barcoding TH+ DACs

Immunostaining after sequencing TH+/− ACs

RNA-FISH validation of TH+ DAC markers

Barcoding and immunofluorescence on cultured cells

Library preparation and sequencing

In vitro surface barcoding

Imaging setups

Image processing and analysis

Sequencing data processing and differential gene expression analysis

Sensitivity estimation

Chimeric read analysis

Intron analysis

Gene length bias analysis

Gene body coverage analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links

Barcoding TH⁺ DACs

Immunostaining after sequencing TH^+/− ACs

RNA-FISH validation of TH⁺ DAC markers