High-throughput muscle fiber typing from RNA sequencing data

Oskolkov, Nikolay; Santel, Malgorzata; Parikh, Hemang M.; Ekström, Ola; Camp, Gray J.; Miyamoto-Mikami, Eri; Ström, Kristoffer; Mir, Bilal Ahmad; Kryvokhyzha, Dmytro; Lehtovirta, Mikko; Kobayashi, Hiroyuki; Kakigi, Ryo; Naito, Hisashi; Eriksson, Karl-Fredrik; Nystedt, Björn; Fuku, Noriyuki; Treutlein, Barbara; Pääbo, Svante; Hansson, Ola

doi:10.1186/s13395-022-00299-4

Research
Open access
Published: 02 July 2022

High-throughput muscle fiber typing from RNA sequencing data

Nikolay Oskolkov^1,2,
Malgorzata Santel³,
Hemang M. Parikh⁴,
Ola Ekström¹,
Gray J. Camp³,
Eri Miyamoto-Mikami⁵,
Kristoffer Ström^1,6,
Bilal Ahmad Mir¹,
Dmytro Kryvokhyzha¹,
Mikko Lehtovirta^1,7,
Hiroyuki Kobayashi⁸,
Ryo Kakigi⁹,
Hisashi Naito⁵,
Karl-Fredrik Eriksson¹,
Björn Nystedt¹⁰,
Noriyuki Fuku⁵,
Barbara Treutlein³,
Svante Pääbo^3,11 &
…
Ola Hansson^1,7

Skeletal Muscle volume 12, Article number: 16 (2022) Cite this article

7408 Accesses
5 Citations
35 Altmetric
Metrics details

Abstract

Background

Skeletal muscle fiber type distribution has implications for human health, muscle function, and performance. This knowledge has been gathered using labor-intensive and costly methodology that limited these studies. Here, we present a method based on muscle tissue RNA sequencing data (totRNAseq) to estimate the distribution of skeletal muscle fiber types from frozen human samples, allowing for a larger number of individuals to be tested.

Methods

By using single-nuclei RNA sequencing (snRNAseq) data as a reference, cluster expression signatures were produced by averaging gene expression of cluster gene markers and then applying these to totRNAseq data and inferring muscle fiber nuclei type via linear matrix decomposition. This estimate was then compared with fiber type distribution measured by ATPase staining or myosin heavy chain protein isoform distribution of 62 muscle samples in two independent cohorts (n = 39 and 22).

Results

The correlation between the sequencing-based method and the other two were r_ATPas = 0.44 [0.13–0.67], [95% CI], and r_myosin = 0.83 [0.61–0.93], with p = 5.70 × 10^–3 and 2.00 × 10^–6, respectively. The deconvolution inference of fiber type composition was accurate even for very low totRNAseq sequencing depths, i.e., down to an average of ~ 10,000 paired-end reads.

Conclusions

This new method (https://github.com/OlaHanssonLab/PredictFiberType) consequently allows for measurement of fiber type distribution of a larger number of samples using totRNAseq in a cost and labor-efficient way. It is now feasible to study the association between fiber type distribution and e.g. health outcomes in large well-powered studies.

Introduction

Our bodies constitute to ~ 30–40% of the skeletal muscle, and it is the most abundant form of the three types of muscle, the others being smooth and cardiac. The skeletal muscle is composed of different fiber types (i.e., muscle cell types), and the relative proportions of these types vary among the muscles, locations within the muscles, individuals, and the sex of individuals [1,2,3,4]. The oxidative and glycolytic potential and the contractile properties differ considerably between fiber types, with the mitochondria-rich slow-twitch fibers (type I) having higher oxidative capacity, and fast-twitch fibers (type IIa and type IIx) having higher glycolytic capacity [1]. The proportions also change as people age, with type II fibers being preferentially affected by sarcopenia [5]. Exercising the skeletal muscle is a major site for catabolic metabolism of the blood glucose and lipids and the metabolic characteristics of this tissue influence both the performance of elite athletes [6, 7] and an individual’s predisposition to disease, e.g., impairments in glucose and lipid transportation into myocytes can promote diabetes and atherosclerotic vascular disease [8,9,10]. Although environmental factors, such a training or sedentary behavior and aging, lead to adaptive alteration in capillary density and fiber composition (an increase in type IIa vs type IIx) in the skeletal muscle, these features are partially under genetic control [11, 12]. However, the extent to which transition of type I to type II fibers and vice-versa occurs remains uncertain [6]. The muscle tissues are usually classified according to the predominant myosin heavy chain (MyHC) isoforms, but this is a simplified classification that disregards the large number of other proteins expressed in the skeletal muscle [2, 13]. It is thus not straightforward to estimate and compare muscle fiber types in humans, e.g., owing to large heterogeneity and limitations of the methodology, and it is even more challenging in frozen, postmortem samples.

In addition to cells from differing types of muscle fibers, a biopsy sample also contain cells such as fibroblasts, endothelial cells, adipocytes, smooth muscle cells, and neuron-associated Schwann cells. Gene expression analysis of the skeletal muscle is thus rendered difficult by the complexity of this tissue. Although sequencing of transcriptomes from muscle biopsies may still provide a perspective on functional differences between individuals, targeted RNA sequencing from isolated cells provides the opportunity to reveal differences specific to the different cell populations [14]. Previous studies investigating the skeletal muscle have applied single-cell RNA sequencing (scRNAseq) to extracted mono-nucleated cells [15,16,17,18,19,20]. These studies have for example described a complex landscape of different cell types, e.g., two different populations of muscle progenitor cells [15], provided detailed knowledge concerning muscle regeneration [19] and muscle disease etiology [20]. A challenge for single-cell genomic studies in the muscle and other solid tissues is to capture cell types that are difficult to isolate in suspension [21], e.g., muscle fibers have not been sequenced in the abovementioned scRNAseq studies. However, a few studies have isolated and sequenced single nuclei including the nuclei from poly-nucleated primary muscle fibers in mice [22,23,24] and e.g. found that myofiber types predominantly express either slow or one of the fast isoforms of MyHC proteins, while only a small proportion of hybrid fibers can express more than one MyHC [22]. However, no study has investigated poly-nucleated primary muscle fibers from humans.

Results

Here, we present a method to estimate the proportion of skeletal muscle fiber types using only muscle tissue RNA sequencing (totRNAseq) data that could be used on frozen samples. The method is based on snRNAseq information of one human individual and then evaluated in two independent larger totRNAseq data sets of the human skeletal muscle (the Muscle SATellite cell study, MSAT, n = 39 and the Juntendo Muscle Study, JMS, n = 23).

A novel muscle fiber type prediction model derived from single-nuclei RNA sequencing

After filtering and quality control (see the “Material and methods” for details), the snRNAseq data set consisted of 2699 nuclei, with 22,084 expressed genes. On average, ~ 500 genes were found per nuclei (Supplementary Fig. 1). Five major clusters of nuclei were identified using graph-based clustering built on Louvain modularity optimization [25]. For visualization of the nuclei populations, t-distributed stochastic neighbor embedding (tSNE) non-linear dimension reduction was applied (Fig. 1a). A separation of type I (cluster B) and type II (cluster A) fiber nuclei is clearly observed, i.e., gene markers of type I and type II fibers (e.g., ATP2A2, TPM3, MYH7B versus ATP2A1, MYBPC2, MYH2) display a distinct expression pattern in different clusters (Fig. 1b). Cluster D is likely representing endothelial cells, i.e., enriched for gene markers like LDB2, VWF, BTNL9, and FLT1. The identity of clusters C and E is less clear but could possibly represent fibroblasts respectively pericytes. The complete list of 48 gene markers for the five clusters are shown in Fig. 1c.

Muscle fiber typing using total RNA sequencing from frozen samples

To evaluate the efficacy of estimating proportions of type I versus type II fiber nuclei in muscle samples from totRNAseq data, a deconvolution analysis [26] was performed in a data set consisting of 39 human subjects, i.e., the MSAT study (Table 1). Briefly, by using the snRNAseq data as a reference, cluster expression signatures were produced by averaging gene expression of cluster gene markers and then applying these to the totRNA MSAT data set by inferring muscle fiber nuclei type via linear matrix decomposition [26] and then finally compare this estimate with the fiber type proportions measured by ATPase staining of the same muscle samples. The correlation between the estimated proportions of muscle fiber nuclei types from totRNAseq data and muscle fiber types from ATPase staining was r = 0.44 [0.13–0.67], [95% CI] at p_spearman = 5.70 × 10^–3, n = 39 (Fig. 2a). The same deconvolution analysis was performed on a second dataset consisting of 23 human subjects from Japan, i.e., the JMS (Table 2). The correlation between the estimated proportions of muscle fiber nuclei types from totRNAseq data and muscle fiber types measured by MyHC protein isoform distribution in the JMS was r = 0.83 [0.61–0.93], [95% CI] at p_spearman = 2.00 × 10^–6, n = 22 (Fig. 2b). One individual was excluded due to low sequencing quality. To further validate the method skeletal muscle totRNAseq data was downloaded from the Genotype-Tissue Expression project (n = 569) and tested for differences in fiber type distribution between men and women. As expected, we observed a larger proportion of type I fibers in women compared to men (68% [54–76%] versus 56% [39–71%], median [95% CI], p_Mann-Whitney = 3.1 × 10^–7, n_women = 171, and n_men = 398, Fig. 2c).

Table 1 Description of the Muscle SATellite cell study (MSAT)

Full size table

Table 2 Description of the Juntendo Muscle Study (JMS)

Full size table

To test the possibility of implementing this method for fiber typing of a large number of samples, we estimated the needed minimal sequencing depth of totRNAseq data that accurately would infer skeletal muscle fiber type composition. An increasing number of randomly selected reads were removed from the totRNAseq data of the 39 samples in the MSAT study. The initial average sequencing depth of 35 million paired-end (PE) reads were “down-sampled” by this method, and Spearman correlations (Fig. 2d) and mean square errors (Supplementary Fig. 2) between the ATPase and totRNAseq predicted fractions of type I fibers were calculated. The accuracy of deconvolution inference of fiber type composition was similar to the 35 million PE reads level even at very low sequencing depths, i.e., down to an of average ~ 10,000 PE reads (Fig. 2d and Supplementary Fig. 2).

Discussion

Here, we present a method to estimate the proportion of skeletal muscle fiber types from frozen samples allowing for a larger number of samples to be measured in a standardized, cost, and labor-efficient way. Skeletal muscle fiber type distribution is a determinant of physical performance [27,28,29,30] and overall health [31,32,33,34,35] and is highly heritable in humans [11, 12]. For example, a reduced proportion of oxidative slow-twitch type I fibers is associated with lower insulin sensitivity in the diabetic muscle [31,32,33,34] and muscle atrophy, e.g., age-related sarcopenia is progressing in a fiber type-specific manner [35]. Recently, it has also been shown that skeletal muscle response and recovery from exercise training is dependent on fiber type composition and is thus an important factor to consider in the development of individualized training advice [36,37,38].

However, many of the abovementioned associations with fiber type distribution are based on low-powered studies. One of the limiting factors is that the methodology for determining fiber type distribution is labor-intensive and thus relatively expensive. The method presented here allows for a larger number of samples to be analyzed using totRNAseq in a standardized and relatively fast way. With this new method, the distribution of type I versus type II fibers can be estimated, but it cannot distinguish between type IIa and type IIx fibers. However, this is achievable with only a small amount of muscle tissue (~ 5–10 mg), e.g., from sampling with the minimally invasive microbiopsy technique [39]. The deconvolution inference of fiber type composition was accurate even for very low sequencing depths, i.e., down to an average of ~ 10,000 PE reads. This means that a shallow-coverage totRNAseq experiment (or targeted RNAseq) will be sufficient to accurately estimate skeletal muscle fiber type composition at a low cost per sample (< 1 dollar/sample). This new method consequently allows for the measurement of fiber type distribution of a larger number of samples and makes it feasible to study the connection between fiber type distribution and health with well-powered studies. It can also be used for estimating fiber type distribution in public repositories of totRNAseq data (e.g., Genotype-Tissue Expression project) and perform in silico analyses of fiber type associations. In conclusion, totRNAseq can efficiently be used to estimate skeletal muscle fiber type distributions of frozen samples.

Materials and methods

snRNAseq data generation

For nuclei isolation from frozen tissue, all following steps were performed on ice with precooled buffers and centrifugation steps were performed at 4 °C. The tissue was disrupted and nuclei liberated through dounce homogenization in ice-cold homogenization buffer (0.32 M sucrose, 3 mM CaCl2, 2 mM magnesium acetate, 0.1 mM EDTA, 1 mM DTT, 10 mg/ml BSA, 10 mM Tris–HCL, protease inhibitors (Sigma-Aldrich)) in the presence of 0.1% NP40. The nuclei suspensions were sequentially passed through 40-, 30-, and 20-μm cell filters (Miltenyi Biotec) and centrifuged at 1000 g for 10 min. The pellet was resuspended in 1% RNase-free BSA in PBS and stained using DAPI (1:1000, BD Pharmingen). Intact single nuclei were sorted in bulk using the DAPI-positive event population, at single-cell sort precision and using a 100-µm nozzle (BD FACS AriaIII) into PBS/1% BSA. Nuclei were counted and loaded on a 10 × chromium microfluidic chip, aiming for the maximum possible number of nuclei to be targeted obtained from the sorting. Single-nucleus experiments were performed using the 10 × genomics single cell 3′ kit v.2 to encapsulate nuclei along with barcode tagged beads, generate, and amplify cDNA and to generate sequencing libraries. Each pooled library was barcoded using i7 barcodes provided by 10 × Genomics. cDNA and sequencing library quality and quantity were determined using Agilent’s High Sensitivity DNA Assay. Libraries were pooled and sequenced in 150-bp paired-end mode on Illumina’s HiSeq platform.

snRNAseq data processing and analysis

Post-processing pipeline Cell Ranger (https://support.10xgenomics.com/single-cell-gene-expression/software/) provided by 10X Genomics was used for demultiplexing, alignment, filtering, barcode, and unique molecular identifier (UMI) counting. The pipeline produced files in FASTQ and BAM formats, as well as the matrix of UMI counts. We used Seurat workflow for further quality control and downstream analysis of the snRNAseq gene expression data. The initial data set contained 2937 nuclei and 22,336 expressed genes. Nuclei with a high fraction of their counts coming from mitochondrial and ribosomal genes were removed. Next, genes with at least one UMI count present in at least one nucleus were selected. After these steps, 2699 nuclei and 22,084 genes were included in the downstream analysis. We normalized the gene expression data with the LogNormalize method of Seurat and standardized the count values prior to performing the Principal Component Analysis (Supplementary Fig. 3). JackStraw procedure [40] was applied as a denoising step in order to select an optimal number of Principal Components (PCs), indicating that 3 PCs to keep for further downstream analysis, which can be viewed as identifying the true intrinsic dimensionality of the snRNAseq data. Further, the number of cells predicted to be proliferative was investigated using a list of the genes annotated as functioning in the cell cycle according to [41]. The vast majority of cells were not detected to be proliferating and the cycling cells did not form any separate cluster (Supplementary Fig. 4a-b). Graph-based clustering based on Louvain modularity optimization [25] with resolution parameter equal to 0.1 was used for detecting boundaries between different populations of nuclei, and tSNE non-linear dimension reduction with perplexity 50, that was chosen as a square root of the number of nuclei according to the k-nearest neighbors rule of thumb, was applied for visualization of the nuclei populations (Fig. 1a). The snRNAseq data was uploaded to Gene Expression Omnibus (GEO) and is available under accession number GSE190489.

Deconvolution analysis

Deconvolution analysis [26] was performed using the snRNAseq data as a reference. For this purpose, the snRNAseq data was used to produce cluster signatures, which are marker gene expressions averaged across cells from each cluster. After that, the gene expression of each type of the skeletal muscle fibers in a bulk RNAseq sample was inferred via linear matrix decomposition [42]. The deconvolution analysis was performed using DeconRNASeq R/Bioconductor package, and the results are presented in Fig. 2a, b.

Study subjects

The MSAT cohort: 39 Swedish male subjects (Table 1) were enrolled in the study by advertising on social media and through local cycling clubs. Inclusion criteria were as follows: (1) male, (2) healthy, not on any medications, and (3) age range between 20 and 55. Subjects were given both oral and written information about the experimental procedures before giving their written informed consent. Each participant went through three visits at different time points. All subjects completed all three visits. The first visit involved a regular doctor’s examination with blood samples and measuring anthropometric characteristics. The second visit consisted, after an overnight fast, of a Wingate test followed by muscle biopsy and VO_2max was measured during the third and last visit. The study was approved by the local Ethics committee, Lund University (Dnr 2015/593). For determination of peak anaerobic power (Wingate) and VO_2max, subjects were instructed to perform only easy training 48 h prior to each test. To determine peak anaerobic power, a 30-s all-out Wingate test [43] was conducted on a cycle ergometer (Monark Peak power). Before the test, a 5-min low intensity ~ 150w warm-up, with instructions to perform a 5 s high cadence drill each minute was performed. The test started with the subject pedaling as fast as possible. When a cadence of 120 rpm was reached, a braking resistance equivalent to 0.7 N × m × kg⁻¹ was applied to the freewheel and remained constant during the 30 s. Subjects were instructed to sit down throughout the test. Strong verbal encouragement was given throughout to ensure a maximal effort was provided. An incremental test to exhaustion was performed to determine VO_2max. The test started with 3 min of cycling at 3 W × kg⁻¹ (rounded down to nearest 10 W) and then increased by 35 W every 2 min until voluntary exhaustion or failure to maintain ≥ 60 rpm. Strong verbal encouragement was given throughout. VO₂ was measured using an Oxycon Pro (Jaeger GmbH, Germany) with a mixing chamber and a 30 s sampling time. Gas sensors were calibrated according to instructions by the vendor before every test. Maximal oxygen uptake was determined as the mean of the two highest values attained during exercise from any 30-s period.

The Juntendo cohort: 23 Japanese subjects (Table 2) were recruited to examine the associations between RNA expression profiles and muscle fiber composition in the Japanese population. All subjects gave their signed informed consent before inclusion in the study. The study protocols were approved by the Ethics Committees of Juntendo University and were performed in accordance with the Declaration of Helsinki. No additional tests were performed in this cohort.

Muscle biopsies and histology

In the MSAT cohort (Table 1), muscle biopsies were taken from the vastus lateralis muscle under sterile conditions and local anesthesia (1% lidocaine) by using a 5-mm Bergström needle and frozen in liquid nitrogen. The biopsies were taken within 5 min after the Wingate test. Serial Sects. (10 μm) were cut using a cryostat at − 20 °C. Myofibrillar ATPase histochemistry was performed by preincubation at pH 4.4, 4.6, and 10.3 to identify fiber types [44]; the proportion of fiber types (i.e., type I, IIa, or IIx) were calculated as the number of each fiber type, divided by the total number of fibers in the section. Computer image analysis was performed using image analysis equipment (BioPix IQ 2.0.16 software, BioPix AB, Sweden).

In the Juntendo cohort (Table 2), muscle biopsies were taken from the vastus lateralis muscle under sterile conditions and local anesthesia (1% lidocaine) by using a disposal needle biopsy instrument (Max Core; C. R. Bard, Covington, GA). The biopsies were collected from approximately 15 cm above the patella in both legs of each subject under ultrasound imaging (Noblus; Aloka, Tokyo, Japan) and avoided the inclusion of subcutaneous fat and the subfascial and myotendinous parts as far as possible. In addition, any visible non-muscle tissues (e.g., adipose tissue) were removed from the biopsy samples. Samples were frozen immediately in liquid nitrogen and stored at − 80℃ until further analysis. Myosin heavy chain (MyHC) protein isoforms were assessed as markers of muscle fiber composition. Frozen muscle samples were homogenized in ice-cold lysis buffer [50 mM HEPES (pH 7.4), 10 mM EDTA, 4 mM EGTA, 50 mM β-glycerophosphate, 25 mM NaF, 5 mM Na₃VO₄], containing a phosphatase inhibitor (PhosSTOP tablet; Roche Diagnostics, Indianapolis, IN), and a protease inhibitor (Complete tablet; Roche Diagnostics). The lysates obtained were centrifuged at 10,000 g for 10 min at 4℃. An insoluble pellet, obtained after centrifugation, was suspended in a sufficient volume of SDS sample buffer [30% glycerol, 5% β-mercaptoethanol, 2.3% SDS, 0.05% bromophenol blue, and 62.5 mM Tris–HCl (pH 6.8)] and boiled at 95℃ for 5 min. MyHC composition was determined by glycerol SDS-PAGE, according to Kakigi et al. [45]. Briefly, protein samples were resolved by performing glycerol SDS-PAGE [stacking gel: 4% acrylamide, 34.7% glycerol, and 125 mM Tris–HCl (pH 6.8); separating gel: 8% acrylamide, 33.3% glycerol, and 375 mM Tris–HCl (pH 8.3)]. Electrophoresis was started at 60 V with stacking gel at 8℃. The voltage was set to 150 V and run for 18 h at 8℃ when the tracking dye had entered the separating gel completely. After separation, the gels were stained with Coomassie brilliant blue (Biosafe G250; Bio-Rad Laboratories, Hercules, CA) and rinsed repeatedly with water. Each gel was scanned using a calibrated densitometer (ChemiDoc Touch Imaging System; Bio-Rad Laboratories), and the relative proportion of MyHC-I, MyHC-IIa, and MyHC-IIx were determined using the calibrated densitometer (ChemiDoc Touch Imaging System) and analytical software (Image Laboratory software version 5.2.1; Bio-Rad Laboratories).

RNA extraction and totRNAseq analysis

In the MSAT cohort, RNA was extracted from 25 to 30 mg of muscle biopsies using a TissueLyser II (Qiagen) and the miRneasy Mini Kit (Qiagen). The RNA concentration was determined using a NanoDrop ND-1000 spectrophotometer (A260/A280 > 1.8 and A260/A230 > 1.0 (NanoDrop Technologies, Wilmington, DE, USA). RNA integrity was verified using the 2200 TapeStation instrument (Agilent Technologies, CA, USA), where all samples had an average RNA integrity number (RIN) above 8. In the Juntendo cohort, frozen muscle samples were crushed with 5.0-mm zirconia beads using a Micro Smash MS-100R (Tomy Seiko, Japan) at 3000 rpm twice for 15 s at 2 °C. The total RNA was extracted from muscle samples using TRIzol® Reagent (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer’s protocol. RNA concentration and purity were checked using a NanoDrop 8000 UV–Vis Spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA). RNA integrity was verified using the 2200 TapeStation instrument (Agilent Technologies, CA, US), where all samples had an average RNA integrity number (RIN) above 8.

All samples from both the MSAT and Juntendo cohorts were sequenced at Lund University using 800 ng input RNA. Library preparation was made using the TruSeq Stranded Total RNA Library Prep Kit with Ribo-Zero Human/Mouse/Rat Set A (Illumina), and the 75 bp paired-end sequencing was performed on a NextSeq instrument using the NextSeq® 500/550 High Output Kit v2 (150 cycles) (Illumina). The sequencing quality was checked with fastQC v0.11.9 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc) and multiQC [46] v1.9. Gene expression was assessed using Salmon [47] v1.2.1. Exon expression was obtained by mapping reads with STAR [48] v2.7.6 and counting with featureCounts [49] v2.0.1 (options: -p -f -C -O). We used the GRCh38 Ensembl v77 [50] as a reference genome. We used the Seqtk tool (https://github.com/lh3/seqtk) for gradual random downsampling of the MSAT data and applied Salmon for quantifying gene expression of the downsampled RNAseq data. The quantified gene expression for each downsampling iteration was normalized with TMM [51], and deconvolution analysis was performed using the gene markers identified for slow- and fast-twitch human clusters in the snRNAseq data as described above. We computed the Spearman correlation coefficient and mean square deviation between predicted and true fiber type composition for each downsampling iteration.

Availability of data and materials

All scripts needed to utilize this new methodology is openly available at https://github.com/OlaHanssonLab/PredictFiberType. The datasets supporting the conclusions of this article are available in the Gene Expression Omnibus repository, GSE190489 at https://www.ncbi.nlm.nih.gov/geo. The totRNAseq data are available in the European Genome-phenome Archive, EGAS00001006362 - Juntendo Muscle Study (JMS), EGAS00001006363 - Muscle SATellite cell study (MSAT) at https://ega-archive.org.

Abbreviations

totRNAseq:: Muscle tissue RNA sequencing data
scRNAseq:: Single-cell RNA sequencing
snRNAseq:: Single-nuclei RNA sequencing
Type I:: Slow-twitch fibers
Type II:: Fast-twitch fibers
MyHC:: Myosin heavy chain
MSAT:: The muscle SATellite cell study
JMS:: Juntendo Muscle Study
tSNE:: T-Distributed stochastic neighbor embedding
ATP2A2 :: ATPase sarcoplasmic/endoplasmic reticulum Ca2 + transporting 2
TPM3 :: Tropomyosin 3
MYH7B :: Myosin heavy chain 7B
ATP2A1 :: ATPase sarcoplasmic/endoplasmic reticulum Ca2 + transporting 1
TNNT3 :: Troponin T3, fast skeletal type
MYH2 :: Myosin heavy chain 2
TTN :: Titin
MEG3 :: Maternally expressed 3
PE:: Paired-end
UMI:: Unique molecular identifier
PCs:: Principal components
PCA:: Principal Components Analysis
GEO:: Gene Expression Omnibus
RIN:: RNA integrity number
EGA:: European Genome-phenome Archive

References

Schiaffino S, Reggiani C. Myosin isoforms in mammalian skeletal muscle. J Appl Physiol. 1994;1985(77):493–501.
Article Google Scholar
Schiaffino S, Reggiani C. Fiber types in mammalian skeletal muscles. Physiol Rev. 2011;91:1447–531. https://doi.org/10.1152/physrev.00031.2010.
Article CAS PubMed Google Scholar
Saltin B, Henriksson J, Nygaard E, Andersen P, Jansson E. Fiber types and metabolic potentials of skeletal muscles in sedentary man and endurance runners. Ann N Y Acad Sci. 1977;301:3–29.
Article CAS Google Scholar
Simoneau JA, Bouchard C. Human variation in skeletal muscle fiber-type proportion and enzyme activities. Am J Physiol. 1989;257:E567–572.
CAS PubMed Google Scholar
Snijders T, Verdijk LB, van Loon LJ. The impact of sarcopenia and exercise training on skeletal muscle satellite cells. Ageing Res Rev. 2009;8:328–38. https://doi.org/10.1016/j.arr.2009.05.003.
Article PubMed Google Scholar
Wilson JM, et al. The effects of endurance, strength, and power training on muscle fiber type shifting. J Strength Cond Res. 2012;26:1724–9. https://doi.org/10.1519/JSC.0b013e318234eb6f.
Article PubMed Google Scholar
Harridge SD, et al. Whole-muscle and single-fibre contractile properties and myosin heavy chain isoforms in humans. Pflugers Arch. 1996;432:913–20.
Article CAS Google Scholar
Tanner CJ, et al. Muscle fiber type is associated with obesity and weight loss. Am J Physiol Endocrinol Metab. 2002;282:E1191–1196. https://doi.org/10.1152/ajpendo.00416.2001.
Article CAS PubMed Google Scholar
Mogensen M, et al. Mitochondrial respiration is decreased in skeletal muscle of patients with type 2 diabetes. Diabetes. 2007;56:1592–9. https://doi.org/10.2337/db06-0981.
Article CAS PubMed Google Scholar
Simoneau JA, Colberg SR, Thaete FL, Kelley DE. Skeletal muscle glycolytic and oxidative enzyme capacities are determinants of insulin sensitivity and muscle composition in obese women. FASEB J. 1995;9:273–8.
Article CAS Google Scholar
Komi PV, et al. Skeletal muscle fibres and muscle enzyme activities in monozygous and dizygous twins of both sexes. Acta Physiol Scand. 1977;100:385–92.
Article CAS Google Scholar
Simoneau JA, Bouchard C. Genetic determinism of fiber type proportion in human skeletal muscle. FASEB J. 1995;9:1091–5.
Article CAS Google Scholar
Schiaffino S, Reggiani C, Murgia M. Fiber type diversity in skeletal muscle explored by mass spectrometry-based single fiber proteomics. Histol Histopathol. 2020;35:239–46. https://doi.org/10.14670/HH-18-170.
Article CAS PubMed Google Scholar
Hedlund E, Deng Q. Single-cell RNA sequencing: technical advancements and biological applications. Mol Aspects Med. 2018;59:36–46. https://doi.org/10.1016/j.mam.2017.07.003.
Article CAS PubMed Google Scholar
De Micheli AJ, Spector JA, Elemento O, Cosgrove BD. A reference single-cell transcriptomic atlas of human skeletal muscle tissue reveals bifurcated muscle stem cell populations. Skelet Muscle. 2020;10:19. https://doi.org/10.1186/s13395-020-00236-3.
Article CAS PubMed PubMed Central Google Scholar
Jensen JB, et al. Isolation and characterization of muscle stem cells, fibro-adipogenic progenitors and macrophages from human skeletal muscle biopsies. Am J Physiol Cell Physiol. 2021. https://doi.org/10.1152/ajpcell.00127.2021.
Article PubMed Google Scholar
van den Heuvel A, et al. Single-cell RNA sequencing in facioscapulohumeral muscular dystrophy disease etiology and development. Hum Mol Genet. 2019;28:1064–75. https://doi.org/10.1093/hmg/ddy400.
Article CAS PubMed Google Scholar
Barruet E, et al. Functionally heterogeneous human satellite cells identified by single cell RNA sequencing. Elife. 2020;9. https://doi.org/10.7554/eLife.51576.
Xi H, et al. A human skeletal muscle atlas identifies the trajectories of stem and progenitor cells across development and from human pluripotent stem cells. Cell Stem Cell. 2020;27:158–176.e110. https://doi.org/10.1016/j.stem.2020.04.017.
Article CAS PubMed PubMed Central Google Scholar
Camps J, et al. Interstitial cell remodeling promotes aberrant adipogenesis in dystrophic muscles. Cell Rep. 2020;31:107597. https://doi.org/10.1016/j.celrep.2020.107597.
Article CAS PubMed Google Scholar
Wu H, Kirita Y, Donnelly EL, Humphreys BD. Advantages of single-nucleus over single-cell RNA sequencing of adult kidney: rare cell types and novel cell states revealed in fibrosis. J Am Soc Nephrol. 2019;30:23–32. https://doi.org/10.1681/ASN.2018090912.
Article CAS PubMed Google Scholar
Dos Santos M, et al. Single-nucleus RNA-seq and FISH identify coordinated transcriptional activity in mammalian myofibers. Nat Commun. 2020;11:5102. https://doi.org/10.1038/s41467-020-18789-8.
Article CAS PubMed PubMed Central Google Scholar
Kim M, et al. Single-nucleus transcriptomics reveals functional compartmentalization in syncytial skeletal muscle cells. Nat Commun. 2020;11:6375. https://doi.org/10.1038/s41467-020-20064-9.
Article CAS PubMed PubMed Central Google Scholar
Petrany MJ, et al. Single-nucleus RNA-seq identifies transcriptional heterogeneity in multinucleated skeletal myofibers. Nat Commun. 2020;11:6374. https://doi.org/10.1038/s41467-020-20063-w.
Article CAS PubMed PubMed Central Google Scholar
Blondel VD, Guillaume JL, Hendrickx JM, de Kerchove C, Lambiotte R. Local leaders in random networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2008;77:036114. https://doi.org/10.1103/PhysRevE.77.036114.
Article CAS PubMed Google Scholar
Gong T, Szustakowski JD. DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data. Bioinformatics. 2013;29:1083–5. https://doi.org/10.1093/bioinformatics/btt090.
Article CAS PubMed Google Scholar
Costill DL, Fink WJ, Pollock ML. Muscle fiber composition and enzyme activities of elite distance runners. Med Sci Sports. 1976;8:96–100.
CAS PubMed Google Scholar
Harber M, Trappe S. Single muscle fiber contractile properties of young competitive distance runners. J Appl Physiol. 2008;1985(105):629–36. https://doi.org/10.1152/japplphysiol.00995.2007.
Article Google Scholar
Widrick JJ, Trappe SW, Costill DL, Fitts RH. Force-velocity and force-power properties of single muscle fibers from elite master runners and sedentary men. Am J Physiol. 1996;271:C676–683. https://doi.org/10.1152/ajpcell.1996.271.2.C676.
Article CAS PubMed Google Scholar
Bellinger P, et al. Determinants of last lap speed in paced and maximal 1500-m time trials. Eur J Appl Physiol. 2020. https://doi.org/10.1007/s00421-020-04543-x.
Article PubMed Google Scholar
Oberbach A, et al. Altered fiber distribution and fiber-specific glycolytic and oxidative enzyme activity in skeletal muscle of patients with type 2 diabetes. Diabetes Care. 2006;29:895–900.
Article CAS Google Scholar
Lillioja S, et al. Skeletal muscle capillary density and fiber type are possible determinants of in vivo insulin resistance in man. J Clin Invest. 1987;80:415–24. https://doi.org/10.1172/JCI113088.
Article CAS PubMed PubMed Central Google Scholar
Henriksen EJ, et al. Glucose transporter protein content and glucose transport capacity in rat skeletal muscles. Am J Physiol. 1990;259:E593–598. https://doi.org/10.1152/ajpendo.1990.259.4.E593.
Article CAS PubMed Google Scholar
Daugaard JR, et al. Fiber type-specific expression of GLUT4 in human skeletal muscle: influence of exercise training. Diabetes. 2000;49:1092–5. https://doi.org/10.2337/diabetes.49.7.1092.
Article CAS PubMed Google Scholar
Ciciliot S, Rossi AC, Dyar KA, Blaauw B, Schiaffino S. Muscle type and fiber type specificity in muscle wasting. Int J Biochem Cell Biol. 2013;45:2191–9. https://doi.org/10.1016/j.biocel.2013.05.016.
Article CAS PubMed Google Scholar
Deshmukh AS, et al. Deep muscle-proteomic analysis of freeze-dried human muscle biopsies reveals fiber type-specific adaptations to exercise training. Nat Commun. 2021;12:304. https://doi.org/10.1038/s41467-020-20556-8.
Article CAS PubMed PubMed Central Google Scholar
Bellinger P, et al. Muscle fiber typology is associated with the incidence of overreaching in response to overload training. J Appl Physiol. 2020;1985(129):823–36. https://doi.org/10.1152/japplphysiol.00314.2020.
Article CAS Google Scholar
Lievens E, Klass M, Bex T, Derave W. Muscle fiber typology substantially influences time to recover from high-intensity exercise. J Appl Physiol. 2020;1985(128):648–59. https://doi.org/10.1152/japplphysiol.00636.2019.
Article CAS Google Scholar
Hayot M, et al. Skeletal muscle microbiopsy: a validation study of a minimally invasive technique. Eur Respir J. 2005;25:431–40. https://doi.org/10.1183/09031936.05.00053404.
Article CAS PubMed Google Scholar
Chung NC, Storey JD. Statistical significance of variables driving systematic variation in high-dimensional data. Bioinformatics. 2015;31:545–54. https://doi.org/10.1093/bioinformatics/btu674.
Article CAS PubMed Google Scholar
Kowalczyk MS, et al. Single-cell RNA-seq reveals changes in cell cycle and differentiation programs upon aging of hematopoietic stem cells. Genome Res. 2015;25:1860–72. https://doi.org/10.1101/gr.192237.115.
Article CAS PubMed PubMed Central Google Scholar
the multifaceted role of decorin in cancer. Sofeu Feugaing, D. D., Götte, M. & Viola, M. More than matrix. Eur J Cell Biol. 2013;92:1–11. https://doi.org/10.1016/j.ejcb.2012.08.004.
Article CAS Google Scholar
Bar-Or O. The Wingate anaerobic test. An update on methodology, reliability and validity. Sports Med. 1987;4:381–94. https://doi.org/10.2165/00007256-198704060-00001.
Brooke MH, Kaiser KK. Three, “myosin adenosine triphosphatase” systems: the nature of their pH lability and sulfhydryl dependence. J Histochem Cytochem. 1970;18:670–2.
Article CAS Google Scholar
Kakigi R, et al. Heat stress enhances mTOR signaling after resistance exercise in human skeletal muscle. J Physiol Sci. 2011;61:131–40. https://doi.org/10.1007/s12576-010-0130-y.
Article CAS PubMed Google Scholar
Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32:3047–8. https://doi.org/10.1093/bioinformatics/btw354.
Article CAS PubMed PubMed Central Google Scholar
Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14:417–9. https://doi.org/10.1038/nmeth.4197.
Article CAS PubMed PubMed Central Google Scholar
Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. https://doi.org/10.1093/bioinformatics/bts635.
Article CAS PubMed Google Scholar
Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–30. https://doi.org/10.1093/bioinformatics/btt656.
Article CAS PubMed Google Scholar
Yates AD, et al. Ensembl 2020. Nucleic Acids Res. 2020;48:D682–8. https://doi.org/10.1093/nar/gkz966.
Article CAS PubMed Google Scholar
Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11:R25. https://doi.org/10.1186/gb-2010-11-3-r25.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge support from the Science for Life Laboratory, the National Genomics Infrastructure, NGI, and the Swedish National Infrastructure for Computing (SNIC) at UPPMAX, partially funded by the Swedish Research Council through grant agreement no. 2018-05973 for providing assistance in massive parallel sequencing and computational infrastructure.

Funding

Open access funding provided by Lund University. This work was financially supported by the following: the Knut and Alice Wallenberg Foundation for equipment, Swedish Research Council project grant 2018–02635, Crafoord Foundation, Novo Nordisk Foundation, Påhlsson Foundation, Diabetes Wellness, the Swedish Diabetes foundation, the Hjelt Foundation, JSPS KAKENHI, Dnr 16KK0188, and by the Institute of Health and Sports Science & Medicine, Juntendo University. LUDC-IRC: Swedish Foundation for Strategic Research, Dnr IRC15-0067, EXODIAB: Swedish Research Council, Strategic Research Area, Dnr 2009–1039. NO and BN are financially supported by the Knut and Alice Wallenberg Foundation as part of the National Bioinformatics Infrastructure Sweden at SciLifeLab. The funding bodies had no influence or were involved in any other way in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Department of Clinical Sciences, Lund University, Malmö, Sweden
Nikolay Oskolkov, Ola Ekström, Kristoffer Ström, Bilal Ahmad Mir, Dmytro Kryvokhyzha, Mikko Lehtovirta, Karl-Fredrik Eriksson & Ola Hansson
Department of Biology, Science for Life Laboratory, National Bioinformatics Infrastructure Sweden, Lund University, Lund, Sweden
Nikolay Oskolkov
Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Malgorzata Santel, Gray J. Camp, Barbara Treutlein & Svante Pääbo
Health Informatics Institute, Morsani College of Medicine, University of South Florida, Gainesville, USA
Hemang M. Parikh
Graduate School of Health and Sports Science, Juntendo University, Chiba, Japan
Eri Miyamoto-Mikami, Hisashi Naito & Noriyuki Fuku
Swedish Winter Sports Research Centre, Mid Sweden University, Östersund, Sweden
Kristoffer Ström
Institute for Molecular Medicine Finland (FIMM), Helsinki University, Helsinki, Finland
Mikko Lehtovirta & Ola Hansson
Mito Medical Center, Tsukuba University Hospital, Ibaraki, Japan
Hiroyuki Kobayashi
Faculty of Management & Information Science, Josai International University, Chiba, Japan
Ryo Kakigi
Department of Cell and Molecular Biology, Science for Life Laboratory, National Bioinformatics Infrastructure Sweden, Uppsala University, Uppsala, Sweden
Björn Nystedt
Okinawa Institute of Science and Technology, Onna-son, Japan
Svante Pääbo

Authors

Nikolay Oskolkov
View author publications
You can also search for this author in PubMed Google Scholar
Malgorzata Santel
View author publications
You can also search for this author in PubMed Google Scholar
Hemang M. Parikh
View author publications
You can also search for this author in PubMed Google Scholar
Ola Ekström
View author publications
You can also search for this author in PubMed Google Scholar
Gray J. Camp
View author publications
You can also search for this author in PubMed Google Scholar
Eri Miyamoto-Mikami
View author publications
You can also search for this author in PubMed Google Scholar
Kristoffer Ström
View author publications
You can also search for this author in PubMed Google Scholar
Bilal Ahmad Mir
View author publications
You can also search for this author in PubMed Google Scholar
Dmytro Kryvokhyzha
View author publications
You can also search for this author in PubMed Google Scholar
Mikko Lehtovirta
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Kakigi
View author publications
You can also search for this author in PubMed Google Scholar
Hisashi Naito
View author publications
You can also search for this author in PubMed Google Scholar
Karl-Fredrik Eriksson
View author publications
You can also search for this author in PubMed Google Scholar
Björn Nystedt
View author publications
You can also search for this author in PubMed Google Scholar
Noriyuki Fuku
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Treutlein
View author publications
You can also search for this author in PubMed Google Scholar
Svante Pääbo
View author publications
You can also search for this author in PubMed Google Scholar
Ola Hansson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NO, SP, BN, and OH wrote the main manuscript text, and NO, GC, BT, DK, and OH prepared the figures. OE, EMM, NF, and OH made the tables. MS, KS, and BM performed experiments. NO, GC, HP, DK, BT, and OH made the bioinformatic analyses. OE, EMM, ML, HK, RK, HN, KFE, NF, SP, and OH performed the human studies and/or collected samples. The authors reviewed and approved the manuscript.

Corresponding author

Correspondence to Ola Hansson.

Ethics declarations

Ethics approval and consent to participate

The MSAT study was approved by the local Ethics committee at Lund University (Dnr 2015/593), and the JMS study was approved by the local Ethics committee at Juntendo University. All participants gave informed consent prior to participating in the studies.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Seurat snRNAseq workflow quality control statistics. From left to right, first figure reports the number of expressed genes per cell, second figure reports the number of UMIs (library size) per cell, third figure shows the fraction of reads mapped to mitochondrial genes per cell. Figure S2. The initial average sequencing depth of 35 million paired-end (PE) reads were ´down-sampled´ and mean square errors between the ATPase and totRNAseq predicted fractions of Type I fibers were calculated. Figure S3. Principal Component Analysis (PCA) plot of gene exppression from human skeletal muscle snRNAseq colored by the cluster annotation, see Figure 1A, obtained from the Louvain clustering on the number of significant principal components (according to Seurat workflow). Figure S4. (a) Principal Component Analysis (PCA) plot, and (b) tSNE plot of gene expression from human skeletal muscle snRNAseq colored by the cell cycle annotation obtained from the CellCycleScoring function from Seurat workflow. No obvious cluster formation based on the cell cycle can be observed from neither the PCA nor the tSNE plot.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Oskolkov, N., Santel, M., Parikh, H.M. et al. High-throughput muscle fiber typing from RNA sequencing data. Skeletal Muscle 12, 16 (2022). https://doi.org/10.1186/s13395-022-00299-4

Download citation

Received: 09 December 2021
Accepted: 07 June 2022
Published: 02 July 2022
DOI: https://doi.org/10.1186/s13395-022-00299-4

High-throughput muscle fiber typing from RNA sequencing data

Abstract

Background

Methods

Results

Conclusions

Introduction

Results

A novel muscle fiber type prediction model derived from single-nuclei RNA sequencing

Muscle fiber typing using total RNA sequencing from frozen samples

Discussion

Materials and methods

snRNAseq data generation

snRNAseq data processing and analysis

Deconvolution analysis

Study subjects

Muscle biopsies and histology

RNA extraction and totRNAseq analysis

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

Skeletal Muscle

Contact us