Skip to main content

A novel atlas of gene expression in human skeletal muscle reveals molecular changes associated with aging



Although high-throughput studies of gene expression have generated large amounts of data, most of which is freely available in public archives, the use of this valuable resource is limited by computational complications and non-homogenous annotation. To address these issues, we have performed a complete re-annotation of public microarray data from human skeletal muscle biopsies and constructed a muscle expression compendium consisting of nearly 3000 samples. The created muscle compendium is a publicly available resource including all curated annotation. Using this data set, we aimed to elucidate the molecular mechanism of muscle aging and to describe how physical exercise may alleviate negative physiological effects.


We find 957 genes to be significantly associated with aging (p < 0.05, FDR = 5 %, n = 361). Aging was associated with perturbation of many central metabolic pathways like mitochondrial function including reduced expression of genes in the ATP synthase, NADH dehydrogenase, cytochrome C reductase and oxidase complexes, as well as in glucose and pyruvate processing. Among the genes with the strongest association with aging were H3 histone, family 3B (H3F3B, p = 3.4 × 10−13), AHNAK nucleoprotein, desmoyokin (AHNAK, p = 6.9 × 10−12), and histone deacetylase 4 (HDAC4, p = 4.0 × 10−9). We also discover genes previously not linked to muscle aging and metabolism, such as fasciculation and elongation protein zeta 2 (FEZ2, p = 2.8 × 10−8). Out of the 957 genes associated with aging, 21 (p < 0.001, false discovery rate = 5 %, n = 116) were also associated with maximal oxygen consumption (VO2MAX). Strikingly, 20 out of those 21 genes are regulated in opposite direction when comparing increasing age with increasing VO2MAX.


These results support that mitochondrial dysfunction is a major age-related factor and also highlight the beneficial effects of maintaining a high physical capacity for prevention of age-related sarcopenia.


Aging profoundly affects skeletal muscle, including loss of muscle mass and strength and increasing the levels of fat and connective tissue [1]. This condition, often termed age-related sarcopenia, leads to a variety of physical conditions that reduce life quality and overall health in aging individuals [2, 3]. As we age, we lose approximately 1 % of leg lean mass per year and approximately 2.5–4 % in leg strength, men to a higher extent than women [4]. This indicates that sarcopenia is not only a matter of loss of muscle mass but rather a concomitant loss of muscle mass and a decline of muscle quality. In order to efficiently delay the onset and severity of sarcopenia, it is crucial to more in detail describe the molecular mechanisms causing this physiological deterioration of muscle function.

In one of the largest previous studies on gene expression in aging muscle [5], muscle biopsies from 81 individuals were investigated. Zahn et al. described a 250-gene signature for muscle aging, compared this to age-associated gene regulation in other tissues and found increased expression of pathways regulating cell growth, complement activation, and ribosomal and extracellular matrix genes and decreased expression of genes for chloride transport and mitochondrial oxidative phosphorylation (OXPHOS). De Magalhaes and colleagues [6] conducted a meta-analysis of microarray experiments on aging in mice, rats, and humans across a variety of tissues. In this cross-species, cross-platform analysis, gene orthologues were meta-analyzed for approximately 400 samples, 42 of which were from human skeletal muscle, comparing old to young individuals. They found 73 genes with altered expression, with increased expression of genes involved in inflammation and immune response, and consistent with Zahn et al. reduced expression of genes associated with energy metabolism, particularly mitochondrial genes (accessible through the GenAge database, It has also been shown that aging individuals have increasing levels of mitochondrial DNA damage leading to reduced expression of genes in the OXPHOS pathway [7]. Taken together, a general finding is that mitochondrial dysfunction is partly responsible for reduced muscle function with aging [8]. Reduced expression of genes in the OXPHOS pathway, including the regulator peroxisome proliferator-activated receptor gamma coactivator alpha (PGC1α), has also been found to be reduced in skeletal muscle from type 2 diabetic patients [9, 10], a strongly age-related metabolic disorder. Another central pathway previously associated with muscle aging is the mammalian target of rapamycin (mTOR), including the mTOR complex I (mTORC1) which plays a crucial role in the regulation of translation in skeletal muscle [11]. A metabolic link between mTOR and glycolysis has also been described where low glycolytic flux leads to binding of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) to the mTORC1-regulator Rheb thereby inhibiting mTORC1 signaling and suppression of protein synthesis [12].

To understand more the molecular mechanisms of aging in detail, larger sets of samples are required to provide more power to detect regulatory patterns on the gene level. We and others have previously combined data for studies of global transcriptomic patterns across thousands of samples [1316], but in this study, we address specific phenotype-related questions for skeletal muscle with a collected compendium of 2852 samples. Reuse of public data is however hampered by the use of different experimental platforms and sample annotation, and analysis is not straightforward when combining such data [17].

Based on this muscle expression compendium, we present the largest study to date of gene expression in human skeletal muscle related to aging. We address the concerns of data heterogeneity by an extensive manual re-annotation of all samples and a variety of computational methods described below. In our meta-analysis, we find 957 genes significantly associated with aging. The data provides substantially more detail to gene-specific effects of the transcriptome and shows more widespread regulation of gene expression associated with aging than previously reported. We further study the pleiotropic associations of the 957 genes associated with aging and show for example that 20 out of the 21 aging genes are also associated with physical capacity but regulated in the opposite direction with increased physical capacity as compared to increased age.

The skeletal muscle expression compendium is publicly available at ArrayExpress ( with accession number E-MTAB-1788.


Data collection and annotation

Experiments stored in the ArrayExpress archive [18] were identified by keyword searching aimed at identifying experiments that contained microarrays done on skeletal muscle tissue from living human individuals and with interventions limited to training and glucose/insulin regulation, excluding for example drug treatments. Samples were annotated using the categories and factor values in Table 1.

Table 1 Defined terms and value ranges used to annotate the compendium

Preprocessing and quality control

Data normalization was done on the raw .cel files for HG-U133A and HG-U133 + 2 using Robust Probabilistic Averaging (RPA) [19, 20]. Custom array definition files were created using the customCDF R/Bioconductor package (v16), removing probes mapping to known SNPs, and summarizing probes for each gene with an ENSG identifier. Quality control was carried out using the R/Bioconductor package array QualityMetrics [21] removing detected outlier arrays.

Statistical analysis

Linear regression analysis for the effect of age and physical capacity and removal of study effects was carried out using the limma R/Bioconductor package and the eBayes function. A linear model

$$ x\sim \mathrm{age}+\mathrm{sex}+\mathrm{study} $$

was fitted to the RPA-normalized, gene-summarized data for all genes on each of the two platforms. The “study” parameter was represented by the original ArrayExpress accession number. Model coefficients and p values were estimated using the eBayes function in limma. For genes present on both arrays, the minimum p value and the maximum of the β values for the regression slope of the age parameter were calculated. p value correction for false discovery rate (FDR) < 0.05 was done using the Benjamini-Hochberg method using N = 31,523 (the total number of p values calculated for the two arrays).

To calculate profiles of gene expression as a function of age across the studies, we adjusted the original data for study effects by subtracting the effect quantified in the regression model. For each gene i and sample j, we calculated

$$ xij={c}_i+{\beta}_{\mathrm{age},i}\times {\mathrm{age}}_j+{\beta}_{\mathrm{sex},i}\times \kern0.5em {\mathrm{sex}}_j+{\varepsilon}_i $$

where c i is the intercept of the linear regression above, β age and β sex are the slopes for the age and sex factors, and ε i is the gene-specific residual from the previous regression.

In the analysis of physical capacity, we used 116 samples from the HG-U133 + 2 array with harmonized annotation for physical capacity, measured as VO2MAX in liters per minute to kilogram (L/(min × kg)). For these, we fitted a linear model

$$ x\sim \mathrm{physical}\kern0.5em \mathrm{capacity}+\kern0.5em \mathrm{Saturday} $$

to the RPA-normalized, gene-summarized expression data. We estimated p values and regression coefficients for the model using the limma package with the eBayes function, as above.

We tested the significance of the overlap between subsets of the 957 genes with database lists using the hypergeometric test and a background of N = 19,597 genes.

Differential expression between subjects with type 2 diabetes (T2D) and normal glucose tolerance (NGT) individuals for the 957 age-associated genes was estimated by meta-analysis of three datasets with full annotation for these groups: E-GEOD-18732, E-GEOD-19420, and E-GEOD-25462, including 102 T2D and 87 NGT samples. The datasets were individually normalized with RPA and meta-analyzed using the geneMeta R/Bioconductor package ( Association to body mass index (BMI) was calculated by retrieving all samples within the seven selected datasets with annotation for BMI from the HG-U133 + 2 arrays and normalized as a single dataset using RPA, followed by a linear regression for BMI adjusted for sex and study, as identified by ArrayExpress accession number and analogously as described for age and physical capacity.

Comparison with public RNA sequencing data

RNA sequencing expression data on human skeletal muscles from n = 157 donors from Genotype-Tissue Expression (GTEx) project ( were used [22]. Across-samples normalization was performed using the TMM normalization method [23]. Association of gene expression for each gene with age was calculated with linear regression using an additive model adjusted for gender. The obtained p values were FDR corrected for multiple testing (FDR < 0.05, Benjamini-Hochberg). All calculations were done using R language for statistical computations.


Building the skeletal muscle data compendium

From ArrayExpress [18], microarray datasets from human skeletal muscle biopsies were selected and manually curated based on the original publications, including available supplemental data (see “Methods” section). The selected experiments contain data from 2852 microarrays from 20 different array platforms (Figure S1 in Additional file 1). Affymetrix-manufactured arrays dominate, represented by 11 different array types and in total 2475 arrays. Using a controlled vocabulary, sample and experimental parameters selected for re-annotation were defined. We retrieved the original papers along with supplemental material to re-annotate each microarray using our newly defined parameters and their value ranges (Table 1). To define a generic control set, representing a normal, healthy population, a set of 1188 “super controls” were selected. In this group, samples were excluded if the individual had any kind of disease, was obese (BMI > 30), or was subjected to any severe intervention.

To avoid the strong bias introduced by differences in individual probe sequences when combining data from different array platforms [24], we restricted this study to data from each platform independently. We used a subset of the compendium based on the two most common platforms: 568 arrays from the Affymetrix HG-U133A, and 1174 arrays from the HG-U133 + 2 platform. The probe effects were addressed by normalizing each dataset with RPA [19, 20], which models the affinity of each individual probe, assuming it to be a stochastic variable with a normal distribution with probe set-specific mean and variance rather than a constant, as in many other normalization methods including RMA and MAS5. To avoid biases introduced by genetic diversity in the studied individuals, we removed all probes mapping to known human SNPs with a minor allele frequency higher than 5 % in a Western European population. Out of 604,258 probes on the HG-U133 + 2 array, 4840 probes were removed; on the HG-133A array, 2157 out of 247,965 probes were removed. Oligonucleotide probes were summarized to gene level probe sets rather than transcript specific ones, also to minimize biases introduced by probe sequences and their representation on different arrays. After quality control [21], 1236 arrays from the two platforms remained: 758 from HG-U133 + 2 and 485 from HG-U133A. The two resulting data matrices contain data for 19,597 genes tested on the HG-U133 + 2 array and a subset of 11,926 of these on the HG-U133A array. The two resulting cross-study data matrices are also available from ArrayExpress, accession number E-MTAB-1788.

These comprehensive data sets represent comparable human skeletal muscle expression data over a vast array of different experimental conditions. In order to identify constitutively expressed genes, we analyzed the variance of expression only removing the study effect. The genes with most stable expression are presented in Table S1 (see Additional file 1) and are not likely to be influenced by the experimental conditions. The most stable genes found were myoglobin (MB), GAPDH, and alpha 1 actin (ACTA1) and could serve as candidates for “housekeeping” endogenous control genes in quantitative real-time PCR experiments.

Expression levels of 957 genes are associated with age

We selected a subset of 361 arrays from the compendium to study the effect of aging on gene expression, i.e., 211 arrays from the HG-133A array and 150 from the HG-U133 + 2 array that had specific annotation of age and gender, ranging from <1 year up to 83-year-old individuals (Figure S2 in Additional file 1). Using a linear model with sex and study ID as covariates, 957 genes were significantly associated with age (p < 0.05, Benjamini-Hochberg correction for multiple testing) (top-50 genes are shown in Table 2). Of these, 484 were upregulated and 473 downregulated with increasing age. We verify the removal of study effects by principal component analysis (PCA) before and after study adjustment. Whereas samples from the same study primarily cluster together in the PCA of the unadjusted dataset, this effect is removed in the adjusted one (Fig. 1). Similarly, we use PCA to verify the absence of gender biases in the dataset after the adjustment for study effects (Figure S3 in Additional file 1). A significant overlap (N = 13, p = 1.0 × 10−5) and complete concordance for the direction of the expression for all 13 genes found in our data set out of the 73 genes detected in the multispecies de Magalhaes study [6] were observed. Twenty-five of the 957 genes are reported in the GenAge database of 288 genes linked to aging, an overlap of the two lists which is significant at p = 0.0020 using a hypergeometric test. The GenAge database has also collected and curated a list of genes in loci detected in genome-wide association studies for longevity. They report 886 genes, 353 of which were significantly associated in the original studies. Out of this list, we detect an overlap of 25 out of 957 genes (p = 0.024). Seventeen of the 957 aging genes have been previously reported in the top 250 genes by the Zahn study [5] (overlap p = 0.065). Using publicly available RNA sequencing data (n = 157) from the GTEx project (, 91 genes out of the 957 were found associated with age, a significant overlap at p = 2.2 × 10−16.

Table 2 Top 50 genes significantly associated with age across 361 samples
Fig. 1
figure 1

Principal component analysis of the 211 HG-U133A arrays, before (a) and after (b) removal of study effects and corresponding plots for the 150 HG-U133 + 2 arrays (c and d). Different studies (identified by database accession numbers) are represented with different colors, and increasing age of the individual from whom the biopsy was taken is indicated by the increasing dot size. The relatively strong study effect seen even after normalization is removed after the adjustment

The genes with the strongest association to aging in the current study are H3F3B (p = 3.4 × 10−13) and AHNAK (p = 6.9 × 10−12), both showing increased expression with increased age. AHNAK is a large protein localized to the sarcomere of skeletal muscle and increased expression of AHNAK with aging is in line with the study by de Magalhaes et al. [6]. Increased expression of AHNAK has previously also been associated with a low VO2MAX and poor muscle fitness [25]. The two genes most strongly showing reduced expression with increasing age are homeobox B2 (HOXB2, p = 1.0 × 10−11) and deleted in lymphocytic leukemia 1 (DLEU1, p = 8.6 × 10−10). The full list of 957 genes is available in Table S2 (see Additional file 2).

Aging significantly alters genes involved in inflammation and mitochondrial metabolism

We analyzed the lists of 484 genes upregulated and 473 downregulated with aging for enrichment of specific pathways and processes using gene set enrichment analysis (GSEA) (Table 3) [9]. We find enrichment of genes with increased expression in six pathways, connected to the complement system, indicating an inflammatory response with aging (p < 0.05, Bonferroni adjusted). This is in line with Zahn et al. that also reported increased expression of genes in the complement system with aging [5]. Thirteen pathways were enriched in genes with decreased expression associated with increasing age (p < 0.05, Bonferroni adjusted). Eight of these represent mitochondrial components, supporting a perturbation of mitochondrial function with aging. Four groups connected to metabolism were also found, indicating reduced expression of genes in the electron transport chain (ETC)/OXPHOS pathway, in the citric acid/tricarboxylic acid cycle (TCA), and in pyruvate metabolism. We observe significant downregulation with increasing age of all four major complexes of the ETC located in the inner mitochondrial membrane (Figure S4 in Additional file 1). Subunits of NADH dehydrogenase (NDUFAF5, p = 1.0 × 10−4; NDUFS3, p = 8.8 × 10−5), cytochrome c reductase (UQCR10, p = 2.6 × 10−4; UQCR11, p = 7.7 × 10−4) and oxidase (COX10, p = 4.3 × 10−5; COX4I1, p = 1.7 × 10−4; COX7B, p = 3.1 × 10−4; COX7C, p = 6.1 × 10−4; COX5B, p = 1.1 × 10−3), and ATP synthase (ATP5G3, p = 1.1 × 10−6; ATPAF1, p = 3.6 × 10−5; ATP5G1, p = 8.0 × 10−5; ATP5C1, p = 2.0 × 10−4; COX10, p = 4.3 × 10−5) are all downregulated with increasing age. We also find that the expression of C-x(9)-C motif containing 2 (CMC2) is decreased with aging (p = 1.0 × 10−7). CMC2 is required for respiratory growth, and mutants with CMC2 deletion are unable to assemble the cytochrome c oxidase complex [26]. In the pyruvate dehydrogenase complex (PDC), proteins A1, B, and X (p = 4.1 × 10−6; 4.2 × 10−5; 1.3 × 10−3) show reduced expression with aging. In connection to carbohydrate metabolism, the expression of 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 1 (PFKFB1) is increased with aging (p = 9.4 × 10−5). Furthermore, glucose uptake through GLUT4 in skeletal muscle is known to be dependent on TBC1 Domain Family Member 1 (TBC1D1) [27, 28], the expression of which is decreased with aging (p = 3.6 × 10−4).

Table 3 Gene set enrichment analysis for the 484 genes upregulated with age and the 473 downregulated

High physical capacity counteracts age-related changes in muscle expression

Decline in muscle mass and performance with aging can be prevented by exercise. Therefore, we explored whether expressions of any of the genes associated with aging also were influenced by physical capacity assessed as VO2MAX.

In a subset of 116 samples with information on VO2MAX, we find 39 genes associated with physical capacity (FDR < 0.05, Benjamini-Hochberg, Table 4), but given the relatively low number of samples included in the VO2MAX analysis, we restricted our analysis to the 957 genes associated with age. Of them, 21 were also associated with VO2MAX (Table 5). It is striking, but not unexpected, that aging and increased physical capacity affects gene expression in opposite directions for 20 of the 21 genes. Two of these, the suppressor of cytokine signaling 2 (SOCS2) and the fasciculation and elongation protein zeta 2 (FEZ2) are also associated with BMI (FEZ2: p BMI = 5.9 × 10−4, SOCS2: p BMI = 7.3 × 10−4); increasing BMI affects gene expression in the same direction as increasing age (FEZ2: p age = 2.8 × 10−8, SOCS2: p age = 5.5 × 10−7) and in the opposite direction with increased physical capacity (FEZ2: p VO2max = 3.5 × 10−5, SOCS2: p VO2max = 6.3 × 10−8) (Fig. 2).

Table 4 Thirty-nine genes genome-wide significantly associated with physical capacity across 116 samples
Table 5 Of the 957 aging genes, 21 were significantly associated with physical capacity across 116 samples
Fig. 2
figure 2

Expression levels of SOCS2 (a), FEZ2 (b), MPC1 (c), and NT5C2 (d) in relation to age. Batch effect-adjusted expression levels are shown

Effect of body mass index and type 2 diabetes on the expression of age-associated genes

We find that three of the age-associated genes also were associated with T2D, CD163 (p age = 2.2 × 10−4, p T2D = 2.0 × 10−4), ZNF415 (p age = 8.9 × 10−5, p T2D = 8.5 × 10−5), and GADD45A (p age = 5.4 × 10−4, p T2D = 1.1 × 10−4) (Table 6). Of these, GADD45A and CD163 have previously been shown to be associated with T2D in GWAS [29, 30]. Interestingly, GADD45A has also been shown to reduce energy production and to stimulate pro-atrophy mechanisms in skeletal muscle [31]. ZNF415 is known to inhibit AP1 and p53 transcriptional activity [32], whereas increased concentration of serum sCD163 is a risk factor for developing T2D [33]. Two of the 957 age-associated genes were also associated with BMI, i.e., EIF4EBP1 (p BMI = 1.8 × 10−6) and AKR1C3 (p BMI = 4.7 × 10−5).

Table 6 Of the 957 aging genes, three were associated with type 2 diabetes


Because of differences in annotation standards, analysis of public gene expression data is often hampered by an inability to combine large sets of arrays for new studies. A major benefit of the data set presented here is that it has been manually annotated using a harmonized vocabulary, enabling a more comprehensive and detailed analysis to be performed. We show that through a strong manual curation effort, we could increase the combinability and utility of public data, deriving the until now largest study on aging in human skeletal muscle, and from the same compendium address additional questions regarding physical capacity, BMI, and T2D. This skeletal muscle compendium is publicly available to allow further studies on gene expression in skeletal muscle.

In the current analysis, we present a detailed description of age-related differences in gene expression in human skeletal muscle and identify 957 genes significantly associated with age. In line with Zahn et al., we find that genes in the complement system show increased expression and mitochondrial genes show decreased expression with aging [5]. Expression of genes in all the major complexes in the ETC, as well as several genes in the PDH complex decreased with aging (Fig. 3). These results together with those of others [34, 35] support the view that elderly subjects have a nearly 50 % lower oxidative capacity per volume of muscle than younger subjects [36]. At the cellular level, this decrease has been ascribed to a reduction in mitochondrial content and lower oxidative capacity of the mitochondria [36], i.e., this decrease of mitochondrial constituents could either reflect defective mitochondria or decreased number of mitochondria or both. Several potential regulators of mitochondrial mass and function were identified among the 957 age-associated genes in the current study. For example, ENDOG is a protein regulated by PGC1A, shown to interact with the mitochondrial genome to regulate mitochondrial mass [37]. TOMM40 is a crucial subunit of the translocase responsible for import of nuclear-encoded mitochondrial precursor proteins [38], which has previously been associated with aging and with exercise-induced mitochondrial biogenesis [39]. MRPL4 and MRPL48 are components of the mitochondrial ribosome, responsible for the production of essential oxidative phosphorylation proteins and CMC2 is required for mitochondrial cytochrome c oxidase assembly [26]. HCCS and TFRC are other proteins associated with aging that are required for proper functioning of the ETC [40, 41]. Among other potential regulators associated with aging are two forkhead transcription factors, i.e., FOXO1 and FOXO3. Both of these have also previously been implicated for roles in aging, longevity, and muscle atrophy [42, 43]. In summary, deterioration in skeletal muscle mitochondrial function is already well recognized as a major factor contributing to age-related muscle degeneration [34, 35], and our findings support this claim on a broad molecular level, identifying a large number of potential regulators.

Fig. 3
figure 3

Schematic illustration of major metabolic effects of aging in human skeletal muscle. A subjectively selected set of effector and regulatory genes from the 957 age-associated genes are shown with their direction of regulation shown with respect to increasing age

We find that genes that have a function in glucose uptake and energy sensing are strongly affected by aging. For example, we see reduced expression of the γ1 regulatory unit of AMPK with increased age. AMPK is a major energy sensor in skeletal muscle, controlling crucial steps of both glucose and lipid metabolism through the ability to sense AMP levels [44]. Reduced AMPK expression is known to result in lower ability of the muscle to utilize glucose through the GLUT4 transporter and to reduce the effectiveness of exercise as a stimulant of glucose uptake and ATP generation through glycolysis, with negative effects on glycemic control and regeneration of muscle mass. Induction of NT5C2 expression with increased age is a possible explanation to the age-associated reduction in AMPK activity, which in turn could be an important contributing factor to reduced mitochondrial function associated with aging [45]. Silencing of NT5C2 expression in cultured human myotubes increased the AMP/ATP ratio and AMPK activity and promoted palmitate oxidation and glucose transport [46], and endogenous expression of NT5C2 is known to inhibit basal lipid oxidation and glucose transport in skeletal muscle. AMPK, in turn, appears to regulate GLUT4 expression via the HDAC4/5-MEF2 axis [47], and in this study, we detected an increased expression of HDAC4 with increased age. The importance of the regulation of GLUT4 levels by AMPK in skeletal muscle is supported by this study showing regulated levels of NT5C2 with age, which can be reversed by physical capacity. Together, the interactions between age-associated changes in gene expression in these key pathways may explain the reduced ability to both generate energy for muscle contraction during exercise and to utilize circulating glucose in the aging muscle.

Muscle aging and physical capacity

Strikingly, we find that genes that are associated with both aging and physical capacity are largely counteracting. The presented data thereby support efforts to maintain high physical fitness in an aging population to counteract negative effects on mitochondrial function [48]. In particular, we hypothesize that SOCS2 and FEZ2, which show significant associations with age, BMI, and physical capacity and acting in the same direction for BMI and age but in the opposite direction for increasing physical capacity, have key regulatory functions in processes that link these three factors. SOCS2 interacts strongly with the activated IGF1R and may play a regulatory role in IGF1 receptor signaling [49]. Age-associated difference in the mRNA level of SOCS2 has previously been demonstrated in muscle from rat, where it was suggested to reflect resistance to the effect of growth hormone [50]. Also, an acute bout of resistance exercise is capable of upregulating SOCS2 in human skeletal muscle [51]. FEZ2 is to our knowledge a novel age-associated gene, the expression of which was altered in the opposite direction with physical capacity.


We show that through a strong manual curation effort, we could increase the combinability and utility of public data, deriving the until now largest study on aging in human skeletal muscle. This skeletal muscle compendium is publicly available, with applications for further studies on transcriptional regulation in skeletal muscle for a number of physiological and biological questions. Overall, our results paint a convoluted picture with many age-related pathways affecting a wide range of fundamental cellular processes. These results support that mitochondrial dysfunction is a major age-related factor and also highlight the beneficial effects of maintaining a high physical capacity for prevention of age-related sarcopenia.



alpha 1 actin


AHNAK nucleoprotein, desmoyokin


C-x(9)-C motif containing 2


deleted in lymphocytic leukemia 1


electron transport chain


false discovery rate


fasciculation and elongation protein zeta 2


glyceraldehyde-3-phosphate dehydrogenase


gene set enrichment analysis


H3 histone, family 3B


homeobox B2




mammalian target of rapamycin


mTOR complex I


normal glucose tolerance


mitochondrial oxidative phosphorylation


principal component analysis


pyruvate dehydrogenase complex


6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 1


peroxisome proliferator-activated receptor gamma coactivator alpha


robust probabilistic averaging


suppressor of cytokine signaling 2


type 2 diabetes


TBC1 domain family member 1


tricarboxylic acid cycle


maximal oxygen consumption


  1. Delmonico MJ, Harris TB, Visser M, Park SW, Conroy MB, Velasquez-Mieyer P, et al. Longitudinal study of muscle strength, quality, and adipose tissue infiltration. Am J Clin Nutr. 2009;90(6):1579–85. doi:10.3945/ajcn.2009.28047.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  2. Dirks AJ, Hofer T, Marzetti E, Pahor M, Leeuwenburgh C. Mitochondrial DNA mutations, energy metabolism and apoptosis in aging muscle. Ageing Res Rev. 2006;5(2):179–95. doi:10.1016/j.arr.2006.03.002.

    Article  CAS  PubMed  Google Scholar 

  3. Kim TN, Choi KM. Sarcopenia: definition, epidemiology, and pathophysiology. J Bone Metab. 2013;20(1):1–10. doi:10.11005/jbm.2013.20.1.1.

    Article  PubMed Central  PubMed  Google Scholar 

  4. Goodpaster BH, Park SW, Harris TB, Kritchevsky SB, Nevitt M, Schwartz AV, et al. The loss of skeletal muscle strength, mass, and quality in older adults: the health, aging and body composition study. J Gerontol A Biol Sci Med Sci. 2006;61(10):1059–64.

    Article  PubMed  Google Scholar 

  5. Zahn JM, Sonu R, Vogel H, Crane E, Mazan-Mamczarz K, Rabkin R, et al. Transcriptional profiling of aging in human muscle reveals a common aging signature. PLoS Genet. 2006;2(7):e115. doi:10.1371/journal.pgen.0020115.eor.

    Article  PubMed Central  PubMed  Google Scholar 

  6. de Magalhães JP, Curado J, Church GM. Meta-analysis of age-related gene expression profiles identifies common signatures of aging. Bioinformatics. 2009;25(7):875–81. doi:10.1093/bioinformatics/btp073.

    Article  PubMed Central  PubMed  Google Scholar 

  7. Bua E, Johnson J, Herbst A, Delong B, McKenzie D, Salamat S, et al. Mitochondrial DNA-deletion mutations accumulate intracellularly to detrimental levels in aged human skeletal muscle fibers. Am J Hum Genet. 2006;79(3):469–80. doi:10.1086/507132.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  8. Austin S, St-Pierre J. PGC1α and mitochondrial metabolism--emerging concepts and relevance in ageing and neurodegenerative disorders. J Cell Sci. 2012;125(Pt 21):4963–71. doi:10.1242/jcs.113662.

    Article  CAS  PubMed  Google Scholar 

  9. Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–73.

    Article  CAS  PubMed  Google Scholar 

  10. Patti ME, Butte AJ, Crunkhorn S, Cusi K, Berria R, Kashyap S, et al. Coordinated reduction of genes of oxidative metabolism in humans with insulin resistance and diabetes: Potential role of PGC1 and NRF1. Proc Natl Acad Sci U S A. 2003;100(14):8466–71. doi:10.1073/pnas.1032913100.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. Bodine SC, Stitt TN, Gonzalez M, Kline WO, Stover GL, Bauerlein R, et al. Akt/mTOR pathway is a crucial regulator of skeletal muscle hypertrophy and can prevent muscle atrophy in vivo. Nat Cell Biol. 2001;3(11):1014–9. doi:10.1038/ncb1101-1014.

    Article  CAS  PubMed  Google Scholar 

  12. Lee MN, Ha SH, Kim J, Koh A, Lee CS, Kim JH, et al. Glycolytic flux signals to mTOR through glyceraldehyde-3-phosphate dehydrogenase-mediated regulation of Rheb. Mol Cell Biol. 2009;29(14):3991–4001. doi:10.1128/MCB.00165-09.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Lukk M, Kapushesky M, Nikkilä J, Parkinson H, Goncalves A, Huber W, et al. A global map of human gene expression. Nat Biotechnol. 2010;28(4):322–4. doi:10.1038/nbt0410-322.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  14. Zheng-Bradley X, Rung J, Parkinson H, Brazma A. Large scale comparison of global gene expression patterns in human and mouse. Genome Biol. 2010;11(12):R124. doi:10.1186/gb-2010-11-12-r124.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  15. Ojala KA, Kilpinen SK, Kallioniemi OP. Classification of unknown primary tumors with a data-driven method based on a large microarray reference database. Genome Med. 2011;3(9):63. doi:10.1186/gm279.

    Article  PubMed Central  PubMed  Google Scholar 

  16. Kilpinen S, Autio R, Ojala K, Iljin K, Bucher E, Sara H, et al. Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues. Genome Biol. 2008;9(9):R139. doi:10.1186/gb-2008-9-9-r139.

    Article  PubMed Central  PubMed  Google Scholar 

  17. Rung J, Brazma A. Reuse of public genome-wide gene expression data. Nat Rev Genet. 2013;14(2):89–99. doi:10.1038/nrg3394.

    Article  CAS  PubMed  Google Scholar 

  18. Rustici G, Kolesnikov N, Brandizi M, Burdett T, Dylag M, Emam I, et al. ArrayExpress update--trends in database growth and links to data analysis tools. Nucleic Acids Res. 2013;41(Database issue):D987–90. doi:10.1093/nar/gks1174.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  19. Lahti L, Elo LL, Aittokallio T, Kaski S. Probabilistic analysis of probe reliability in differential gene expression studies with short oligonucleotide arrays. IEEE/ACM Trans Comput Biol Bioinform. 2011;8(1):217–25. doi:10.1109/TCBB.2009.38.

    Article  PubMed  Google Scholar 

  20. Lahti L, Torrente A, Elo LL, Brazma A, Rung J. A fully scalable online pre-processing algorithm for short oligonucleotide microarray atlases. Nucleic Acids Res. 2013;41(10), e110. doi:10.1093/nar/gkt229.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  21. Kauffmann A, Gentleman R, Huber W. arrayQualityMetrics—a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25(3):415–6. doi:10.1093/bioinformatics/btn647.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  22. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–5. 10.1038/ng.2653.

    Article  Google Scholar 

  23. Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11(3):R25. doi:10.1186/gb-2010-11-3-r25.

    Article  PubMed Central  PubMed  Google Scholar 

  24. Rudy J, Valafar F. Empirical comparison of cross-platform normalization methods for gene expression data. BMC Bioinformatics. 2011;12:467. doi:10.1186/1471-2105-12-467.

    Article  PubMed Central  PubMed  Google Scholar 

  25. Parikh H, Nilsson E, Ling C, Poulsen P, Almgren P, Nittby H, et al. Molecular correlates for maximal oxygen uptake and type 1 fibers. Am J Physiol Endocrinol Metab. 2008;294(6):E1152–9. doi:10.1152/ajpendo.90255.2008.

    Article  CAS  PubMed  Google Scholar 

  26. Horn D, Zhou W, Trevisson E, Al-Ali H, Harris TK, Salviati L, et al. The conserved mitochondrial twin Cx9C protein Cmc2 Is a Cmc1 homologue essential for cytochrome c oxidase biogenesis. J Biol Chem. 2010;285(20):15088–99. doi:10.1074/jbc.M110.104786.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  27. Szekeres F, Chadt A, Tom RZ, Deshmukh AS, Chibalin AV, Björnholm M, et al. The Rab-GTPase-activating protein TBC1D1 regulates skeletal muscle glucose metabolism. Am J Physiol Endocrinol Metab. 2012;303(4):E524–33. doi:10.1152/ajpendo.00605.2011.

    Article  CAS  PubMed  Google Scholar 

  28. An D, Toyoda T, Taylor EB, Yu H, Fujii N, Hirshman MF, et al. TBC1D1 regulates insulin- and contraction-induced glucose transport in mouse skeletal muscle. Diabetes. 2010;59(6):1358–65. doi:10.2337/db09-1266.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  29. Wang N, Yang C, Xie F, Sun L, Su X, Wang Y, et al. Gadd45α: a novel diabetes-associated gene potentially linking diabetic cardiomyopathy and baroreflex dysfunction. PLoS ONE. 2012;7(12), e49077. doi:10.1371/journal.pone.0049077.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  30. Sporrer D, Weber M, Wanninger J, Weigert J, Neumeier M, Stögbauer F, et al. Adiponectin downregulates CD163 whose cellular and soluble forms are elevated in obesity. Eur J Clin Invest. 2009;39(8):671–9. doi:10.1111/j.1365-2362.2009.02170.x.

    Article  CAS  PubMed  Google Scholar 

  31. Bongers KS, Fox DK, Ebert SM, Kunkel SD, Dyle MC, Bullard SA, et al. Skeletal muscle denervation causes skeletal muscle atrophy through a pathway that involves both Gadd45a and HDAC4. Am J Physiol Endocrinol Metab. 2013;305(7):E907–15. doi:10.1152/ajpendo.00380.2013.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  32. Cheng Y, Wang Y, Li Y, Deng Y, Hu J, Mo X, et al. A novel human gene ZNF415 with five isoforms inhibits AP-1- and p53-mediated transcriptional activity. Biochem Biophys Res Commun. 2006;351(1):33–9. doi:10.1016/j.bbrc.2006.09.161.

    Article  CAS  PubMed  Google Scholar 

  33. Møller HJ, Frikke-Schmidt R, Moestrup SK, Nordestgaard BG, Tybjærg-Hansen A. Serum soluble CD163 predicts risk of type 2 diabetes in the general population. Clin Chem. 2011;57(2):291–7. doi:10.1373/clinchem.2010.154724.

    Article  PubMed  Google Scholar 

  34. Short KR, Bigelow ML, Kahl J, Singh R, Coenen-Schimke J, Raghavakaimal S, et al. Decline in skeletal muscle mitochondrial function with aging in humans. Proc Natl Acad Sci U S A. 2005;102(15):5618–23. doi:10.1073/pnas.0501559102.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Johnson ML, Robinson MM, Nair KS. Skeletal muscle aging and the mitochondrion. Trends Endocrinol Metab. 2013;24(5):247–56. doi:10.1016/j.tem.2012.12.003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  36. Conley KE, Jubrias SA, Esselman PC. Oxidative capacity and ageing in human muscle. J Physiol. 2000;526(Pt 1):203–10.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  37. McDermott-Roe C, Ye J, Ahmed R, Sun XM, Serafín A, Ware J, et al. Endonuclease G is a novel determinant of cardiac hypertrophy and mitochondrial function. Nature. 2011;478(7367):114–8. doi:10.1038/nature10490.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. Humphries AD, Streimann IC, Stojanovski D, Johnston AJ, Yano M, Hoogenraad NJ, et al. Dissection of the mitochondrial import and assembly pathway for human Tom40. J Biol Chem. 2005;280(12):11535–43. doi:10.1074/jbc.M413816200.

    Article  CAS  PubMed  Google Scholar 

  39. Joseph AM, Ljubicic V, Adhihetty PJ, Hood DA. Biogenesis of the mitochondrial Tom40 channel in skeletal muscle from aged animals and its adaptability to chronic contractile activity. Am J Physiol Cell Physiol. 2010;298(6):C1308–14. doi:10.1152/ajpcell.00644.2008.

    Article  CAS  PubMed  Google Scholar 

  40. Schaefer L, Ballabio A, Zoghbi HY. Cloning and characterization of a putative human holocytochrome c-type synthetase gene (HCCS) isolated from the critical region for microphthalmia with linear skin defects (MLS). Genomics. 1996;34(2):166–72. doi:10.1006/geno.1996.0261.

    Article  CAS  PubMed  Google Scholar 

  41. Ishii KA, Fumoto T, Iwai K, Takeshita S, Ito M, Shimohata N, et al. Coordination of PGC-1beta and iron uptake in mitochondrial biogenesis and osteoclast activation. Nat Med. 2009;15(3):259–66. doi:10.1038/nm.1910.

    Article  CAS  PubMed  Google Scholar 

  42. Willcox BJ, Donlon TA, He Q, Chen R, Grove JS, Yano K, et al. FOXO3A genotype is strongly associated with human longevity. Proc Natl Acad Sci U S A. 2008;105(37):13987–92. doi:10.1073/pnas.0801030105.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  43. Sandri M, Sandri C, Gilbert A, Skurk C, Calabria E, Picard A, et al. Foxo transcription factors induce the atrophy-related ubiquitin ligase atrogin-1 and cause skeletal muscle atrophy. Cell. 2004;117(3):399–412.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  44. Hamilton SR, Stapleton D, O’Donnell JB, Kung JT, Dalal SR, Kemp BE, et al. An activating mutation in the gamma1 subunit of the AMP-activated protein kinase. FEBS Lett. 2001;500(3):163–8.

    Article  CAS  PubMed  Google Scholar 

  45. Reznick RM, Zong H, Li J, Morino K, Moore IK, Yu HJ, et al. Aging-associated reductions in AMP-activated protein kinase activity and mitochondrial biogenesis. Cell Metab. 2007;5(2):151–6. doi:10.1016/j.cmet.2007.01.008.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  46. Kulkarni SS, Karlsson HK, Szekeres F, Chibalin AV, Krook A, Zierath JR. Suppression of 5′-nucleotidase enzymes promotes AMP-activated protein kinase (AMPK) phosphorylation and metabolism in human and mouse skeletal muscle. J Biol Chem. 2011;286(40):34567–74. doi:10.1074/jbc.M111.268292.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  47. Richter EA, Hargreaves M. Exercise, GLUT4, and skeletal muscle glucose uptake. Physiol Rev. 2013;93(3):993–1017. doi:10.1152/physrev.00038.2012.

    Article  CAS  PubMed  Google Scholar 

  48. Broskey NT, Greggio C, Boss A, Boutant M, Dwyer A, Schlueter L, et al. Skeletal muscle mitochondria in the elderly: effects of physical fitness and exercise training. J Clin Endocrinol Metab. 2014;99(5):1852–61. doi:10.1210/jc.2013-3983.

    Article  CAS  PubMed  Google Scholar 

  49. Dey BR, Spence SL, Nissley P, Furlanetto RW. Interaction of human suppressor of cytokine signaling (SOCS)-2 with the insulin-like growth factor-I receptor. J Biol Chem. 1998;273(37):24095–101.

    Article  CAS  PubMed  Google Scholar 

  50. Haddad F, Adams GR. Aging-sensitive cellular and molecular mechanisms associated with skeletal muscle hypertrophy. J Appl Physiol (1985). 2006;100(4):1188–203. doi:10.1152/japplphysiol.01227.2005.

    Article  CAS  Google Scholar 

  51. Buford TW, Cooke MB, Willoughby DS. Resistance exercise-induced changes of inflammatory gene expression within human skeletal muscle. Eur J Appl Physiol. 2009;107(4):463–71. doi:10.1007/s00421-009-1145-z.

    Article  CAS  PubMed  Google Scholar 

Download references


This work has been funded by the Swedish Research Council Linnaeus grant: Lund University Diabetes Centre (Dnr 349-2006-237) and SFO Exodiab (Dnr 2009–1039), European Research Council (ERC)—Advanced Researcher Grant (GA 269045), NuGo, ALF, the Crafoord Foundation, SV Skånes diabetes förening, the Wallenberg Foundation, the Novo Nordisk Foundation, EXGENESIS, UMAS Fonder, Magn. Bergvalls Stiftelse, Syskonen Svenssons Fond, the Swedish Diabetes Research Foundation (2009–060), Swedish research council, the European Community’s Seventh Framework Programme (FP7/2007-2013), and ENGAGE (HEALTH-F4-2007-201413). The funding agencies had no influence on the design of the study, the collection or analysis of the data, or any other aspect of the presented work.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Ola Hansson.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

JS and CE acquired the data, performed the analysis, and interpreted the data. NO, KS, LL, and AB performed the analysis and interpreted the data. LG designed the study and interpreted the data. JR and OH designed the study, performed the analysis, and interpreted the data. All authors have been involved in drafting the manuscript and/or revising it critically. All authors read and approved the final manuscript.

Jing Su and Carl Ekman contributed equally to this work.

Additional files

Additional file 1: Supplementary material

FigureS S1-4 and Table S1 (DOCX 385 kb)

Additional file 2: Table S2

957 genes significantly associated with age across 340 samples (XLSX 257 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Su, J., Ekman, C., Oskolkov, N. et al. A novel atlas of gene expression in human skeletal muscle reveals molecular changes associated with aging. Skeletal Muscle 5, 35 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: