MicroRNA Predictors of Longevity in

Download PDF České info

Neither genetic nor environmental factors fully account for variability in individual longevity:
genetically identical invertebrates in homogenous environments often experience no less variability in lifespan than outbred human populations. Such variability is often assumed to result from stochasticity in damage accumulation over time; however, the identification of early-life gene expression states that predict future longevity would suggest that lifespan is least in part epigenetically determined. Such “biomarkers of aging,” genetic or otherwise, nevertheless remain rare. In this work, we sought early-life differences in organismal robustness in unperturbed individuals and examined the utility of microRNAs, known regulators of lifespan, development, and robustness, as aging biomarkers. We quantitatively examined Caenorhabditis elegans reared individually in a novel apparatus and observed throughout their lives. Early-to-mid–adulthood measures of homeostatic ability jointly predict 62% of longevity variability. Though correlated, markers of growth/muscle maintenance and of metabolic by-products (“age pigments”) report independently on lifespan, suggesting that graceful aging is not a single process. We further identified three microRNAs in which early-adulthood expression patterns individually predict up to 47% of lifespan differences. Though expression of each increases throughout this time, mir-71 and mir-246 correlate with lifespan, while mir-239 anti-correlates. Two of these three microRNA “biomarkers of aging” act upstream in insulin/IGF-1–like signaling (IIS) and other known longevity pathways, thus we infer that these microRNAs not only report on but also likely determine longevity. Thus, fluctuations in early-life IIS, due to variation in these microRNAs and from other causes, may determine individual lifespan.

Published in the journal: . PLoS Genet 7(9): e32767. doi:10.1371/journal.pgen.1002306
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1002306

Summary

Introduction

Inter-individual variation in human longevity has not been found to be under substantial genetic control, with heritability generally between 15% and 30% [1],[2]. At the same time, shared environmental factors contribute little in these human studies, and can be completely controlled in large-scale experiments on inbred invertebrates without abrogating lifespan variability [3]–[5]. Indeed, rearing Caenorhabditis elegans in a homogenous, chemically defined liquid medium more than doubles the coefficient of variability in lifespan compared to feeding the animals live bacteria on solid agar (a less-homogenous environment) [6]. As the external environment of C. elegans can be easily controlled, the genetics of its lifespan are well understood [7], and its developmental plan is famously invariant, this nematode is an ideal organism in which to investigate how and when individuality arises, and how these differences produce a phenotype as variable as lifespan [8].

The identification of “biomarkers of longevity” –⁠ measurable parameters that predict individual longevity better than chronological age [9] –⁠ will help pinpoint genetic and physiological processes that promote or defer senescent decline. Further, such biomarkers may help clarify whether lifespan differences are simply the result of variable accumulation of damage over time, or whether they may also result from gene-regulatory states, potentially set early in life, that determine individual robustness [10]. To date, most identified and proposed biomarkers in C. elegans have largely been phenomenological, downstream indicators of homeostatic maintenance. One important class of such markers is locomotory function. Herndon and colleagues showed that a qualitative evaluation of individual locomotory ability correlates with remaining lifespan of same-aged animals, and, moreover, that these movement classes correlate with the degree of sarcopenia (decline in muscle mass and function) in those individuals [11]. Later work showed quantitative correlations between the rate of decrease in body movement and lifespan [12], as well as between the span of functional pharyngeal pumping or body movement and lifespan [13], [14]. In addition to muscle decline, general decreases in macromolecular homeostasis have long been observed in aging C. elegans via increases in non-hydrolysable, autofluorescent “age pigments” such as lipofuscin [15] in intestinal lysosomes [16]–[18]. Lipofuscin accumulation correlates with the qualitative movement classes defined in Herndon et al. [19], though such accumulation has not been directly shown to predict an individual's future longevity, and in one recent work was specifically not found be predictive of longevity [20]. (This last observation was made of green-wavelength autofluorescence, which is more specific for flavin compounds, while lipofuscin per se fluoresces most strongly in blue wavelengths [19], [21].) Lastly, animals that reach their final adult size more rapidly have shorter lifespans [14]. As part of this work, we systematically validated adult growth and movement rates, tissue homeostasis, and age pigment accumulation as phenomenological biomarkers of longevity in nematodes, and further, by measuring multiple biomarkers per individual, deduced the relationships among these markers.

While they may suggest clinically relevant markers of human aging, such measurements do little to elucidate the genetic mechanisms underlying lifespan variability. Transcriptional profiling of aging C. elegans has suggested sets of genes that change expression during aging and may thus report an animal's “physiological age” [22]–[24]. The definitive test remains demonstrating that a particular gene's expression level predicts future longevity on an individual basis. The first genetic predictor of individual lifespan was identified by Rea and colleagues, who demonstrated that the ability to upregulate a reporter for expression of the heat-shock protein hsp-16.2 after a mild heat stress correlates with post-stress longevity [25]. However, such stress also induces a protective effect [26]. Thus it is not clear whether the measured effect reflects innate differences in “heat-shock response capacity”, which in un-stressed animals might also correlate with future longevity, or whether the degree of heat-shock response is determined stochastically at the time of the stress. Recently, mid-life expression variation in several additional genes in C. elegans, including daf-16 and its well-known target sod-3, have been shown to predict future longevity in un-perturbed individuals [20]. Therefore, we sought regulatory factors further upstream that might have constitutive activities that determine robustness to damage and/or longevity in unperturbed animals.

MicroRNAs (miRNAs) –⁠ short non-coding RNAs that bind to and regulate the expression of target mRNAs –⁠ have been proposed as determinants of organismal robustness to environmental variation [27], a prediction that has been borne out experimentally [28], [29]. Similarly, miRNAs may regulate longevity by determining individual capacities to respond to damage [30]. lin-4 was the first miRNA to be shown to regulate lifespan and stress-resistance, through its action on the insulin/IGF-1-like signaling (IIS) pathway [31], which is well known for its role in longevity determination [32]–[34]. Many miRNAs change expression levels during aging in C. elegans [35], and recently mir-71, mir-239, and mir-246, all of which increase in expression over time, have been shown to promote (mir-71, mir-246) and antagonize (mir-239) longevity and stress-resistance, through IIS (mir-71, mir-239) and the DNA damage response pathway (mir-71) [36]. Further, miRNAs in other contexts have proven to be able biomarkers of various human pathologies [37]–[40] and perhaps also aging [41]. Here we report that mir-71, mir-239, and mir-246 expression profiles, measured by promoter::GFP reporter constructs, predict individual longevity in C. elegans.

Results

Lifelong Observation of Individual C. elegans

To determine early-life correlates of eventual longevity, we developed a minimally invasive individual-nematode culture system (Figure 1A) that allows in situ imaging of freely moving, unanesthetized animals. Briefly, single eggs at the pre-hatch “pretzel” stage and a bacterial food source are deposited atop PEG-1000-methacrylate hydrogel pads embedded in and crosslinked to a glass slide (see Materials and Methods). The top of the slide is covered with liquid polydimethylsiloxane (PDMS), which polymerizes in approximately 12 hours at 23°C to yield a thin, transparent, and gas-permeable membrane that reduces desiccation and prevents contamination. (All ages reported in this work refer to time after slide preparation; as approximately 98% of viable eggs hatched within 5 hours, we simply report this as “age post-hatch.”) All strains were crossed into the temperature-sensitive fertility-defective strain spe-9(hc88) and all assays were conducted at 23°C to prevent reproduction [42]. We obtained good developmental synchrony with this method; after 40 hours, most animals are near the middle of the 4^th-larval stage, based on vulval morphology (not shown). The mean lifespan of 10.7 days at 23°C in this apparatus is similar to that in standard culture conditions, according to previous reports and our own controls (see Materials and Methods).

**Fig. 1. Single-animal vermiculture and measurement.**

At each timepoint, brightfield/fluorescence image pairs were acquired for each animal, and movement rates and health status evaluated by examining motion after stimulation with 0.25 seconds of green light; animals that did not respond were deemed to have perished (Figure S1). Figure 1B illustrates the distribution of lifespans of 463 individuals cultured in this apparatus and the corresponding survival curve.

In this fashion, measurements can be made on individual animals throughout their lives and correlated with eventual longevity. In particular, the position of each animal was identified in brightfield images via custom semi-automated software, allowing quantification of various morphological and image-based features. As an example, Figure 1C illustrates the length of one particular animal measured at daily intervals from hatching until death. In attempting to determine correlates of future longevity, we focused on measurements made during days 3–7 post-hatch, stretching from the attainment of adulthood (beginning of the reproductive period) to the onset of mortality. Less than 3% of the animals die before day 7; while measurements made later often correlate better with remaining longevity, they are of more limited utility as more of the study population has died before the measurements could be made. The day 3–7 range is illustrated in Figure 1C along with the two measures we employed to summarize data in this range: the mean level of a particular measurement over that span, and the slope of a least-squares linear fit of the data in that span.

As noted previously [14], we observed that adult C. elegans tend to shrink over time (Figure 1D). This shortening is not observed in the length distributions of heat-killed animals of different ages [43]; we also anecdotally observed that animals often “relax” and lengthen after death. Thus, the shortening appears to reflect a physiological process and not an actual change in the size of the cuticle. As such, we wished, as an illustrative test case, to examine whether size and/or size maintenance over time had any relevance for eventual lifespan. Figure 1E illustrates a retrospective analysis: the average length-versus-time profile is shown for animals grouped according to the number of days lived. Quite clearly, longer-lived animals are both markedly larger than their short-lived siblings and better able to maintain their length over time. This analysis can be made prospectively as well: the slope of the length vs. time curve between days 3 and 7 (as per Figure 1C) correlates well with each animal's future longevity (Figure 1F). Specifically, 27% of overall lifespan variability is accounted for by the days 3–7 length slope alone. (This correlation, and all others shown, is the aggregate of several trials, described in Table S1; per-trial results are given in Figure S3.) The mean length over that time range also correlates positively with lifespan; including both in the regression analysis increases the R² measure of lifespan-predictive ability to 32%. We found similar correlations with volume and surface area; however, length is the most robust.

Note that the R² value is often an over-optimistic estimate of how well a model will predict values from future data, due to “over-fitting” of particular features of the original dataset, particularly with least-squares models, small or outlier-prone datasets, and/or multiple independent parameters. We therefore also estimated future predictive ability via leave-one-out (l.o.o.) cross-validation, in which the prediction for each data point is generated from a model constructed using all other data points. For the length measurements the l.o.o. R² is 31%.

Finally, regression models predict lifespan quantitatively; we can simplify this to a categorical measure to ask how well above -⁠ or below-average predicted longevity translates to actual longevity. Figure S2A shows the distributions of observed lifespans for animals with above-average and below-average predicted longevity based the two length measurements (days 3–7 slope and mean); Figure S2B illustrates the corresponding survival curves. We find that above-average length-predicted lifespan is 71% sensitive and specific for above-average longevity. (Defining the test about the average predicted and measured values yields balanced sensitivity and specificity; other thresholds trade off between the two.) The above-average-predicted-lifespan cohort has a 17% increase in mean lifespan compared to the below-average cohort.

Phenomenological Predictors of Longevity

It had been previously speculated that that age pigments, known to correlate with the current health state, will be predictive of future longevity [19], though this was not borne out in a recent study [20]. We therefore tested accumulation of autofluorescent age pigments (imaged through a red filter set and apparent in gut granules and in aged gonads; see Materials and Methods and Figure S4). Figure 2A shows the patterns in autofluorescent age pigment accumulation in two individuals, computationally straightened and fit to the average day-5 shape and size for visualization, between days 3 and 7 post-hatch; Figure 2B shows the pigment accumulation trends for cohorts with different lifespans. (These measurements are of the 95^th percentile of pixel intensity within the defined “worm region” of the original images; here and in all subsequent cases, other measures such as mean or median yield similar results.) Clearly, the longer-lived animals have lower absolute levels of pigmentation, even early in life, and also lower rates of increase in pigment levels. Prospectively, animals with higher levels of autofluorescence, and also those with higher rates of autofluorescence accumulation, in days 3–7 post hatch are likely to experience shorter lifespans (Figure 2C).

**Fig. 2. Phenomenological predictors of nematode longevity.**

Various measures of movement rates have been shown to predict future longevity [11]–[14]; we attempted to replicate this finding by calculating a daily movement score from pixel intensity differences in sequential images (see Materials and Methods); higher scores indicate more movement (Figure 2D). We found that the mean and slope of the motion score, days 3–7, positively correlate with eventual longevity (Figure 2E). That is, high movement rates and maintenance of these rates through mid-adulthood are markers of longer lifespan, strengthening the conclusions from previous studies. Finally, aged C. elegans have a very typical “decrepit” appearance in brightfield images [11], [18]. As quantitative measurements of image texture have previously been used as proxies of age-related tissue deterioration (in particular, sarcopenia) in nematodes [44], [45], we defined a daily measure of whole-animal textural decrepitude (see Materials and Methods and Figure 2F) which, examined between days 3 and 7, predicts a sizable fraction of longevity variation (Figure 2G): more deteriorated-appearing animals (high mean decrepitude days 3–7) and those that more rapidly become so (positive slope) are shorter-lived.

Overall, each of the individual measurements shown in Figure 1 and Figure 2 consists of a phenomenological evaluation of one or more aspects of nematode health states, encompassing sarcopenia (motion, image texture, size), tissue maintenance (image texture, size), and autophagocytic ability (autofluorescence accumulation). Because these measurements of tissue and cellular homeostasis are integrative and relatively “downstream”, they provide powerful mid-life predictors of eventual longevity; however, they yield few clues regarding the origin of individual differences in longevity.

mir-71 Expression Levels and Spatial Patterns Predict Lifespan

The miRNA mir-71 increases rapidly in expression during larval development, peaks at early to mid-adulthood, and then gradually declines (Figure 3A, 3B and [36]). Beyond differences in lifespan and stress-resistance [36], mir-71 mutant animals appear phenotypically wild-type [46]. Further, mir-71 genetically interacts with IIS downstream of the insulin receptor homolog daf-2 but upstream of daf-16, the FOXO transcription factor that is a major IIS effector [36]. miR-71 is predicted to target several genes in the IIS pathway; of these, pdk-1 levels are greatly increased in aged animals lacking mir-71 [36]. Additionally, mir-71 appears to be both a downstream target of and a regulator of DNA damage responses via CDC-25.1 [36]. A transgenic reporter, mir-71::GFP, containing the promoter of mir-71 driving GFP expression, was previously characterized [36], [47]. Though ubiquitously expressed during adulthood, mir-71::GFP expression is most prominent in the hypodermis, pharynx, vulva, intestinal, and tail cells [36], [47], [48].

<i>mir-71</i>::GFP levels and expression patterns predict longevity. — **Fig. 3. *mir-71*::GFP levels and expression patterns predict longevity.**

Longer-lived cohorts have distinctly different temporal patterns of mir-71::GFP expression compared to shorter-lived siblings (Figure 3B). Expression levels are here quantified as the 95^th percentile of pixel intensities in the worm's head region; other measurements (mean, median, etc.), and/or aggregating across the whole body, produce similar results though we find this to be the most robust. Specifically, retention of “youthful” mir-71 states, both in terms of high levels of mir-71::GFP expression and of maintenance of these levels through mid-adulthood, correlates with longevity. Prospectively, both the mean mir-71::GFP expression levels and the change in these levels between days 3 and 7 both correlate with future lifespan variability (Figure 3C and 3D); together these two parameters predict 35% of lifespan variation (the l.o.o. value is 32%). Animals with higher or longer-lasting mir-71::GFP expression tend to live longer, consistent with the known role of miR-71 in promoting lifespan [36].

To further examine and quantify trends in mir-71::GFP expression patterns, we used principal components analysis (PCA). This procedure is conceptually similar to one described recently [49], which employed hierarchical clustering instead of PCA. We used 979 fluorescent images from 146 individual animals, controlled for individual differences in size, shape, internal compression due to locomotion, and overall mir-71::GFP expression level (see Materials and Methods) so that the analysis captured trends only in the spatial pattern of expression. The first principal component, which by definition explains the single largest correlated trend in the dataset (18% of total expression-pattern variability in this case), captures a transition from a highly specific head/vulva/tail expression pattern to more diffuse whole-body expression (Figure 3E). As the individuals shown in Figure 3A illustrate, there is both low-level whole-body background expression of mir-71::GFP that remains relatively constant over time, and strong head/vulva/tail-specific expression that peaks and declines. The position of the mir-71::GFP expression pattern along this principal component (“PC score”), in terms of standard deviations above or below the mean expression pattern (Figure 3E), therefore quantifies the degree of strong, tissue-specific expression at a given day. Figure 3F clearly shows that longer-lived cohorts have more positive and increasing scores, corresponding to high (and increasing) degrees tissue-specificity of expression, while short-lived cohorts have more negative and decreasing scores. Quantitatively, maintenance of head/vulva/tail expression (measured here by the slope of days 3–7 PC scores, though in general other approaches could be employed), and the average overall degree of head/vulva/tail expression (mean PC score day 3–7), are highly correlated with longevity (R² = 29% for slope and 39% for mean); jointly they predict 47% of individual longevity variation (Figure 3G; l.o.o. 45%). Animals with above-average predicted longevities based on these two measurements have substantially different observed lifespans than those with below-average predictions, illustrating the utility of these measures as a diagnostic test of longevity (Figure S2D; the difference in mean lifespan between these two cohorts is 20%).

We confirmed that GFP expression alone does not predict lifespan by examining animals bearing the mIs10 transgene, which contains a myo-2 promoter driving GFP expression in the larval and adult head, a gut enhancer driving intestinal GFP expression in the adult, and pes-10::GFP, which is expressed embryonically. We found no whole-body or head-only summary of GFP (mean/median/95^th percentile/etc.) that, measured in terms of mean or slope over days 3–7 (or various other ranges), predicts longevity to any significant or substantial degree in this dataset (not shown). In addition, other miRNA promoter::GFP fusions correlate (and anti-correlate) with longevity to different degrees (below).

If the primary lifespan-determining target of miR-71 regulation is IIS, then mir-71::GFP expression patterns should no longer correlate with lifespan absent IIS. We tested this by examining mir-71::GFP fluorescence in a daf-16(mu86) background, which lacks this primary IIS effector. Because daf-16 lies extremely close to the spe-9 genomic locus, it was impractical to construct a mir-71::GFP; daf-16; spe-9 strain. Thus, we modified our experimental protocol to use and to allow for the placement of synchronized young-adult animals onto gel pads treated with the drug 5-fluoro-2′-deoxyuridine (FUDR) to prevent reproduction in the culture apparatus (see Materials and Methods). In this regime, lifespan was somewhat extended, along with a concomitant “stretching out” of the rise-peak-fall temporal expression pattern of mir-71::GFP intensity (Figure S5). This may be at least partially due to the FUDR, which has been shown to extend lifespan in an environment-dependent fashion [50]. Nevertheless, mir-71::GFP levels remain predictive of longevity after the day 3–7 measurement window is adjusted to account for the lifespan extension (Figure S5A and Figure 4B).

<i>mir-71</i>::GFP levels do not positively correlate with longevity absent DAF-16. — **Fig. 4. *mir-71*::GFP levels do not positively correlate with longevity absent DAF-16.**

However, in the daf-16(mu86) background, mir-71::GFP expression differences were no longer apparent between cohorts with different lifespans (Figure 4A). Because daf-16(mu86) animals are short-lived, we adjusted the “early adulthood” window in which measurements of GFP expression were made to match (Figure S5A); in this window, neither the increase in (Figure 4B), nor the mean level of (not shown), mir-71::GFP positively correlated with lifespan. If no accounting for shortened lifespan is made (and, indeed, we observe no compression of the rise-peak-fall temporal expression pattern of mir-71::GFP in the daf-16 background; Figure S5B), we found that GFP expression changes anti-correlate with lifespan (Figure 4B). This anti-correlation appears to be driven by a small subpopulation of the most short-lived animals which have very high mir-71::GFP expression levels. Thus, in a daf-16 null background, the predictive power of mir-71::GFP expression is either suppressed, or, potentially, reversed.

Finally, we note that compared to matched mir-71::GFP controls, mir-71::GFP; daf-16(mu86) animals had approximately double the peak GFP expression levels (Figure S5B), though the shape of the temporal pattern (Figure S5B) and spatial distribution (Figure S5C) of expression remained extremely similar. This may suggest negative-feedback regulation of miR-71 by DAF-16 or one of its targets.

Rates of mir-246/mir-239 Increase Promote/Antagonize Individual Longevity

Much like mir-71, mir-246 mutants appear phenotypically wild-type except for decreased longevity and stress-resistance [36], [46]. The expression of miR-246 increases over time, and a mir-246::GFP construct shows that the gene is expressed in the gonadal sheath [36], [47]. Our detailed analysis of mir-246::GFP in individual animals shows a gradual plateauing of mir-246 expression in late adulthood (Figure 5A and 5B), but, unlike mir-71, no concomitant loss of tissue specificity. We find that animals in which mir-246::GFP levels plateau more slowly (measured by the slope of mir-246::GFP 95^th-percentile fluorescence intensity between days 3 and 7) are relatively longer-lived: change in mir-246 expression in this time range predicts 20% of total longevity variation (l.o.o. 18%; Figure 5B and 5C). The mean level of mir-246::GFP between days 3 and 7, however, does not clearly predict longevity. Additionally, we observed that while the distributions of lifespans for animals with slow -⁠ vs. fast-increasing mir-246::GFP expression are significantly different (Figure S2E), they have nearly identical modal values; however, a subset with particularly early mortality appears to be associated with low mir-246::GFP slopes. None of the principal components of spatial expression variability correlate with longevity: mir-246::GFP expression is highly tissue-specific, and so the principal components predominantly capture uninteresting variations in the internal position of the gonad sheath.

Changes in <i>mir-246</i>::GFP and <i>mir-239</i>::GFP expression over time predict longevity. — **Fig. 5. Changes in *mir-246*::GFP and *mir-239*::GFP expression over time predict longevity.**

Unlike mir-71 and mir-246, mir-239 antagonizes longevity: mutants lacking the identical mir-239a and mir-239b sequences have increased lifespan and stress-resistance (though, again, no other clear phenotypes) [36], [46]. Genetic experiments suggest that mir-239 is downstream of daf-2 and upstream of daf-16; miR-239 may promote IIS (which is anti-longevity) via indirectly increasing levels of the cytoplasmic IIS transduction components AGE-1 (the catalytic subunit of phosphatidylinositol 3-kinase) and PDK-1. mir-239::GFP expression is predominantly in several head and tail neurons, with lower levels in pharyngeal and gut tissues [36]. While mir-71 and mir-246 levels peak or plateau, respectively, at mid-life (Figure 3A, 3B and Figure 5A, 5B), we observe that mir-239::GFP expression levels drift upward over time (Figure 5D and 5E). Consistent with its role as a lifespan antagonist, higher mir-239::GFP levels correlate with shorter lifespans. As there is relatively little variability in mir-239::GFP expression at post-hatch day 3, the days 3–7 increase and the day-7 magnitude of expression capture similar information, and both predict approximately 10% of longevity variability. For the slope measure, the l.o.o. R² is 7% (Figure 5F). Though this value is somewhat low compared to the others reported, the lifespans of high and low mir-239-expressing animals remain significantly and substantially different, with a difference in means of approximately one day (Figure S2F), or ∼10% of the average lifespan; moreover, as a diagnostic test for above-average longevity, above-average mir-239::GFP slope performs nearly as well as mir-246::GFP or the motion -⁠ or texture-based measures (Figure S2). As with mir-246::GFP, PCA applied to mir-239::GFP images did not yield any expression-pattern trends predictive of future longevity.

Relationships among Longevity Biomarkers

We measured many of the reported biomarkers in the same animals, enabling us to construct a multivariate predictor of nematode longevity, which we term the “survival prediction index”. This model, incorporating the length, motion, texture, and autofluorescence-accumulation measurements described in Figure 1 and Figure 2, predicts 62% of all lifespan variability across all datasets for which these parameters were measured (l.o.o. 57%; Figure 6A), and above-average “survival prediction indices” divide the animals into well-delineated long-lived and short-lived subgroups (Figure S2G; mean lifespan is 22% increased in the high-survival-index cohort compared to the low-index cohort). The relative importance of each biomarker can be inferred by examination of the relative regression weights: autofluorescence (slope): −0.302; length (slope): 0.266; length (mean): 0.174; motion (mean): 0.149; motion (slope): 0.137; texture (mean): 0.121; texture (slope): 0.077. (To render regression weights directly comparable, all input parameter values were expressed in unit-free terms of standard deviations from their mean; including mean autofluorescence values did not improve lifespan-predictive ability.) Overall, the measures of size and age pigments dominate (for these measures alone, lifespan-prediction R² = 55%, l.o.o. 53%).

**Fig. 6. Multivariate lifespan predictions and relationships between biomarkers.**

Further, we find that adding mir-71 or mir-239 measurements to those above do not improve the lifespan predictions of the model (not shown), indicating that these measurements provide information that is also captured by the downstream, phenomenological markers (see also below). However, adding mir-246 slope measurements adds at least 5% to the R² value attained from the phenomenological markers, which suggests that miR-246 promotes longevity via more than just the mechanisms reported on by length, texture, motion, and autofluorescence.

We formalized this analysis by inferring the conditional independencies of various parameters from partial correlations. That is, while all of the biomarkers correlate with one another, it is possible to statistically infer whether these relationships are direct or indirect. For example: mir-71 levels correlate with longevity, but this correlation is largely abrogated when length maintenance is controlled for; conversely, controlling for mir-71 reduces the correlation between length and lifespan to a much lesser degree (see Table S2, which lists the correlation of each marker with longevity after various controls). Therefore either mir-71 influences length, which then influences longevity, or length variability is an upstream cause of both mir-71 variability and lifespan variability. To systematically evaluate these interactions, we constructed a partial correlation network (also known as a graphical Gaussian model) from our data (Figure 6B; see Materials and Methods) [51]. We have not yet directly evaluated the relationship between the different miRNAs, which requires measuring the promoter activity of these genes in the same animals. We can, however, indirectly infer the relationship between these miRNAs via intervening factors measured in all datasets: the network in Figure 6B reflects the consensus of networks calculated from each dataset (Figure S6). While the relatively weak relationship between mir-239 and lifespan prevents its accurate placement into the network, other trends are clear. Age-pigment accumulation, motion, and texture/mir-71/length provide relatively independent information about lifespan from one another, though there may be some relation between the motion and texture scores (Figure S6). Including mir-246 decreases the link between length and lifespan (Figure S6 and Table S2), suggesting that, while its precise position is indeterminate, mir-246 is either an upstream cause of both length and lifespan variation, or mir-246 levels are determined by physiological processes that regulate length maintenance, and these levels more directly control lifespan.

Finally, we examined the timing of each measure's lifespan-predictive ability: do some measurements provide earlier hints at longevity than others? The accuracy of lifespan prediction for each measure (measured as the R² value) is plotted versus time in Figure 6C. For this analysis, we did not manually determine one or two summaries of the time-course of biomarker measurements (e.g. “mean length between days 3 and 7”), but instead predicted longevity using all of the raw time-course data up to a given age (e.g. to evaluate the predictive ability of length up to day 4, we regressed lifespan against the lengths measured at days 2, 3, and 4). We used ridge-regression with an automatically-determined penalty in order to prevent over-fitting due to the increased number of parameters at later time-points (see Materials and Methods); note however that the R² values are in some cases more optimistic compared to the simpler measurements defined earlier.

Based on this analysis, we observe that the various predictors of longevity differ markedly in their timing: the mir-71 PCA scores provide the earliest, and best, predictors of lifespan, while mir-246 and motion provide the latest-onset information about future lifespan.

Discussion

We used a novel culture system that allowed us to quantitatively examine individual nematodes throughout time and directly correlate inter-individual variability in early life with variability in eventual lifespan. Through this observational study, we identified phenotypic markers that, measured while >97% of the population remains alive, predict over 60% of future individual longevity variability. In addition to confirming markers that previously had been shown to correlate with lifespan (movement rates [12]–[14]) and widely suspected to do so (visual decrepitude [11], [18], [45], age pigment accumulation [17]–[19], [43]) we have also found a novel biomarker of longevity in maintenance of adult size through mid-life (Figure 1E and 1F). This dovetails with the recent finding that juvenile “growth span” (time to reach adult size) also predicts lifespan [14].

Our results regarding age pigmentation are inconsistent with others recently reported [20], an observation which may be explained by the fact that we measured age pigmentation in the red spectral range (see Materials and Methods), while Sánchez-Blanco and Kim examined green autofluorescence. Note that lipofuscin, per se (as opposed to other autofluorescent “age pigments”), fluoresces most strongly in blue ranges [21]. This indicates the potential heterogeneity of age pigmentation across the color spectrum.

Seeking a more detailed understanding of the genetics of individual longevity determination, we found that fluctuations in the levels of miR-71, 239, and 246 (imputed via promoter::GFP reporters) predict a substantial portion of longevity. To date, only one other group has reported any genes, in any species, in which endogenous, un-perturbed fluctuations in expression level have been reported to be predictive of future longevity [20]. The ability of expression states to predict later longevity quite early in life further suggests that some fraction of longevity variance is indeed the result of developmentally determined epigenetic states of “robustness” or “frailty”. This hypothesis has been raised theoretically [52], and the degree of post-heat-shock longevity-extension (correlated with HSP-16.2 expression [25]) has been shown to fit this model well [53], [54].

In particular, the degree of strong, tissue-specific mir-71 expression in the head, vulva, and tail is the single most powerful biomarker identified, explaining 47% of future longevity (Figure 3F and 3G), and also the earliest indicator of future longevity (Figure 6C). Combining these findings, which are based on observation of essentially unperturbed individuals, with earlier knockout studies that demonstrate that abrogation of these miRNAs alters mean lifespan [36], we infer that individual, wild-type variability in the expression of these regulators not only reports on longevity but may likely determine it as well. (In all cases, the correlation or anti-correlation of miRNA levels with longevity is in the same direction as suggested by previous knockout and overexpression studies [36].) Note that it is not necessarily the case that genes that alter lifespan when removed or artificially overexpressed will determine individual lifespan in wild-type conditions: core health-determining genes may be tightly regulated to have low inter-individual variation, or that variability may be buffered downstream and not translate into altered longevity.

In the case of mir-71 and mir-239, the mechanism of individual lifespan determination may be via the well-known insulin/IGF-1-like signaling pathway (IIS) [36], the absence of which promotes longevity via stress-response and macromolecular homeostatic mechanisms, among others [55]–[57]. In particular, mir-71 knockout led to increased expression of components of the IIS transduction machinery [36], and we here found that daf-16 null animals have increased levels of mir-71::GFP expression. These observations suggest that DAF-16 and miR-71 levels are held in homeostatic balance via a mutually regulatory feedback loop. Thus, it is possible that inter-individual fluctuations in miR-71 levels may directly determine inter-individual levels of tonic IIS activity, and hence, individual lifespan. This conclusion is strengthened by our finding that, absent DAF-16, mir-71::GFP is no longer predictive of longevity. One final piece of evidence for the role of IIS in lifespan determination comes in the work of Sánchez-Blanco and Kim [20], which identifies daf-16 and sod-3, a canonical DAF-16 target, as among the best predictors of individual longevity. (In fact, sod-3 reporter expression is their strongest predictor reported, with an R²-value of 0.32.)

Further, mir-71 may also act by downregulating cell-cycle checkpoint proteins [36], the absence of which also promotes stress-response factors, even in postmitotic cells [58]. The early predictive ability of mir-71::GFP suggests that mir-71, and thus either or both of the signaling pathways it regulates, may play an early-life role in determining organismal robustness and longevity.

According to our network analysis, image texture, mir-71::GFP expression, and size are all closely related, with size as the most downstream measure of what we take to be the age-regulated tissue disorganization and sarcopenia reported previously [11], [18]. Overall, and especially given mir-71's role in regulating pathways that determine levels of stress responses, we suspect that this nexus of predictors reflects processes of somatic growth and maintenance. However, while both mir-71 and mir-246 levels are correlated with length maintenance, mir-246, unlike mir-71, provides additional information about longevity not captured by length or other “somatic maintenance” features. Given this, the relatively late (i.e. post-reproductive) timing of its predictive ability, the sharp upregulation of mir-246::GFP expression at reproductive maturity (Figure 5A, 5B and [47]), and its localization literally at the interface between the gonad and other somatic tissues, it is tempting to speculate that mir-246 is involved in balancing reproduction and somatic maintenance [32], [59]. Further, we find that age pigment accumulation also provides a degree of information about lifespan not captured by the “somatic maintenance” nexus, suggesting that it, too, reflects an inter-related but parallel lifespan-determining process.

It is also worth examining these results in light of those of our previous work, which compared miRNA expression over time in wild-type and long-lived daf-2 IIS mutant animals [36]. Often, daf-2 animals are taken as a paradigm case of “long-lived” animals, and the presence or absence of a physiological feature in these mutants assumed indicative of a role as a biomarker of successful aging. While in many cases this is undoubtedly so (for example, decreases in rates of lipofuscin accumulation in these animals [19]), our results yield the unsurprising finding that the relationship between physiological events in daf-2 mutants and long-lived wild-type animals are inexact. Specifically, we here find clear evidence that elevated levels of mir-71 and mir-246 expression are associated with extended longevity, yet in daf-2 animals these miRNAs are not upregulated, likely reflecting their upstream, negative-regulatory roles in insulin signaling [36]. Similarly, we find that daf-16 animals are physiologically quite dissimilar from short-lived wild-type individuals in terms of mir-71::GFP expression. Intact individuals which are short-lived typically have low mir-71::GFP levels, while daf-16 animals show a dramatic elevation in mir-71::GFP. Mechanistically, this may be due to disrupted negative feedback from DAF-16 (or a target) on mir-71; pragmatically it suggests the limitations of inference about the physiology of intact animals based on findings in particular mutants. In this case, based only on the average difference in mir-71::GFP in daf-16 vs. wild-type, one might incorrectly conclude that high miR-71 levels are a marker of short lifespan. We thus believe there is great utility to the quantitative observation of individual wild-type (or nearly so) animals.

Lastly, in all cases observed, it is the retention of young-adult-like trends (high and/or increasing length, mir-71 and 246 expression; low or slowly-increasing autofluorescence and mir-239) into mid -⁠ and late-adulthood that predicts longer lifespan. Classic antagonistic pleiotropy theories of aging posit that some age-related degeneration may be due to alleles which are beneficial in early life but become damaging over time (“live fast, die young” effects) [60]; however, in the examples here presented we find that the loss of youthful biometrics, not their continuation, proves most harmful.

Thus, from this and previous studies [36], these miRNAs appear to be relatively upstream regulators of lifespan-determining pathways that are relevant to the determination of inter-individual variation in nematode lifespans. The miRNAs we have identified are not well conserved in higher animals so these particular mechanisms of lifespan determination are likely nematode-specific. However, given the conserved nature of aging pathways across phylogeny, our work does suggest that fluctuations in other regulators of these pathways (including other miRNAs) may predict or determine individual aging rates in more complex organisms, perhaps foreshadowing or even controlling the timing of age-related decline in humans.

Materials and Methods

Single-Animal Vermiculture

Waterjet-cut borosilicate glass slides were obtained from Advanced Waterjet and Engraving (Anaheim, CA). Before and after use, slides were soaked in a base bath (2 : 5 isopropanol:H₂O, 1.5M KOH) to remove all organic material. Prior to use, the slides were treated to functionalize the glass surface with reactive methacryl groups as follows: slides were rinsed in distilled H₂O (dH₂O) and submerged in 5% HCl (aq.) for 10 min to protonate surface hydroxyls, rinsed again in dH₂O and then submerged with agitation for 2 min. in methacryl silane solution (2% ^v/_v 3-methacryloxypropyltrimethoxysilane [Z-6030, Dow-Corning; Midland MI] in 95% ethanol with 0.02% ^v/_v glacial acetic acid, made fresh and stirred vigorously for 10 min. immediately prior to use). The slides were then rinsed in 95% ethanol, heated at 110°C to effect the condensation of the silane reagent to the glass surface, and stored with desiccant. Prior to use, one side of each slide was sealed with Scotch Premium Performance packing tape (3M; St. Paul, MN).

We use a methacryl-difunctional polyethylene glycol to create a crosslinked hydrogel [61] that, when polymerized in the methacryl-derivatized glass wells, crosslinks also to the sides of the wells. This prevents the high rate of escape down the sides of the wells that we observe when using agar gels. Agar-free but otherwise standard nematode growth media [62] was supplemented with 4% ^w/_v dimethacryl PEG-1000 (Polysciences; Warrington, PA), 4% ^w/_v monomethacryl PEG-1100 (Sigma-Aldrich; St. Louis, MO) as a plasticizer, and 0.1% ^w/_v 1-[4-(2-hydroxyethoxy)-phenyl]-2-hydroxy-2-methyl-1-propane-1-one (Irgacure 2959, BASF; Ludwigshafen, Germany), a water-soluble, photo-activatable crosslinking initiator. This PEG-NGM was filter-sterilized and pipetted to fill the wells in the glass slides level to the top. The slides were then placed in a sealed chamber with a UV-transparent borosilicate glass lid, which was purged with nitrogen and exposed to 1.5 J of shortwave UV (λ_max = 350nm) radiation to initiate crosslinking.

1 µL 12.5% ^w/_v E. coli OP50 (resuspended in M9) was pipetted onto each NGM-PEG pad, and individual eggs at the pre-hatch “pretzel” stage of development were transferred with an eyelash pick. Liquid polydimethylsiloxane (PDMS; Sylgard 184, Dow-Corning; Midland MI) was mixed 1∶10 with its crosslinking agent, de-gassed for 20 min. under vacuum, and pipetted atop the slide assemblies, which were then placed in 10 cm diameter polystyrene Petri dishes alongside small dH₂O-saturated cotton strips (to prevent desiccation), sealed with parafilm, and stored at 23°C. PDMS polymerizes after approximately 12 hours in these conditions.

Most eggs hatch within 5 hours of slide preparation and reach their full adult size approximately 50 hours later. Ages reported are hours and days post slide preparation. These values are within the described range for this temperature [63], suggesting that our culture apparatus is substantially similar to standard conditions. Further, our observed mean lifespan of 10.7 days at 23°C (Figure 1B) is similar to our own measurements of spe-9(hc88) animals picked as pretzel-stage eggs onto standard NGM plates seeded with OP50 (mean lifespan = 9.5 days at 24°C, n = 350), and measurements of wild-type animals on solid media reported by others [6], [17]. However, we observe a somewhat smaller standard deviation in lifespan of ≈1.9 days vs. the 3–4 in those previous studies, suggesting that this culture apparatus provides an extremely uniform environment.

For the daf-16 epistasis analysis, we modified the above protocol to allow for chemical sterilization of young adult animals by 5-fluoro-2′-deoxyuridine (FUDR; Sigma-Aldrich; St. Louis, MO). Specifically, FUDR from a 10 mg/mL aqueous stock was added at 1∶100 to PEG-NGM prior to filter-sterilization, which was polymerized in the glass slides as above. FUDR causes growth arrest of animals prior to the 4^th larval stage, so synchronized young adult animals were produced by hypochlorite treatment of gravid adults to isolate eggs [42] followed by overnight starvation in M9 buffer to synchronize animals as L1s, which were then plated on standard NGM-agar plates with OP50 food and allowed to grow to young adulthood at 23°C. These animals were transferred individually to PEG-NGM-FUDR slides supplemented with concentrated OP50 as above. Moving animals crawl into polymerizing PDMS, so the slides were sealed with 0.5mm-thick strips of PDMS that had been pre-cured on a glass plate at 100°C for one hour and cut to size. We observed an increase in longevity under these conditions relative to non-FUDR treated animals plated as embryos (Figure S5A).

Strains

The following C. elegans strains provided by Caenorhabditis Genetics Center (CGC) were used in our studies: VT2084 (mir-71::GFP), VT1607 (mir-246::GFP), and PD4793 (mIs10: myo-2::GFP; pes-10::GFP; F22B7.9::GFP). mir-239::GFP was generated previously [36]. All strains were crossed into BA671, a spe-9(hc88) temperature-sensitive fertilization-deficient mutant, and assays were conducted at the restrictive temperature of 23°C. This strain has normal longevity at this temperature [42]. The small fraction of animals that reproduced in the culture apparatus and could not be clearly distinguished from their offspring was excluded from further analysis.

We determined that VT2084 actually contains the complete precursor miRNA sequence for miR-71 inside the “promoter” region driving transgenic GFP expression; therefore we wished to determine whether this strain overexpresses miR-71, which was previously shown to increase longevity [36]. Overexpression of miR-71 was not likely because the transgene in strain VT2084 was integrated via low-copy bombardment; however we confirmed via quantitative RT-PCR analysis (as performed previously [36]) that mature miR-71 levels in mir-71::GFP; spe-9(hc88) animals reared at 23°C (as similar as possible to our own culture conditions) showed minimal overexpression. Specifically, synchronized populations of spe-9(hc88) animals and mir-71::GFP; spe-9(hc88) animals were prepared by hypochlorite treatment and overnight starvation in M9, reared on NGM plates seeded with OP50 at 23°C, and harvested at 5 days post-plating, corresponding to the time of peak mir-71::GFP expression at young adulthood. Levels of miR-71 in mir-71::GFP; spe-9(hc88) animals were 102% or 115% that of spe-9(hc88) animals, depending on whether the U18 RNA or miR-66 (which are not temporally regulated), respectively, was used as a loading control. Mature miR-71 levels were also measured in homozygotic mir-71 null (n4115); mir-71::GFP animals prepared similarly; here, miR-71 expression was approximately 58% or 25% that of wild-type (using the U18 or miR-66 control, respectively). (The n4115 strain without the mir-71::GFP transgene had undetectable miR-71 expression.) Further, we see no phenotypic consequence of the extra copies of the miR-71 sequence in VT2084: the lifespan of mir-71::GFP; spe-9 animals in our apparatus is approximately 1.05 times that of the mean lifespan of the other strains analyzed, which is well within the range of inter-replicate variability. Thus, we conclude that for the purposes of this work, mir-71::GFP acts as a phenotypically wild-type reporter of miR-71 expression.

Image Acquisition

We calibrated our microscope daily to control for spatial and temporal variation in light-source intensity, as described previously [36]. At the desired sample interval (typically daily), each slide was briefly removed from its humid chamber and placed in an upright microscope (Axioplan 2i; Carl Zeiss; Oberkochen, Germany), driven by custom software, for acquisition of brightfield and fluorescent images at 10× magnification. Per-slide acquisition time was typically under 20 minutes. Each animal was manually located and brought into focus, and a series of 10ms-exposure brightfield images were acquired, interleaved with fluorescence exposures of 1, 10, and 100 ms, in order to ensure that a properly exposed image was obtained. This sequence was performed for each filter-set of interest; in this case a GFP-bandpass filter to measure transgene expression (41017; Chroma; Bellows Falls, VT) and a TRITC filter to measure autofluorescence (41002c; Chroma).

While the peak autofluorescence of lipofuscin, a chief age pigment, is in the blue range [19], blue light evokes a strong escape response [64], which is problematic as the animals typically leave the field of view rapidly thereafter. As the green range was used for GFP measurements, we compromised and measured age pigment species that autofluoresce in the red range. Like lipofuscin, we observed these species in gut granules [16] and also in larger gonadal inclusions (Figure S4). Further, we established that our measurements of GFP intensity were not biased by light emitted from the autofluorescent species that also fluoresce in the same wavelengths. We found that the relative degree of autofluorescence was quite low (<10% relative intensity) compared to the GFP signals measured, even in aged animals; correcting for this bleed-through did not alter any findings. Further, the 95^th-percentile measurements of image intensity we used are less sensitive to low-intensity autofluorescence signals as compared to measures of mean image intensity (for example).

After fluorescence image acquisition, each animal was further stimulated with an 0.25-second pulse of green light, which stimulates a robust escape response in healthy animals and causes head and/or tail retraction in more decrepit individuals. After a 1-second delay, three images were subsequently recorded at 1-second intervals to measure post-stimulation movement.

Image Analysis

Post-stimulation motion image sequences were visually scrutinized to determine if voluntary “twitching” occurred. Animals with no detectable motion were determined to have died in the interval between the current and previous image acquisition. (Assuming a Bayesian null hypothesis of a uniform prior distribution over time-of-death within the sample interval, the expectation value for the actual, unknown, time of death is halfway between the current and previous acquisition time. We took the number of hours between slide preparation and this time-of-death estimate as the lifespan.) For each timepoint, the non-overexposed fluorescent image with the longest exposure time was selected, subject to manual review to ensure that the animal's locomotion did not cause unacceptable blur. As each fluorescent image was acquired with flanking brightfield images, the brightfield image in which the animal's position is closest to that in the fluorescent image was selected automatically as that with maximum mutual information with the fluorescent image [65], again subject to manual review.

Once the best brightfield/fluorescence image pairs were defined, the outline of the animal in each brightfield image was determined using custom semi-automated software; the position was assumed to be the same in the fluorescent image. The nematode-finding procedure was as follows: based on a training set of labeled nematode/non-nematode regions of brightfield images, a logistic regression classifier was trained to estimate the probability that a given patch of pixels is inside of an animal [66]. The classifier was applied to each brightfield image to create a rough mask, which was distance-transformed to produce “valleys” of low values along the midlines of masked regions. Based on user-input head and tail points, a least-cost path through this distance-transformed mask was calculated using Dijkstra's algorithm. This centerline was manually modified as necessary, and the left and right flanks of each animal determined based on average size for its age, with manual modifications.

Given each animal's centerline and outline, it is trivial to straighten the image and to warp the size and shape of any given animal to a standardized “unit worm” defined by the average size and shape of all animals of a given age. The PCA fluorescence measurements described below were made on standardized, warped images; all other measurements were made from the original images, within the animal's boundaries as defined. Further, given the head-to-tail centerline, we defined the “head” region as the initial 20% of the animal.

Image Measurements

Fluorescence measurements were made on images corrected for background camera noise (dark field), spatial illumination inhomogeneities (flat field) and temporal variation in illumination via reference fluorescent beads [36]; after correction, intensity values were divided by the exposure time to render all images comparable. Pixel values within the defined whole-animal or head regions were extracted and summary statistics (such as 95^th percentile of intensity) made. These raw measurements are available as Table S3, while per-animal summary statistics (days 3–7 slope and mean, etc.) are provided in Table S4.

Four measures of motion were made within the defined animal region: the fraction of pixels changing relative intensity by more than 18% between the different brightfield images acquired, and the average pixel-wise coefficient of variation across these images, both before and after green-light stimulation. ν-support-vector regression (SVR) [67] using an RBF kernel was then used to map these four parameters to the number of days of life remaining, using LIBSVM [68]. Parameters were selected using 10-fold cross-validation on a subset of the input data: C = 10, ν = 0.8, γ = 0.3, though performance was roughly equivalent across several decades of C and γ values, 0.2<ν<0.9. Thereafter we used 100-fold cross-validation on our dataset of 4318 motion-statistics/days-remaining data points and made predictions for each data point without “peeking” by training the SVR on the days-remaining figure for that data point. The predicted “days of life remaining” based on the four measures of motion was used as our aggregate motion score.

For simplicity, image texture features were calculated directly from pixel intensity patterns [69], though “filter-bank” methods have also been employed on nematode images [44], [45]. First, age-specific texture patterns (“textons”) were determined. Brightfield images (acquired through the GFP filterset only) were grouped by age: 3 -⁠ and 4-day-old, 5 -⁠ and 6-day-old, up to 15 -⁠ and 16-day-old. For each group of images, 500000 17×17-pixel patches (within the defined animal outlines) were randomly sampled, after which k-means classification was performed to yield 30 representative textons for each age group (210 overall). Next, the texture of each animal was characterized as follows: for each 17×17-pixel patch falling within the defined brightfield image region, the closest texton (in terms of Euclidian distance) of the 210 overall was determined. The “texture signature” of a given image was defined as the 210-element histogram containing the number of closest-matching 17×17-pixel patches for each texton, divided by the total number of patches in that image. These signatures were then used as input to a support vector regression procedure precisely as described above (best parameters: C = 5, ν = 0.6, γ = 0.004; again performance was relatively insensitive to parameter setting). Texture-based predictions of “days of life remaining” were used as texture-decrepitude scores.

Principal components analysis was performed on fluorescent images, from day 3 onward, warped to unit size and shape. However, as C. elegans stretch and compress as they move, and due to inter-individual anatomical variation as well as variation in animal-outline-finding, warping images based on the outline alone does not cause anatomical features to come into precise register across every animal. We therefore manually defined the position of the vulva on each brightfield image and used that position to initialize a mild nonlinear warping procedure, which longitudinally stretches and compresses the image using five evenly spaced control points. Given an image and a reference, hill-climbing optimization was then used to find the position of the control points that maximized the correlation coefficient between the image and reference pixels, with a penalty for large deformations. The mir-71::GFP images were mutually aligned using the expectation-maximization algorithm as follows: the mean image across the population was calculated (expectation step), then each image was warped to match the mean (maximization step). These steps were alternated until convergence; typically three iterations sufficed. Results were then manually inspected to ensure face validity. Finally, the mean pixel intensity of each image was subtracted away so that inter-image variability was due only to the distribution of pixel intensities, and not to overall changes in mean brightness. After this procedure, the principal components analysis was performed on the images, and PCA scores along each component were calculated for each animal at each timepoint.

Statistical Analysis

Estimates of the underlying distribution of sampled data (length, lifespans) were performed with Gaussian kernel density estimation, using the Scott's rule-of-thumb to choose the kernel variance (i.e. bandwidth): σ²n^−0.2 where σ² is the sample variance and n is the sample size [70]. Pairs of lifespan distributions were tested for equality using the two-tailed Kolmogorov–Smirnov test.

Single and multivariate regression of biomarkers versus lifespan was conducted with ordinary least-squares regression, with the coefficient of determination (R²) calculated according to the standard formula. Note that in the univariate case, this is equivalent to the squared Pearson product-moment correlation coefficient (r) between the biomarker and lifespan. Significance of correlations was measured with an F-test of the R² value: the statistic σ²_model/σ²_error has an F distribution with (df_model, df_error) degrees of freedom, where σ²_model and σ²_error are the model and error components of the overall variance, respectively, df_model = p, df_error = n-p-1, p is the number of fit parameters, and n is the number of observations. As σ²_model is the sum of squared distances between the predicted values and the mean value, divided by df_model, and σ²_error is the sum of squared residuals divided by df_error, simple algebra on the definition of the R² value yields F = R² df_error/([1-R²] df_model).

Leave-one-out R² values were calculated as follows: given one or more biomarker values for a set of individuals, an ordinary least-squares regression model to predict lifespan from these values was estimated based on the data from each individual save one; then the lifespan of that individual was predicted using that model. This was repeated for each individual. The R² value was calculated from the residuals of the leave-one-out predictions according to the standard formula.

Partial correlation networks (Figure 6B and Figure S6) were computed using TETRAD IV [71], using the “PC” search algorithm with the multiple-regression independence test and the α threshold for the test set to 0.001. Arrows of direction of the influences were discarded, as these inferences were not robust across independence tests or α values; however the basic network structure was robust.

Lifespan-predictive values versus age (Figure 6C) were calculated as follows. For each age n, from 2–7 days, all levels measured for a particular marker from day 2 to n were considered. (That is, for n = 2, only the day-2 value was used; for n = 4, the values at days 2, 3, and 4 were used.) The values under consideration, and all of their pairwise multiples (“interaction terms”, so that rates of change can be incorporated) were then used to construct a multivariate regression model to measure how well eventual longevity can be predicted using only data up to age n. In order to prevent the changing number of parameters over time from affecting the R² values, and in particular to avoid over-fitting later timepoints due to large number of parameters and interaction terms, we used ridge regression, with a penalty term automatically chosen to minimize the generalized cross-validation error [72].

Supporting Information

Zdroje

1. GögeleMPattaroCFuchsbergerCMinelliCPramstallerPP 2010 Heritability Analysis of Life Span in a Semi-isolated Population Followed Across Four Centuries Reveals the Presence of Pleiotropy Between Life Span and Reproduction. J Gerontol A Biol Sci Med Sci 705 1 12

2. HerskindAMMcGueMHolmNVSørensenTIHarvaldB 1996 The heritability of human longevity: a population-based study of 2872 Danish twin pairs born 1870–1900. Hum Genet 97 319 323

3. HoriuchiS 2003 Interspecies Differences in the Life Span Distribution: Humans versus Invertebrates. Pop Dev Review 29 127 151

4. VaupelJWCareyJRChristensenKJohnsonTEYashinAI 1998 Biodemographic trajectories of longevity. Science 280 855 860

5. BrooksALithgowGJJohnsonTE 1994 Mortality rates in a genetically heterogeneous population of Caenorhabditis elegans. Science 263 668 671

6. SzewczykNJUdranszkyIAKozakESungaJKimSK 2006 Delayed development and lifespan extension as features of metabolic lifestyle alteration in C. elegans under dietary restriction. J Exp Biol 209 4129 4139

7. KenyonCJ 2010 The genetics of ageing. Nature 464 504 512

8. KirkwoodTBLFederMFinchCEFranceschiCGlobersonA 2005 What accounts for the wide variation in life span of genetically identical organisms reared in a constant environment? Mech Ageing Dev 126 439 443

9. BakerGTSprottRL 1988 Biomarkers of aging. Exp Gerontol 23 223 239

10. PincusZSlackFJ 2010 Developmental biomarkers of aging in Caenorhabditis elegans. Dev Dyn 239 1306 1314

11. HerndonLASchmeissnerPJDudaronekJMBrownPAListnerKM 2002 Stochastic and genetic factors influence tissue-specific decline in ageing C. elegans. Nature 419 808 814

12. HsuA-LFengZHsiehM-YXuXZS 2009 Identification by machine vision of the rate of motor activity decline as a lifespan predictor in C. elegans. Neurobiol Aging 30 1498 1503

13. HuangCXiongCKornfeldK 2004 Measurements of age-related changes of physiological processes that predict lifespan of Caenorhabditis elegans. Proc Natl Acad Sci USA 101 8084 8089

14. HulmeSEShevkoplyasSSMcGuiganAPApfeldJFontanaW 2010 Lifespan-on-a-chip: microfluidic chambers for performing lifelong observation of C. elegans. Lab on a Chip 10 589 597

15. TermanABrunkUT 2006 Oxidative stress, accumulation of biological “garbage”, and aging. Antioxidants & Redox Signaling 8 197 204

16. ClokeyGVJacobsonLA 1986 The autofluorescent “lipofuscin granules” in the intestinal cells of Caenorhabditis elegans are secondary lysosomes. Mech Ageing Dev 35 79 94

17. KlassMR 1977 Aging in the nematode Caenorhabditis elegans: major biological and environmental factors influencing life span. Mech Ageing Dev 6 413 429

18. GariganDHsuA-LFraserAGKamathRSAhringerJ 2002 Genetic analysis of tissue aging in Caenorhabditis elegans: a role for heat-shock factor and bacterial proliferation. Genetics 161 1101 1112

19. GerstbreinBStamatasGKolliasNDriscollM 2005 In vivo spectrofluorimetry reveals endogenous biomarkers that report healthspan and dietary restriction in Caenorhabditis elegans. Aging Cell 4 127 137

20. Sánchez-BlancoAKimSK 2011 Variable Pathogenicity Determines Individual Lifespan in Caenorhabditis elegans. PLoS Genet 7 e1002047 doi:10.1371/journal.pgen.1002047

21. HouthoofdKBraeckmanBPLenaertsIBrysKMatthijssensF 2005 DAF-2 pathway mutations and food restriction in aging Caenorhabditis elegans differentially affect metabolism. Neurobiol Aging 26 689 696

22. LundJTedescoPDukeKWangJKimSK 2002 Transcriptional profile of aging in C. elegans. Curr Biol 12 1566 1573

23. GoldenTRHubbardADandoCHerrenMAMelovS 2008 Age-related behaviors have distinct transcriptional profiles in Caenorhabditis elegans. Aging Cell 7 850 865

24. FortneyKKotlyarMJurisicaI 2010 Inferring the functions of longevity genes with modular subnetwork biomarkers of Caenorhabditis elegans aging. Genome Biol 11 R13

25. ReaSLWuDCypserJRVaupelJWJohnsonTE 2005 A stress-sensitive reporter predicts longevity in isogenic populations of Caenorhabditis elegans. Nat Genet 37 894 898

26. CypserJRJohnsonTE 2002 Multiple stressors in Caenorhabditis elegans induce stress hormesis and extended longevity. J Gerontol A Biol Sci Med Sci 57 B109 114

27. HornsteinEShomronN 2006 Canalization of development by microRNAs. Nat Genet 38 S20 24

28. LiXCassidyJJReinkeCAFischboeckSCarthewRW 2009 A microRNA imparts robustness against environmental fluctuation during development. Cell 137 273 282

29. StarkABrenneckeJBushatiNRussellRBCohenSM 2005 Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3′ UTR evolution. Cell 123 1133 1146

30. Ibáñez-VentosoCDriscollM 2009 MicroRNAs in C. elegans Aging: Molecular Insurance for Robustness? Curr Genomics 10 144 153

31. BoehmMSlackFJ 2005 A developmental timing microRNA and its target regulate life span in C. elegans. Science 310 1954 1957

32. CrawfordDLibinaNKenyonC 2007 Caenorhabditis elegans integrates food and reproductive signals in lifespan determination. Aging Cell 6 715 721

33. DillinACrawfordDKKenyonC 2002 Timing requirements for insulin/IGF-1 signaling in C. elegans. Science 298 830 834

34. LinKDormanJBRodanAKenyonC 1997 daf-16: An HNF-3/forkhead family member that can function to double the life-span of Caenorhabditis elegans. Science 278 1319 1322

35. Ibáñez-VentosoCYangMGuoSRobinsHPadgettRW 2006 Modulated microRNA expression during adult lifespan in Caenorhabditis elegans. Aging Cell 5 235 246

36. de LencastreAPincusZZhouKKatoMLeeSS 2010 MicroRNAs both promote and antagonize longevity in C. elegans. Curr Biol 20 2159 2168

37. YanaiharaNCaplenNBowmanESeikeMKumamotoK 2006 Unique microRNA molecular profiles in lung cancer diagnosis and prognosis. Cancer Cell 9 189 198

38. VoliniaSCalinGALiuC-GAmbsSCimminoA 2006 A microRNA expression signature of human solid tumors defines cancer gene targets. Proc Natl Acad Sci USA 103 2257 2261

39. SchetterAJLeungSYSohnJJZanettiKABowmanED 2008 MicroRNA expression profiles associated with prognosis and therapeutic outcome in colon adenocarcinoma. JAMA 299 425 436

40. WangYLiangYLuQ 2008 MicroRNA epigenetic alterations: predicting biomarkers and therapeutic targets in human diseases. Clin Genet 74 307 315

41. Noren HootenNAbdelmohsenKGorospeMEjioguNZondermanAB 2010 microRNA expression patterns reveal differential expression of target genes with age. PLoS ONE 5 e10724 doi:10.1371/journal.pone.0010724

42. FabianTJJohnsonTE 1994 Production of age-synchronous mass cultures of Caenorhabditis elegans. J Gerontol A Biol Sci Med Sci 49 B145 156

43. DavisBOAndersonGLDusenberyDB 1982 Total luminescence spectroscopy of fluorescence changes during aging in Caenorhabditis elegans. Biochemistry 21 4089 4095

44. ShamirLWolkowCAGoldbergIG 2009 Quantitative measurement of aging using image texture entropy. Bioinformatics 25 3060 3063

45. JohnstonJIserWBChowDKGoldbergIGWolkowCA 2008 Quantitative image analysis reveals distinct structural transitions during aging in Caenorhabditis elegans tissues. PLoS ONE 3 e2821 doi:10.1371/journal.pone.0002821

46. MiskaEAAlvarez-SaavedraEAbbottALLauNCHellmanAB 2007 Most Caenorhabditis elegans microRNAs are individually not essential for development or viability. PLoS Genet 3 e215 doi:10.1371/journal.pgen.0030215

47. MartinezNJOwMCReece-HoyesJSBarrasaMIAmbrosVR 2008 Genome-scale spatiotemporal analysis of Caenorhabditis elegans microRNA promoter activity. Genome Res 18 2005 2015

48. IsikMKorswagenHCBerezikovE 2010 Expression patterns of intronic microRNAs in Caenorhabditis elegans. Silence 1 5

49. SeewaldAKCypserJMendenhallAJohnsonT 2010 Quantifying phenotypic variation in isogenic Caenorhabditis elegans expressing Phsp-16.2::gfp by clustering 2D expression patterns. PLoS ONE 5 e11426 doi:10.1371/journal.pone.0011426

50. MitchellDHStilesJWSantelliJSanadiDR 1979 Synchronous growth and aging of Caenorhabditis elegans in the presence of fluorodeoxyuridine. J Gerontol 34 28 36

51. SpirtesPGlymourC 1991 An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review 9 62 72

52. YashinAIVaupelJWIachineIA 1994 A duality in aging: the equivalence of mortality models based on radically different concepts. Mech Ageing Dev 74 1 14

53. YashinAICypserJWJohnsonTEMichalskiAIBoykoSI 2002 Heat shock changes the heterogeneity distribution in populations of Caenorhabditis elegans: does it tell us anything about the biological mechanism of stress response? J Gerontol A Biol Sci Med Sci 57 B83 92

54. WuDReaSLYashinAIJohnsonTE 2006 Visualizing hidden heterogeneity in isogenic populations of C. elegans. Exp Gerontol 41 261 270

55. PiperMDWSelmanCMcElweeJJPartridgeL 2008 Separating cause from effect: how does insulin/IGF signalling control lifespan in worms, flies and mice? J Intern Med 263 179 191

56. MurphyCTMcCarrollSABargmannCIFraserAKamathRS 2003 Genes that act downstream of DAF-16 to influence the lifespan of Caenorhabditis elegans. Nature 424 277 283

57. SamuelsonAVCarrCERuvkunG 2007 Gene activities that mediate increased life span of C. elegans insulin-like signaling mutants. Genes Dev 21 2976 2994

58. OlsenAVantipalliMCLithgowGJ 2006 Checkpoint proteins control survival of the postmitotic cells in Caenorhabditis elegans. Science 312 1381 1385

59. YamawakiTMArantes-OliveiraNBermanJRZhangPKenyonC 2008 Distinct activities of the germline and somatic reproductive tissues in the regulation of Caenorhabditis elegans' longevity. Genetics 178 513 526

60. ClarkWR 2004 Reflections on an unsolved problem of biology: the evolution of senescence and death. Adv Gerontol 14 7 20

61. PfisterPMWendlandtMNeuenschwanderPSuterUW 2007 Surface-textured PEG-based hydrogels with adjustable elasticity: Synthesis and characterization. Biomaterials 28 567 575

62. BrennerS 1974 The genetics of Caenorhabditis elegans. Genetics 77 71 94

63. ByerlyLCassadaRCRussellRL 1976 The life cycle of the nematode Caenorhabditis elegans. I. Wild-type growth and reproduction. Dev Biol 51 23 33

64. EdwardsSCharlieNMilfortMBrownBGravlinC 2008 A Novel Molecular Solution for Ultraviolet Light Detection in Caenorhabditis elegans. PLoS Biol 6 e198 doi:10.1371/journal.pbio.0060198

65. WellsWMViolaPAtsumiHNakajimaSKikinisR 1996 Multi-modal volume registration by maximization of mutual information. Med Image Analysis 1 35 51

66. KimKIJungKParkSHKimHJ 2002 Support vector machines for texture classification. IEEE Trans PAMI 24 1542 1550

67. SchölkopfBSmolaAWilliamsonRCBartlettPL 2000 New support vector algorithms. Neural Computation 12 1207 1245

68. ChangC-CLinC-J 2010 LIBSVM: a library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm

69. VarmaMZissermanA 2003 Texture classification: are filter banks necessary? Proc IEEE Conf Computer Vision and Pattern Recognition 2 691 698

70. ScottD 1992 Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley & Sons, New York, Chichester

71. ScheinesRSpirtesPGlymourCMeekC 2010 The TETRAD Project: Causal Models and Statistical Data. http://www.phil.cmu.edu/projects/tetrad/

72. OrrM 1995 Regularization in the selection of radial basis function centers. Neural Computation 7 606 623