TL;DR: Most content libraries do not decay because the articles got worse. They decay because the query intent shifted, a SERP feature ate the click, a better-resourced competitor published a deeper version, or an algorithm update reweighted the signals the page was originally optimised for. Refresh decisions made on calendar age are noisy and waste editorial capacity on pages that would have recovered anyway and on pages that will not recover regardless. The operating discipline that works is a marginal-lift framework: score each candidate by the expected post-refresh delta against the realistic cost of editing it, and stack-rank.
A note on tools and brands. Backlinko, Animalz, SimilarWeb, Sistrix, Ahrefs, Semrush, and Search Engine Land appear throughout this essay as the available public-research sources. Brian Dean, Tom Critchlow, John Mueller, Lily Ray, and Aleyda Solis appear as named practitioners whose public work informs the discussion. Quantitative claims framed as advisory-engagement observation come from anonymized partner operators, not from the named publications or analysts. Public claims are attributed inline.
What People Mean by Content Decay
The phrase "content decay" carries more freight than it should. In the loose practitioner usage, it describes any organic traffic decline on a page that was previously performing well. In the stricter usage that this essay will use throughout, it describes a structured pattern: a page reaches a traffic plateau, performs steadily for some period, and then enters a sustained downtrend that is not explained by overall site traffic change, seasonality, or a tracking break.
The pattern matters because the right response depends on the cause. Animalz's 2020 study of their own content portfolio, one of the most-cited public datasets on the subject, classified decay into traffic-driven (the page itself lost rankings on its core terms), feature-driven (the page kept its rankings but the SERP added features that took the click), and intent-driven (the underlying query population shifted to a different content shape). The categories are not mutually exclusive, but they imply different remediation strategies. A page suffering pure feature-driven decay does not need to be rewritten. A page suffering intent-driven decay almost certainly does, sometimes from scratch.
The published research and our advisory experience converge on a few baseline facts. The median content piece on an established editorial site peaks somewhere between months 6 and 18, plateaus for some period, and then enters a slow decline that compounds at a few percent per quarter. The Backlinko studies place the typical "honeymoon period" for new content at roughly three weeks of accelerated growth followed by a slower trajectory, with year-over-year decay rates clustering in the 5 to 30 percent range for the average mid-tail page after the third year. SimilarWeb's content-aging analyses report compatible patterns across the editorial-publisher segment, with sharper decline on news-shaped content and slower decline on evergreen explanatory content.
The Four Mechanisms of Decay
Identifying the mechanism is the precondition for any sensible refresh decision. The four mechanisms below cover the large majority of cases in our partner data.
The first mechanism is algorithm-update reweighting. Google's core updates and the helpful-content updates of 2022 to 2025 have repeatedly reweighted the signals that determine which pages rank for which queries. Lily Ray's documentation of HCU-affected sites in 2023 to 2024 catalogued patterns where pages that previously ranked at positions 1 to 3 on broad informational queries dropped to positions 12 to 30 essentially overnight, on no detectable content change. The cause was algorithm-side: Google decided that the signals the page had accumulated no longer carried the weight they once did, often because the page's site-wide signals (topical concentration, perceived expertise, link-economy patterns) had been reassessed. Pages affected by this mechanism rarely recover from edits to the page itself. They recover, if they recover, from site-wide signal rebuilds.
The second mechanism is query intent drift. The query population that drove traffic to the page in 2022 may not be the query population searching the same string in 2025. Google's intent-classification model has become more confident over time at recognising commercial, transactional, informational, navigational, and visual intents, and it has become more willing to serve different page shapes for the same query string as the modal intent shifts. A page written as a long explanatory essay loses traffic when the modal intent for the query shifts toward a tool, a calculator, a comparison table, or a step-by-step procedure. The string in the URL is the same; the click-through population has changed.
The third mechanism is SERP feature expansion. Google has steadily added AI Overviews, expanded People Also Ask, surfaced more video carousels, given local packs and shopping panels more prominent placement, and packed the above-the-fold real estate with non-organic destinations. The same organic position now sits further down the visible page and is more often pre-empted by a feature that satisfies the query. SparkToro's 2024 zero-click study with Datos found that in the United States, only about 360 out of every 1,000 Google searches resulted in a click to the open web, with the remainder ending in zero clicks or in clicks to Google's own properties. The feature mechanism affects pages without affecting their content quality.
The fourth mechanism is competitor publishing. The number of competent operators publishing on any given topic has gone up. A page that earned position 2 in 2021 against four mediocre competitors faces, in 2025, eight to twelve competent competitors, several of whom have invested more in depth, format, or author authority. Decay caused by competitor publishing is structural: the page did not get worse, but the market got better. The refresh question becomes whether incremental investment can re-establish a competitive advantage at the new bar, and the answer depends on whether the operator has the resources to clear that bar.
The Four Decay Mechanisms and Their Refresh Implications
| Mechanism | Diagnostic Signal | What Refresh Does | When It Works |
|---|---|---|---|
| Algorithm reweighting (HCU, core updates) | Sharp position drop on known update date, site-wide pattern across many URLs | Minimal direct lift from per-page edits; site-wide signals dominate | Rarely from refresh alone; almost always requires broader site-wide work |
| Query intent drift | Sustained gradual position decline; SERP composition for the query has shifted toward a different content shape | Format rewrite (essay to tool, prose to comparison table) can recover; same-format refresh rarely does | When intent shift is captured by changing format, not just updating numbers |
| SERP feature expansion | Position holds or improves but clicks decline; impression-to-click ratio drops on Search Console | Body edits do not recover the click; only structural answer-block work helps reclaim snippet share | Only when the page can win a SERP feature back from the destination that took the click |
| Competitor publishing | Position decline with no algorithm event; top-3 SERP increasingly held by higher-investment pages | Substantive expansion, depth, original data, or new format can recover | When operator has the editorial budget and topical authority to clear the new bar |
The diagnostic distinction is what separates refresh strategies that compound from refresh strategies that consume editorial capacity for no return. The framework that follows treats mechanism identification as the upstream decision.
What the Public Research Actually Shows
The most-cited public studies on content decay agree on the directional pattern and disagree on the magnitude. The Animalz 2020 study, conducted across their own portfolio and a sample of client sites, reported that the median article in their dataset lost about half of its monthly organic traffic over the two years following its peak, with a substantial subset losing 80 percent or more. The Backlinko hub on content refresh has historically reported peak-to-trough declines in similar ranges across the case studies Brian Dean has published. The SimilarWeb 2023 publisher report identified content aging as a structural drag on traffic across the major publishing categories, with news content decaying fastest and reference content slowest.
The Sistrix studies on visibility decay (drawing on the Sistrix Visibility Index, the toolkit's domain-level estimated visibility metric) have repeatedly documented the long-tail of decay: most pages lose traffic gradually, a small fraction lose traffic catastrophically, and an even smaller fraction sustain or grow. The shape of the distribution is fat-tailed, which means that aggregate-level statistics understate how much variation exists at the individual-URL level.
The published case studies on refresh impact are more cautiously stated, because the counterfactual is hard to construct. Brian Dean's classic Backlinko refresh case studies (the Skyscraper Technique posts and the various "increased traffic by X percent" anecdotes) reported large lifts on individual pieces but did not control for what the same pages would have done absent the refresh. The methodological honest answer is that any single-page case study is anecdote, not evidence. The systematic evidence is weaker, and the systematic evidence suggests that refresh works on a subset of pages and produces noise on the rest.
In our advisory partner data, refresh interventions produce a measurable post-refresh lift (operationalised as the four-week-average post versus the four-week-average immediately pre-refresh, controlling for site-wide traffic trend) on roughly 40 to 55 percent of attempted refreshes, with the remainder producing flat or negative results. The win rate is conditional on the diagnostic step: among refreshes triggered by an identified content gap or freshness issue, the win rate rises to 60 to 75 percent. Among refreshes triggered by calendar age alone (older than 18 months, regardless of decay diagnosis), the win rate falls to 25 to 35 percent. Diagnosis matters more than effort.
The methodological gap in the published refresh literature is that almost no study constructs a proper counterfactual. The standard public case study reports a percentage lift after a refresh without an attempt to estimate what the page would have done in the absence of the refresh. In a content category with high baseline volatility (year-over-year traffic swings of plus or minus 20 percent are common at the page level), an uncontrolled before-after comparison is essentially a noise reading. The few studies that have attempted controls (Animalz's matched-pair analyses among them) report tighter lift estimates than the headline anecdotes suggest, with median lifts often in the 15 to 35 percent range rather than the 100 to 300 percent range that single-page case studies tend to report.
The other systematic issue in the public literature is survivorship bias. Operators who write up refresh case studies almost always write up the refreshes that worked. The refreshes that produced no lift or a negative lift do not become blog posts. The published distribution of refresh outcomes is therefore right-skewed in a way that overstates the typical practitioner experience. The candid in-house data we see across partner engagements looks much more like a flat distribution with a positive mean than like a heavy right tail of dramatic wins.
The chart understates the operating point. The win rate is one number; the magnitude of the win matters more. Pages refreshed for the right reason tend to recover 30 to 80 percent of their lost traffic; pages refreshed for the wrong reason tend to recover nothing and to consume editorial hours that could have been spent on net-new publishing or on better-targeted refreshes.
The Marginal-Lift Framework
The diagnostic distinction translates into an operating framework. Rather than triaging refresh candidates by calendar age or by absolute traffic decline, the framework triages them by expected marginal lift per editorial hour invested. The four inputs are: the realistic upper bound of recoverable traffic, the probability that a refresh will recover a meaningful fraction of it, the editorial cost in hours, and the strategic value of the page beyond its raw traffic.
Step one is the upper-bound estimate. A page that was earning 4,000 monthly sessions at peak and is now earning 1,800 has a recoverable upper bound of approximately 2,200 sessions per month, assuming a successful refresh restores it to the prior peak. The upper bound is rarely fully realised; partner-data refreshes that succeed typically recover 30 to 80 percent of the gap, with the median around 50 percent.
Step two is the recovery probability. This is the diagnostic step. A page where the decay is consistent with an identified content gap (a missing section, an outdated statistic, an under-developed sub-topic that competitors now address) gets a high probability of recovery. A page where the decay is consistent with algorithm reweighting affecting the entire site gets a very low probability. A page where the decay is feature-driven (the SERP added an AI Overview that answers the query directly) gets a recovery probability conditional on whether the page can win the snippet back.
Step three is the editorial cost. The cost ranges from one to two hours for a light freshness pass (updating dates, statistics, a few sentences of new context) up to twenty to forty hours for a substantive rewrite (new research, new sections, new examples, new structural format). A refresh that requires the higher end of that range is competing against publishing a wholly new piece on a different topic for the same editorial budget.
Step four is the strategic value adjustment. A page that ranks for a money-keyword (a term that converts directly or that drives qualified pipeline) is worth more than a page that ranks for an informational keyword regardless of raw traffic. A page that is the entry point for a topical cluster (the hub page that 12 other articles link to and that anchors the cluster's internal link graph) is worth more than a leaf page of equivalent traffic. Strategic value is a multiplier on the raw lift calculation.
The composite score is roughly: expected lift = (upper bound) times (recovery probability) times (strategic multiplier), divided by editorial hours. Stack-rank candidates by composite score, take the top quartile, leave the bottom half alone.
The scatter shape that emerges across partner libraries is consistent: a tight cluster of quick wins in the low-hours, moderate-traffic quadrant; a smaller cluster of strategic refreshes in the medium-hours, high-traffic quadrant; and a noisy distribution of low-ROI candidates that the framework filters out before any editorial work begins.
Diagnosis Before Action
The diagnostic step is the place where most refresh programmes go wrong. The default workflow at most operators is to sort a content list by some combination of "is more than 18 months old" and "lost X percent of traffic" and to send the top of the list to a writer. This produces a refresh queue that is roughly uncorrelated with refresh success.
The diagnostic workflow that we have found to work in advisory engagements has four parallel checks. The first is a Search Console query-level diff: pull the queries the page ranked for during its peak quarter and during its current quarter, identify which queries were lost, and ask whether the lost queries are still relevant or whether the query population has shifted away from the page's topic entirely. If the page used to rank for "best CRM software 2022" and the term has been replaced in user behaviour by more specific cuts ("best CRM for small SaaS teams"), the page is fighting a query population that no longer exists at meaningful volume.
The second check is the SERP composition diff: compare the current top-10 SERP for the page's primary query to the SERP composition at the time the page was at peak. If the modern SERP has an AI Overview where there was none, two video results where there were none, a People Also Ask box that satisfies most of the query, and a comparison shopping module, the page is losing clicks to feature expansion that it cannot reverse from the body text. The relevant intervention, if any, is to compete for the feature itself (answer-block restructuring for the snippet, video creation for the carousel) rather than to refresh prose that the user will not see.
The third check is the competitor-content diff: pull the current top-3 results for the page's primary query and compare them substantively to the page. Are the competitors using a different format (calculator, table, structured comparison) that better matches the query intent? Have they invested in original data, expert quotes, or visual elements that the page does not have? Are they longer, more comprehensive, more recent? If the competitor pages have moved meaningfully ahead on objective dimensions, the refresh has to match or exceed that bar to recover ranking. If matching the bar is infeasible given editorial resources, the refresh is unlikely to succeed regardless of effort.
The fourth check is the algorithm-history diff: overlay known core update and helpful-content update dates on the page's traffic curve. If the decay onset aligns sharply with an update date and the same pattern appears across many other URLs on the site, the page is suffering from algorithm reweighting and per-page refresh is a low-probability intervention. The remediation is site-wide and editorial-strategic, not URL-level.
Pre-refresh diagnostic pipeline
The pipeline takes 20 to 40 minutes per URL when run by an analyst familiar with the site. It saves 5 to 30 hours per URL that would otherwise have been spent on refreshes with low probability of success. The arithmetic favours the diagnostic step by an order of magnitude or more.
The tooling for the diagnostic is mostly free. Search Console covers the query-level diff and the impression-to-click diff. The SERP composition diff requires either a manual sampling of current SERPs against archived screenshots or a paid feature in one of the rank-tracking tools. The competitor-content diff is manual or semi-manual; the tools that promise automated competitor content analysis are useful for surface comparison but not for the substantive depth comparison that the decision actually requires. The algorithm-overlay step is supported by the published timelines from Search Engine Land, Search Engine Journal, the Sistrix update tracker, and the various aggregator dashboards that catalog known and suspected Google updates.
A practical implementation pattern that works in partner engagements is a quarterly diagnostic batch: take 60 to 100 candidate URLs (filtered by traffic-loss threshold and editorial relevance), run the four-step diagnostic on each in a structured spreadsheet, and use the spreadsheet to produce the refresh queue for the following quarter. The diagnostic spreadsheet becomes an institutional artefact that gets richer over time as the operator learns which mechanisms recur, which competitors set the bar in which clusters, and which interventions tend to work in the operator's specific competitive context.
The Refresh Playbook
Once a page is identified as a good refresh candidate, the playbook depends on which of the four mechanisms is in play. The interventions below correspond to the mechanisms identified by the diagnostic.
For pages suffering from content gap (specific sub-topics the competitors now cover that the page does not), the intervention is targeted section addition. Identify the missing sub-topics from the competitor diff, write the missing sections (typically two to four H2 blocks of 300 to 600 words each), and integrate them into the page's existing structure without disrupting the parts that are still ranking well. The editorial cost is moderate (6 to 12 hours), and the success rate in partner data is in the 60 to 75 percent range.
For pages suffering from freshness staleness (outdated statistics, deprecated tools, old screenshots, references to obsolete versions), the intervention is a freshness pass. Update the dates, the statistics, the example references, and the publication metadata; rewrite the introduction to reflect the current state of the topic; and refresh any time-stamped elements. The cost is low (1 to 3 hours), and the success rate is lower in absolute terms but higher relative to cost; the marginal lift per hour invested is often the highest of any refresh category.
For pages suffering from format mismatch (the modal intent for the query has shifted to a format the page does not adopt), the intervention is substantive restructuring. This is the most expensive category. A page written as a 3,000-word essay that needs to become an interactive comparison table, a calculator, or a step-by-step procedure is essentially a new build, with the cost ranging from 20 to 40 hours including any tooling work. The success rate is high when the new format actually matches the modal intent; the failure mode is investing in a format that the operator's design or engineering team cannot ship cleanly.
For pages suffering from SERP-feature loss (the snippet, the People Also Ask answer, or the video carousel that used to attribute to the page is now attributed elsewhere), the intervention is answer-block restructuring. Reformat the answer to fit the snippet's expected shape (40 to 60 words for paragraph snippets, clean numbered lists for list snippets, well-structured comparison tables for table snippets) and resubmit by waiting for the next crawl. The cost is low (2 to 5 hours), the success rate is in the 35 to 55 percent range, and the upside is a meaningful traffic recovery if the snippet is reclaimed.
Refresh Playbook by Mechanism (Practitioner Reference)
| Mechanism | Intervention | Editorial Hours | Success Rate | Median Recovery |
|---|---|---|---|---|
| Content gap | Targeted section addition | 6-12 | 60-75% | 40-60% of lost traffic |
| Freshness staleness | Date/stat/intro freshness pass | 1-3 | 50-65% | 20-35% of lost traffic |
| Format mismatch to intent | Substantive restructure or new build | 20-40 | 45-60% | 60-90% of lost traffic |
| SERP feature loss | Answer-block restructure for snippet recovery | 2-5 | 35-55% | 30-50% of lost traffic |
| Competitor outranks | Match-or-exceed depth and original data | 15-30 | 40-55% | 35-65% of lost traffic |
| Algorithm reweighting | Per-page refresh is low-probability; site-wide work required | n/a | 15-25% | 10-25% of lost traffic |
The playbook is intentionally specific. Generic "refresh the post" instructions produce generic outcomes. The mechanism-matched intervention is what makes the success rates in the table achievable rather than aspirational.
Retiring Content That Will Not Recover
The harder editorial decision than what to refresh is what to retire. Retirement (unpublishing, redirecting to a more relevant URL, or merging into a stronger sibling page) is unintuitive for operators who have been trained to think of every published page as an asset. In practice, content libraries accumulate liabilities as well as assets, and the liabilities consume crawl budget, dilute topical authority, and reduce average page quality in ways that affect site-wide signals.
The retirement decision applies to three categories of page. The first is pages that have lost the bulk of their traffic, have been identified through the diagnostic as belonging to a mechanism with low refresh-success probability, and have low strategic value (no conversion path, no inbound links of quality, no role in the topical cluster). The right operational move is to redirect them to the most relevant remaining page on the site, consolidating any residual link equity rather than leaving the URL to slowly decay further.
The second category is pages that compete with stronger siblings for the same query. Topic cannibalisation, where two or three pages on a site target overlapping queries and split the ranking signal, is a recurring pattern. The remediation is to identify the strongest page (by current rank, by link profile, by content quality), redirect the weaker pages to it, and consolidate the topical authority into one URL.
The third category is pages that violate current quality standards in ways that may be dragging down site-wide signals. The helpful-content update of 2022 and the subsequent core updates of 2023 to 2025 have placed substantial weight on site-wide perceived quality and topical concentration. Sites with large libraries of low-quality thin pages, AI-generated pages from earlier experiments, or off-topic legacy content have in many cases benefited from substantial pruning. The mechanism is plausibly that the site-wide quality and topical signals carry weight in ranking decisions for all pages on the site, and removing low-quality content lifts the average that gets attributed to every URL.
Operating the Refresh Programme
The refresh programme as an organisational practice needs a few standing properties to compound. The first is that it is a discipline rather than a project. A quarterly batch of 30 refresh candidates analysed, diagnosed, and shipped is more valuable than an annual project of 200 candidates that gets bogged down in mid-flight. Cadence beats volume.
The second is that refresh capacity competes for the same editorial budget as net-new publishing, and the allocation between the two should be explicit. In partner data, the most successful editorial programmes allocate roughly 30 to 50 percent of total editorial hours to refresh, retirement, and content-system maintenance, with the remaining 50 to 70 percent on net-new publishing. Sites where refresh consistently loses to net-new in the allocation argument tend to accumulate decay debt that eventually requires a much more expensive intervention. Sites where refresh consistently wins over net-new tend to slow their topical expansion and lose ground to competitors who keep publishing.
The third is that the success metric for the refresh programme is not "number of pages refreshed" or even "average traffic lift per refresh." The right metric is the marginal-lift programme: the total traffic recovered per editorial hour invested in refresh and retirement work, tracked over rolling six-month windows. This metric punishes refresh work that produces no lift and rewards diagnostic discipline that filters out the bad candidates upstream.
The fourth standing property is a tracking discipline for refresh outcomes. Every refresh should be tagged in the analytics layer (or in a separate tracker) with the date, the mechanism diagnosed, the intervention chosen, the editorial hours invested, and the four-week-before and twelve-week-after traffic figures. Without this tracking, the programme cannot answer whether it is improving over time, which mechanisms have the best success rates in the operator's specific context, and which interventions are under- or over-resourced. The tracking is cheap (a spreadsheet or a Notion table is sufficient) and the cumulative learning value is large.
Diagnosis is the bottleneck. The refresh playbook is well-understood; the work of identifying which pages will respond to which intervention is where editorial judgement has to live.
The fifth property is that the programme should be willing to retire pages without ceremony. Editorial cultures that treat every page as sacred accumulate decay debt and lose the ability to discriminate between pages that matter and pages that do not. The retirement queue is part of the programme, not a failure of it.
Refresh as a Compounding Practice
Treated well, the refresh programme compounds. A site that diagnoses each candidate, ships the right intervention, and tracks outcomes accumulates institutional knowledge about which interventions work in its specific topical and competitive context. That knowledge compounds into a refresh queue that gets better-calibrated each cycle, an editorial team that develops a sense for which pages are worth investing in, and a content library whose average quality rises over time rather than drifts downward.
Treated badly, the refresh programme produces motion without progress. Operators who skip the diagnostic step, refresh on calendar age, and measure success by output volume tend to spend large editorial budgets on a queue that is roughly uncorrelated with refresh success. They report ambiguous results, dismiss the practice as low-ROI, and reallocate to net-new publishing, which compounds the original decay problem.
The marginal-lift framework is operationally a forcing function for the diagnostic step. By scoring candidates on expected lift per hour rather than on calendar age, the framework refuses to allow effort to substitute for thought. The pages that get refreshed are the pages where refresh has a high probability of moving the metric; the rest are either retired or left alone. Editorial capacity is spent where it has the best chance of compounding.
Key Takeaways
- Content decays for four distinct mechanisms (algorithm reweighting, query intent drift, SERP feature expansion, competitor publishing), and the right refresh intervention depends on which mechanism is in play. Refresh decisions made on calendar age or absolute traffic decline alone are roughly uncorrelated with refresh success.
- The diagnostic step (Search Console query diff, SERP composition diff, competitor content diff, algorithm-update overlay) takes 20 to 40 minutes per URL and shifts refresh success rates from 30 to 35 percent up to 55 to 70 percent without changing the editorial budget.
- The marginal-lift framework triages candidates by expected lift (recoverable traffic times recovery probability) divided by editorial hours, with a strategic-value multiplier for pages that matter beyond their raw traffic. Stack-rank candidates by composite score; take the top quartile; leave the bottom half alone.
- Different mechanisms call for different interventions: content gap pages respond to targeted section addition; freshness-stale pages to a freshness pass; format-mismatch pages to substantive restructuring; SERP-feature-loss pages to answer-block restructuring; competitor-outrank pages to match-or-exceed depth.
- Retirement (redirect, merge, or unpublish) is part of the programme, not a failure of it. Pages with low refresh probability and low strategic value should be consolidated into stronger siblings rather than left to decay further; site-wide signals reward this discipline under helpful-content and core-update algorithms.
- The right success metric for the refresh programme is traffic recovered per editorial hour invested, tracked over rolling six-month windows. This metric punishes refresh on calendar age and rewards diagnostic-driven prioritisation.
- Treat refresh as a compounding practice. A site that runs the discipline accumulates institutional knowledge about which interventions work in its specific context; a site that runs it as project work loses that knowledge and produces motion without progress.
Concepts defined
Read Next
- SEO
Schema Markup ROI: Which Types Actually Move Rankings
A field-evidence audit of which schema.org types reliably move rankings or SERP feature acquisition, and which are tag-soup with no measurable impact.
- SEO
Backlink Quality Scoring Beyond DR and UR
A multi-dimensional framework for scoring backlinks beyond Ahrefs DR and Moz DA, drawing on the graph-theoretic literature, Google spam policy, and operating case studies.
- SEO
Featured Snippet Acquisition: Reverse-Engineering the SERP Feature Market
How to win featured snippets, People Also Ask, knowledge panels, and video carousels, and the click-through cost of snippet ownership that the zero-click question understates.
The Conversation
Be the first to weigh in
Join the conversation
Disagree, share a counter-example from your own work, or point at research that changes the picture. Comments are moderated, no account required.