High-quality sources reduce noise filtering

You are not filtering well enough. You are filtering too much.

The average knowledge worker consumes the equivalent of 174 newspapers worth of information per day. That number, from a 2011 USC study, has only accelerated since. Your instinct in response is to get better at filtering — skim faster, bookmark more aggressively, install another "read later" app. But this instinct is backwards. The problem is not that you lack filtering skill. The problem is that you need to filter at all.

Herbert Simon identified this in 1971, half a century before the attention economy had a name: "A wealth of information creates a poverty of attention and a need to allocate that attention efficiently among the overabundance of information sources that might consume it." Simon's insight was that most system designers treated the problem as information scarcity when the real constraint was attention scarcity. They built systems to deliver more information when they should have built systems to deliver less, better information.

This lesson makes a single structural claim: choosing credible sources upstream is orders of magnitude more efficient than filtering noise downstream. The difference between a well-curated information environment and a noisy one is not a matter of discipline or willpower. It is an engineering decision about where you place the filter.

The upstream principle: filter at the source, not at the consumer

In water treatment, the cheapest intervention is protecting the watershed — keeping contaminants out of the water supply in the first place. Once pollutants enter the system, every downstream treatment stage adds cost, complexity, and failure modes. The same principle applies to oil and gas filtration, data engineering pipelines, and manufacturing quality control: upstream prevention is always cheaper than downstream correction.

Dan Heath formalized this as "upstream thinking" in his 2020 book Upstream: The Quest to Solve Problems Before They Happen. Heath documents how organizations that shift from reactive problem-solving to preventive system design achieve dramatically better outcomes. An online travel company prevented twenty million customer service calls per year by making simple changes to its booking interface. A school district cut its dropout rate in half by identifying at-risk students in ninth grade rather than intervening after they stopped showing up.

The information equivalent is straightforward: every low-quality source you subscribe to generates noise you must then spend cognitive resources to identify, evaluate, and discard. Every high-quality source you subscribe to generates signal you can act on directly. The former taxes your attention. The latter compounds it.

In computing, this principle has a name that dates to the 1950s: garbage in, garbage out. William D. Mellin, a US Army mathematician working with early computers, explained in 1957 that computers cannot think for themselves — "sloppily programmed" inputs inevitably lead to incorrect outputs. The same is true of your cognitive system. If you feed it unreliable information, your conclusions will be unreliable, regardless of how carefully you reason about them.

Source quality predicts decision quality better than information volume

The research here is unambiguous. Keller and Staelin published a landmark study in the Journal of Consumer Research (1987) that separated information into two components — quality and quantity — and measured their independent effects on decision effectiveness. The results were striking: when information quality was held constant, increases in information quantity had a strong negative effect on decision accuracy. More information, absent quality control, made people worse at deciding.

This finding has been replicated and extended across disciplines. Eppler and Mengis (2004), in a comprehensive review spanning organization science, accounting, marketing, and information systems, concluded that information overload degrades processing capacity, reduces the ability to integrate new information, and impairs time allocation for analysis. The overload is not merely uncomfortable — it is cognitively destructive.

The mechanism is not mysterious. Your working memory holds roughly three to five items at a time (Cowan, 2001). Every piece of low-quality information that enters your processing queue displaces a potential piece of high-quality information. When you follow 200 sources instead of 20, you are not 10x more informed. You are forcing your limited cognitive workspace to perform triage on an order of magnitude more material, most of which will be discarded. The triage itself consumes the attention you need for actual thinking.

This is why the most effective researchers, writers, and decision-makers are ruthless about source selection. They do not consume more. They consume better.

Source credibility is real and measurable

The scientific study of source credibility begins with Hovland and Weiss (1951), who conducted what became one of the most cited experiments in persuasion research. Participants received identical messages attributed to either high-credibility or low-credibility sources. Messages from credible sources produced significantly greater immediate attitude change. But the study revealed something more subtle: over time, the persuasive effect of the credible source decayed, while the effect of the non-credible source increased slightly — a phenomenon they called the "sleeper effect."

The mechanism is dissociation. Over weeks, people remember the message but forget the source. The quality signal that initially helped them correctly weigh the information degrades, leaving them with unsourced beliefs of uncertain reliability. This is exactly what happens when you consume information from a mix of credible and non-credible sources without tracking provenance — you end up with a head full of claims you cannot evaluate because you have lost the metadata about where they came from.

Modern source evaluation frameworks have operationalized this insight. The CRAAP test, developed by Sarah Blakeslee and librarians at California State University, Chico, evaluates sources across five dimensions: Currency (is it timely?), Relevance (does it fit your need?), Authority (is the author qualified?), Accuracy (is it supported by evidence?), and Purpose (why was it created?). Mike Caulfield's SIFT method offers a faster heuristic for digital contexts: Stop (before you engage), Investigate the source, Find better coverage, and Trace claims to their origin.

Both frameworks share a core insight: evaluating credibility is a skill, and applying it before you invest attention in a source is far cheaper than applying it after you have already processed the content. The five seconds it takes to check whether an author has relevant credentials or whether a claim traces back to primary research can save you thirty minutes of engaging with sophisticated-sounding noise.

The Lindy heuristic: time as a credibility filter

Nassim Nicholas Taleb introduced the Lindy effect in Antifragile (2012) as a heuristic for non-perishable things: the longer something has survived, the longer its remaining life expectancy. A book that has been in print for forty years can be expected to remain in print for another forty. A book published last month has no such track record.

Applied to information sources, the Lindy effect provides a powerful filter. Ideas that have survived centuries of scrutiny — Aristotle's logic, Euclid's geometry, Adam Smith's analysis of incentives — have demonstrated their robustness against time and criticism. A blog post published yesterday has not. This does not mean new sources are worthless. It means the burden of proof is asymmetric: old sources have earned credibility through survival, while new sources must earn it through evidence and track record.

Taleb puts it directly: "The only effective judge of things is time." This is not an argument for reading only ancient texts. It is an argument for weighting your information diet toward sources that have demonstrated durability. When you must read something new, apply higher scrutiny. When you read something old that is still being cited, you can allocate less scrutiny — time has already done much of the filtering for you.

The practical implication: if you are building a reading list on any topic, start with the sources that have survived the longest. Read the textbook before the blog post. Read the primary research before the summary thread. Read the book that has been in print for twenty years before the one that came out last week. You will understand the newcomers better for having the foundation, and you will waste less time on ideas that do not survive contact with established knowledge.

Trusted curators as signal amplifiers

You do not have to do all the source evaluation yourself. One of the most efficient strategies for reducing noise is to identify people who are already excellent filters and let their curation do the upstream work for you.

Gwern Branwen, the anonymous researcher behind gwern.net, exemplifies this pattern. His particular method involves following obscure topics for long periods, then collating and sorting through massive amounts of information to surface what matters. His essays are dense, citation-heavy, and rigorously argued — which means that when Gwern writes about a topic, he has already done hundreds of hours of source evaluation that you do not need to repeat. Following his work is not outsourcing your thinking. It is leveraging his filtering.

Tyler Cowen, economist and author of the Marginal Revolution blog, performs a similar function at a different frequency — daily links, book recommendations, and short analyses that compress his prodigious reading into actionable pointers. Patrick Collison, CEO of Stripe, maintains public reading lists and question pages that function as curated windows into domains you might not otherwise encounter.

The pattern is consistent: trusted curators do not just share information. They compress the signal-to-noise ratio of entire domains into a form you can consume efficiently. Following five excellent curators in your field often produces more actionable insight than following fifty primary sources, because the curators have already done the filtering, evaluation, and synthesis that would otherwise consume your attention budget.

The key word is trusted. A curator earns trust the same way a source earns credibility: through track record, transparency about methods, willingness to update when wrong, and demonstrated expertise in the relevant domain. A curator who never cites sources, never acknowledges errors, and covers every trending topic with equal confidence is not a curator — they are a content producer optimizing for engagement, and following them adds noise rather than removing it.

Your AI as a credibility co-processor

AI tools are increasingly capable of assisting with source evaluation, though they require informed oversight. The AI fact-checking market reached $1.52 billion in 2024, and approximately 30% of professional fact-checkers have integrated AI into their workflows. Tools like Originality.ai have achieved accuracy rates above 85% in claim verification tasks.

But the most valuable application of AI for source quality is not automated fact-checking — it is structured source evaluation. You can use an LLM to perform a CRAAP analysis on a source before you invest time reading it. You can ask it to trace a claim to its primary research. You can have it compare how a finding is reported across multiple outlets to identify editorialization versus factual reporting. You can use it to identify the citation graph around a paper — who cites it, who criticizes it, whether the findings have been replicated.

This is where personal knowledge management systems amplify the effect. In a Zettelkasten, every note is atomic and linked. If you tag each note with its source and your assessment of that source's credibility, you build a compounding record of which sources have proven reliable over time. In Tiago Forte's PARA system, the Resources category becomes a curated library of trusted sources organized by domain — not a graveyard of bookmarks you will never revisit, but an active inventory of where to look first when you need information on a specific topic.

The combination of AI evaluation and structured knowledge management creates a feedback loop: your PKM system tracks which sources have historically produced signal, your AI tools help evaluate new sources against that track record, and the results feed back into your system as updated credibility metadata. Over months, your information environment becomes self-refining — not because you are spending more time on curation, but because the system accumulates evidence about what works.

Protocol: build your upstream filter

This is not a theoretical exercise. Here is how you restructure your information environment to filter upstream rather than downstream.

Step 1: Audit ruthlessly. List every information source you currently consume — subscriptions, follows, feeds, newsletters, podcasts, bookmarks. For each one, answer honestly: has this source changed my thinking or informed a real decision in the last 30 days? If no, remove it. Do not negotiate with yourself about "maybe later" or "just in case." Sources you do not use actively are noise generators sitting in your attention field.

Step 2: Apply the Lindy test. For the sources that remain, ask: how long has this source been producing reliable content? Prioritize sources with multi-year track records over those with months. This does not mean ignoring new voices — it means requiring new sources to demonstrate value before earning a permanent slot in your attention budget.

Step 3: Identify your curators. Find 3-5 people in your primary domains who are known for rigorous, well-sourced thinking. Follow them closely. Use their recommendations as a pre-filtered entry point into new material. When a trusted curator recommends a source, that recommendation carries the weight of their track record.

Step 4: Build source tiers. Create three tiers. Tier 1: 5-10 sources you check daily, high trust, consistently valuable. Tier 2: 10-20 sources you check weekly, good but not essential. Tier 3: everything else, checked only when you have a specific question. Review monthly. Demote sources that have not changed your thinking. Promote sources that consistently surprise you with quality.

Step 5: Use AI for evaluation, not consumption. Before adding a new source, run it through a structured evaluation — check the author's credentials, trace a few claims to primary sources, compare coverage across outlets. Use AI tools to accelerate this process, but make the final judgment yourself. The goal is not to automate your reading. It is to automate the triage that decides what is worth reading.

Step 6: Track provenance. In your notes, always record where an idea came from. When you discover that a claim you believed turns out to be wrong, trace it back to its source and downgrade that source's credibility tier. When a source consistently produces claims that hold up under scrutiny, upgrade it. Your credibility assessments should be living data, not static opinions.

The compounding effect of better inputs

Every improvement in source quality compounds over time. When your inputs are reliable, your notes are reliable. When your notes are reliable, your decisions are reliable. When your decisions are reliable, the feedback you generate about what works is itself reliable — creating a virtuous cycle that makes your entire epistemic system more accurate with each iteration.

The inverse is also true, and more common. When your inputs are noisy, every downstream process inherits that noise. Your notes contain a mix of signal and misinformation you cannot distinguish without re-evaluating the source. Your decisions incorporate unreliable assumptions you do not know are unreliable. Your feedback loops reinforce errors rather than correcting them.

This is why source curation is not a nice-to-have productivity hack. It is structural infrastructure for your thinking. It belongs in the same category as having a capture system, maintaining a calendar, or keeping a decision log — foundational practices that determine the quality ceiling of everything built on top of them.

In the previous lesson, you learned that urgency is usually noise — a signal that feels important but rarely is. Source quality is the structural complement: most sources feel informative but rarely improve your thinking. The discipline is the same in both cases. Resist the feeling. Examine the evidence. Keep what proves its value. Discard the rest.

The next lesson takes this further. If curating better sources is about upgrading what enters your information environment, the concept of an information diet is about designing that environment as a deliberate, bounded system — choosing not just what you consume, but how much, when, and why.

Sources

Hovland, C. & Weiss, W. (1951). "The Influence of Source Credibility on Communication Effectiveness." Public Opinion Quarterly, 15, 635-650.
Keller, K.L. & Staelin, R. (1987). "Effects of Quality and Quantity of Information on Decision Effectiveness." Journal of Consumer Research, 14(2), 200-213.
Eppler, M.J. & Mengis, J. (2004). "The Concept of Information Overload: A Review of Literature." The Information Society, 20, 325-344.
Simon, H.A. (1971). "Designing Organizations for an Information-Rich World." Johns Hopkins University.
Taleb, N.N. (2012). Antifragile: Things That Gain from Disorder. Random House.
Heath, D. (2020). Upstream: The Quest to Solve Problems Before They Happen. Avid Reader Press.
Caulfield, M. (2019). SIFT: The Four Moves. University of Washington.
Cowan, N. (2001). "The Magical Number 4 in Short-Term Memory." Behavioral and Brain Sciences, 24(1), 87-114.