Balance your hierarchy

Your hierarchy is confessing something

Open your file system, your note app, your mental model of your own life. Look at the shape of it. Not the content — the shape. One branch goes eight levels deep. Three others are barely two levels. You have 47 notes about productivity systems and 2 about your health. Your "Work" folder has nested subfolders for every client, every project, every quarter going back three years. Your "Relationships" folder — if it exists at all — is a single file called "people.md."

That shape is not neutral. It is a structural confession about where you pay attention and where you don't. And the imbalance isn't just an aesthetic problem. It degrades the hierarchy's core function: enabling you to find, use, and build on what you know.

A balanced hierarchy is one where depth is proportional to complexity across branches. Lopsided hierarchies — those with very deep branches alongside very shallow ones — indicate structural problems that compound over time.

What computer science proved about balance

Computer scientists solved the balance problem formally in the 1960s, and their solution tells you exactly what goes wrong when hierarchies tilt.

A binary search tree stores data so that every lookup follows a path from root to target. In a balanced tree, that path is always proportional to the logarithm of the total items — O(log n) in formal notation. Search 1,000 items in about 10 steps. Search 1,000,000 items in about 20 steps. That's the power of balanced structure.

But when a tree becomes unbalanced — when all the new data lands on one side — the structure degrades toward a linked list. The worst case: O(n), meaning you might have to traverse every single item to find what you need. A tree with 1,000,000 entries that should take 20 steps to search now takes 1,000,000 steps. Same data. Same hierarchy. Catastrophically different performance. The University of Wisconsin's CS curriculum puts it plainly: "Although the average-case times for lookup, insert, and delete are all O(log N), the worst-case time is O(N)" for unbalanced trees.

This is why Adelson-Velsky and Landis invented AVL trees in 1962 — the first self-balancing binary search tree. The rule is strict: for every node, the heights of the left and right subtrees differ by at most one. If an insertion violates this, the tree rotates to restore balance. The result: guaranteed O(log n) lookup. Always. The worst-case height of an AVL tree is bounded by approximately 1.44 log n — meaning even the longest possible search path stays short.

Red-black trees relax the constraint slightly (worst-case height bounded by 2 log n) in exchange for faster insertions. B-trees, used in every modern database, generalize the concept so a single node can hold multiple data elements, keeping the tree "as short as possible" to minimize disk access — the most expensive operation in data retrieval.

The lesson from all three structures is identical: unbalanced hierarchies destroy retrieval performance. The data is still there. The structure makes it unreachable in practical time.

The human version: navigation and cognitive load

What's true for algorithms is true for human minds — with tighter constraints.

In 1956, George Miller published his landmark finding that human short-term memory holds approximately seven items, plus or minus two. In 2001, Nelson Cowan refined this to a true capacity of about four items when rehearsal strategies are stripped away. Either number tells you something about hierarchy design: at every level of your hierarchy, the number of choices you present to yourself must fit within your cognitive capacity.

Dwight Miller's 1981 study at Indiana University directly tested this for hierarchical menus. He varied menu depth from one to six levels and breadth from two to sixty-four choices per level. The result was a U-shaped function: too shallow (too many choices per level) overwhelmed users; too deep (too many levels to click through) exhausted their patience and memory. The optimum for his task was two levels with eight choices per level.

Information architect Cody Burleson synthesizes the practical guideline: static navigation systems should not exceed three levels of depth, and a breadth-to-depth ratio of roughly 7:3 provides a "healthy set of parameters" for hierarchy design. Geoff Hart, writing on technical communication, adds that the balance point depends on the complexity and familiarity of the content — expert users tolerate more depth, while novices need more breadth.

Steve Krug's "Don't Make Me Think" encodes the same principle from a usability perspective: every additional level of hierarchy is a decision point, and every decision point burns cognitive resources. A hierarchy that is deep in one branch and shallow in another forces users to switch between two different cognitive strategies — patient drilling and broad scanning — which increases confusion and retrieval failure.

The research converges: human hierarchies perform best when depth is roughly consistent across branches, stays within three to four levels, and offers five to eight choices at each level. Deviations from this aren't style preferences. They're performance degradations.

Organizational structures tell the same story

Management theory quantified this decades before UX research existed.

In 1933, French management consultant V.A. Graicunas published a mathematical analysis of supervisory relationships. He identified three types of interactions a manager must track: direct single relationships (manager to each subordinate), cross-relationships (between subordinates), and direct group relationships (manager to combinations of subordinates). His formula showed that adding subordinates increases the total number of relationships almost geometrically: 1 subordinate means 1 relationship, 2 means 6, 3 means 18, 4 means 44, 5 means 100, 6 means 222.

Graicunas concluded that a top-level manager should have no more than five or six direct reports. Lyndall Urwick, building on this in the 1950s, advocated a universal span of five, emphasizing clear lines of authority and communication.

Now look at what happens in lopsided organizations. One division has a VP, three directors, twelve managers, and sixty individual contributors — four clean levels. Another division has a VP and eighty individual contributors — one level. The deep side has clear escalation paths but slow decision-making. The flat side has speed but no intermediate structure for coaching, context-sharing, or conflict resolution.

Neither extreme works. The organizational research since the 1980s shows that effective spans vary by level (3-8 at senior levels, up to 15-30 for routine supervisory roles), but the overall structure must be proportionally balanced. Lopsided org charts produce the same failure as lopsided file trees: some paths are impossibly deep while others lack the structure to navigate at all.

Lopsided knowledge systems in practice

Personal knowledge management surfaces this pattern constantly, and the symptoms are predictable.

The first symptom is the mega-folder. One top-level category absorbs everything. "Work" grows to 14 nested levels while "Personal" stays flat. The PARA method (Projects, Areas, Resources, Archive) was designed partly to prevent this — by forcing four top-level categories with different purposes, it distributes depth across functional domains rather than letting one domain dominate.

The second symptom is orphan notes. When one branch grows disproportionately deep, notes that don't fit its structure end up homeless — sitting in an "Inbox" or "Unsorted" folder indefinitely. The Obsidian community identifies orphan note detection as a core maintenance task precisely because unbalanced hierarchies generate orphans at a predictable rate. Every note that resists classification in your deep branch is evidence that your hierarchy's shape doesn't match your knowledge's shape.

The third symptom is retrieval failure. You know the note exists. You wrote it. You can almost see it. But the 14-level-deep branch has so many intermediate folders that you can't reconstruct the path. You resort to full-text search — which means your hierarchy has failed its primary job. As Anthony TD writes in his PKM best practices: notes are only as useful as the ability to retrieve and act on them.

Sönke Ahrens' Zettelkasten approach addresses balance structurally. By using a flat numbering system with branching sequences (1, 1a, 1a1, 1b, 2, 2a...), the Zettelkasten naturally limits depth because each branch grows only when genuine conceptual differentiation demands it. There are no empty placeholder folders. No category exists unless a note needs it. The result is organic balance — depth proportional to actual complexity, not anticipated complexity.

AI systems fail the same way

The balance problem extends directly into artificial intelligence, and the consequences are measurable.

When training data is imbalanced — 90% from one category and 10% from another — machine learning models optimize for the majority class. The model shows high accuracy overall while systematically failing on underrepresented categories. IBM's research on AI bias describes this directly: a healthcare AI trained predominantly on data from one demographic "may not perform as well for the other," despite appearing successful by aggregate metrics.

This is structurally identical to the lopsided knowledge hierarchy. Your "Work" branch has rich, nuanced, deeply organized information. Your "Health" branch has two flat notes. When you try to make a decision about health, you're operating from a model trained on insufficient data — and you might not notice because the overall system feels comprehensive. The deep branch creates a false sense of completeness.

RAG (Retrieval-Augmented Generation) systems — the architecture behind most AI knowledge assistants — demonstrate the retrieval version of this problem. When the underlying knowledge base has uneven depth, the AI retrieves rich context for well-represented topics and thin, generic context for underrepresented ones. The answers feel confident in both cases. But the quality diverges dramatically based on the balance of the source hierarchy.

MIT researchers working on debiasing AI models found that the solution isn't adding more data to the shallow branches after the fact — it's structuring the data collection to be proportionally representative from the start. The parallel to personal knowledge management is direct: rebalancing a hierarchy after it has grown lopsided is harder than maintaining balance as you build.

How to diagnose and correct imbalance

Balance is not symmetry. A hierarchy where every branch is exactly four levels deep is not balanced — it's forced. Balance means proportional depth: branches that cover more complex domains go deeper, while simpler domains stay shallower, and the ratio between the deepest and shallowest branches stays reasonable.

Diagnostic: Map your depths. For each top-level branch in your primary knowledge system, count the maximum depth. Write the numbers down. If your deepest branch is more than three times deeper than your shallowest, you have a structural imbalance worth addressing.

Correction 1: Collapse the deep side. Look at your deepest branch and ask at each intermediate level: does this level create a distinction that affects how I retrieve or use the items below it? If two adjacent levels could merge without losing meaningful differentiation, merge them. The AVL tree's rotation operation is the algorithmic version of this — when one side gets too heavy, restructure to redistribute depth.

Correction 2: Expand the shallow side. Look at your shallowest branch and ask: am I lumping together items that actually deserve separate treatment? "Health" as a single flat file might need to become three branches: "Physical," "Mental," "Medical" — each with two or three levels. Not because symmetry demands it, but because the domain has real sub-structure you've been ignoring.

Correction 3: Redistribute on a schedule. Graicunas' formula shows that relationship complexity grows geometrically. Small imbalances become large imbalances quickly. Review your hierarchy's shape quarterly. Treat it like organizational restructuring: not a sign of failure, but a sign of growth that the current structure hasn't accommodated.

Correction 4: Let usage data guide you. Track where you search, where you browse, and where you get lost. If you consistently use search instead of navigation in one branch, that branch is too deep. If you consistently browse another branch in seconds, it might be appropriately shallow — or it might lack the depth to hold what you'll eventually need.

The deeper principle

A hierarchy is a theory about how things relate. It says: these items belong together, at this level of detail, with this much structure around them. When your hierarchy is lopsided, your theory is lopsided. You've built a sophisticated model of one domain and a crude sketch of another.

That matters because you think through your structures. The depth of your hierarchy in a given domain determines the nuance of your reasoning in that domain. If "Career" has twelve levels of carefully nested sub-categories and "Purpose" has one flat note, your decisions about career will feel nuanced and your reflections on purpose will feel vague. Not because purpose is simpler than career — but because your hierarchy hasn't given you the structure to think about it with precision.

Balance your hierarchy, and you balance your capacity for thought across the domains that matter to you. The structure is not just organizing your knowledge. It is shaping what you are capable of knowing.