The Pareto Principle - LLMs vs Nature 🧠 Brain STREAM 2️⃣4️⃣

person working on laptop — Photo by Austin Distel on Unsplash

Happy Monday morning. Hope you like this new Brain STREAM newsletter format! Thought it was time I got organised in this respect. I’m really enjoying exploring the topics in the recent newsletters. Annette

If you’re reading this newsletter, chances are you’ve heard of the Pareto Principle. In the spirit of this topic, I asked the AI component of Capacities to describe it to me.

“The Pareto Principle, also known as the 80/20 rule, states that roughly 80% of effects come from 20% of causes. It originated from an observation made by Italian economist Vilfredo Pareto in the early 20th century when he noticed that 80% of Italy's land was owned by 20% of the population.”

Fab, thanks AI! 80% of my work, with 20% of the effort.

The Pareto Principle seems to crop up everywhere - finance, populations, business, even nature - despite no-one knowing why. It’s one of those mysteries of the universe.

It’s not a rule because it has no known cause and effect.

So, chances are 80% of anything I do, will come from 20% of my effort anyway, because the 80/20 rule is just a natural phenomenon is it not?

Today I consider Large Language models and how I see them contributing to my own research work. If it is one thing LLMs excel at, it’s doing 80% of our work for us.

Round 1: Large Language Models

I was asking my sister if she uses Chat GPT in her line of work (business finance stuff). She said she uses it to do the first 70% of the work, creating marketing templates and business finance summaries, often through the use of a series of prompting questions.

I was curious to see some of the ways LLMs could help in research specifically, so I asked it “for examples of how academic researchers could use LLMs to apply the Pareto Principle”:

Summarising key papers enabling you to find the 20% of research papers that will provide 80% of the background of your work
Finding the 20% most highly cited papers for 80% of the topic background
Analysing datasets and finding the 20% of the variables explaining 80% of the variance
Indicate the 20% of your hypotheses which are most likely to be significant (🤔)
Create text drafts containing the 20% of your results to explain 80% of your findings
List the top 20% of researchers in your field to optimise collaboration

Now whilst there may be some useful approaches contained in these examples I’d be curious to know how it can predict the most significant hypotheses. Nevertheless, having the Pareto Principle in mind when conversing with an LLM is a worthwhile thing to do.

This is partly because I find they give answers that are 80% right. What I mean by this is they aren’t wrong but they aren’t 100% right either.

Here’s an example from my own field (haha pun intended) with my comments written in bold:

Prompt: Please tell me the advantages and disadvantages of ley grasslands for soil structure

Answer: Certainly! Here are the specific advantages and disadvantages of ley grasslands for soil structure:

`Advantages for Soil Structure`

Improved Soil Aggregation:
- Root Systems: The robust (viz. perennial, usually, they are no more ‘robust’, in fact they probably change more over time) root systems of grasses and legumes (and herbs) in ley grasslands enhance soil aggregation (usually grasses specifically that are drivers of aggregation though not always), leading to improved soil structure.
- Organic Matter: Decomposition of root biomass and plant residues increases organic matter content, which binds soil particles into stable aggregates. (Not only decomposition, but a lot of organic material is produced in root exudates)
- (Wet-dry cycling of soils by roots also drives aggregation)
Reduction of Soil Compaction:
- Root Penetration: Deep root systems (mostly from those herbs you neglected to mention, alfalfa being an exception; the roots don’t have to be deep to alleviate surface compaction) help break up compacted layers of soil, improving aeration (depends on the species) and root growth for subsequent crops (in the subsoil only if you till afterwards which is often the case).
- Earthworm Activity: Increased organic matter and root exudates encourage earthworm activity (as far as I’m aware exudates do not encourage earthworm activity but they do contribute to organic matter), which further helps to alleviate compaction and improve soil porosity.
Enhanced Water Infiltration and Retention:
- Porosity: ~~Improved soil structure~~ (Roots and earthworms) increases the number of soil pores, facilitating better water infiltration (the ‘number’ of pores is misleading, it is the volume, size distribution, connectivity and tortuosity of those pores that changes water infiltration. It’s the equivalent of saying stronger people have more muscles - well they have the same number as everyone else, it’s just they have more fibres, better blood flow etc.).
- Water Holding Capacity: Higher organic matter content enhances the soil’s ability to retain moisture, reducing runoff and improving drought resilience.
Erosion Control:
- Soil Stability: Dense root systems stabilize soil particles, protecting against erosion by wind and water.
- Surface Cover: Continuous ground cover provided by grasses and legumes (lets just say ley grassland plant species) reduces the impact of raindrops and prevents soil erosion. (Surface cover also intercepts raindrops, reducing soil ponding and runoff)

You get the picture…

In my limited experience of using LLMs for my research, I’ve generally found them rubbish. I tried Paperpal and it was nothing short of awful for a very simple question. It seemed to be obsessed about telling me about the microbial effects of plant roots when I specifically ask it about soil structural effects. Yes, microbial effects are relevant, but not within a 200 word answer.

I’ve used Research Rabbit and other AI paper summarisers; useful maybe to make sure you haven’t missed something, but they are not up to date, so another 80% performance. To train it with the papers I need to give it to get an actually useful output, would require me to do some pretty decent literature searches, read the abstracts and well, hang on a minute - isn’t that what the AI is supposed to do?

Where I have found LLMs effective is to quickly and simply explain concepts, standard methods and R code. This is especially useful when an LLM is contained within your note-taking system (like Capacities). I could see me using them to help create lecture slides when I have a zillion other things to do at the same time.

If you don’t mind, I’m going to big myself up a little here, and say that I work on the frontier of new knowledge. Whilst LLMs can interpret what has gone exceedingly well, even methodically structure new opportunities and approaches, it just doesn’t (yet) have the level of understanding in what I’m working on. I’m not sure how useful it is to summarise 80% of what has gone when often the devil is in the detail.

💡 Action Point: Where could an LLM take away 80% of your work?

Round 2 next Monday, same time, same place

Subscribe now

🧪 Sharpen Your Skills

Two cool #scicomm jobs at Earth Minutes; Content Executive and Graphic Designer, both freelance and remote; I was almost tempted…

🧫 Interesting Stuff

Ali Abdaal asks; Why have you not already achieved your goals? Is it your Goals, Plan or System that’s failing you? Plus he says don’t become a full-time YouTuber, it’s just stressful; interesting coming from him… 😲
Don’t let eco-friendly messaging lead you into thinking food is better for you; Bryan Johnson compares dark chocolates for flavanoid and heavy metal levels; I wonder how many replicates they did? 🤔

⌛ In Case You Missed It

What’s it like doing a PhD? And what would I do differently…
Last week’s issue of 🧠 Brain STREAM:
Why the ‘Rule of Three’ is such a powerful concept 🤯: Brain STREAM 2️⃣3️⃣
It hasn’t escaped my attention over the last few weeks of how much I have been calling upon the ‘Rule of Three’ within my life: I list three things that went well and three things that didn’t go well when journaling I have three 'Quarterly Quests' 🧙‍♂️…
🧠 Brain STREAM Knowledge Ecology
Although n=6 is not exactly a rigorous number of replicates (🤣), the answer is in from the poll I did on Twitter and LinkedIn:
- Should science be more succinct or do we need more detail?
- 👉 4 of 6 said more detail is needed in science; I was surprised 😮

Quote of the Week

Author and historian Henry Hitchings on meetings in the book ‘Let’s Talk’ by Nihal Arthanayake:

"The meeting is ... a paradigm of what conversation is, and yet, the meeting is one of the lowest forms of civilisation, and is the enemy of thought and listening and getting anything done!"

What I’m Reading

Favourite research paper of the week; Finn et al (2013); exploiting positive plant interactions makes for better insurance effects, causing almost unanimous overyielding in multi-species agricultural grasslands.
Let’s Talk - How to Have Better Conversations by Nihal Arthanayake; taking turns, actively listening and giving space are key to a good conversation.

I can't say how much it means to me for you to read (and hopefully subscribe to…😜) this newsletter.

Thank you and I will see you next week.

Thanks for reading 🧠 Brain STREAM ! Subscribe for free to receive new posts and support my work.

Tagged in:

Newsletter, Skills - Information Management

Last Update: September 08, 2025

Author

Annette Raffan 132 Articles

Subscribe to our Newsletter

Comments

Table of Contents