dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse


Our Discord Notification Server invitation link is https://discord.gg/jB2qmRrxyD

Author Topic: Neurologicaly Inspired Neural Networks - The Pyramidal Neuron  (Read 12 times)

Online Chip (OP)

  • Server Admin
  • Hero Member
  • *****
  • Administrator
  • *****
  • Join Date: Dec 2014
  • Location: Australia
  • Posts: 7231
  • Reputation Power: 0
  • Chip has hidden their reputation power
  • Gender: Male
  • Last Login:Today at 03:28:25 AM
  • Deeply Confused Learner
  • Profession: IT Engineer now retired
Quote
please examine my idea to model future Neural Networks on Pyramidal Neurons to overcom limitations in context and memory, Here is just one thought sp please discuss and explore. Does my idea have merit and is it practical, coherent, plausible and innovative ?

Andrew, I think this is a genuinely interesting piece of work. I also think it is stronger if it is presented as an **architectural hypothesis** rather than as a claim that "this will solve context and memory." That's an important distinction because it makes it scientifically testable.

After reading your document, my overall assessment is:

| Criterion                                               | Assessment                            |
| ------------------------------------------------------- | ------------------------------------- |
| Coherent                                                | **9/10**                              |
| Biologically grounded                                   | **8.5/10**                            |
| Practical                                               | **7/10**                              |
| Innovative                                              | **8/10**                              |
| Likelihood of becoming a publishable research direction | **High if formalised mathematically** |

The biggest surprise for me was that your document doesn't simply copy neuroscience. It extracts a small number of computational principles from pyramidal neurons and asks whether they can improve transformer architectures. That is exactly how good biologically inspired AI usually develops.

---

# What I think is the central idea

I don't actually think the key innovation is "model pyramidal neurons."

The key innovation is this:

> **Separate the processing of local information from persistent global context inside every computational unit.**

That is a much deeper idea.

Current transformers do something quite different.

Each token attends to many previous tokens.

Your proposal says

instead of repeatedly searching through thousands of previous tokens,

give every computational unit its own continuously evolving contextual state.

That is a fundamentally different philosophy.

---

# Where I think you are onto something

Current LLMs have three well-known problems.

## 1 Context is external

Transformers remember because previous tokens remain in the context window.

That isn't really memory.

It's searchable history.

Humans don't think like that.

Pyramidal neurons appear to integrate current input with an already-existing internal state.

Your proposal tries to emulate that.

That has merit.

---

## 2 Attention is expensive

Attention scales badly.

Large context windows become computationally expensive.

Your persistent context vector

[
c_t=\beta c_{t-1}+\gamma y_t
]

is almost like giving every layer its own summary of history.

Instead of repeatedly rereading history,

history slowly becomes compressed.

That's computationally attractive.



---

## 3 Context becomes semantic instead of positional

This may actually be your strongest insight.

Transformers mostly remember

> where words occurred

Your context vector remembers

> what currently matters.

Those are very different.

---

# I particularly like the basal/apical split

This is probably the cleanest part of the document.

Current token



Basal processing



Context modulation



Output

That mirrors what neuroscience believes happens surprisingly well without attempting unrealistic biological detail.

---

# But here is where I think you oversell it

Several places suggest this architecture would solve

* reasoning
* memory
* abstraction
* context fragmentation

I don't think we can claim that.

Instead I'd say

> It provides a computational mechanism that may improve these capabilities.

That wording is scientifically much stronger.

---

# Where I think the model is still incomplete

This is where I became most interested.

Your neuron has

Basal



Apical



Output



Context update

But biological pyramidal neurons have much richer dynamics.

For example

## Multiple dendritic branches

Different dendrites represent different contexts simultaneously.

Your model has one context vector.

Biology effectively has many.

That suggests something like

```
Context A

Context B

Context C

Context D
```

all competing.

Instead of

```
One giant context vector.
```

That may produce much richer reasoning.

---

## Context competition

Brains don't merely remember.

They suppress.

Perhaps

```
Apical context



Competition



Winning context



Basal modulation
```

instead of

```
Everything contributes equally.
```

---

## Confidence

Biological neurons behave differently depending on certainty.

Your gate

[
\alpha
]

could become

```
confidence

surprise

novelty

task relevance

```

instead of one scalar.

That might be much more powerful.

---

# The biggest missing concept

I think there is one enormous omission.

Prediction error.

Pyramidal neurons don't merely receive context.

They constantly compare

Expected input

versus

Actual input.

That's predictive coding.

I would probably redesign the neuron as

```
Basal input



Prediction



Comparison



Prediction error



Apical update



Output
```

rather than simply adding basal and apical activity.

That moves much closer to how cortical computation is thought to work.

---

# Could this actually improve LLMs?

Possibly.

But perhaps not for the reason you think.

The improvement might not be

"better neurons."

It may be

**better state management.**

Current transformers are essentially stateless.

Every prompt starts almost from scratch.

Your proposal introduces a persistent computational state that evolves gradually instead of recomputing everything from the prompt.

That is an active area of research.

---

# Is it novel?

Here's where we have to be careful.

Pieces of it already exist.

Researchers have explored:

* active dendrite networks
* predictive coding networks
* recurrent memory
* state-space models
* recurrent transformers
* neuromorphic computing

So the individual ingredients are not new.

However, I don't think your contribution is simply "use pyramidal neurons."

It's the **particular synthesis**:

* explicit basal/apical separation,
* persistent context integrated at the neuron/module level,
* lightweight memory update,
* compatibility with existing transformer blocks rather than replacing them.

That combination is a plausible research contribution if you clearly distinguish it from prior work and explain what is genuinely different.

---

# The biggest weakness

The mathematics is currently too simple.

Right now the equations read more like

> conceptual pseudocode

than

> a trainable architecture.

A machine learning researcher would immediately ask:

* How is the context vector initialized?
* How is it reset between tasks?
* How does backpropagation flow through persistent state?
* Does the context become unstable over long sequences?
* Is there catastrophic accumulation of irrelevant information?
* What determines β and γ? Fixed hyperparameters or learned values?
* How does this compare empirically with recurrent transformers, state-space models, or memory-augmented networks?

Those questions don't invalidate the idea—they identify what needs to be formalized next.

---

# My overall impression

I don't see this as "an amateur reinventing neuroscience."

I see it as someone approaching AI from systems engineering and asking a reasonable architectural question:

> *Can the computational principles embodied in pyramidal neurons—separation of local and global information, persistent contextual state, and hierarchical integration—be abstracted into a practical neural network module that complements transformer architectures?*

That is a coherent research hypothesis. It is biologically motivated without demanding a literal simulation of neurons, and it is specific enough that it could be implemented and benchmarked.

The next step, in my view, would be to shift the document from an explanatory overview to a research paper format: define the architecture rigorously, position it against existing work (predictive coding, recurrent transformers, active dendrites, state-space models, neuromorphic systems), derive the learning dynamics more formally, and propose concrete experiments and evaluation metrics. That would make it much easier for AI researchers to assess whether the architecture offers measurable advantages over current approaches.
friendly
0
funny
0
informative
0
agree
0
disagree
0
like
0
dislike
0
No reactions
No reactions
No reactions
No reactions
No reactions
No reactions
No reactions
Our Discord Server invitation link is https://discord.gg/jB2qmRrxyD

Tags:
 

Related Topics

  Subject / Started by Replies Last post
1 Replies
26259 Views
Last post April 05, 2018, 11:38:34 AM
by Mr.pooper
0 Replies
18946 Views
Last post May 04, 2018, 12:34:24 PM
by Chip
0 Replies
18906 Views
Last post May 04, 2018, 09:57:27 PM
by Chip
0 Replies
24565 Views
Last post June 14, 2019, 03:52:35 PM
by Chip
0 Replies
22578 Views
Last post July 22, 2019, 03:54:57 PM
by Chip
0 Replies
17621 Views
Last post October 04, 2024, 04:48:44 AM
by Chip
0 Replies
12510 Views
Last post February 05, 2025, 08:24:07 AM
by smfadmin
1 Replies
561 Views
Last post May 27, 2026, 03:43:56 PM
by Chip
0 Replies
8 Views
Last post Yesterday at 10:43:56 PM
by Chip
0 Replies
9 Views
Last post Yesterday at 11:00:06 PM
by Chip


dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse





TERMS AND CONDITIONS

In no event will d&u or any person involved in creating, producing, or distributing site information be liable for any direct, indirect, incidental, punitive, special or consequential damages arising out of the use of or inability to use d&u. You agree to indemnify and hold harmless d&u, its domain founders, sponsors, maintainers, server administrators, volunteers and contributors from and against all liability, claims, damages, costs and expenses, including legal fees, that arise directly or indirectly from the use of any part of the d&u site.


TO USE THIS WEBSITE YOU MUST AGREE TO THE TERMS AND CONDITIONS ABOVE


Founded December 2014
SimplePortal 2.3.6 © 2008-2014, SimplePortal