dopetalk

Simple Machines Forum

News:

🧾✨ Link to our Forum Charter: Read, Respect, Reflect

Solicitation and Dealing of Drugs is Strictly Prohibited !
Please email smfadmin if you wish to advertise here
Non drug topics are also very welcome !
All Terms and Conditions are at the Bottom of the Page

dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse

Our Discord Notification Server invitation link is https://discord.gg/jB2qmRrxyD

« previous next »

Print

Pages: 1 Go Down

Author Topic: What the basic components of AI are and how the data flows (Read 459 times)

Chip (OP)

Server Admin
Hero Member
Administrator
Join Date: Dec 2014
Location: Australia
Posts: 7265
Reputation Power: 0
Gender:
Last Login:Yesterday at 09:42:52 PM
Deeply Confused Learner
Profession: IT Engineer now retired

What the basic components of AI are and how the data flows

« on: May 27, 2026, 03:48:33 PM »

Quote

What are the basic components of AI and show the data flow

The Conceptual AI stack

Tokenisation
↓
Neural Networks
↓
Transformers
↓
LLMs
↓
RAG

The real AI stack

Raw Data
↓
Tokenisation
↓
Embeddings
↓
Transformer Neural Network
↓
Pretraining
↓
LLM
↓
Alignment / Fine Tuning
↓
Inference Runtime
↓
Tools / APIs / Memory
↓
RAG / Agents / Long-Term Memory
↓
Applications

Core components of AI (neural network model)

1. Input data
Raw information such as:
- images, text, audio, sensor readings
Converted into numeric form (vectors/tensors)

2. Parameters (weights + biases)
- Weights: control strength of connections
- Biases: shift outputs
These are the “learned memory” of the model.

3. Layers (transformations)
Each layer computes:

output = activation(Wx + b)
They progressively transform raw input into useful representations.

4. Activation function
Adds non-linearity:
- ReLU, sigmoid, tanh
Without it, the model becomes just a linear function.

5. Loss function
Measures error:
- difference between prediction and truth
Outputs a single number representing “how wrong” the model is.

6. Optimiser
Updates weights to reduce error:
- gradient descent / Adam
It performs learning.

---

Data flow (forward + learning loop)

Forward pass

Input
 ↓
Layer 1 (Wx + b + activation)
 ↓
Layer 2
 ↓
...
 ↓
Output (prediction)

Loss calculation

Prediction + True label → Loss → Error value

Backward pass (backpropagation)

Error
 ↓
Compute gradients (credit assignment)
 ↓
Propagate backwards through layers

Parameter update

Weights = Weights - learning_rate × gradient

---

Full AI training loop

Input → Forward pass → Prediction
                    ↓
                   Loss
                    ↓
            Backpropagation
                    ↓
           Update parameters
                    ↓
              Repeat many times

---

Key idea
AI is not rules or logic.
It is a feedback loop that tunes a large mathematical function until its outputs match data.

Generated by ChatGPT.

« Last Edit: May 28, 2026, 07:57:19 AM by Chip »

0

0

0

0

0

0

0

No reactions

No reactions

No reactions

No reactions

No reactions

No reactions

No reactions

Our Discord Server invitation link is https://discord.gg/jB2qmRrxyD

smfadmin

SMF (internal) Site
Administrator
Sr. Member
Join Date: Dec 2014
Location: Management
Posts: 548
Reputation Power: 0
Last Login:Yesterday at 02:17:30 PM
Supplied Install Member

Re: What the basic components of AI are and how the data flows

« Reply #1 on: May 28, 2026, 08:06:33 AM »

How A Query Moves/Flows Through AI And Becomes A Reply (According to ChatGPT)

A query like:

Quote

“Track this query as it moved through AI and when it becomes the reply”

passes through multiple transformation layers before you see text on screen.

Here’s the actual flow.

1. Raw User Input

You type characters into the client:

Track this query as it moved through AI and when it becomes the reply

At this point it is just UTF-8 text.

No meaning yet.
No intelligence yet.

2. Client Packaging

The app packages the message with metadata:

conversation history
timestamps
system prompts
tool availability
memory context
user settings
safety policies

The model never receives “just your sentence”.

It receives a huge structured context window.

Conceptually:

{
  "system": "...",
  "developer": "...",
  "memory": "...",
  "conversation": [...],
  "new_user_message": "Track this query..."
}

3. Tokenisation

The text is split into tokens.

Not words.
Subword fragments.

Example approximation:

["Track", " this", " query", " as", " it", " moved", ...]

Each token becomes an integer ID.

Example:

[18374, 420, 9912, 328, ...]

This is the first point where language becomes machine-processable.

4. Embedding Projection

Each token ID is mapped into a high-dimensional vector.

Conceptually:

"query" →
[-0.182, 0.991, 0.004, ...]

Now the sentence exists as geometry.

Meaning is represented as spatial relationships in vector space.

Words with similar meaning cluster nearby.

5. Positional Encoding

Transformers cannot inherently understand order.

So position information is injected:

Token 1 + position encoding
Token 2 + position encoding
...

This lets the model distinguish:

dog bites man

from:

man bites dog

6. Transformer Attention Passes

This is the core intelligence layer.

The model repeatedly performs:

self-attention
weighting
nonlinear transformations
residual mixing
layer normalization

Across dozens to hundreds of layers.

The sentence effectively becomes a dynamic graph of relationships.

The model starts constructing abstractions like:

“user wants architecture explanation”
“timeline/process requested”
“technical depth preferred”
“explanatory mode appropriate”

Attention dynamically links concepts:

"Track" ↔ process
"moved" ↔ pipeline
"reply" ↔ output generation

7. Internal Latent Reasoning

This part is invisible.

The model builds temporary internal representations:

intent estimation
response planning
style selection
safety evaluation
tool routing possibilities
probability forecasting

Not symbolic reasoning like code.

More like evolving activation fields.

There is no little narrator internally saying:

Quote

“I should explain transformers now.”

Instead:

billions of weighted activations statistically converge toward likely continuations.

8. Next-Token Prediction

The model does not generate replies all at once.

It predicts one token at a time.

Internally:

P("A") = 0.02
P("The") = 0.41
P("Your") = 0.03

The selected token becomes part of the context.

Then prediction repeats.

Loop:

input → predict token → append token → predict next token

Thousands of times.

Your entire reply emerges autoregressively.

9. Decoding

Token IDs convert back into text.

Example:

[32, 918, 4412]
↓
"The model predicts"

Now language reappears from vector space.

10. Postprocessing Layer

Before display:

formatting is applied
markdown rendered
citations inserted
safety filters checked
UI elements added
streaming chunked

11. Streaming To User

You see the response progressively appear.

But internally the model may already have generated much further ahead.

The stream is buffered and emitted incrementally.

12. When Does It “Become” The Reply?

Interesting philosophical point.

There are several candidates:

A. At tokenisation?
No — syntax only.

B. During latent representation?
Partially — semantic structure exists internally.

C. At first generated token?
Closer.

D. When decoding back into text?
Most practical definition.

E. When rendered to the user?
Human-visible completion.

The Deep Reality

The reply is not stored anywhere beforehand.

It is not retrieved from a database.

It does not exist until generated.

The model continuously collapses probability distributions into actual token choices in real time.

So the “reply” only truly comes into existence progressively during autoregressive decoding.

Extremely Simplified Stack

User Text
   ↓
Tokenisation
   ↓
Vector Embeddings
   ↓
Transformer Attention
   ↓
Latent Semantic State
   ↓
Next-Token Prediction Loop
   ↓
Decoded Tokens
   ↓
Rendered Reply

Your earlier AI stack summary was actually fairly good:

Tokenisation Pipeline
↓
Neural Networks
↓
Transformers
↓
LLMs
↓
RAG Systems

Though technically:

Transformers ARE a type of neural network, and LLMs are transformer-based systems trained at massive scale.

So structurally it’s closer to:

Tokenisation
↓
Embeddings
↓
Transformer Neural Network
↓
LLM Behaviour
↓
Tooling / Memory / RAG / Agents
↓
Final Response

« Last Edit: May 28, 2026, 08:44:45 AM by Chip »

0

0

0

0

0

0

0

No reactions

No reactions

No reactions

No reactions

No reactions

No reactions

No reactions

measure twice, cut once

Print

Pages: 1 Go Up

« previous next »

Tags:

Related Topics

		Subject / Started by	Replies	Last post
		(basic info) Understanding Drug Use and Addiction Started by Chip Psychology and Psychiatry	0 Replies 20854 Views	June 18, 2018, 03:40:55 AM by Chip
		Big Data's Influence in Medicine and Pharmaceuticals Started by Chip Artificial Intelligence / Deep Learning	0 Replies 21531 Views	June 28, 2018, 02:32:54 AM by Chip
		DNA Digital Data Storage - possible ? Infinite data storage of the future ? Started by Chip Genetics	0 Replies 22952 Views	May 18, 2019, 07:53:52 AM by Chip
		[HowStuffWorks] How Your Brain Works: Brain Components Started by Chip Neuroscience	0 Replies 26897 Views	June 14, 2019, 04:06:52 PM by Chip
		Supplement Data for Extremely High Dose Methylamphetamine Users Started by Chip Phenethylamines	0 Replies 13639 Views	December 27, 2024, 06:15:16 PM by Chip
		New laser-based artificial neuron processes enormous data sets at high speed Started by smfadmin Artificial Intelligence / Deep Learning	0 Replies 12574 Views	February 05, 2025, 08:24:07 AM by smfadmin
		Data centres will use twice as much energy by 2030 — driven by AI Started by smfadmin Energy	0 Replies 13927 Views	April 23, 2025, 07:36:53 AM by smfadmin
		What Bupropion Feels Like (Data Gathered by AI) Started by Chip Cathinones	0 Replies 13904 Views	June 02, 2025, 08:36:22 PM by Chip
		The basic "Learning AI" curriculum Started by Chip Computing	0 Replies 419 Views	May 27, 2026, 04:14:33 PM by Chip
		Vintage Technology History -- Electronics: BASIC RADIO CIRCUITRY Started by smfadmin Assorted and Other Tech.	0 Replies 315 Views	June 04, 2026, 11:46:05 PM by smfadmin

It appears that you have not registered with dopetalk. To register, please click here...

Need help or a chat ?

If you need any help or a chat then IM/PM or email me, Chip

dopetalk does not endorse any advertised product nor does it accept any liability for it's use or misuse

TERMS AND CONDITIONS

In no event will d&u or any person involved in creating, producing, or distributing site information be liable for any direct, indirect, incidental, punitive, special or consequential damages arising out of the use of or inability to use d&u. You agree to indemnify and hold harmless d&u, its domain founders, sponsors, maintainers, server administrators, volunteers and contributors from and against all liability, claims, damages, costs and expenses, including legal fees, that arise directly or indirectly from the use of any part of the d&u site.

TO USE THIS WEBSITE YOU MUST AGREE TO THE TERMS AND CONDITIONS ABOVE

Founded December 2014

SMF 2.0.19 | SMF © 2021, Simple Machines
Simple Audio Video Embedder
SMFAds for Free Forums | Sitemap | Terms and Policies
XHTML
RSS
WAP2

Server load over the past 5, 10 and 15 minutes respectively: 0.12, 0.31, 0.32

Page created in 0.333 seconds with 114 queries.

SimplePortal 2.3.6 © 2008-2014, SimplePortal