How a General AI Model Becomes a Helpful Assistant

UpdatedMay 26, 2026

Heya! 👋 I love helping people, and one of the best ways I do this is by sharing my knowledge and experiences. My journey reflects the power of growth and transformation, and I’m here to document and share it with you.

I started as a pharmacist, practicing at a tertiary hospital in the Northern Region of Ghana. There, I saw firsthand the challenges in healthcare delivery and became fascinated by how technology could offer solutions. This sparked my interest in digital health, a field I believe holds the key to revolutionizing healthcare.

Determined to contribute, I taught myself programming, mastering tools like HTML, CSS, JavaScript, React, PHP, and more. But I craved deeper knowledge and practical experience. That’s when I joined the ALX Software Engineering program, which became a turning point. Spending over 70 hours a week learning, coding, and collaborating, I transitioned fully into tech.

Today, I am a Software Engineer and Digital Health Solutions Architect, building and contributing to innovative digital health solutions. I combine my healthcare expertise with technical skills to create impactful tools that solve real-world problems in health delivery.

Imposter syndrome has been part of my journey, but I’ve learned to embrace it as a sign of growth. Livestreaming my learning process, receiving feedback, and building in public have been crucial in overcoming self-doubt. Each experience has strengthened my belief in showing up, staying consistent, and growing through challenges.

Through this platform, I document my lessons, challenges, and successes to inspire and guide others—whether you’re transitioning careers, exploring digital health, or diving into software development.

I believe in accountability and the value of shared growth. Your feedback keeps me grounded and motivated to continue this journey. Let’s connect, learn, and grow together! 🚀

Comments

Join the discussion

No comments yet. Be the first to comment.

AI Engineering

Part 7 of 12

Up next

System Prompts and Guardrails in AI models

At this point, you already understand two major stages in the life of an AI system. First: the model learns language patterns through pretraining Then: the model is shaped into a helpful assistant

The Difference Between a Base Model and an Assistant

Let’s start with an important distinction.

A base model is simply a model that completed pretraining.

It learned patterns from massive text datasets.

But it has not yet been carefully trained for human interaction.

You can think of it like this:

A base model has:

language ability
pattern recognition
general knowledge patterns

But it does not yet have:

polished behavior
conversational structure
safety tuning
helpful formatting
response discipline

We can explain it with the analogy of a fresh graduate entering a workplace.

The graduate may be intelligent.

But they still need training before becoming effective in a real job.

The same thing happens with AI.

What Is Fine-Tuning?

Fine-tuning is additional training performed after pretraining.

During fine-tuning, humans provide examples of:

good responses
preferred behavior
useful formats
acceptable interaction styles

The model studies these examples and adjusts its behavior accordingly.

For example, trainers may teach the AI:

how to answer politely
how to structure explanations
how to summarize clearly
how to avoid harmful responses
how to follow instructions carefully

Over time, the AI becomes more aligned with what humans consider helpful.

A Simple Analogy: Job Training

Imagine hiring someone fresh out of university.

They may know:

language
concepts
theory

But they still need workplace training.

A hospital trains doctors differently from how a law firm trains lawyers.

The same thing happens with AI.

A general model becomes specialized through additional instruction and feedback.

This is why the lesson is called:

“The Job Training Phase.”

Because the model is essentially being trained for a role.

What Actually Happens During Fine-Tuning?

During fine-tuning, the model sees examples like this:

User Prompt

Explain photosynthesis simply.

Preferred Response

Photosynthesis is the process plants use to turn sunlight into energy...

The model studies thousands or millions of examples like this.

Gradually, it learns:

preferred wording
useful structures
response style
formatting habits
conversational behavior

This is why many AI assistants naturally:

use headings
organize information clearly
answer in steps
explain ideas conversationally

Those behaviors were trained into them.

What Is RLHF?

Now we arrive at one of the most important ideas in modern AI alignment.

RLHF stands for:

Reinforcement Learning from Human Feedback

The name sounds intimidating, but the core idea is actually simple.

Humans compare AI responses and choose which one feels better.

The system then learns from those preferences.

How RLHF Works

Imagine the AI gives two answers to the same question.

Response A

vague
confusing
robotic

Response B

clearer
structured
more helpful

Humans choose Response B.

The model then adjusts itself slightly toward that style of response.

This process repeats millions of times.

Gradually, the AI becomes more aligned with human preferences.

Why AI Assistants Feel Different

This explains something many people notice quickly.

ChatGPT, Claude, and Gemini do not feel identical.

That is because they were trained differently.

Different companies:

prioritize different behaviors
use different training examples
apply different safety philosophies
optimize for different user experiences

Your source highlights this clearly.

For example:

ChatGPT

Often feels:

energetic
structured
action-oriented
list-heavy

Claude

Often feels:

cautious
reflective
careful with uncertainty
more paragraph-based

Gemini

Often feels:

exploratory
conversational
flexible
creative

These differences are not accidental.

They come from:

training decisions
feedback systems
fine-tuning priorities

In other words:

AI personality is engineered.

Fine-Tuning Shapes Behavior, Not Intelligence

This distinction matters a lot.

Fine-tuning can shape:

tone
helpfulness
formatting
conversational style
refusal behavior

But it does not magically create true understanding.

For example:

a model can sound like a doctor
without actually possessing medical expertise

Your source explains this directly:

Fine-tuning teaches behavior, not real knowledge.

That is why AI can still:

hallucinate
reason poorly
generate false information

even when sounding highly confident.

Why Human Feedback Matters So Much

Without human feedback, many AI systems feel awkward or difficult to use.

They may:

overcomplicate answers
ignore instructions
produce chaotic outputs
behave inconsistently

Human feedback teaches the model:

what users prefer
what feels clear
what feels useful
what feels trustworthy

This is one reason modern AI assistants feel surprisingly conversational.

Their communication style was heavily shaped by human evaluation.

But Human Feedback Also Introduces Bias

This is important to understand.

Human feedback is not perfectly objective.

Humans have:

cultural assumptions
preferences
blind spots
political views
communication biases

So when humans train AI systems, some of those biases can influence the model.

This means:

AI behavior is not neutral
AI reflects training choices
“helpfulness” is partly subjective

That is why different AI systems sometimes respond differently to the same request.

Why AI Can Feel Trustworthy Even When Wrong

One of the most important consequences of fine-tuning is this:

The AI becomes better at sounding trustworthy.

That does not necessarily mean it became more accurate.

This distinction is critical.

A well fine-tuned model may:

sound confident
communicate clearly
appear intelligent
feel persuasive

while still producing:

false facts
weak reasoning
fabricated information

Fine-tuning improves presentation.

It does not eliminate errors.

The Main Training Stages (Simple Summary)

At this point, it helps to zoom out and see the bigger picture.

Stage 1 — Pretraining

The AI learns language patterns from massive datasets.

Like general education.

Stage 2 — Supervised Fine-Tuning

Humans show the model examples of preferred responses.

Like guided workplace training.

Stage 3 — RLHF

Humans rank outputs and reinforce preferred behavior.

Like performance feedback.

Stage 4 — Deployment

The AI becomes available to users.

Companies continue refining the system over time.

Common Beginner Mistakes

Mistake 1: Thinking AI behavior is natural

Most assistant behavior was intentionally trained.

Mistake 2: Confusing politeness with intelligence

A polite answer can still be wrong.

Mistake 3: Thinking all AI systems are the same

Different training methods create different personalities and behaviors.

Mistake 4: Assuming fine-tuning solves everything

Fine-tuning improves behavior.

It does not solve all reasoning or factual problems.

Mental Model

Here is the clearest way to think about this phase:

Pretraining

teaches the AI:

how language works

Fine-tuning

teaches the AI:

how to behave around humans

That distinction is extremely important.

Practice Thinking

Think carefully through these questions:

Why might a raw base model behave strangely without fine-tuning?
Why do different AI assistants feel different?
Why can a helpful-sounding response still be inaccurate?
What kinds of human preferences might influence AI behavior?
Why does RLHF make AI feel more conversational?

Take your time with these.

These are foundational AI literacy questions.

Key Takeaways

Fine-tuning happens after pretraining
A base model learns language patterns but not polished assistant behavior
Fine-tuning teaches preferred response styles
RLHF uses human feedback to shape behavior
Different AI assistants feel different because they were trained differently
Fine-tuning improves helpfulness, not true understanding
Human feedback strongly shapes AI personality and behavior

What’s Next

At this stage, the AI has:

learned language patterns
learned how to behave more helpfully

But another important question remains:

Why does the AI sometimes refuse requests?

Why can the same prompt produce:

acceptance from one AI
refusal from another
or different levels of caution?

That takes us into:

system prompts
guardrails
moderation systems
and the invisible rules shaping AI behavior.

In the next lesson, we will explore the hidden instruction systems working behind every AI conversation.

Command Palette

Comments

AI Engineering

System Prompts and Guardrails in AI models

More from this blog

The Difference Between a Base Model and an Assistant

What Is Fine-Tuning?

A Simple Analogy: Job Training

What Actually Happens During Fine-Tuning?

User Prompt

Preferred Response

What Is RLHF?

How RLHF Works

Response A

Response B

Why AI Assistants Feel Different

ChatGPT

Claude

Gemini

Fine-Tuning Shapes Behavior, Not Intelligence

Why Human Feedback Matters So Much

But Human Feedback Also Introduces Bias

Why AI Can Feel Trustworthy Even When Wrong

The Main Training Stages (Simple Summary)

Stage 1 — Pretraining

Stage 2 — Supervised Fine-Tuning

Stage 3 — RLHF

Stage 4 — Deployment

Common Beginner Mistakes

Mistake 1: Thinking AI behavior is natural

Mistake 2: Confusing politeness with intelligence

Mistake 3: Thinking all AI systems are the same

Mistake 4: Assuming fine-tuning solves everything

Mental Model

Pretraining

Fine-tuning

Practice Thinking

Key Takeaways

What’s Next