Generative AI: Prompt Engineering Basics

Chapters

1Foundations of Generative AI

2LLM Behavior and Capabilities

3Core Principles of Prompt Engineering

4Writing Clear, Actionable Instructions

5Roles, Personas, and System Prompts

6Supplying Context and Grounding

Curating Background Information Injecting Data Snippets Grounding With Sources Retrieval Summaries in Prompts Citing and Linking Evidence Planning Context Budgets Chunking and Windowing Pinning Critical Facts Canonical Source Selection Structured Context Blocks Delimiters and Separators Unknowns and Clarification Triggers Session Memory Strategies Preventing Context Leakage Updating Stale Context

7Examples: Zero-, One-, and Few-Shot

8Structuring Outputs and Formats

9Reasoning and Decomposition Techniques

10Iteration, Testing, and Prompt Debugging

11Evaluation, Metrics, and Quality Control

12Safety, Ethics, and Risk Mitigation

13Tools, Functions, and Agentic Workflows

14Retrieval-Augmented Generation (RAG)

15Multimodal and Advanced Prompt Patterns

Courses/Generative AI: Prompt Engineering Basics/Supplying Context and Grounding

Supplying Context and Grounding

27295 views

Feed the model the right facts at the right time using structured context blocks, delimiters, and source pinning.

Content

5 of 15

Citing and Linking Evidence

Citation Sass and Structure — Evidence That Actually Links

576 views

intermediate

humorous

machine learning

education theory

gpt-5-mini

576 views

Versions:

Citation Sass and Structure — Evidence That Actually Links

Watch & Learn

AI-discovered learning video

Start learning for free

Bookmark content and pick up later
AI-generated study materials
Flashcards, timelines, and more
Progress tracking and certificates

Free to join · No credit card required

Citing and Linking Evidence — Make Your AI's Claims Walk a Straight Line

“If your model says it’s from the New York Times but can’t point to an article, that’s not confidence — that’s a party trick.”

You already learned how to ask an LLM to use retrieval summaries and how to ground outputs with source material. Now we do the responsible, slightly less sexy but absolutely crucial work: making the model actually cite and link its evidence so humans (and downstream systems) can verify, audit, and reuse what it says.

This lesson builds on:

Retrieval Summaries in Prompts (you learned how to feed concise retrieved context into prompts), and
Grounding With Sources (you learned to attach source metadata to retrieved chunks).

It also leans on Roles, Personas, and System Prompts: use those to enforce citation style, voice, and verification tasks across single- and multi-agent flows.

Why citation matters (beyond polite academia)

Trust & Verifiability: Users can check claims quickly. No more “trust me” hallucinations.
Attribution & Licensing: Respect IP and connect outputs to owners (DOIs, permalinks).
Debugging: If something’s wrong, you can trace which source led the model astray.
Chaining tools: Other agents or code can fetch the links for more processing.

Imagine a three-agent pipeline: Retriever (gets docs), Synthesizer (writes answers), Verifier (checks claims). If the Synthesizer doesn’t attach robust citations, the Verifier is blind. Roles matter.

Core principles for citing and linking evidence

Always include source metadata: title, author, date, URL, and retrieval timestamp. If available: DOI, page number, or paragraph id.
Prefer canonical links: DOI, publisher permalink, archive.org snapshot—URLs that are stable.
Include exact text snippets or quote spans used to support a claim (with offsets or paragraph IDs) when possible.
State confidence: is the citation direct support, partial support, or contradicted?
Avoid invented citations: if the model can’t find a source, make it say so.
Use citation formats consistently (inline, footnote, bracketed) based on user needs and system constraints.

Citation patterns and prompt templates

Below are practical templates. Use system prompts to enforce these formats across agents.

Inline bracketed citations (compact, machine-friendly)

System: You are a Synthesizer. When you assert a fact, append [source_id] where source_id refers to the retrieval record.
User: Summarize the retrieved docs and answer: Is remote work productivity higher than office work?
Expected output: "Studies show mixed results; a 2021 meta-analysis found small productivity gains for remote work [SRC_12]."

Markdown with clickable links (user-facing)

Instruction: Provide an answer with inline citations. Each citation should include: (Author, Year) - [title](url) and a one-sentence quote excerpt.
Output example: Remote work yielded modest productivity gains (Smith et al., 2021) - [A Meta-Analysis of Remote Work](https://doi.org/xxxx). Quote: "...average output rose by 4%..." (p. 12).

Footnote-style numbered citations (good for long answers)

Write an answer, and number citations [1], [2], etc. At the end include a references section with full metadata and retrieval timestamps.

Evidence-first answer (prioritize verifiability)

Start with evidence list: 1) [Title](URL) — excerpt. Then synthesize the claim and attach bracketed evidence IDs to each sentence.

Example retrieval record (good practice)

Use structured retrieval results so downstream components can render citations reliably.

{
  "id": "SRC_12",
  "title": "A Meta-Analysis of Remote Work",
  "author": "J. Smith et al.",
  "date": "2021-07-15",
  "url": "https://example.org/meta-remote",
  "doi": "10.1234/remote.2021",
  "excerpt": "Average output rose by 4%...",
  "page": 12,
  "retrieved_at": "2026-03-09T14:32:00Z",
  "confidence": 0.92
}

Feeding this object into the prompt makes it trivial for the model to reference it reliably and prevents hallucinated URLs.

Multi-agent citation choreography

Use roles to split responsibility:

Retriever: returns structured records (id, metadata, excerpt, confidence).
Synthesizer: writes the answer, attaching bracketed IDs and short quotes.
Verifier: re-fetches top citations, checks quotes and pulls page/paragraph for verification.

System prompts bolt this down. Example system instruction to the Synthesizer:

"Every factual claim must include one or more [SRC_x] tags corresponding to retrieval records. If no record supports the claim, say 'No supporting source found' instead of guessing. Provide a short quote (≤140 chars) for each source used."

That last line is vital — force the model to point at evidence, not just conjure it.

Formatting options: quick pros & cons

Style	Pros	Cons
Inline [SRC_1]	Compact, machine-friendly	Less readable for lay users
Markdown links	User-friendly, click-to-verify	Long URLs clutter text
Footnotes	Good for dense text	Requires extra navigation
Excerpt-first	Extremely verifiable	Longer, may feel clunky

Choose based on audience. For APIs and multi-agent flows, structured IDs + metadata win. For end-users, Markdown links + quote snippets feel friendlier.

Common mistakes (and how to avoid them)

Hallucinated links: Always generate citations from retrieval records; never make up domains or DOIs.
Vague metadata: include retrieval timestamps and document IDs.
Overlinking: don’t cite every sentence if one source supports a whole paragraph — instead cite once and indicate it covers the paragraph.
Ignoring licensing: don’t provide full-text links for paywalled content unless you have rights; provide DOI/abstract links instead.

Quick checklist to include in your system prompt

Require structured retrieval records with id, title, url, doi, excerpt, and retrieved_at.
Enforce citation format (example: [SRC_x] + markdown link in references).
Require a one-line quote for each cited source.
If uncertain, say "No supporting source found".
For multi-agent systems: require Verifier role to confirm citations before finalizing answers.

Final pep talk (and a tiny dare)

Citations aren’t academic red tape — they’re the scaffolding that keeps your app from becoming a hallucination factory. Use roles to separate duties, make retrieval records structured, and force the model (with system prompts) to point to actual evidence. Your users will love you for it. Your lawyers might too.

Try this: implement a tiny Synthesizer prompt that refuses to answer without citing at least one authoritative source. Then, watch how quickly the hallucinations drop.

Key takeaways:

Always attach source metadata and retrieval timestamps.
Use stable links (DOI, permalinks) and quote snippets for verifiability.
Enforce citation behavior via system prompts and role separation.

Go forth and make your AI cite like it's going to therapy — honest, transparent, and a little more reliable than last Tuesday.

Flashcards

Mind Map

Speed Challenge

Comments (0)

Please sign in to leave a comment.

No comments yet. Be the first to comment!

Ready to practice?

Study with flashcards, timelines, and more

Earn certificates for completed courses

Bookmark content for later reference

Track your progress across all topics