Structured Prompt Engineering

The Anatomy of a Production Prompt

The Core Components:

Role/Persona - Sets behavior and expertise level
Context - Background information needed
Instructions - What to do, step by step
Constraints - What NOT to do, output format
Examples - Few-shot demonstrations (optional)
Input - The actual data to process

Basic Template: Full runnable structured prompt notebook

prompt = """
[ROLE]
You are an experienced customer support specialist for AcmeCo.

[CONTEXT]
AcmeCo sells software subscriptions. Our refund policy: 30-day money-back guarantee, no questions asked. After 30 days, refunds require manager approval.

[INSTRUCTIONS]
Analyze the customer message below and:
- Identify the customer's primary issue
- Determine if this qualifies for automatic refund
- Suggest next steps for the support team

[CONSTRAINTS]
- Be empathetic but concise
- Never promise refunds outside policy
- If uncertain, recommend manager review

[INPUT]
Customer message: customer_message
"""

Practice Check:

Write a prompt that includes role, context, instructions, constraints, examples, and input.
Expected: All 6 components present with explicit constraints and a clear output schema.
Try this too: Compare the outputs if you remove the structure and use less powerful models

In Production:

Cost impact: Clear output schemas reduce parsing errors and retries (fewer re-runs).
Reliability: Structured prompts are easier to validate and monitor.
Performance: Slight token overhead; mitigate with caching (see Model Selection & Cost Optimization)

XML Tags: Your Secret Weapon

Why XML Tags Work:

LLMs were trained on HTML/XML (web data)
Tags create clear boundaries in the context
Reduced hallucinations in controlled studies

Before (Ambiguous):

prompt = f"""
System: You are a helpful assistant.
User question: {user_input}
Background context: {knowledge_base}
Previous conversation: {history}
"""
# LLM might confuse where context ends and question begins

After (Clear):

prompt = f"""
<role>You are a helpful assistant</role>

<context>
<knowledge_base>
{knowledge_base}
</knowledge_base>

<conversation_history>
{history}
</conversation_history>
</context>

<user_question>
{user_input}
</user_question>
"""
# Crystal clear boundaries

In Production:

Cost Impact: Tagging adds tokens; mitigate with caching .
Reliability: Clear boundaries reduce off-context responses; improves evaluability.
Performance: Slight overhead; offset by fewer retries and clearer parsing.
Real Example: Teams report 40–60% fewer hallucinations when tags + validation are combined.

Pattern: Full runnable xml tags prompt notebook

def format_docs(docs):
    return "\n".join(f"- {doc}" for doc in docs)

def build_support_prompt(query: str, customer_data: dict, relevant_docs: list) -> str:
    return f"""
<system_role>
You are AcmeCo's senior customer support specialist.
</system_role>

<policy_documents>
{format_docs(relevant_docs)}
</policy_documents>

<customer_context>
<account_tier>{customer_data['tier']}</account_tier>
<account_age_days>{customer_data['age_days']}</account_age_days>
<previous_issues>{customer_data['issue_count']}</previous_issues>
</customer_context>

<customer_query>
{query}
</customer_query>

<response_instructions>
Analyze the query and provide:
1. Issue category (billing, technical, account)
2. Urgency level (low, medium, high)
3. Recommended action
4. Suggested response to customer

Output as JSON with keys: category, urgency, action, response
</response_instructions>
"""

Few-Shot Examples: Teaching by Showing

When to Use Few-Shot:

Complex or subjective tasks
Specific output format required
Edge cases need clarification

Pattern: Full runnable few-shot examples notebook

prompt = f"""
<task>Classify customer sentiment</task>

<examples>
<example>
<input>Your product is amazing! Best purchase ever.</input>
<output>positive</output>
</example>

<example>
<input>It's okay, does the job but nothing special.</input>
<output>neutral</output>
</example>

<example>
<input>Terrible quality. Broke after one week. Demanding refund!</input>
<output>negative</output>
</example>
</examples>

<input_to_classify>
{customer_message}
</input_to_classify>
"""

Quality Over Quantity:

3-5 examples usually enough
More examples = more tokens = higher cost
Examples should cover edge cases, not just obvious ones

Production Note: Store examples separately and inject only relevant ones:

example_db = {
    "classification": [...],
    "extraction": [...],
    "summarization": [...]
}

def get_relevant_examples(task_type: str, limit: int = 3):
    return example_db[task_type][:limit]

Model-Specific Optimization

Different models have different “personalities.” Here’s what works best for each: Full runnable model-specific optimization notebook

GPT-4 / GPT-4 Turbo

# Strengths: Structured output, following complex instructions
# Best practices:
prompt = """
{
  "role": "senior_analyst",
  "task": "financial_analysis",
  "output_format": {
    "summary": "string",
    "key_metrics": ["string"],
    "recommendation": "buy|hold|sell"
  }
}

Input: {data}
"""
# GPT-4 excels at JSON, clear role definitions

Claude (Sonnet/Opus)

# Strengths: Natural language, complex reasoning, long context
# Best practices:
prompt = """
<thinking>
Let me work through this step by step...
</thinking>

[Your instructions here]
"""
# Claude benefits from explicit thinking blocks
# Excellent with XML tags and markdown

Gemini 1.5 Pro

# Strengths: Massive context (2M tokens), multimodal
# Best practices:
prompt = """
[Upload entire 500-page PDF]
[Upload 10 images]
[Provide conversation history]

Based on ALL of the above context, answer: {query}
"""
# Many teams place the query at the end; validate per task (see Gemini prompting strategies)
# Can handle entire codebases or document sets

Structured Outputs with JSON Schemas

Why: Schema enforcement reduces parsing errors and retries; makes outputs machine-checkable. Pattern: Full runnable structured outputs with JSON schemas notebook

import jsonschema

CONTRACT_SCHEMA = {
    "type": "object",
    "properties": {
        "parties": {"type": "array", "items": {"type": "string"}},
        "key_dates": {"type": "array", "items": {"type": "string", "format": "date"}},
        "obligations": {"type": "array", "items": {"type": "string"}},
        "risk_flags": {"type": "array", "items": {"type": "string"}},
        "summary": {"type": "string"}
    },
    "required": ["parties", "key_dates", "obligations", "risk_flags", "summary"],
    "additionalProperties": False
}

PROMPT = f"""
<task>Summarize and extract fields from the contract</task>

<constraints>
- Output valid JSON only (no markdown, no comments)
- Conform EXACTLY to this JSON schema (no extra fields):
{json.dumps(CONTRACT_SCHEMA, indent=2)}
</constraints>

<contract>
{{contract_text}}
</contract>
"""

def validate_output(json_text: str) -> dict:
    data = json.loads(json_text)
    jsonschema.validate(instance=data, schema=CONTRACT_SCHEMA)
    return data

See: OpenAI Structured Outputs (2025), Anthropic Prompt Engineering (2025), Gemini Prompting Strategies (2025) in Additional Resources.

Home

Context Engineering & Prompt Design

Retrieval Augmented Generation (RAG)

AI Agents

Agent Reliability & Optimization

Multi-Agent Systems & Coordination

Structured Prompt Engineering

The Anatomy of a Production Prompt

XML Tags: Your Secret Weapon

Few-Shot Examples: Teaching by Showing

Model-Specific Optimization

GPT-4 / GPT-4 Turbo

Claude (Sonnet/Opus)

Gemini 1.5 Pro

Structured Outputs with JSON Schemas

Home

Context Engineering & Prompt Design

Retrieval Augmented Generation (RAG)

AI Agents

Agent Reliability & Optimization

Multi-Agent Systems & Coordination

​The Anatomy of a Production Prompt

​XML Tags: Your Secret Weapon

​Few-Shot Examples: Teaching by Showing

​Model-Specific Optimization

​GPT-4 / GPT-4 Turbo

​Claude (Sonnet/Opus)

​Gemini 1.5 Pro

​Structured Outputs with JSON Schemas

The Anatomy of a Production Prompt

XML Tags: Your Secret Weapon

Few-Shot Examples: Teaching by Showing

Model-Specific Optimization

GPT-4 / GPT-4 Turbo

Claude (Sonnet/Opus)

Gemini 1.5 Pro

Structured Outputs with JSON Schemas