Instruction & Prompt Guidelines for Reliable Labeling
Who This Is For
Core Principles for Reliable Instructions
- Write for the exact decision you want. One instruction should map to one decision.
- Define terms before you ask reviewers to use them.
- Prefer objective rules over subjective language.
- Show examples and counterexamples that match real difficulty.
- State how to handle ambiguity and when to escalate.
- Version guidelines and track changes so everyone stays aligned.
Labeling Guideline Template
Use this outline for any labeling or evaluation task.
1. Task Definition
Describe the goal in one or two sentences. State the expected input and the expected output.
2. Label Set And Definitions
List each label. Give a short definition and one clear example.
3. Decision Rules
Write step-by-step rules that decide the label. Keep each step a single action.
4. Examples And Counterexamples
Include three to five examples for each label. Add one tricky counterexample for edge cases.
5. Ambiguity Policy
State what to do when the rules do not apply. Tell reviewers to choose a default label or to escalate.
6. Escalation Path
Name who to ask and how to log a question. Include a one-line service level for responses.
7. Quality Targets
Set inter-annotator agreement targets and gold-test minimums. Give a short note on how you measure them.
8. Submission Checklist
Add a brief checklist that reviewers run before they submit.
9. Tooling Conventions
Describe hotkeys, file naming, and required fields.
10. Privacy And Security
List redaction rules and any restricted content.
11. Version And Ownership
Add the version number, date, and the owner of the guideline.
Prompt Writing Basics for Labeling and Evaluation
- Begin with the task and the expected output format.
- Specify the label set or the rubric dimension.
- Ask for a short rationale when it helps catch mistakes.
- Set limits on length, time, and scope.
- Remind reviewers what to do when they are unsure.
Good Prompt Template
Weak Prompt Example
Rubrics for RLHF and Preference Work
Use a simple, repeatable rubric that focuses on the real task.
- Helpfulness: Does the answer solve the user’s request and follow the instruction.
- Correctness: Is the content accurate and free of unsupported claims.
- Safety and Policy: Does the answer follow policy and avoid risky content.
- Style and Tone: Does the answer match the desired tone and format.
Ask raters to rank two answers side by side. Require a one-line rationale that cites the rule they used. Keep scales short and clear. For most tasks a three-point or five-point scale is enough.
Positive and Negative Instruction Examples
Clear Instruction
Unclear Instruction
Edge Cases and Ambiguity
Calibration and Change Management
- Run a small pilot with a gold set before full production.
- Review disagreements and update rules that cause confusion.
- Re-run the pilot until agreement meets the target.
- Announce changes with a short summary and a new version number.
- Keep old versions for reference and audit.
Quality Gates and Metrics
- Gold-Test Pass Rate: Use a small gold set in every batch.
- Inter-Annotator Agreement: Track agreement and coach when it drops.
- Escalation Resolution Time: Measure how quickly questions get answered.
- Re-Label Rate: Watch how often labels are corrected after review.
- Release Gates: Block training or release if a gate falls below the target.
Prompts for Synthetic Generation and Safe Use
Generation Prompt Template
Safety Checks
Frequently Asked Questions
Do I always need rationales in prompts?
How often should I update instructions?
Can I reuse prompts across modalities?
Ready to raise label quality?
