The Verification Checklist
Build a personal AI verification checklist and stress-test it against real AI outputs. 25 minutes.
One-liner: Build a personal AI verification system β a checklist you'll actually use β and stress-test it against real AI outputs to find its limits.
π§ Jump in (Tinkerers start here)[edit | edit source]
You're going to build a verification checklist, then immediately try to break it.
Step 1 β Generate something to verify. Ask AI to produce a piece of content you might actually use in your work:
Write a [deliverable type β e.g., client email, project proposal, market analysis, technical recommendation] about [topic relevant to your work]. Make it detailed and specific. Include data points, recommendations, and reasoning.
Step 2 β Build your checklist. Before reading the output carefully, write your own verification checklist. Start with these categories and add your own:
| Check || Question || Pass/Fail
| Factual claims || Are specific numbers, dates, or statistics verifiable? || | Sources || Could I find the original source for any cited information? || | Reasoning || Does the logic hold? Are there hidden assumptions? || | Completeness || What important perspective or consideration is missing? || | Tone/audience || Is the tone appropriate? Would the intended audience trust this? || | Actionability || Are the recommendations specific enough to actually follow? || | Your domain check || [Add a check specific to your field] || | Your domain check || [Add another check specific to your field] ||
Step 3 β Apply the checklist. Go through the AI output line by line using your checklist. Mark each check as pass or fail. For every fail, note what the issue is.
Step 4 β Stress-test the checklist. Now deliberately ask AI to produce something harder to verify:
Write the same type of [deliverable] but on a topic I'm less familiar with: [topic outside your expertise]. Make it equally detailed and authoritative.
Apply your checklist again. Where does it fail to catch problems? What check do you need to add?
Step 5 β Finalize. Update your checklist based on what you learned. Save it where you'll actually use it β bookmark it, pin it, print it, whatever works.
π Plan first (Planners start here)[edit | edit source]
Here's what you're about to do:
- Generate test content β Ask AI to produce a work-relevant deliverable. This gives you realistic material to verify.
- Draft your checklist β Build a structured verification list covering factual accuracy, reasoning quality, completeness, tone, and domain-specific concerns.
- Apply to familiar territory β Use the checklist on AI output about a topic you know. This lets you calibrate how well the checklist catches real errors.
- Apply to unfamiliar territory β Use the checklist on AI output about a topic you don't know well. This exposes gaps in your process β the errors you can only catch with domain knowledge.
- Iterate and save β Update the checklist based on what it missed. Save it in a format you'll actually reach for.
"Done" looks like: A tested, refined verification checklist (8-12 items) saved in a usable format, with evidence of at least one error it caught and one gap you identified and fixed.
π§ Why this matters (Strategists start here)[edit | edit source]
In EP-Basic-01, you built a simple 3-question verification prompt. Here, you're building a systematic process β a checklist that works regardless of topic, catches both factual and reasoning errors, and is tuned to your specific work context. The community's 75% Ethical Prompting score means most people intend to verify AI output but lack a consistent method. A checklist turns good intentions into reliable behavior. At the advanced level, you'll scale this into a governance framework for a team; this exercise builds the individual practice first.
Reflection[edit | edit source]
- Which check caught the most problems? Which was least useful?
- How did your verification experience change between the familiar topic and the unfamiliar one?
- Is your checklist something you'd actually pull up before sending an AI-generated deliverable? What format makes it most likely you'll use it?
- π¬ Trade checklists with a colleague. Have them apply yours to an AI output from their work β their feedback will reveal blind spots specific to your domain. (Social Learners)
β¬οΈ Level up[edit | edit source]
Ready for more? Try EP-Advanced-01 β where you'll design an AI governance framework for a team or project.
Back to Ethical Prompting & Judgment