AI Best Practices
Comprehensive guide to maximizing AI performance, minimizing costs, and maintaining clinical safety in Paw & Care’s AI-powered features.Clinical Safety
Human-in-the-Loop Workflow
AI Generates Draft
AI creates SOAP notes, clinical insights, or triage recommendationsStatus: Draft (not visible to other staff, not part of legal record)
Veterinarian Reviews
DVM reads AI output, makes edits, adds missing informationRequired Actions:
- Read all four SOAP sections
- Verify vitals and measurements
- Check medication names and dosages
- Confirm diagnosis accuracy
Veterinarian Approves
DVM clicks “Finalize” to approve recordStatus: Finalized (immutable, part of legal medical record)
Review Checklists
- SOAP Notes
- Clinical Insights
- Triage Calls
Before Finalizing, Verify:
Subjective Section
- Chief complaint accurate
- Duration of symptoms correct
- Owner observations match conversation
Objective Section
- Vitals present and plausible (not hallucinated)
- Physical exam findings match your notes
- Diagnostic results included if performed
Assessment Section
- Diagnosis clinically appropriate
- Differential diagnoses reasonable
- No unsupported claims
Dictation Best Practices
Recording Technique
Content Structure
Recommended Dictation Flow:Total dictation time: 3-5 minutes for comprehensive SOAP note
Medical Terminology Tips
- Spell Out First Use
- Use Full Medical Terms
- Clarify Ambiguous Sounds
- Correct Mistakes Immediately
Uncommon Terms: Spell first occurrenceAfter Spelling: Use normally
Cost Optimization
Token Management
- Use Browser SpeechRecognition
- Adjust Detail Level
- Batch Processing
- Cache Common Queries
Free Live Transcription:Cost Savings: ~40% reduction in Whisper API calls
Usage Monitoring
Set Monthly Budget:Accuracy Optimization
Prompt Engineering
- Be Specific
- Provide Examples
- Set Constraints
- Request Structured Output
❌ Vague Prompt:✅ Specific Prompt:
Model Selection
- gpt-4o-mini (Recommended)
- gpt-4-turbo
- gpt-3.5-turbo
Use For:
- Routine SOAP notes
- Clinical insights
- Billing extraction
- 70% cheaper than GPT-4
- Faster (8-12s vs 15-20s)
- Good structured output
Temperature Settings
Lower temperature = more consistent, factual, but potentially repetitiveHigher temperature = more creative, varied, but potentially inconsistent
Quality Assurance
Regular Audits
Weekly Spot Checks
Sample: 10 random AI-generated SOAP notesCheck For:
- Hallucinated vitals (numbers not in dictation)
- Incorrect patient names
- Inappropriate diagnoses
- Missing information from dictation
Monthly Accuracy Review
Metrics to Track:
- Transcription WER (Word Error Rate)
- SOAP note veterinarian acceptance rate
- Clinical insight relevance score
- Emergency detection false positive rate
- WER: < 5% (excellent)
- Acceptance: > 85% (good)
- Insight relevance: > 70% (good)
- Emergency false positives: < 3% (acceptable)
Error Reporting
Implement Feedback Loop:- Hallucination: AI invented information not in dictation
- Misrecognition: Whisper transcribed word incorrectly
- Missing Information: AI omitted content from dictation
- Inappropriate Suggestion: Clinical insight not relevant/safe
Training & Onboarding
Staff Training Checklist
For Veterinarians
Training Duration: 30 minutesTopics:
- How to record high-quality dictations
- SOAP note review checklist
- When to reject AI suggestions
- How to report AI errors
- Understanding confidence scores
For Veterinary Technicians
Training Duration: 15 minutesTopics:
- Monitoring AI call triage
- When to escalate emergency calls
- How to assist with dictation setup
- Understanding triage levels
Best Practice Documentation
Create Practice-Specific Guide:Troubleshooting
AI Keeps Hallucinating Vitals
AI Keeps Hallucinating Vitals
Symptom: SOAP notes include temperature/heart rate not mentioned in dictationRoot Cause: Prompt doesn’t emphasize “only use stated information”Solution: Update system prompt:
Whisper Transcribes Breed Names Wrong
Whisper Transcribes Breed Names Wrong
Symptom: “Rhodesian Ridgeback” becomes “Rosy and Ridgeback”Solution 1: Spell breed name in dictationSolution 2: Add to Whisper prompt parameterSolution 3: Post-processing correction
Clinical Insights All Low Relevance
Clinical Insights All Low Relevance
Symptom: Insights marked “Not Relevant” by vets >50% of the timeCauses:
- Insights too generic (“Consider blood work”)
- Not species-specific
- Suggests tests practice doesn’t offer
Luna AI Books Wrong Appointments
Luna AI Books Wrong Appointments
Symptom: Appointments scheduled at unavailable timesCause:
check_availability function returning incorrect dataDebug:- Test function URL directly:
curl https://your-api.com/api/appointments/available?date=2026-03-15 - Verify response format matches Retell expectations
- Check database query for off-by-one errors (timezone issues)
Summary Checklist
Next Steps
SOAP Generation
Implement AI dictation workflow
Clinical Insights
Configure diagnosis suggestions
Voice Assistant
Set up Luna AI phone system
Overview
Return to AI & ML overview