AI Agents Overview

Introduction

MedMitra’s AI system employs a multi-agent architecture powered by LangGraph to process complex medical cases. The system orchestrates specialized agents that work together to analyze patient data, extract insights from medical documents, and generate comprehensive clinical assessments.

Architecture Overview

The AI system consists of two primary specialized agents:

Medical Insights Agent

Analyzes lab reports, generates SOAP notes, and provides diagnostic insights

Vision Agent

Processes radiology images using vision models to extract medical findings

System Components

1. Agent Orchestration Layer

The agentic_process function in backend/agentic.py serves as the main orchestrator:

backend/agentic.py

async def agentic_process(
    case_id: str, 
    user_id: str,
    patient_name: str,
    patient_age: int,
    patient_gender: str,
    case_summary: Optional[str] = None,
    lab_files: Optional[List[Dict[str, Any]]] = None,
    radiology_files: Optional[List[Dict[str, Any]]] = None
):

2. State Management

The system uses TypedDict state models to track processing through the pipeline. The primary state container is MedicalAnalysisState:

backend/models/state_models.py

class MedicalAnalysisState(TypedDict):
    # Input data
    case_input: CaseInput
    
    # Processed documents
    processed_lab_docs: List[LabDocument]
    processed_radiology_docs: List[RadiologyDocument]
    
    # Analysis results
    case_summary: Optional[CaseSummary]
    soap_note: Optional[SOAPNote]
    primary_diagnosis: Optional[Diagnosis]
    
    # Final output
    medical_insights: Optional[MedicalInsights]
    
    # Processing metadata
    processing_errors: List[str]
    processing_stage: str
    confidence_scores: Dict[str, float]

3. Data Flow

Processing Pipeline

The complete case analysis follows this sequence:

File Upload & Categorization

Files are uploaded and categorized as either lab or radiology documents

Document Processing

Lab files: Extracted using PDF parser to text
Radiology files: Analyzed by Vision Agent for visual findings

Medical Analysis Workflow

The Medical Insights Agent processes documents through a LangGraph workflow:

Process lab documents
Process radiology documents
Generate case summary
Generate SOAP note
Generate primary diagnosis
Compile insights

Result Storage

Final insights are saved to Supabase with confidence scores

Key Features

Parallel Processing

The system can process multiple documents concurrently:

backend/agentic.py

if lab_files:
    for lab_file in lab_files:
        # Process each lab file
        result = await process_pdf_async(temp_file_path)
        
if radiology_files:
    # Process all radiology files for the case
    result = await vision_agent(case_id)

Confidence Scoring

Every analysis step generates a confidence score (0.0-1.0) that reflects the model’s certainty:

backend/agents/medical_ai_agent.py

confidence_scores = [
    state["case_summary"].confidence_score,
    state["soap_note"].confidence_score,
    state["primary_diagnosis"].confidence_score
]
overall_confidence = sum(confidence_scores) / len(confidence_scores)

Error Handling

The system tracks errors throughout the pipeline and updates case status accordingly:

try:
    medical_insights = await medical_agent.process(case_input)
    await supabase.update_case_status(case_id=case_id, status="completed")
except Exception as e:
    logger.error(f"Error in AI insights generation: {e}")
    await supabase.update_case_status(case_id=case_id, status="failed")

LangGraph Integration

MedMitra uses LangGraph to define the workflow as a directed graph. This provides:

Stateful processing: Each node updates the shared state
Clear dependencies: Edges define the execution order
Parallel execution: Independent nodes can run concurrently
Error recovery: Failed nodes can be retried or alternative paths taken

See the Workflow documentation for detailed graph structure.

Models Used

Agent	Model	Purpose
Medical Insights Agent	`llama-3.3-70b-versatile`	Text analysis, diagnosis generation
Vision Agent	`meta-llama/llama-4-scout-17b-16e-instruct`	Medical image analysis

Next Steps

Medical Insights Agent

Deep dive into the text analysis agent

Vision Agent

Learn about medical image processing

Workflow Details

Understand the complete processing pipeline

API Reference

Integrate with the case creation API

Overview

Getting Started

Core Features

User Guide

AI Agents

AI Agents Overview

Introduction

Architecture Overview

Medical Insights Agent

Vision Agent

System Components

1. Agent Orchestration Layer

2. State Management

3. Data Flow

Processing Pipeline

Key Features

Parallel Processing

Confidence Scoring

Error Handling

LangGraph Integration

Models Used

Next Steps

Medical Insights Agent

Vision Agent

Workflow Details

API Reference

Build docs developers (and LLMs) love

Overview

Getting Started

Core Features

User Guide

AI Agents

​Introduction

​Architecture Overview

Medical Insights Agent

Vision Agent

​System Components

​1. Agent Orchestration Layer

​2. State Management

​3. Data Flow

​Processing Pipeline

​Key Features

​Parallel Processing

​Confidence Scoring

​Error Handling

​LangGraph Integration

​Models Used

​Next Steps

Medical Insights Agent

Vision Agent

Workflow Details

API Reference

Build docs developers (and LLMs) love

Introduction

Architecture Overview

System Components

1. Agent Orchestration Layer

2. State Management

3. Data Flow

Processing Pipeline

Key Features

Parallel Processing

Confidence Scoring

Error Handling

LangGraph Integration

Models Used

Next Steps