AI-Powered Generation

Introduction

After extracting and normalizing code metadata, DocuGen AI leverages Google’s Gemini API to generate human-readable documentation. The AI acts as a Technical Writer, interpreting metadata and creating clear, professional documentation.

The AI generation layer is implemented in docugen/api/gemini_client.py using the google-genai Python SDK.

The GeminiClient Class

The GeminiClient class (gemini_client.py:23) encapsulates all AI interactions:

class GeminiClient:
    def __init__(
        self,
        api_key: str,
        model: str = DEFAULT_MODEL,
        system_prompt: str = SYSTEM_PROMPT,
        client: Any | None = None,
    ) -> None:
        if not api_key or not api_key.strip():
            raise ValueError("GEMINI_API_KEY is required.")
        
        self.model = model
        self.system_prompt = system_prompt
        self.client = client or self._build_client(api_key.strip())

Configuration

API Key: Required authentication for Gemini API
Model: Defaults to gemini-3.1-flash-lite-preview (gemini_client.py:11)
System Prompt: Defines the AI’s role and behavior
Client: Auto-initialized genai.Client instance

You can inject a custom client for testing or use a different Gemini model by passing the model parameter.

The System Prompt

The system prompt (gemini_client.py:12-20) is critical—it defines how the AI behaves:

SYSTEM_PROMPT = (
    "Actua como un Technical Writer Senior experto en documentacion de software. "
    "Tu tarea es analizar los metadatos de codigo (clases, funciones, docstrings) que te proporcionare.\n"
    "1. Genera una descripcion tecnica de alto nivel pero facil de entender.\n"
    "2. Crea ejemplos de uso (Usage Examples) basados en las firmas de las funciones.\n"
    "3. Manten un tono profesional, conciso y utiliza formato Markdown limpio.\n"
    "4. Si el codigo carece de docstrings, infiere la utilidad basandote en el nombre de las variables y la logica.\n"
    "5. Responde unicamente con el contenido del Markdown, sin introducciones ni despedidas."
)

What the Prompt Achieves

Instruction	Purpose
”Technical Writer Senior”	Sets expertise level and writing style
”Genera descripción técnica”	Ensures high-level overview is included
”Crea ejemplos de uso”	Forces the AI to generate usage examples
”Tono profesional, conciso”	Maintains consistency and brevity
”Infiere la utilidad”	Handles codebases without docstrings
”Markdown limpio”	Ensures output is properly formatted
”Sin introducciones ni despedidas”	Removes conversational fluff

The system prompt is in Spanish in this implementation. You can customize it by passing a different system_prompt parameter during initialization.

Metadata Preparation

Before sending data to the AI, the _build_content() method (gemini_client.py:44-56) formats the metadata:

def _build_content(self, project_metadata: dict[str, Any], user_prompt: str | None = None) -> str:
    metadata_json = json.dumps(project_metadata, ensure_ascii=False, indent=2)
    sections = [
        "Project metadata (JSON):",
        "```json",
        metadata_json,
        "```",
    ]
    
    if user_prompt and user_prompt.strip():
        sections.extend(["", f"Additional instruction: {user_prompt.strip()}"])
    
    return "\n".join(sections)

What Gets Sent to Gemini

JSON-formatted metadata from the normalization phase:

{
  "summary": {
    "file_count": 5,
    "class_count": 3,
    "function_count": 12,
    "method_count": 8
  },
  "files": [
    {
      "path": "docugen/core/parser.py",
      "classes": [...],
      "functions": [...],
      "metrics": {...}
    }
  ]
}

Optional user prompt for custom instructions:

Additional instruction: Focus on the AST parsing logic.

The metadata is serialized with ensure_ascii=False to preserve Unicode characters (useful for non-English codebases) and indent=2 for readability.

Making the API Call

The generate_markdown() method (gemini_client.py:77-89) orchestrates the entire generation process:

def generate_markdown(self, project_metadata: dict[str, Any], user_prompt: str | None = None) -> str:
    content = self._build_content(project_metadata, user_prompt=user_prompt)
    
    try:
        response = self.client.models.generate_content(
            model=self.model,
            contents=content,
            config={"system_instruction": self.system_prompt},
        )
    except Exception as exc:
        raise RuntimeError(f"Gemini API call failed: {exc}") from exc
    
    return self._extract_text(response)

Request Configuration

model: The Gemini model ID (e.g., gemini-3.1-flash-lite-preview)
contents: The formatted metadata + user prompt
config.system_instruction: The Technical Writer system prompt

The Gemini Flash Lite model is optimized for speed and cost-efficiency while maintaining high-quality output for documentation tasks.

Response Extraction

Gemini API responses can vary in structure. The _extract_text() method (gemini_client.py:58-75) handles multiple response formats:

@staticmethod
def _extract_text(response: Any) -> str:
    # Try direct text attribute
    direct_text = getattr(response, "text", None)
    if isinstance(direct_text, str) and direct_text.strip():
        return direct_text.strip()
    
    # Fallback: iterate through candidates
    for candidate in getattr(response, "candidates", []) or []:
        content = getattr(candidate, "content", None)
        parts = getattr(content, "parts", None) if content is not None else None
        if not parts:
            continue
        
        text_parts = [getattr(part, "text", "") for part in parts if getattr(part, "text", None)]
        joined = "".join(text_parts).strip()
        if joined:
            return joined
    
    raise RuntimeError("Gemini response did not include text content.")

Why This Complexity?

Direct access: Some models return response.text directly
Structured responses: Others return response.candidates[0].content.parts[0].text
Multi-part responses: Text may be split across multiple parts

The method uses getattr() with fallbacks to avoid AttributeError exceptions when the response structure differs from expectations.

Error Handling

API Call Failures

All exceptions during the API call are caught and wrapped with context (gemini_client.py:86-87):

except Exception as exc:
    raise RuntimeError(f"Gemini API call failed: {exc}") from exc

Common errors:

Invalid API key
Network timeouts
Rate limiting
Model unavailability

Empty Responses

If the response doesn’t contain text, an error is raised (gemini_client.py:75):

raise RuntimeError("Gemini response did not include text content.")

When debugging API issues, check that your API key is valid and that you haven’t exceeded Gemini’s rate limits.

Client Initialization

The _build_client() static method (gemini_client.py:38-42) handles SDK initialization:

@staticmethod
def _build_client(api_key: str) -> Any:
    if genai is None:
        raise RuntimeError("google-genai is not installed. Install dependencies first.")
    return genai.Client(api_key=api_key)

Dependency Check

The module attempts to import google.genai at the top (gemini_client.py:6-9):

try:
    from google import genai
except ImportError:
    genai = None  # type: ignore[assignment]

This allows the module to be imported even if google-genai isn’t installed (useful for testing or alternative backends).

Integration with Templates

After generation, the AI-produced Markdown is combined with project metadata using the TemplateEngine (templates/engine.py:27):

def render_readme(ai_content: str, project_name: str, summary: dict[str, Any] | None = None) -> str:
    engine = TemplateEngine()
    return engine.render(
        DEFAULT_TEMPLATE,
        {
            "project_name": project_name,
            "ai_content": ai_content.strip(),
            "summary": summary or {},
        },
    )

This creates the final documentation file with:

Project name as the title
AI-generated content as the main body
Summary statistics (file count, class count, etc.)

Example Workflow

Here’s how the pieces fit together:

# 1. Initialize client
client = GeminiClient(api_key="your-key")

# 2. Prepare metadata (from previous phases)
metadata = prepare_for_ai(parsed_files)

# 3. Generate documentation
ai_markdown = client.generate_markdown(
    project_metadata=metadata,
    user_prompt="Focus on the public API"
)

# 4. Render final output
final_docs = render_readme(
    ai_content=ai_markdown,
    project_name="MyProject",
    summary=metadata["summary"]
)

Customization Options

Use a Different Model

client = GeminiClient(
    api_key="your-key",
    model="gemini-4.5-pro"  # More powerful model
)

Custom System Prompt

custom_prompt = """
You are a documentation expert. Generate API reference documentation
with a focus on code examples and best practices. Use TypeScript-style
type annotations when possible.
"""

client = GeminiClient(
    api_key="your-key",
    system_prompt=custom_prompt
)

Provide User Instructions

ai_markdown = client.generate_markdown(
    project_metadata=metadata,
    user_prompt="Include detailed examples for the GeminiClient class"
)

User prompts are appended to the metadata, allowing you to guide the AI’s focus without changing the system prompt.

Best Practices

API Key Security: Store API keys in environment variables, never in code
Error Handling: Wrap API calls in try/except blocks to handle network issues gracefully
Rate Limiting: Implement retry logic with exponential backoff for production use
Prompt Tuning: Experiment with system prompts to match your documentation style
Model Selection: Use Flash models for speed, Pro models for complex codebases

Get Started

Core Concepts

Usage

Advanced

AI-Powered Generation

Introduction

The GeminiClient Class

Configuration

The System Prompt

What the Prompt Achieves

Metadata Preparation

What Gets Sent to Gemini

Making the API Call

Request Configuration

Response Extraction

Why This Complexity?

Error Handling

API Call Failures

Empty Responses

Client Initialization

Dependency Check

Integration with Templates

Example Workflow

Customization Options

Use a Different Model

Custom System Prompt

Provide User Instructions

Best Practices

Next Steps

Architecture Overview

CLI Reference

Build docs developers (and LLMs) love

Get Started

Core Concepts

Usage

Advanced

​Introduction

​The GeminiClient Class

​Configuration

​The System Prompt

​What the Prompt Achieves

​Metadata Preparation

​What Gets Sent to Gemini

​Making the API Call

​Request Configuration

​Response Extraction

​Why This Complexity?

​Error Handling

​API Call Failures

​Empty Responses

​Client Initialization

​Dependency Check

​Integration with Templates

​Example Workflow

​Customization Options

​Use a Different Model

​Custom System Prompt

​Provide User Instructions

​Best Practices

​Next Steps

Architecture Overview

CLI Reference

Build docs developers (and LLMs) love

Introduction

The GeminiClient Class

Configuration

The System Prompt

What the Prompt Achieves

Metadata Preparation

What Gets Sent to Gemini

Making the API Call

Request Configuration

Response Extraction

Why This Complexity?

Error Handling

API Call Failures

Empty Responses

Client Initialization

Dependency Check

Integration with Templates

Example Workflow

Customization Options

Use a Different Model

Custom System Prompt

Provide User Instructions

Best Practices

Next Steps