OpenAI Research Agent

Overview

The OpenAI Research Agent is a sophisticated multi-agent application built with OpenAI’s Agents SDK and Streamlit. It leverages specialized AI agents working together to conduct comprehensive research on any topic and generate detailed, well-structured reports with source citations.

Features

Multi-Agent System

Three specialized agents work in coordination:

Triage Agent: Plans research strategy
Research Agent: Searches and collects information
Editor Agent: Compiles professional reports

Intelligent Research

Automated web search and fact collection
Source attribution for all information
Real-time research progress tracking
Structured data extraction

Professional Reports

Comprehensive 1000+ word reports
Structured outlines and sections
Markdown formatting
Downloadable format

Advanced Monitoring

Integrated tracing for entire workflow
Real-time fact collection display
Research process visualization
Error handling and fallbacks

Architecture

The system uses a coordinated multi-agent approach:

Agent Roles

Triage Agent
Research Agent
Editor Agent

Responsibility: Research planning and coordination

Analyzes user’s research topic
Creates structured research plan with:
- Clear topic statement
- 3-5 specific search queries
- 3-5 key focus areas
Coordinates handoffs between agents
Ensures workflow completion

Model: GPT-4o-mini

Setup

Clone the Repository

git clone https://github.com/Shubhamsaboo/awesome-llm-apps.git
cd awesome-llm-apps/starter_ai_agents/openai_research_agent

Install Dependencies

pip install -r requirements.txt

Required packages:

openai-agents - OpenAI Agents SDK
openai - OpenAI API client
streamlit - Web interface
uuid - Unique identifiers
pydantic - Data validation
python-dotenv - Environment management
asyncio - Async operations

Configure API Key

Set your OpenAI API key as an environment variable:

export OPENAI_API_KEY='your-api-key-here'

Or create a .env file:

OPENAI_API_KEY=your-api-key-here

Get your API key from OpenAI Platform

Run the Application

streamlit run research_agent.py

Navigate to http://localhost:8501 in your browser

Usage

Conducting Research

Enter Topic

Type your research topic in the sidebar or select from example topics

Start Research

Click “Start Research” to begin the multi-agent workflow

Monitor Progress

Watch real-time updates in the “Research Process” tab:

Research plan creation
Fact collection as it happens
Editor report compilation

View Report

Switch to the “Report” tab to see the final comprehensive report with:

Full markdown-formatted content
Structured outline
Source citations
Download option

Code Example

Custom Fact Collection Tool

from agents import function_tool
import streamlit as st
from datetime import datetime

@function_tool
def save_important_fact(fact: str, source: str = None) -> str:
    """Save an important fact discovered during research.
    
    Args:
        fact: The important fact to save
        source: Optional source of the fact
    
    Returns:
        Confirmation message
    """
    if "collected_facts" not in st.session_state:
        st.session_state.collected_facts = []
    
    st.session_state.collected_facts.append({
        "fact": fact,
        "source": source or "Not specified",
        "timestamp": datetime.now().strftime("%H:%M:%S")
    })
    
    return f"Fact saved: {fact}"

Agent Configuration

from agents import Agent, Runner, WebSearchTool, handoff, trace
from pydantic import BaseModel

# Define data models
class ResearchPlan(BaseModel):
    topic: str
    search_queries: list[str]
    focus_areas: list[str]

class ResearchReport(BaseModel):
    title: str
    outline: list[str]
    report: str
    sources: list[str]
    word_count: int

# Research Agent
research_agent = Agent(
    name="Research Agent",
    instructions="""
    You are a research assistant. Given a search term, you search 
    the web and produce a concise summary of results. The summary 
    must be 2-3 paragraphs and less than 300 words.
    """,
    model="gpt-4o-mini",
    tools=[WebSearchTool(), save_important_fact],
)

# Editor Agent
editor_agent = Agent(
    name="Editor Agent",
    handoff_description="A senior researcher who writes comprehensive reports",
    instructions="""
    You are a senior researcher tasked with writing a cohesive report. 
    Create an outline first, then generate a lengthy and detailed report 
    in markdown format. Aim for 5-10 pages, at least 1000 words.
    """,
    model="gpt-4o-mini",
    output_type=ResearchReport,
)

# Triage Agent (Coordinator)
triage_agent = Agent(
    name="Triage Agent",
    instructions="""
    You are the coordinator. Your job is to:
    1. Understand the user's research topic
    2. Create a research plan (topic, search_queries, focus_areas)
    3. Hand off to the Research Agent
    4. After research, hand off to the Editor Agent
    """,
    handoffs=[handoff(research_agent), handoff(editor_agent)],
    model="gpt-4o-mini",
    output_type=ResearchPlan,
)

Running the Research Workflow

import asyncio
import uuid

async def run_research(topic):
    # Create a unique conversation ID for tracing
    conversation_id = str(uuid.uuid4().hex[:16])
    
    # Trace the entire workflow
    with trace("News Research", group_id=conversation_id):
        # Start with triage agent
        triage_result = await Runner.run(
            triage_agent,
            f"Research this topic thoroughly: {topic}"
        )
        
        # Get the research plan
        research_plan = triage_result.final_output
        
        # Editor agent compiles the report
        report_result = await Runner.run(
            editor_agent,
            triage_result.to_input_list()
        )
        
        return report_result.final_output

# Run the async function
report = asyncio.run(run_research("AI developments in 2024"))

Example Topics

The app includes pre-configured example topics:

Travel Research

“What are the best cruise lines in USA for first-time travelers who have never been on a cruise?”Expected Output: Comparison of cruise lines, pricing, routes, amenities, and first-timer tips

Product Research

“What are the best affordable espresso machines for someone upgrading from a French press?”Expected Output: Machine comparisons, price ranges, features, and upgrade recommendations

Destination Research

“What are the best off-the-beaten-path destinations in India for a first-time solo traveler?”Expected Output: Hidden gems, safety tips, cultural insights, and travel logistics

Streamlit Interface Features

Two-Tab Layout

Research Process
Report

Real-time monitoring of the research workflow:

Research plan display
Live fact collection with sources
Agent status updates
Progress indicators
Report preview snippet

State Management

import streamlit as st

# Initialize session state
if "conversation_id" not in st.session_state:
    st.session_state.conversation_id = str(uuid.uuid4().hex[:16])
if "collected_facts" not in st.session_state:
    st.session_state.collected_facts = []
if "research_done" not in st.session_state:
    st.session_state.research_done = False
if "report_result" not in st.session_state:
    st.session_state.report_result = None

Advanced Features

Tracing and Monitoring

The app includes integrated tracing for the entire workflow:

from agents import trace

with trace("News Research", group_id=conversation_id):
    # All agent operations are traced
    result = await Runner.run(agent, message)

Benefits:

Debug agent interactions
Monitor performance
Track token usage
Identify bottlenecks

Error Handling

Robust error handling with fallbacks:

try:
    report_result = await Runner.run(editor_agent, triage_result.to_input_list())
    st.session_state.report_result = report_result.final_output
except Exception as e:
    st.error(f"Error generating report: {str(e)}")
    # Fallback to display raw agent response
    if hasattr(triage_result, 'new_items'):
        messages = [item for item in triage_result.new_items]
        raw_content = "\n\n".join([str(m.content) for m in messages])
        st.session_state.report_result = raw_content

Use Cases

Market Research

Research competitors, market trends, and industry analysis

Academic Research

Gather information for papers, literature reviews, and studies

Product Research

Compare products, read reviews, and make informed purchase decisions

Travel Planning

Research destinations, accommodations, and travel tips

Performance Considerations

API Costs: Each research session uses multiple API calls. The Research Agent makes several web searches, and the Editor Agent generates long-form content. Monitor your OpenAI usage.

Processing Time: Complex topics may take 30-60 seconds to research and compile. The app provides real-time progress updates during this time.

Best Results: For more comprehensive reports, use specific topics. Instead of “AI”, try “Recent developments in AI safety and alignment research”.

Troubleshooting

API Key Not Found

Ensure your OPENAI_API_KEY environment variable is set:

echo $OPENAI_API_KEY

If empty, set it in your .env file or export it in your shell.

Async Errors

If you see asyncio-related errors, ensure you’re using Python 3.7+. The app uses asyncio.run() which requires modern Python.

Empty Reports

If reports are empty or incomplete, check:

Topic is specific enough
Internet connection is stable
OpenAI API is operational
No rate limit issues

Next Steps

Customize Agents

Modify agent instructions to specialize in specific research domains

Add More Tools

Integrate additional tools like academic databases or specialized APIs

Explore Examples

Check out more AI agent examples

GitHub Repository

View the complete source code

Starter Examples

Advanced Examples

RAG Examples

Overview

Features

Multi-Agent System

Intelligent Research

Professional Reports

Advanced Monitoring

Architecture

Agent Roles

Setup

Usage

Conducting Research

Code Example

Custom Fact Collection Tool

Agent Configuration

Running the Research Workflow

Example Topics

Streamlit Interface Features

Two-Tab Layout

State Management

Advanced Features

Tracing and Monitoring

Error Handling

Use Cases

Market Research

Academic Research

Product Research

Travel Planning

Performance Considerations

Troubleshooting

Next Steps

Customize Agents

Add More Tools

Explore Examples

GitHub Repository

Build docs developers (and LLMs) love

Starter Examples

Advanced Examples

RAG Examples

​Overview

​Features

Multi-Agent System

Intelligent Research

Professional Reports

Advanced Monitoring

​Architecture

​Agent Roles

​Setup

​Usage

​Conducting Research

​Code Example

​Custom Fact Collection Tool

​Agent Configuration

​Running the Research Workflow

​Example Topics

​Streamlit Interface Features

​Two-Tab Layout

​State Management

​Advanced Features

​Tracing and Monitoring

​Error Handling

​Use Cases

Market Research

Academic Research

Product Research

Travel Planning

​Performance Considerations

​Troubleshooting

​Next Steps

Customize Agents

Add More Tools

Explore Examples

GitHub Repository

Build docs developers (and LLMs) love

Overview

Features

Architecture

Agent Roles

Setup

Usage

Conducting Research

Code Example

Custom Fact Collection Tool

Agent Configuration

Running the Research Workflow

Example Topics

Streamlit Interface Features

Two-Tab Layout

State Management

Advanced Features

Tracing and Monitoring

Error Handling

Use Cases

Performance Considerations

Troubleshooting

Next Steps