AI research and benchmarking

What is AI research and benchmarking?

The ChatGPT Scraper API provides advanced features for researchers, data scientists, and developers who need to analyze ChatGPT’s behavior, response patterns, and internal query mechanisms. With access to raw response data and query fan-out insights, you can conduct deep analysis of how ChatGPT processes prompts and generates responses.

Why use the API for AI research?

Understanding how ChatGPT works internally is crucial for:

Model benchmarking: Compare ChatGPT’s performance across different prompts and topics
Response analysis: Study how ChatGPT structures and generates its answers
Query optimization: Learn how to craft better prompts based on internal query patterns
Academic research: Collect structured data for AI behavior studies and publications
Quality assurance: Test and validate ChatGPT’s responses for production applications

Advanced research features like raw response access and query fan-out insights require additional credits but provide invaluable data for serious research and development work.

Key features for research

Raw response access

Access the complete streaming response payload for advanced debugging, timing analysis, and low-level metadata

Query fan-out insights

See the actual search queries ChatGPT uses internally to gather information for its responses

Structured output

Retrieve responses as parsed JSON with consistent schema for easy analysis and storage

Multiple formats

Access responses in plain text, Markdown, or raw HTML for different research needs

Query fan-out insights

When you enable include.searchQueries (+2 credits), the API reveals how ChatGPT interprets and breaks down your prompt internally.

What you learn from query fan-out:

Prompt interpretation: How ChatGPT understands and decomposes your question
Search strategy: What specific searches it performs to gather information
Information sources: Which topics and queries it prioritizes
Optimization opportunities: How to refine prompts for better results

Analyze query fan-out patterns across multiple prompts to develop a systematic understanding of how ChatGPT processes different types of questions.

Example query fan-out analysis

import requests

payload = {
    'prompt': 'Compare the top 3 programming languages for web development in 2025',
    'country': 'US',
    'include': {
        'searchQueries': True
    }
}

response = requests.post(
    'https://api.cloro.dev/v1/monitor/chatgpt',
    headers={'Authorization': 'Bearer YOUR_API_KEY'},
    json=payload
)

data = response.json()

# Analyze the queries ChatGPT used internally
if 'searchQueries' in data['result']:
    print("ChatGPT used these queries:")
    for query in data['result']['searchQueries']:
        print(f"  - {query}")

Raw response access

Enable include.rawResponse (+2 credits) to access the complete streaming response payload.

Use cases for raw response data:

Timing analysis: Understand response generation speed and patterns
Debugging: Investigate unexpected behaviors or errors
Metadata extraction: Access low-level response information not in structured output
Stream processing: Study how ChatGPT streams responses in real-time

Example raw response analysis

import requests
import json

payload = {
    'prompt': 'Explain quantum computing in simple terms',
    'include': {
        'rawResponse': True,
        'markdown': True
    }
}

response = requests.post(
    'https://api.cloro.dev/v1/monitor/chatgpt',
    headers={'Authorization': 'Bearer YOUR_API_KEY'},
    json=payload
)

data = response.json()

# Save raw response for detailed analysis
if 'rawResponse' in data['result']:
    with open('raw_response.json', 'w') as f:
        json.dump(data['result']['rawResponse'], f, indent=2)
    
    print(f"Raw response events: {len(data['result']['rawResponse'])}")

Research workflow

Step 1: Design your research study

Define your research questions and required data:

What aspects of ChatGPT do you want to analyze?
Do you need query fan-out, raw responses, or both?
What output format works best for your analysis pipeline?

Step 2: Collect structured data

Submit prompts with appropriate parameters:

import requests
import time

prompts = [
    "What is machine learning?",
    "Explain deep learning",
    "Compare supervised and unsupervised learning"
]

results = []

for prompt in prompts:
    payload = {
        'prompt': prompt,
        'include': {
            'markdown': True,
            'rawResponse': True,
            'searchQueries': True
        }
    }
    
    response = requests.post(
        'https://api.cloro.dev/v1/monitor/chatgpt',
        headers={'Authorization': 'Bearer YOUR_API_KEY'},
        json=payload,
        timeout=180  # 3 minutes timeout
    )
    
    results.append(response.json())
    time.sleep(1)  # Rate limiting

Step 3: Analyze and benchmark

Process collected data for insights:

# Analyze response characteristics
for i, result in enumerate(results):
    data = result['result']
    
    print(f"\nPrompt {i+1}: {prompts[i]}")
    print(f"Model: {data['model']}")
    print(f"Response length: {len(data['text'])} characters")
    
    if 'searchQueries' in data:
        print(f"Search queries used: {len(data['searchQueries'])}")
        print(f"Queries: {', '.join(data['searchQueries'])}")
    
    if 'rawResponse' in data:
        print(f"Raw events: {len(data['rawResponse'])}")

When conducting large-scale research, implement rate limiting and error handling to ensure reliable data collection.

Benchmarking best practices

Consistent parameters

Use the same API parameters across test runs to ensure comparable results

Multiple samples

Collect multiple responses for each prompt to account for response variability

Structured storage

Save responses in a structured format (JSON) for easy analysis and reproducibility

Timeout handling

Set appropriate timeouts (120-180 seconds) to handle complex prompts

Advanced research scenarios

Prompt optimization studies

Use query fan-out to understand how different prompt formulations affect information retrieval:

prompt_variations = [
    "Best programming languages 2025",
    "What are the top programming languages in 2025?",
    "Compare the best programming languages for 2025"
]

# Analyze how ChatGPT interprets each variation
for prompt in prompt_variations:
    # Submit request and analyze searchQueries
    pass

Model comparison

Track which ChatGPT model versions are used:

model_usage = {}

for result in results:
    model = result['result']['model']
    model_usage[model] = model_usage.get(model, 0) + 1

print("Model distribution:", model_usage)

Request parameters - Configure advanced research options
Response structure - Understand the data schema
Credit usage - Plan your research budget

Next steps

Explore other use cases for the ChatGPT Scraper API:

Get Started

Core Concepts

Guides

Use Cases

AI research and benchmarking

What is AI research and benchmarking?

Why use the API for AI research?

Key features for research

Raw response access

Query fan-out insights

Structured output

Multiple formats

Query fan-out insights

What you learn from query fan-out:

Example query fan-out analysis

Raw response access

Use cases for raw response data:

Example raw response analysis

Research workflow

Step 1: Design your research study

Step 2: Collect structured data

Step 3: Analyze and benchmark

Benchmarking best practices

Consistent parameters

Multiple samples

Structured storage

Timeout handling

Advanced research scenarios

Prompt optimization studies

Model comparison

Next steps

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Use Cases

​What is AI research and benchmarking?

​Why use the API for AI research?

​Key features for research

Raw response access

Query fan-out insights

Structured output

Multiple formats

​Query fan-out insights

​What you learn from query fan-out:

​Example query fan-out analysis

​Raw response access

​Use cases for raw response data:

​Example raw response analysis

​Research workflow

​Step 1: Design your research study

​Step 2: Collect structured data

​Step 3: Analyze and benchmark

​Benchmarking best practices

Consistent parameters

Multiple samples

Structured storage

Timeout handling

​Advanced research scenarios

​Prompt optimization studies

​Model comparison

​Related API features

​Next steps

Build docs developers (and LLMs) love

What is AI research and benchmarking?

Why use the API for AI research?

Key features for research

Query fan-out insights

What you learn from query fan-out:

Example query fan-out analysis

Raw response access

Use cases for raw response data:

Example raw response analysis

Research workflow

Step 1: Design your research study

Step 2: Collect structured data

Step 3: Analyze and benchmark

Benchmarking best practices

Advanced research scenarios

Prompt optimization studies

Model comparison

Related API features

Next steps