What is MCP?
The Model Context Protocol is a standard for connecting LLMs to external context and tools. It allows models to:- Access external data - Query databases, APIs, and file systems
- Execute actions - Perform operations beyond text generation
- Maintain context - Preserve state across multiple interactions
- Extend capabilities - Add custom functionality specific to your use case
MCP support in vLLora is based on the rmcp Rust implementation, providing both client and server functionality.
Architecture
vLLora implements MCP in two ways:1. Built-in MCP server
vLLora exposes its own MCP server that provides tools for querying traces and debugging:2. MCP client for external servers
vLLora can connect to external MCP servers and make their tools available to LLMs:Configuring MCP servers
Via Web UI
Configure MCP servers through the vLLora web interface:- Navigate to Settings → MCP Configuration
- Click Add MCP Server
- Provide server details (transport type, URL, authentication)
- Select which tools to enable
- Click Save
Via API
Configure MCP servers programmatically:Via Rust SDK
Define MCP servers in your Rust code:Transport types
vLLora supports multiple MCP transport mechanisms:- SSE (Server-Sent Events)
- HTTP
- WebSocket
Authentication
Secure MCP connections with various authentication methods:Bearer token
API key
Custom headers
Tool filtering
Control which tools are available to LLMs:Enable all tools
Specific tools only
Pattern matching
Using MCP tools in requests
Once configured, MCP tools are automatically available to LLMs:Built-in vLLora MCP tools
The vLLora MCP server provides debugging and observability tools:search_traces
Search through trace data:get_llm_call
Retrieve details of a specific LLM call:get_run_overview
Analyze a complete agent run:get_recent_stats
Monitor system health:Example MCP servers
DeepWiki
Search Wikipedia and retrieve article content:Tavily Search
Web search with AI-optimized results:File System
Read and write files:Environment variables
Pass configuration to MCP servers via environment variables:Debugging MCP connections
Check server status
View tool calls in traces
MCP tool executions appear astool spans in traces:
Enable debug logging
Best practices
Use tool filtering for security
Use tool filtering for security
Only enable the specific tools that your application needs. This reduces the attack surface and prevents unintended tool usage.
Implement authentication
Implement authentication
Always use authentication for production MCP servers. Bearer tokens and API keys provide basic protection.
Monitor tool usage
Monitor tool usage
Track tool calls via traces to understand how LLMs use external capabilities and identify potential issues.
Set environment variables for configuration
Set environment variables for configuration
Use environment variables to configure MCP servers rather than hardcoding values. This enables easier updates and environment-specific settings.
Test tools independently
Test tools independently
Verify MCP server functionality before integrating with vLLora. Use curl or the MCP client directly to test tool behavior.
Next steps
MCP configuration guide
Detailed MCP server configuration
MCP servers
Available MCP servers and tools
Tracing
Monitor MCP tool usage
API reference
Tool calling API documentation