🎯 MCP Smart Advisor

A powerful MCP (Model Context Protocol) server that provides intelligent AI routing and multi-advisor consultations through OpenRouter API. Get expert coding advice from 5 premium AI models with smart routing, advanced caching, rate limiting, and security features.

✨ Features

🧠 Smart Routing System - Intelligent provider selection based on task requirements
🎲 Random Mode - NEW! Randomly selects from all available providers for unpredictable results
🤖 5 Premium AI Providers - Claude Sonnet 4, OpenAI o3, xAI Grok, Google Gemini Flash, DeepSeek
⚡ Intelligent Caching - LRU cache with configurable TTL to reduce API costs
🛡️ Security First - Input validation, prompt injection detection, and rate limiting
📊 Comprehensive Logging - Structured logging with cache metrics and health monitoring
⚙️ Fully Configurable - Environment variables for all settings
🔄 Retry Logic - Exponential backoff with Promise.allSettled for resilient API calls
🔄 Circuit Breakers - Individual provider protection with automatic failover

🚀 Quick Start

Install via npx

npx mcp-smart@1.5.3

Install globally

npm install -g mcp-smart@1.5.3

Run directly

mcp-smart

🎲 Try Random Mode

# Install and test the new random routing feature
npx mcp-smart@1.5.3

# In your MCP client, try:
# model: "random" - for unpredictable AI provider selection

🔧 Configuration

Set your OpenRouter API key and configure the server:

export OPENROUTER_API_KEY="your-api-key-here"
export MAX_RETRIES=3
export REQUEST_TIMEOUT=30000
export CACHE_TTL=300000
export MAX_TOKENS=4000
export MAX_CACHE_SIZE=100
export RATE_LIMIT_REQUESTS=10
export RATE_LIMIT_WINDOW=60000
# Circuit Breaker Configuration
export CIRCUIT_BREAKER_FAILURE_THRESHOLD=5
export CIRCUIT_BREAKER_RECOVERY_TIMEOUT=60000
export CIRCUIT_BREAKER_MONITORING_PERIOD=300000
export CIRCUIT_BREAKER_HALF_OPEN_MAX_CALLS=3

Environment Variables

Variable	Default	Description
`OPENROUTER_API_KEY`	Required	Your OpenRouter API key
`MAX_RETRIES`	`3`	Maximum retry attempts for failed requests
`REQUEST_TIMEOUT`	`30000`	Request timeout in milliseconds
`CACHE_TTL`	`300000`	Cache time-to-live in milliseconds (5 min)
`MAX_TOKENS`	`4000`	Maximum tokens per API request
`MAX_CACHE_SIZE`	`100`	Maximum number of cached responses
`MAX_TASK_LENGTH`	`10000`	Maximum task input length
`MAX_CONTEXT_LENGTH`	`20000`	Maximum context input length
`RATE_LIMIT_REQUESTS`	`10`	Requests per rate limit window
`RATE_LIMIT_WINDOW`	`60000`	Rate limit window in milliseconds (1 min)
`CIRCUIT_BREAKER_FAILURE_THRESHOLD`	`5`	Consecutive failures before opening circuit breaker
`CIRCUIT_BREAKER_RECOVERY_TIMEOUT`	`60000`	Time to wait before attempting recovery (ms)
`CIRCUIT_BREAKER_MONITORING_PERIOD`	`300000`	Circuit breaker monitoring window (ms)
`CIRCUIT_BREAKER_HALF_OPEN_MAX_CALLS`	`3`	Max calls allowed in half-open state

🎯 Usage with Claude Code

Add to your claude-code-config.json:

{
  "mcpServers": {
    "smart-advisor": {
      "command": "npx",
      "args": ["mcp-smart@1.5.3"],
      "env": {
        "OPENROUTER_API_KEY": "your-openrouter-api-key-here"
      }
    }
  }
}

🎯 Claude Code Integration with Agent Hooks

To enable automatic Smart advisor access in Claude Code, add this to your ~/.claude/CLAUDE.md file (or local project CLAUDE.md). This hooks the MCP server with Claude's agent system:

When facing uncertainty and needing advice, you have exclusive access to L7 Google programmer named Smart. Ask him when appropriate using these terms: smart_advisor, code_review, get_advice, expert_opinion, smart_llm. You can specify what you're looking for, these are the modes available:
- auto - GPT-4o-mini intelligently selects the best provider
- intelligence - Routes to Claude Sonnet 4 (ultimate reasoning)
- premium - Routes to OpenAI o3 (high-end reasoning)
- speed - Routes to xAI Grok (fast responses)
- balance - Routes to Google Gemini Flash (cost/performance balance)
- cost - Routes to DeepSeek (budget-friendly)
- all - Multi-provider consultation (all providers)

This integration allows Claude Code to automatically consult the Smart advisor when encountering complex coding problems or needing expert opinions.

🤖 AI Providers & Intelligence Ranking

Provider Capabilities (Ranked by Intelligence)

Provider	Intelligence	Cost	Speed	Context	Strengths
Claude Sonnet 4	🔥 Ultimate	High	Medium	Very High	Supreme reasoning, ethical coding, comprehensive solutions
OpenAI o3	🏆 Highest	Very High	Slow	Medium	Complex reasoning, creativity, advanced coding
xAI Grok	⭐ Very High	Medium	Fast	High	Real-time data, creative thinking, fast responses
Google Gemini Flash	⭐ Very High	Low	Fast	Highest (2M)	Multimodal, research, long-context, speed
DeepSeek	✅ High	Low	Fast	Medium	Cost-effective, coding/logic/math, analysis

🎛️ Smart Routing Strategies

Routing Options

Strategy	Provider	Use Case	Description
`auto` ⚡	GPT-4o-mini decides	Default smart routing	Intelligent provider selection based on task
`intelligence` 🔥	Claude Sonnet 4	Ultimate reasoning	Most capable model for complex problems
`premium` 🏆	OpenAI o3	Premium alternative	High-end reasoning and creativity
`speed` 🚀	xAI Grok	Fast responses	Quick turnaround with real-time data
`balance` ⚖️	Google Gemini Flash	Cost/performance	Optimal balance of speed, cost, and capability
`cost` 💰	DeepSeek	Budget-friendly	Maximum cost efficiency
`random` 🎲	Random provider	Unpredictable	Randomly selects from all available providers
`all` 🌟	All providers	Comprehensive	Multi-provider consultation

Direct Provider Access

claude - Force Claude Sonnet 4
openai - Force OpenAI o3
xai - Force xAI Grok
google - Force Google Gemini Flash
deepseek - Force DeepSeek

💡 Example Usage

Smart Auto-Routing (Recommended)

// Let GPT-4o-mini choose the best provider
await smart_advisor({
  model: "auto",
  task: "Optimize this React component for performance",
  context: "Component renders 10,000+ items with complex state"
});

Strategy-Based Routing

// Maximum intelligence for complex problems
await smart_advisor({
  model: "intelligence", // Routes to Claude Sonnet 4
  task: "Design a fault-tolerant distributed system architecture",
  context: "Microservices with 99.99% uptime requirement"
});

// Speed-optimized responses
await smart_advisor({
  model: "speed", // Routes to xAI Grok
  task: "Quick debugging help for this JavaScript error",
  context: "TypeError in production, need fast solution"
});

// Cost-effective solutions
await smart_advisor({
  model: "cost", // Routes to DeepSeek
  task: "Write a simple sorting algorithm",
  context: "Basic coding task for learning"
});

// Random provider selection
await smart_advisor({
  model: "random", // Randomly selects from all providers
  task: "Refactor this function for better readability",
  context: "Legacy code that needs modernization"
});

🎲 Random Mode Benefits

The random routing strategy offers unique advantages:

🔍 Provider Testing - Compare different AI approaches to the same problem
⚖️ Load Balancing - Distribute requests across providers automatically
🎯 Bias Reduction - Avoid over-reliance on a single provider
🚀 Discovery - Uncover unexpected solutions from different AI models
🔄 Experimentation - Perfect for A/B testing AI provider performance

// Great for testing different perspectives
for (let i = 0; i < 5; i++) {
  const result = await smart_advisor({
    model: "random",
    task: "Explain this complex algorithm",
    context: "University-level computer science"
  });
  console.log(`Attempt ${i + 1}: Different AI perspective`);
}

Multi-Advisor Consultation

// Get perspectives from all providers
await smart_advisor({
  model: "all",
  task: "Review this security-critical authentication system",
  context: "OAuth2 implementation handling sensitive user data"
});

Direct Provider Access

// Use specific provider directly
await smart_advisor({
  model: "claude",
  task: "Ethical considerations for AI system design",
  context: "Building recommendation engine for social media"
});

🏗️ Architecture

The server implements a structured 4-persona prompt system:

Manager - Defines clear requirements and ensures understanding
Smart Technical Advisor - Creates detailed technical architecture with deep insights
QA - Implements comprehensive tests covering edge cases and bottlenecks
Engineer - Provides production-ready, secure, and efficient code

Smart Routing Flow

User Request → GPT-4o-mini Analysis → Provider Selection → Response
                     ↓
           [Task Complexity Assessment]
                     ↓
    [Cost/Performance/Speed Requirements]
                     ↓
           [Optimal Provider Routing]

🛡️ Security Features

Prompt Injection Detection - Advanced pattern recognition for malicious inputs
Script Injection Prevention - Blocks XSS and code injection attempts
Input Validation - Length limits and comprehensive sanitization
Rate Limiting - Configurable request limits per time window
Security Logging - Detailed audit trail for security events

🔄 Resilience & Fault Tolerance

Circuit Breaker Pattern

Provider Protection - Individual circuit breakers for each AI provider
Automatic Failover - Smart fallback to healthy providers when others fail
Self-Healing - Automatic recovery testing with configurable timeouts
State Management - CLOSED, OPEN, and HALF_OPEN states with proper transitions
Failure Thresholds - Configurable consecutive failure limits before opening
Monitoring - Real-time circuit breaker status and metrics tracking

📈 Performance & Monitoring

Caching System

LRU Caching - Intelligent cache eviction based on usage patterns
Cache Metrics - Real-time hit/miss ratios and performance tracking
TTL Management - Configurable cache expiration

Resilience Features

Circuit Breakers - Individual protection for each AI provider with automatic failover
Promise.allSettled - Graceful handling of provider failures
Intelligent Fallback - Hierarchical fallback to healthy providers based on capability ranking
Exponential Backoff - Smart retry logic for transient failures
Health Monitoring - Comprehensive system health checks including circuit breaker status
Provider Recovery - Automatic testing and recovery of failed providers

Monitoring Dashboard

// Access real-time metrics
const health = server.getHealthCheck();
const cacheMetrics = server.getCacheMetrics();

🔍 Advanced Features

Health Monitoring

System uptime tracking
Cache performance metrics
Rate limit monitoring
Provider failure tracking
Version information

Cache Management

Hit/miss ratio tracking
Eviction monitoring
Size management
Performance optimization

Security Monitoring

Injection attempt detection
Rate limit violations
Input validation failures
Security event logging

🛠️ Development

# Clone the repository
git clone https://github.com/sliday/mcp-smart.git
cd mcp-smart

# Install dependencies
npm install

# Build the project
npm run build

# Run in development mode
npm run dev

# Run tests (44/45 passing)
npm test

# Run tests with coverage
npm run test:coverage

🧪 Testing

The project maintains high test coverage with 44/45 tests passing:

Unit tests for all providers and routing strategies
Integration tests for end-to-end workflows
Security tests for injection detection
Performance tests for caching and rate limiting
Error handling and fallback testing

📊 Version History

v1.5.3 (Latest)

✨ Added random routing strategy for unpredictable provider selection
🎲 New random mode randomly selects from all available providers
📚 Enhanced documentation with random mode benefits and use cases
🎯 Added detailed examples for provider testing and load balancing
🧪 Perfect test coverage - 47/47 tests passing (100%)
🔧 Improved routing logic with better error handling
✨ Highlighted random mode feature in Quick Start section
🛠️ Enhanced test isolation and mock setup for reliability

v1.4.0

✨ Added Claude Sonnet 4 (ultimate intelligence)
✨ Added xAI Grok-3-beta (speed optimization)
✨ Updated Google to Gemini Flash (cost-effective)
✨ Enhanced smart routing with 7 strategies
✨ Improved intelligence hierarchy and provider rankings
🔧 Enhanced auto-routing decision logic
📈 44/45 test coverage

v1.3.0

✨ Smart routing system with auto-selection
🔧 Cost/performance-aware provider selection
📈 Comprehensive test coverage

v1.2.0

🛡️ Enhanced security with prompt injection detection
⚡ Improved caching with metrics tracking
🔄 Promise.allSettled for better error handling

📄 License

MIT License - see LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please read our contributing guidelines and submit pull requests.

🐛 Issues

Report bugs and request features at GitHub Issues.

🌟 Support

If you find this project helpful, please give it a star on GitHub!

Built with ❤️ for the MCP ecosystem
Powered by Claude Sonnet 4, OpenAI o3, xAI Grok, Google Gemini Flash, and DeepSeek

mcp-smart

🎯 MCP Smart Advisor

✨ Features

🚀 Quick Start

Install via npx

Install globally

Run directly

🎲 Try Random Mode

🔧 Configuration

Environment Variables

🎯 Usage with Claude Code

🎯 Claude Code Integration with Agent Hooks

🤖 AI Providers & Intelligence Ranking

Provider Capabilities (Ranked by Intelligence)

🎛️ Smart Routing Strategies

Routing Options

Direct Provider Access

💡 Example Usage

Smart Auto-Routing (Recommended)

Strategy-Based Routing

🎲 Random Mode Benefits

Multi-Advisor Consultation

Direct Provider Access

🏗️ Architecture

Smart Routing Flow

🛡️ Security Features

🔄 Resilience & Fault Tolerance

Circuit Breaker Pattern

📈 Performance & Monitoring

Caching System

Resilience Features

Monitoring Dashboard

🔍 Advanced Features

Health Monitoring

Cache Management

Security Monitoring

🛠️ Development

🧪 Testing

📊 Version History

v1.5.3 (Latest)

v1.4.0

v1.3.0

v1.2.0

📄 License

🤝 Contributing

🐛 Issues

🌟 Support

Readme

Keywords

Package Sidebar

Install

Repository

Homepage

DownloadsWeekly Downloads

Version

License

Unpacked Size

Total Files

Last publish

Collaborators

Weekly Downloads