Axon.MCP.Server

System Architecture Overview

📋 Executive Summary

Axon.MCP.Server is a sophisticated AI-powered code intelligence platform built on a modern microservices architecture. It combines FastAPI (REST API), Model Context Protocol (AI integration), Celery (distributed processing), and a hybrid analysis engine (Tree-sitter + Roslyn) to transform codebases into queryable knowledge bases.

🏗️ High-Level Architecture

System Components Diagram

graph TB
    subgraph "Client Layer"
        A1[AI Assistants<br/>ChatGPT/Claude/Cursor]
        A2[React Dashboard<br/>Port 80]
        A3[REST Clients<br/>External Tools]
    end
    
    subgraph "API Layer"
        B1[MCP Server<br/>:8001]
        B2[REST API<br/>:8080]
    end
    
    subgraph "Processing Layer"
        C1[Celery Worker<br/>Repository Sync]
        C2[Enrichment Worker<br/>AI Analysis - 8 concurrency]
        C3[Beat Scheduler<br/>Periodic Tasks]
    end
    
    subgraph "Analysis Layer"
        D1[Tree-sitter Parsers<br/>Python, JS, TS, C#]
        D2[Roslyn Analyzer<br/>C# Semantic Analysis]
        D3[EF Core Analyzer<br/>Entity Detection]
        D4[Knowledge Extractor<br/>Call Graph, Patterns]
    end
    
    subgraph "Data Layer"
        E1[(PostgreSQL + pgvector<br/>:5432)]
        E2[(Redis<br/>Cache/Queue :6379)]
    end
    
    subgraph "Monitoring"
        F1[Prometheus<br/>:9090]
        F2[Grafana<br/>:3000]
    end
    
    subgraph "Source Control"
        G1[GitLab API]
        G2[Azure DevOps API]
    end
    
    A1 --> B1
    A2 --> B2
    A3 --> B2
    B1 --> E1
    B2 --> E1
    B2 --> E2
    B2 --> C1
    C1 --> D1
    C1 --> D2
    C1 --> D3
    C1 --> D4
    C2 --> E1
    C1 --> E1
    C1 --> E2
    D2 --> E1
    G1 --> C1
    G2 --> C1
    B2 --> F1
    F1 --> F2

🔧 Technology Stack

Backend Stack

Web Framework: FastAPI 0.110.0 (async, high-performance)
Background Processing: Celery 5.3.6 + Redis
Database ORM: SQLAlchemy 2.0.29 (async)
Authentication: PyJWT 2.10.1, python-jose 3.3.0
Code Parsing:
- Tree-sitter 0.25.0+ (Python, JS, TS, C#)
- Roslyn (C# semantic analysis via subprocess)
Embeddings:
- Local: sentence-transformers 2.5.1+
- Cloud: OpenAI API 1.14.0
LLM Integration: OpenRouter/Ollama/OpenAI
Monitoring: Prometheus 0.20.0, structlog 24.1.0

Frontend Stack

Framework: React 18 + TypeScript
Build Tool: Vite 5
Styling: Custom CSS with dark theme
State Management: React hooks + Context API

Infrastructure

Database: PostgreSQL 15 + pgvector extension
Cache/Queue: Redis 7
Container: Docker + Docker Compose
Monitoring: Prometheus + Grafana
Migrations: Alembic 1.13.1

📦 Core Components

1. API Layer (`src/api`)

Purpose: REST API endpoints for UI and external integrations

Key Files:

main.py - FastAPI application entry point
auth.py - JWT + API Key authentication
routes/ - API endpoint handlers

Responsibilities:

Handle HTTP requests
JWT authentication and authorization
Rate limiting (SlowAPI)
Request validation (Pydantic)
Response formatting

Authentication Flow:

sequenceDiagram
    participant C as Client
    participant A as API
    participant D as Database
    
    C->>A: POST /auth/login {username, password}
    A->>D: Verify credentials
    D-->>A: User data
    A->>A: Generate JWT token
    A-->>C: Set HTTP-only cookie + token
    
    C->>A: GET /api/v1/repositories (with cookie)
    A->>A: Validate JWT
    A->>D: Query repositories
    D-->>A: Repository list
    A-->>C: JSON response

2. MCP Server (`src/mcp_server`)

Purpose: Model Context Protocol server for AI assistant integration

Key Files:

server.py - MCP protocol implementation
tools/ - 12 MCP tools for AI assistants
resources/ - MCP resource handlers

12 Available Tools:

search - Semantic + full-text code search
get_symbol_details - Detailed symbol information
get_call_graph - Function call relationships
get_inheritance_hierarchy - Class inheritance tree
get_module_summary - AI-generated code summaries
get_file_symbols - List symbols in a file
get_repository_structure - Project/solution organization
get_api_endpoints - List REST API routes
get_ef_entities - Entity Framework mappings
explore_service - Navigate service architecture
find_implementations - Interface implementations
get_system_architecture_map - Architecture diagrams

Transport Modes:

HTTP (default): http://localhost:8001 - For remote AI clients
Stdio: Standard input/output - For local Claude Desktop

3. Workers (`src/workers`)

Purpose: Distributed background processing with Celery

Key Files:

celery_app.py - Celery configuration
sync_worker.py - Main repository sync orchestrator (605 lines)

Worker Types:

Core Worker (1 concurrency): CPU-bound tasks (parsing, cloning)
Enrichment Worker (8 concurrency): IO-bound tasks (LLM calls)
Beat Scheduler: Periodic tasks (auto-sync, cleanup)

Queues:

repository_sync - Repository cloning and parsing
file_parsing - Individual file processing
embeddings - Vector embedding generation
ai_enrichment - LLM-based enrichment
default - General tasks

4. Analysis Layer

4.1 Tree-sitter Parsers (`src/parsers`)

Purpose: Fast syntactic parsing for multiple languages

Supported Languages:

C# (csharp_parser.py)
Python (python_parser.py)
JavaScript/TypeScript (javascript_parser.py)
Vue (vue_parser.py)

Extracts:

Symbols (classes, functions, variables)
Docstrings/comments
Import statements
Basic relationships
Complexity scores (cyclomatic complexity)

4.2 Roslyn Analyzer (`roslyn_analyzer/`)

Purpose: Deep semantic analysis for C# code

Architecture: Persistent C# subprocess communicating via JSON over stdin/stdout

Capabilities:

Type resolution (var user → User class)
Cross-file references
Namespace resolution
Generic type inference
Method signature extraction with full type info

Communication Protocol:

// Request
{"operation": "analyze", "filePath": "UserService.cs", "content": "..."}

// Response
{
  "symbols": [...],
  "relations": [
    {"from": "UserController.Login", "to": "AuthService.Authenticate", "type": "calls"}
  ]
}

4.3 Knowledge Extractor (`src/extractors`)

Purpose: Extract high-level patterns and relationships

Components:

knowledge_extractor.py - Main coordinator
api_extractor.py - Detect REST endpoints
call_graph_builder.py - Build function call graph
pattern_detector.py - Detect design patterns

Extracts:

API endpoints (routes, HTTP methods)
Call graphs (function dependencies)
Import relationships
Design patterns (Repository, Factory, etc.)

4.4 EF Core Analyzer (`src/analyzers`)

Purpose: Extract Entity Framework Core mappings

Extracts:

Entity classes → Database tables
Properties → Columns
Relationships (one-to-many, many-to-many)
Foreign keys, navigation properties

5. Data Layer

5.1 Database (`src/database`)

Models (models.py):

14 SQLAlchemy models
Optimized indexes for common queries
Cascading deletes for data integrity
pgvector column for embeddings

Session Management (session.py):

Async session factory
Connection pooling (20 connections, max 40)
Automatic session cleanup

5.2 Redis Cache

Uses:

Celery message broker
Search result caching
LLM response caching
Real-time sync progress (Pub/Sub)

6. Embeddings (`src/embeddings`)

Purpose: Generate vector embeddings for semantic search

Components:

generator.py - Embedding generation
summarizer.py - LLM-based summarization
chunking.py - Code chunking strategies

Providers:

Local: sentence-transformers/all-mpnet-base-v2 (768 dims)
OpenAI: text-embedding-3-small (1536 dims)

Chunking Strategy:

Function-level chunks (entire function body)
Class-level chunks (class definition + methods)
Module-level chunks (file overview)

⚙️ Processing Pipeline

The sync_repository task orchestrates the complete analysis pipeline:

graph LR
    A[1. Clone<br/>Repository] --> B[2. Detect<br/>Solutions]
    B --> C[3. Restore<br/>.NET Deps]
    C --> D[4. Initialize<br/>Roslyn]
    D --> E[5. Discover<br/>Files]
    E --> F[6. Parse<br/>Tree-sitter]
    F --> G[7. Analyze<br/>Roslyn C#]
    G --> H[8. Extract<br/>APIs]
    H --> I[9. Resolve<br/>Imports]
    I --> J[10. Build<br/>Call Graph]
    J --> K[11. Detect<br/>Services]
    K --> L[12. Analyze<br/>EF Entities]
    L --> M[13. Generate<br/>Embeddings]
    M --> N[14. AI<br/>Enrichment]

Pipeline Steps Detail

Clone Repository: GitLab/Azure DevOps API → local cache
Detect Solutions/Projects: Scan .sln and .csproj files
Restore .NET Dependencies: Run dotnet restore
Initialize Roslyn: Start C# subprocess, load solution
Discover Files: List source files, compute hashes
Parse (Tree-sitter): Extract symbols, imports, complexity
Analyze (Roslyn): Deep C# semantic analysis
Extract APIs: Detect REST endpoints, routes
Resolve Imports: Map import statements to symbols
Build Call Graph: Track function calls
Detect Services: Identify APIs, workers, libraries
Analyze EF Entities: Extract database mappings
Generate Embeddings: Create vector embeddings
AI Enrichment: LLM-generated summaries (optional)

Memory Management:

Keyset pagination (50 files per batch)
session.expunge_all() every 50 files
Re-fetch repository context per batch

🐳 Deployment Architecture

Docker Services (10 Containers)

┌─────────────────────────────────────────────────────────────┐
│                    Docker Network: axon-network             │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐     │
│  │ PostgreSQL   │  │    Redis     │  │   React UI   │     │
│  │  + pgvector  │  │   Cache/MQ   │  │   (Nginx)    │     │
│  │    :5432     │  │    :6379     │  │     :80      │     │
│  └──────────────┘  └──────────────┘  └──────────────┘     │
│                                                             │
│  ┌──────────────┐  ┌──────────────┐                        │
│  │  REST API    │  │  MCP Server  │                        │
│  │  (FastAPI)   │  │    (HTTP)    │                        │
│  │    :8080     │  │    :8001     │                        │
│  └──────────────┘  └──────────────┘                        │
│                                                             │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐     │
│  │   Worker     │  │  Enrichment  │  │     Beat     │     │
│  │ (Sync - 1)   │  │ Worker (8)   │  │  Scheduler   │     │
│  └──────────────┘  └──────────────┘  └──────────────┘     │
│                                                             │
│  ┌──────────────┐  ┌──────────────┐                        │
│  │  Prometheus  │  │   Grafana    │                        │
│  │    :9090     │  │    :3000     │                        │
│  └──────────────┘  └──────────────┘                        │
│                                                             │
└─────────────────────────────────────────────────────────────┘

Resource Allocation:

Core Worker: 1 concurrency (CPU-bound parsing)
Enrichment Worker: 8 concurrency (IO-bound LLM calls)
PostgreSQL: 20 connection pool, max 40
Redis: 50 max connections

🔍 Search Architecture

Hybrid Search Flow

sequenceDiagram
    participant C as Client
    participant API as Search Service
    participant PG as PostgreSQL
    participant V as pgvector
    participant R as Redis
    
    C->>API: search(query="authentication")
    API->>R: Check cache
    alt Cache Hit
        R-->>API: Cached results
        API-->>C: Return results
    else Cache Miss
        API->>API: Generate query embedding
        par Vector Search
            API->>V: Cosine similarity search
            V-->>API: Top 10 results
        and Keyword Search
            API->>PG: Full-text search
            PG-->>API: Top 10 results
        end
        API->>API: Merge + re-rank results
        API->>R: Cache results (TTL: 3600s)
        API-->>C: Return ranked results
    end

Search Strategies

Semantic Search: Vector similarity using pgvector
- Embed query using same model as code chunks
- Cosine similarity: embedding <=> query_vector
- Fast with HNSW index
Full-Text Search: PostgreSQL tsvector
- Indexes symbol names, docstrings, signatures
- Supports fuzzy matching
- Language-specific stemming
Hybrid: Combine both with score fusion
- Vector search weight: 0.6
- Keyword search weight: 0.4
- Re-rank by relevance

🔐 Security Architecture

Authentication Flow

graph LR
    A[Client Request] --> B{Has Cookie?}
    B -->|Yes| C[Validate JWT]
    B -->|No| D{Has X-API-Key?}
    D -->|Yes| E[Validate API Key]
    D -->|No| F[401 Unauthorized]
    C -->|Valid| G[Authorize Role]
    C -->|Invalid| F
    E -->|Valid| G
    E -->|Invalid| F
    G -->|Authorized| H[Process Request]
    G -->|Forbidden| I[403 Forbidden]

Auth Mechanisms:

JWT Tokens: For UI authentication (HTTP-only cookies)
API Keys: For service-to-service calls
Role-Based Access: admin, readonly

Secure Defaults:

HTTPS enforced in production
CORS configured for allowed origins
Rate limiting (100 req/min per IP)
Audit logging for all API calls

📊 Monitoring & Observability

Metrics Stack

Application Metrics → Prometheus → Grafana Dashboards
Logs (structlog) → Redis Pub/Sub → Dashboard
Celery Events → Flower (optional)

Key Metrics:

API latency (p50, p95, p99)
Search queries per second
Repository sync duration
Worker queue depth
Database connection pool usage
Cache hit ratio

Pre-configured Dashboards:

API Performance
Repository Sync Status
Search Analytics
System Health

🌐 Source Control Integration

GitLab Integration

Features:

Auto-discover all accessible projects
Webhook support for auto-sync on push
Personal Access Token (PAT) authentication

Azure DevOps Integration

Features:

Repository scanning per project
NTLM authentication support
SSL verification bypass (for on-prem)

API Endpoints:

POST /api/v1/repositories/discover - Scan source control
POST /api/v1/repositories/sync/{id} - Trigger manual sync
GET /api/v1/repositories - List all repositories

🔮 Future Architecture Enhancements

Planned Improvements

RAG Pipeline: Add ask_codebase tool for conversational queries
Microservices Split: Separate parsing, embeddings, and API into independent services
Kubernetes: Production-grade orchestration with Helm charts
Event Sourcing: Audit trail for all symbol changes
GraphQL API: Supplement REST API for flexible queries
Multi-tenancy: Organization-level isolation

📖 Additional Resources

Data Models - Database schema details
Infrastructure - Deployment configurations
API Reference - REST API documentation
MCP Tools - MCP protocol tools

This site is open source. Improve this page.

Axon.MCP.Server

System Architecture Overview

📋 Executive Summary

🏗️ High-Level Architecture

System Components Diagram

🔧 Technology Stack

Backend Stack

Frontend Stack

Infrastructure

📦 Core Components

1. API Layer (src/api)

2. MCP Server (src/mcp_server)

3. Workers (src/workers)

4. Analysis Layer

4.1 Tree-sitter Parsers (src/parsers)

4.2 Roslyn Analyzer (roslyn_analyzer/)

4.3 Knowledge Extractor (src/extractors)

4.4 EF Core Analyzer (src/analyzers)

5. Data Layer

5.1 Database (src/database)

5.2 Redis Cache

6. Embeddings (src/embeddings)

⚙️ Processing Pipeline

Pipeline Steps Detail

🐳 Deployment Architecture

Docker Services (10 Containers)

🔍 Search Architecture

Hybrid Search Flow

Search Strategies

🔐 Security Architecture

Authentication Flow

📊 Monitoring & Observability

Metrics Stack

🌐 Source Control Integration

GitLab Integration

Azure DevOps Integration

🔮 Future Architecture Enhancements

Planned Improvements

📖 Additional Resources

1. API Layer (`src/api`)

2. MCP Server (`src/mcp_server`)

3. Workers (`src/workers`)

4.1 Tree-sitter Parsers (`src/parsers`)

4.2 Roslyn Analyzer (`roslyn_analyzer/`)

4.3 Knowledge Extractor (`src/extractors`)

4.4 EF Core Analyzer (`src/analyzers`)

5.1 Database (`src/database`)

6. Embeddings (`src/embeddings`)