mistral-reference-architecture

📁 jeremylongshore/claude-code-plugins-plus-skills 📅 Jan 27, 2026

总安装量

周安装量

#30558

全站排名

安装命令

npx skills add https://github.com/jeremylongshore/claude-code-plugins-plus-skills --skill mistral-reference-architecture

Agent 安装分布

cursor 1

claude-code 1

gemini-cli 1

Skill 文档

Mistral AI Reference Architecture

Overview

Production-ready architecture patterns for Mistral AI integrations.

Prerequisites

Understanding of layered architecture
Mistral AI SDK knowledge
TypeScript project setup
Testing framework configured

Project Structure

my-mistral-project/
âââ src/
â   âââ mistral/
â   â   âââ client.ts           # Singleton client wrapper
â   â   âââ config.ts           # Environment configuration
â   â   âââ types.ts            # TypeScript types
â   â   âââ errors.ts           # Custom error classes
â   â   âââ prompts/            # Prompt templates
â   â   â   âââ system.ts       # System prompts
â   â   â   âââ templates.ts    # Reusable templates
â   â   âââ handlers/
â   â       âââ chat.ts         # Chat completion handlers
â   â       âââ embeddings.ts   # Embedding handlers
â   â       âââ tools.ts        # Function calling handlers
â   âââ services/
â   â   âââ ai/
â   â       âââ index.ts        # Service facade
â   â       âââ chat.ts         # Chat service
â   â       âââ rag.ts          # RAG implementation
â   â       âââ cache.ts        # Caching layer
â   âââ api/
â   â   âââ chat/
â   â       âââ route.ts        # API endpoint
â   â       âââ stream.ts       # Streaming endpoint
â   âââ jobs/
â       âââ ai/
â           âââ batch.ts        # Background processing
âââ tests/
â   âââ unit/
â   â   âââ mistral/
â   âââ integration/
â       âââ mistral/
âââ config/
â   âââ mistral.development.json
â   âââ mistral.staging.json
â   âââ mistral.production.json
âââ docs/
    âââ mistral/
        âââ SETUP.md
        âââ RUNBOOK.md

Layer Architecture

âââââââââââââââââââââââââââââââââââââââââââ
â             API Layer                    â
â   (Routes, Controllers, Middleware)      â
âââââââââââââââââââââââââââââââââââââââââââ¤
â           Service Layer                  â
â  (Business Logic, Orchestration)         â
âââââââââââââââââââââââââââââââââââââââââââ¤
â          Mistral Layer                   â
â   (Client, Prompts, Error Handling)      â
âââââââââââââââââââââââââââââââââââââââââââ¤
â         Infrastructure Layer             â
â    (Cache, Queue, Monitoring)            â
âââââââââââââââââââââââââââââââââââââââââââ

Key Components

Step 1: Client Wrapper

// src/mistral/client.ts
import Mistral from '@mistralai/mistralai';
import { getMistralConfig } from './config';

let instance: Mistral | null = null;

export function getMistralClient(): Mistral {
  if (!instance) {
    const config = getMistralConfig();
    instance = new Mistral({
      apiKey: config.apiKey,
      timeout: config.timeout,
    });
  }
  return instance;
}

export function resetMistralClient(): void {
  instance = null;
}

Step 2: Configuration Management

// src/mistral/config.ts
import { z } from 'zod';

const configSchema = z.object({
  apiKey: z.string().min(1),
  model: z.string().default('mistral-small-latest'),
  timeout: z.number().default(30000),
  maxRetries: z.number().default(3),
  cache: z.object({
    enabled: z.boolean().default(true),
    ttlSeconds: z.number().default(300),
  }).default({}),
});

export type MistralConfig = z.infer<typeof configSchema>;

export function getMistralConfig(): MistralConfig {
  const env = process.env.NODE_ENV || 'development';

  const baseConfig = {
    apiKey: process.env.MISTRAL_API_KEY,
    model: process.env.MISTRAL_MODEL,
    timeout: parseInt(process.env.MISTRAL_TIMEOUT || '30000'),
    maxRetries: parseInt(process.env.MISTRAL_MAX_RETRIES || '3'),
  };

  return configSchema.parse(baseConfig);
}

Step 3: Error Handling

// src/mistral/errors.ts
export class MistralServiceError extends Error {
  constructor(
    message: string,
    public readonly code: string,
    public readonly status?: number,
    public readonly retryable: boolean = false,
    public readonly originalError?: Error
  ) {
    super(message);
    this.name = 'MistralServiceError';
  }
}

export function wrapMistralError(error: unknown): MistralServiceError {
  if (error instanceof MistralServiceError) return error;

  const err = error as any;

  // Rate limit
  if (err.status === 429) {
    return new MistralServiceError(
      'Rate limit exceeded',
      'RATE_LIMIT',
      429,
      true,
      err
    );
  }

  // Auth error
  if (err.status === 401) {
    return new MistralServiceError(
      'Authentication failed',
      'AUTH_ERROR',
      401,
      false,
      err
    );
  }

  // Server error
  if (err.status >= 500) {
    return new MistralServiceError(
      'Mistral service error',
      'SERVICE_ERROR',
      err.status,
      true,
      err
    );
  }

  return new MistralServiceError(
    err.message || 'Unknown error',
    'UNKNOWN',
    err.status,
    false,
    err
  );
}

Step 4: Service Layer

// src/services/ai/chat.ts
import { getMistralClient } from '../../mistral/client';
import { wrapMistralError } from '../../mistral/errors';
import { withRetry } from '../../utils/retry';
import { cache } from './cache';

export interface ChatOptions {
  model?: string;
  temperature?: number;
  maxTokens?: number;
  useCache?: boolean;
}

export class ChatService {
  async complete(
    messages: Array<{ role: string; content: string }>,
    options: ChatOptions = {}
  ): Promise<string> {
    const client = getMistralClient();
    const model = options.model || 'mistral-small-latest';

    // Check cache for deterministic requests
    if (options.useCache && options.temperature === 0) {
      const cached = await cache.get(messages, model);
      if (cached) return cached;
    }

    try {
      const response = await withRetry(() =>
        client.chat.complete({
          model,
          messages,
          temperature: options.temperature,
          maxTokens: options.maxTokens,
        })
      );

      const content = response.choices?.[0]?.message?.content ?? '';

      // Cache deterministic responses
      if (options.useCache && options.temperature === 0) {
        await cache.set(messages, model, content);
      }

      return content;
    } catch (error) {
      throw wrapMistralError(error);
    }
  }

  async *stream(
    messages: Array<{ role: string; content: string }>,
    options: ChatOptions = {}
  ): AsyncGenerator<string> {
    const client = getMistralClient();
    const model = options.model || 'mistral-small-latest';

    try {
      const stream = await client.chat.stream({
        model,
        messages,
        temperature: options.temperature,
        maxTokens: options.maxTokens,
      });

      for await (const event of stream) {
        const content = event.data?.choices?.[0]?.delta?.content;
        if (content) yield content;
      }
    } catch (error) {
      throw wrapMistralError(error);
    }
  }
}

export const chatService = new ChatService();

Step 5: Health Check

// src/mistral/health.ts
import { getMistralClient } from './client';

export interface HealthStatus {
  status: 'healthy' | 'degraded' | 'unhealthy';
  latencyMs: number;
  error?: string;
}

export async function checkMistralHealth(): Promise<HealthStatus> {
  const client = getMistralClient();
  const start = Date.now();

  try {
    await client.models.list();
    return {
      status: 'healthy',
      latencyMs: Date.now() - start,
    };
  } catch (error: any) {
    return {
      status: 'unhealthy',
      latencyMs: Date.now() - start,
      error: error.message,
    };
  }
}

Step 6: Prompt Templates

// src/mistral/prompts/templates.ts
export interface PromptTemplate {
  system: string;
  user: (vars: Record<string, string>) => string;
}

export const templates: Record<string, PromptTemplate> = {
  summarize: {
    system: 'You are a helpful assistant that creates concise summaries.',
    user: ({ text, maxWords }) =>
      `Summarize the following text in ${maxWords || '100'} words or less:\n\n${text}`,
  },

  classify: {
    system: 'You are a classifier. Respond with only the category name.',
    user: ({ text, categories }) =>
      `Classify the following text into one of these categories: ${categories}\n\nText: ${text}`,
  },

  codeReview: {
    system: 'You are an expert code reviewer. Be concise and actionable.',
    user: ({ code, language }) =>
      `Review this ${language} code and suggest improvements:\n\n\`\`\`${language}\n${code}\n\`\`\``,
  },
};

export function buildPrompt(
  templateName: keyof typeof templates,
  vars: Record<string, string>
): Array<{ role: string; content: string }> {
  const template = templates[templateName];
  return [
    { role: 'system', content: template.system },
    { role: 'user', content: template.user(vars) },
  ];
}

Data Flow Diagram

User Request
     â
     â¼
âââââââââââââââ
â   API       â
â   Gateway   â
ââââââââ¬âââââââ
       â
       â¼
âââââââââââââââ    âââââââââââââââ
â   Service   âââââ¶â   Cache     â
â   Layer     â    â   (Redis)   â
ââââââââ¬âââââââ    âââââââââââââââ
       â
       â¼
âââââââââââââââ
â   Mistral   â
â   Client    â
ââââââââ¬âââââââ
       â
       â¼
âââââââââââââââ
â  Mistral    â
â   API       â
âââââââââââââââ

Instructions

Step 1: Create Directory Structure

mkdir -p src/mistral/{handlers,prompts}
mkdir -p src/services/ai
mkdir -p src/api/chat
mkdir -p tests/{unit,integration}/mistral
mkdir -p config docs/mistral

Step 2: Implement Core Components

Create client wrapper, config, and error handling.

Step 3: Build Service Layer

Implement chat service with caching and retry.

Step 4: Add Health Checks

Configure health endpoint for monitoring.

Output

Structured project layout
Client wrapper with retry
Error handling implemented
Health checks configured

Resources

Next Steps

For multi-environment setup, see mistral-multi-env-setup.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台