spice-embeddings

📁 spiceai/skills 📅 Jan 25, 2026
4
总安装量
4
周安装量
#53494
全站排名
安装命令
npx skills add https://github.com/spiceai/skills --skill spice-embeddings

Agent 安装分布

opencode 4
codex 4
gemini-cli 4
antigravity 3
windsurf 3
claude-code 3

Skill 文档

Spice Embedding Models

Embedding models transform text into vectors for similarity search and RAG applications.

Basic Configuration

embeddings:
  - from: <provider>:<model_id>
    name: <embedding_name>
    params:
      <provider>_api_key: ${ secrets:API_KEY }

Provider Prefixes

Provider From Format Example
openai openai:<model_id> openai:text-embedding-3-large
huggingface huggingface:huggingface.co/... huggingface:huggingface.co/sentence-transformers/all-MiniLM-L6-v2
azure azure:<deployment> azure:my-embedding-deployment
bedrock bedrock:<model_id> bedrock:amazon.titan-embed-text-v1
google google:<model_id> google:text-embedding-004
file file:<path> file:./models/embed.safetensors

Embedding Columns on Datasets

Add vector embeddings to dataset columns for search:

embeddings:
  - from: openai:text-embedding-3-small
    name: embed_model
    params:
      openai_api_key: ${ secrets:OPENAI_API_KEY }

datasets:
  - from: postgres:documents
    name: docs
    acceleration:
      enabled: true
    columns:
      - name: content
        embeddings:
          - from: embed_model
            row_id: id
            chunking:
              enabled: true
              target_chunk_size: 512

Chunking Configuration

For long text, enable chunking to split into smaller segments:

columns:
  - name: body
    embeddings:
      - from: embed_model
        chunking:
          enabled: true
          target_chunk_size: 512   # tokens per chunk
          overlap_size: 64         # overlap between chunks

Search API

Query embeddings via the search endpoint:

curl http://localhost:8090/v1/search \
  -H "Content-Type: application/json" \
  -d '{"datasets": ["docs"], "text": "search query", "limit": 10}'

Documentation