spice-embeddings
4
总安装量
4
周安装量
#53494
全站排名
安装命令
npx skills add https://github.com/spiceai/skills --skill spice-embeddings
Agent 安装分布
opencode
4
codex
4
gemini-cli
4
antigravity
3
windsurf
3
claude-code
3
Skill 文档
Spice Embedding Models
Embedding models transform text into vectors for similarity search and RAG applications.
Basic Configuration
embeddings:
- from: <provider>:<model_id>
name: <embedding_name>
params:
<provider>_api_key: ${ secrets:API_KEY }
Provider Prefixes
| Provider | From Format | Example |
|---|---|---|
openai |
openai:<model_id> |
openai:text-embedding-3-large |
huggingface |
huggingface:huggingface.co/... |
huggingface:huggingface.co/sentence-transformers/all-MiniLM-L6-v2 |
azure |
azure:<deployment> |
azure:my-embedding-deployment |
bedrock |
bedrock:<model_id> |
bedrock:amazon.titan-embed-text-v1 |
google |
google:<model_id> |
google:text-embedding-004 |
file |
file:<path> |
file:./models/embed.safetensors |
Embedding Columns on Datasets
Add vector embeddings to dataset columns for search:
embeddings:
- from: openai:text-embedding-3-small
name: embed_model
params:
openai_api_key: ${ secrets:OPENAI_API_KEY }
datasets:
- from: postgres:documents
name: docs
acceleration:
enabled: true
columns:
- name: content
embeddings:
- from: embed_model
row_id: id
chunking:
enabled: true
target_chunk_size: 512
Chunking Configuration
For long text, enable chunking to split into smaller segments:
columns:
- name: body
embeddings:
- from: embed_model
chunking:
enabled: true
target_chunk_size: 512 # tokens per chunk
overlap_size: 64 # overlap between chunks
Search API
Query embeddings via the search endpoint:
curl http://localhost:8090/v1/search \
-H "Content-Type: application/json" \
-d '{"datasets": ["docs"], "text": "search query", "limit": 10}'