funsloth-upload

📁 chrisvoncsefalvay/funsloth 📅 Jan 28, 2026

总安装量

周安装量

#47663

全站排名

安装命令

npx skills add https://github.com/chrisvoncsefalvay/funsloth --skill funsloth-upload

Agent 安装分布

windsurf 1

opencode 1

cursor 1

kiro-cli 1

codex 1

claude-code 1

Skill 文档

Model Upload & Card Generator

Create model cards and upload fine-tuned models to Hugging Face Hub.

Gather Context

If coming from training manager, you should have:

model_path, base_model, dataset, technique
training_config (LoRA rank, LR, epochs)
final_loss, training_time, hardware

If missing, ask for essential information.

Configuration

1. Repository Settings

Ask for:

Repo name: username/model-name
Visibility: Public or Private
License: MIT, Apache 2.0, CC-BY-4.0, Llama 3 Community, etc.

2. Export Formats

Options:

LoRA adapter only (~50-200MB) – Users merge themselves
Merged 16-bit (15-140GB) – Ready to use
GGUF quantized (4-8GB) – For llama.cpp/Ollama
All of the above (Recommended)

3. GGUF Quantization

If GGUF selected, ask which levels. See references/GGUF_GUIDE.md.

Method	Size	Quality
Q4_K_M	~4GB	Good (Recommended)
Q5_K_M	~5GB	Better
Q8_0	~8GB	Best

Generate Model Card

Create README.md with:

YAML Metadata – license, tags, base_model, datasets
Model Description – Table with key attributes
Training Details – Hyperparameters, LoRA config, results
Usage Examples – Transformers, Unsloth, Ollama, llama.cpp
Intended Use – Primary use cases, out-of-scope
Limitations – Biases, known issues
Citation – BibTeX entry

Execute Upload

1. Create Repository

from huggingface_hub import create_repo
create_repo("username/model-name", private=False, exist_ok=True)

2. Upload Files

from huggingface_hub import HfApi
api = HfApi()

# LoRA adapter
api.upload_folder(folder_path="./outputs/lora_adapter", repo_id="username/model")

# Model card
api.upload_file(path_or_fileobj="README.md", path_in_repo="README.md", repo_id="username/model")

3. Generate GGUF (if selected)

from unsloth import FastLanguageModel

model, tokenizer = FastLanguageModel.from_pretrained("./outputs/lora_adapter")
model.save_pretrained_gguf("./gguf", tokenizer, quantization_method="q4_k_m")

Use scripts/convert_gguf.py for multiple quantizations.

4. Verify

from huggingface_hub import list_repo_files
print(list_repo_files("username/model"))

Final Report

Upload Complete!

Model: https://huggingface.co/{repo_name}

Uploaded:

LoRA adapter

Model card

GGUF files (if selected)

Next steps:

Verify model page

Add example outputs

Run benchmarks

Share on social media

Model Card Best Practices

Be specific about limitations
Include usage examples – copy-pasteable
Document training details
Credit sources – base model, dataset, tools
Use tables – easier to scan

Error Handling

Error	Resolution
Repo exists	Use `exist_ok=True`
Permission denied	Check HF token has write access
Upload timeout	Use chunked upload

Bundled Resources

scripts/convert_gguf.py – GGUF conversion
references/GGUF_GUIDE.md – GGUF details and Ollama setup
references/TROUBLESHOOTING.md – Upload issues

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台