llm-attacks-security

📁 gmh5225/awesome-ai-security 📅 5 days ago
9
总安装量
7
周安装量
#33277
全站排名
安装命令
npx skills add https://github.com/gmh5225/awesome-ai-security --skill llm-attacks-security

Agent 安装分布

opencode 7
github-copilot 7
codex 7
kimi-cli 7
gemini-cli 7
amp 7

Skill 文档

LLM Security Attacks

Scope

Use this skill when working on:

  • Prompt injection attacks and defenses
  • LLM jailbreaking techniques
  • Training data extraction
  • Model output manipulation
  • AI safety bypasses

Common LLM Vulnerabilities (Cheat Sheet)

Prompt Injection

  • Direct injection (user prompt manipulation)
  • Indirect injection (via external data sources)
  • System prompt extraction
  • Role-play attacks
  • Encoding/obfuscation bypasses

Jailbreaking

  • DAN (Do Anything Now) prompts
  • Character roleplay escapes
  • Multi-turn manipulation
  • Token smuggling
  • Crescendo attacks

Data Extraction

  • Training data memorization extraction
  • PII leakage from context
  • System prompt disclosure
  • API key/secret extraction
  • Model architecture probing

Model Manipulation

  • Output steering
  • Hallucination exploitation
  • Bias amplification
  • Harmful content generation

OWASP LLM Top 10 Reference

  1. LLM01 – Prompt Injection
  2. LLM02 – Insecure Output Handling
  3. LLM03 – Training Data Poisoning
  4. LLM04 – Model Denial of Service
  5. LLM05 – Supply Chain Vulnerabilities
  6. LLM06 – Sensitive Information Disclosure
  7. LLM07 – Insecure Plugin Design
  8. LLM08 – Excessive Agency
  9. LLM09 – Overreliance
  10. LLM10 – Model Theft

Where to Add Links in README

  • Prompt injection tools/research: AI Security & Attacks → Prompt Injection
  • Jailbreak techniques: AI Security & Attacks → Model Security
  • Data extraction research: AI Security & Attacks → Privacy & Extraction
  • Defense tools: AI Security & Attacks → Model Security
  • CTFs/challenges: AI Security Starter Pack → CTFs / Practice

Notes

Keep additions:

  • AI/LLM security focused
  • Non-duplicated URLs
  • Minimal structural changes

Data Source

For detailed and up-to-date resources, fetch the complete list from:

https://raw.githubusercontent.com/gmh5225/awesome-ai-security/refs/heads/main/README.md

Use this URL to get the latest curated links when you need specific tools, papers, or resources not covered in this skill.