docs-to-notebooklm
2
总安装量
1
周安装量
#67820
全站排名
安装命令
npx skills add https://github.com/zstmfhy/docs-to-notebooklm --skill docs-to-notebooklm
Agent 安装分布
claude-code
1
antigravity
1
gemini-cli
1
Skill 文档
Docs to NotebookLM
ä»åç§ææ¯ææ¡£ç½ç«æ¹éæåå å®¹å¹¶åæ¥å° Google NotebookLM çå·¥å ·éã
åè½ç¹æ§
- ð 夿¡æ¶æ¯æ: VitePress, Docusaurus, GitBook, VuePress
- ð¤ æºè½æå: èªå¨è¯å«ææ¡£ç»æï¼æåçº¯ææ¬
- ð¦ èªå¨åæ¹: NotebookLM éå¶ 50 æä»¶/ç¬è®°æ¬ï¼èªå¨å建å¤ä¸ª
- ð å¢é忥: æ¯ææç¹ç»ä¼ åè¿åº¦ä¿å
- ð¯ ç²¾åæå: Playwright å¤ç卿å 载页é¢
å¿«éå¼å§
å®è£ ä¾èµ
pip install playwright beautifulsoup4 html2text
playwright install chromium
npm install -g @notebooklm/cli
notebooklm login
åºæ¬ä½¿ç¨
# 1. æåææ¡£é¾æ¥
python scripts/extract_sidebar_iterative.py "https://docs.example.com" \
--output links.json --delay 1.5
# 2. ä¸è½½ä¸º Markdown
python scripts/download_markdown.py --input links.json \
--output docs/ --delay 2.0
# 3. ä¸ä¼ å° NotebookLM
python scripts/upload_markdown_to_notebooklm.py --input docs/ \
--notebook "ææ¯ææ¡£" --yes
èæ¬è¯´æ
extract_sidebar_iterative.py
æåææ¡£ç½ç«ç导èªé¾æ¥ã
python scripts/extract_sidebar_iterative.py <start_url> \
--output links.json \
--delay 1.5 \
--max-pages 1000
åæ°:
start_url: ææ¡£ç½ç«èµ·å§ URL--output: è¾åºæä»¶è·¯å¾ï¼JSON æ ¼å¼ï¼--delay: 页é¢å 载延è¿ï¼ç§ï¼ï¼é»è®¤ 1.0--max-pages: æå¤§æå页颿°ï¼é»è®¤ 1000--headless: æ 头模å¼è¿è¡--cookie: æ·»å è®¤è¯ cookie
è¾åº:
links.json: ææææ¡£é¾æ¥links.txt: ææ¬æ ¼å¼é¾æ¥å表extract_progress.json: è¿åº¦æä»¶ï¼æ¯ææç¹ç»ä¼ ï¼
download_markdown.py
ä¸è½½ææ¡£é¡µé¢å¹¶è½¬æ¢ä¸º Markdownã
python scripts/download_markdown.py \
--input links.json \
--output docs/ \
--delay 2.0
åæ°:
--input: 龿¥æä»¶ï¼JSON æ TXT æ ¼å¼ï¼--output: è¾åºç®å½--delay: 请æ±å»¶è¿ï¼ç§ï¼ï¼é»è®¤ 1.5--concurrent: å¹¶åæ°ï¼é»è®¤ 1--max-files: æå¤§ä¸è½½æ°é
è¾åº:
docs/: Markdown æä»¶ç®å½README.md: çæçç´¢å¼download_progress.json: è¿åº¦æä»¶
upload_markdown_to_notebooklm.py
ä¸ä¼ Markdown æä»¶å° NotebookLMã
python scripts/upload_markdown_to_notebooklm.py \
--input docs/ \
--notebook "æçææ¡£" \
--yes \
--batch-size 50
åæ°:
--input: è¾å ¥ç®å½--notebook: ç¬è®°æ¬åç§°--pattern: æä»¶å¹é 模å¼ï¼é»è®¤*.md--yes: è·³è¿ç¡®è®¤--delay: ä¸ä¼ å»¶è¿ï¼ç§ï¼ï¼é»è®¤ 0.5--batch-size: æ¯æ¹æä»¶æ°ï¼æå¤§ 50
ç¹æ§:
- èªå¨åæ¹ï¼è¶ è¿ 50 æä»¶æ¶å建å¤ä¸ªç¬è®°æ¬
- è¿åº¦è·è¸ªï¼æ¯ 10 个æä»¶æ¾ç¤ºè¿åº¦
- é误å¤çï¼å¤±è´¥æä»¶ä¿åå°
_failed_uploads.txt
使ç¨ç¤ºä¾
ç¤ºä¾ 1: ç«å±±å¼æ GPU ææ¡£
# æå龿¥
python scripts/extract_sidebar_iterative.py \
"https://www.volcengine.com/docs/6419/70481?lang=zh" \
--output volc_links.json
# ä¸è½½ææ¡£
python scripts/download_markdown.py \
--input volc_links.json \
--output volc_docs/ \
--delay 2.0
# ä¸ä¼ ï¼96 个æä»¶ â 2 个ç¬è®°æ¬ï¼
python scripts/upload_markdown_to_notebooklm.py \
--input volc_docs/ \
--notebook "ç«å±±å¼æGPUææ¡£" \
--yes
ç¤ºä¾ 2: éè¦ç»å½çææ¡£
# æ·»å è®¤è¯ cookie
python scripts/extract_sidebar_iterative.py \
"https://docs.internal.com" \
--output internal_links.json \
--cookie "session_id=xxxxx"
ç¤ºä¾ 3: 大鿿¡£ï¼èªå¨åæ¹ï¼
# 120 个æä»¶ä¼èªå¨å建 3 个ç¬è®°æ¬
python scripts/upload_markdown_to_notebooklm.py \
--input large_docs/ \
--notebook "大忿¡£é" \
--batch-size 40 \
--yes
NotebookLM éå¶å¤ç
NotebookLM æ¯ä¸ªç¬è®°æ¬æå¤ 50 ä¸ªæ¥æºã
èªå¨åæ¹é»è¾:
96 个æä»¶ ÷ 50 æ¯æ¹ = 2 个ç¬è®°æ¬
ç¬è®°æ¬ 1: "大忿¡£é" (50 个æä»¶)
ç¬è®°æ¬ 2: "大忿¡£é (2)" (46 个æä»¶)
常è§é®é¢
Q: Playwright æµè§å¨æªå®è£ ï¼
playwright install chromium
Q: æåä¸å°é¾æ¥ï¼
ç§»é¤ --headless é项æ¥çæµè§å¨è¡ä¸ºï¼
python scripts/extract_sidebar_iterative.py \
"https://docs.example.com" \
--output links.json
Q: ä¸ä¼ 失败ï¼
- æ£æ¥ç»å½ç¶æ:
notebooklm status - åå°å»¶è¿:
--delay 0.3 - æ¥ç失败æä»¶:
_failed_uploads.txt
Q: æä»¶å 容ä¸å®æ´ï¼
å¢å å»¶è¿çå¾ JS å è½½ï¼
python scripts/extract_sidebar_iterative.py \
"https://docs.example.com" \
--output links.json \
--delay 3.0
ææ¯æ
- Playwright: æµè§å¨èªå¨åï¼å¤ç卿å 容
- BeautifulSoup: HTML è§£æåå 容æå
- html2text: HTML 转 Markdown
- NotebookLM CLI: ä¸ Google NotebookLM 交äº
ä¾èµé¡¹
playwright>=1.40.0
beautifulsoup4>=4.12.0
html2text>=2020.1.16
æä½³å®è·µ
- éçéå¶: 使ç¨åççå»¶è¿ï¼1-2 ç§ï¼é¿å 被å°
- æ¹éå¤ç: 大鿿¡£èªå¨åæ¹ï¼æ éæå¨å¹²é¢
- è¿åº¦ä¿å: ææèæ¬æ¯ææç¹ç»ä¼
- é误å¤ç: æ£æ¥å¤±è´¥æä»¶å表并éè¯
è¾åºæä»¶
龿¥æå
links.json: JSON æ ¼å¼é¾æ¥links.txt: çº¯ææ¬é¾æ¥extract_progress.json: è¿åº¦ä¿¡æ¯
ä¸è½½
docs/*.md: Markdown æä»¶docs/README.md: ç´¢å¼æä»¶download_progress.json: ä¸è½½è¿åº¦
ä¸ä¼
.notebooklm_info.json: ç¬è®°æ¬ ID.upload_summary.json: ä¸ä¼ æè¦_failed_uploads.txt: 失败å表
许å¯è¯
MIT License