paper-analyzer
npx skills add https://github.com/zsyggg/paper-craft-skills --skill paper-analyzer
Agent 安装分布
Skill 文档
Academic Paper Analyzer – 妿¯è®ºææ·±åº¦è§£æ
æ ¸å¿è½å
- MinerU Cloud API é«ç²¾åº¦ PDF è§£æ
- èªå¨æåå¾çãè¡¨æ ¼ãLaTeX å ¬å¼
- å¤ç§åä½é£æ ¼ï¼æ äºå / 妿¯å / ç²¾ç¼å
- å¯éå ¬å¼è®²è§£ï¼æå ¥å ¬å¼å¾ç并详ç»è§£è¯»
- å¯é代ç åæï¼ç»å GitHub 弿ºä»£ç 讲解
- è¾åº Markdown + HTMLï¼base64 åµå ¥å¾çï¼
åç½®åå¤
MinerU API Token
- è®¿é® https://mineru.net 注åè´¦å·
- è·å API Token
- 设置ç¯å¢åéï¼æ¨èï¼ï¼
export MINERU_TOKEN="your_token_here"
ä¾èµå®è£
pip install requests markdown
æä½æ¥éª¤
ç¬¬ä¸æ¥ï¼PDF è§£æï¼ä½¿ç¨ MinerU APIï¼
python scripts/mineru_api.py <pdf_path> <output_dir>
æè ç´æ¥ä¼ å ¥ tokenï¼
python scripts/mineru_api.py paper.pdf ./output YOUR_TOKEN
è¾åºç»æï¼
output_dir/*.md– Markdown æä»¶ï¼å«å ¬å¼ãè¡¨æ ¼ï¼output_dir/images/– é«è´¨éæåçå¾ç
ç¬¬äºæ¥ï¼æå论æä¿¡æ¯
python scripts/extract_paper_info.py <output_dir>/*.md paper_info.json
ç¬¬ä¸æ¥ï¼é£æ ¼éæ©ï¼è¯¢é®ç¨æ·ï¼
å¨çææç« åï¼å¿ 须询é®ç¨æ·ä»¥ä¸é项ï¼
1. åä½é£æ ¼ï¼å¿ éï¼
| 飿 ¼ | ç¹ç¹ | éç¨åºæ¯ |
|---|---|---|
| storytellingï¼æ äºåï¼ | ä»ç´è§åºåï¼ç¨æ¯å»åä¾åï¼å讲æ äº | å ¬ä¼å·ãææ¯å客ãç§æ® |
| academicï¼å¦æ¯åï¼ | ä¸ä¸æ¯è¯ï¼ä¸¥è°¨è¡¨è¿°ï¼ä¿çåææ¦å¿µ | 妿¯æ¥åã论æç»¼è¿°ãç ç©¶ç»å享 |
| conciseï¼ç²¾ç¼åï¼ | ç´å»æ ¸å¿ï¼è¡¨æ ¼å表ï¼ä¿¡æ¯å¯åº¦é« | å¿«éäºè§£ã论æéè§ãææ¯è°ç |
2. å ¬å¼é项ï¼å¯éï¼
| é项 | 说æ |
|---|---|
| with-formulas | æå ¥å ¬å¼å¾ç并详ç»è®²è§£ç¬¦å·å«ä¹ |
| no-formulasï¼é»è®¤ï¼ | 纯æåæè¿°ï¼ä¸å å«å ¬å¼å¾ç |
3. 代ç é项ï¼å¯éï¼ä» å½è®ºææ GitHub æ¶ï¼
| é项 | 说æ |
|---|---|
| with-code | å éä»åºï¼è´´å ³é®æºç ï¼ä»£ç ä¸è®ºæå¯¹ç §è®²è§£ |
| no-codeï¼é»è®¤ï¼ | ä¸å å«ä»£ç åæ |
询é®ç¤ºä¾ï¼
è¯·éæ©æç« 飿 ¼ï¼
- academic – 妿¯åï¼ä¸ä¸ä¸¥è°¨ï¼é»è®¤æ¨èï¼
- storytelling – æ äºåï¼æ´ç´ æ¥å°æ°
- concise – ç²¾ç¼åï¼å¿«éé 读
æ¯å¦éè¦å ¬å¼è®²è§£ï¼ï¼è®ºæå 嫿°å¦å ¬å¼æ¶æ¨èï¼ æ¯å¦éè¦ç»å GitHub 代ç åæï¼ï¼æ£æµå°å¼æºä»åºï¼xxxï¼
å¦æç¨æ·ä¸ç¡®å®éåªä¸ªï¼é»è®¤ä½¿ç¨ academicï¼å¦æ¯åï¼é£æ ¼ã
ç¬¬åæ¥ï¼æºè½çææç«
æ ¹æ®ç¨æ·éæ©ç飿 ¼ï¼é 读对åºç飿 ¼å®ä¹æä»¶ï¼
styles/storytelling.md– æ äºå飿 ¼æåstyles/academic.md– 妿¯å飿 ¼æåstyles/concise.md– ç²¾ç¼å飿 ¼æåstyles/with-formulas.md– å ¬å¼è®²è§£æåstyles/with-code.md– 代ç åææå
è½»éæ¨¡å¼ï¼èçä¸ä¸æï¼
éè¦ï¼ä¸ºé¿å ä¸ä¸æè¨èï¼è¯·éµå¾ªä»¥ä¸ååï¼
- ä¸è¦åå¤è¯»åå¾çæä»¶ – MinerU å·²æåé«è´¨éå¾çï¼ç´æ¥å¼ç¨è·¯å¾å³å¯
- ä¿¡ä»» paper_info.json – å å«å¾çå表åå æ°æ®ï¼æ éè§è§ç¡®è®¤
- åªçå ³é®å¾ – æå¤è¯»å 1-2 å¼ æ ¸å¿æ¶æå¾ï¼å ¶ä½ç´æ¥å¼ç¨
- è®©ç¨æ·éªè¯ – çæ HTML åè®©ç¨æ·èªå·±æ£æ¥å¾çæ¯å¦æ£ç¡®
éç¨åä½åå
é¿å ï¼
- AI 常ç¨è¯ï¼”æ·±å ¥æ¢è®¨”ã”è³å ³é覔ã”娅é¢å”ï¼
- æºæ¢°åç« èæ é¢
- LaTeX å
¬å¼è¯æ³ï¼å¦
$\mathcal{O}(1)$ï¼- ä½¿ç¨æåçå ¬å¼å¾ç - å¹³éºç´åçææ¯æè¿°
éç¨ï¼
- èªç¶æ®µè½åè¿°
- å åå©ç¨ MinerU æåçå¾ç
- 论æä¸çæ¯å¼ å ³é®å¾é½åºè¯¥è¢«è®²è§£å°
- å ¬å¼æªå¾æ¯ LaTeX è¯æ³æ´æè¯»
storytelling 飿 ¼æ¹æ³è®ºï¼æ äºåä¸ç¨ï¼
以䏿¹æ³è®ºä» å¨ç¨æ·éæ© storytelling 飿 ¼æ¶åºç¨ï¼
1. ä»ç´è§åå ¥ï¼ä¸è¦ç´æ¥è®²ææ¯
- éè¯¯ï¼”æ¬ææåºäºä¸ç§åºäºåå¸è¡¨çæ¡ä»¶è®°å¿æ¨¡å”
- æ£ç¡®ï¼”ä½ ææ²¡ææ³è¿ï¼å¤§æ¨¡åå ¶å®æ¯æ²¡æè®°å¿åè½çï¼”
2. å 讲åå²èæ¯ï¼åè®²åæ°
- ä»ç»æ°ææ¯åï¼å è§£éç¸å ³çæ§ææ¯
- 让读è çè§£”为ä»ä¹éè¦è¿ä¸ªåæ°”
3. ç¨ç®åä¾åè´¯ç©¿å ¨æ
- éä¸ä¸ªç®åçä¾ååå¤ä½¿ç¨
- ä¾å¦ï¼”ä¸å½çé¦é½å¨å京”
4. 使ç¨çå¨çæ¯å»
- “å¤§ç®æèå”ã”æ¥åå ¸ vs èåå ¸”
- 让æ½è±¡æ¦å¿µå ·è±¡å
5. é»è¾éè¿ï¼å±å±æ·±å ¥
- ç®åé®é¢ â 夿é®é¢ â è§£å³æ¹æ¡
6. æç¼æ ¸å¿æ´è§
- ç¨ä¸å¥è¯æ»ç»ï¼å¦”è®°å¿å½è®°å¿ï¼è®¡ç®å½è®¡ç®”
æç« ç»æ
1. 论æä¿¡æ¯
**è®ºææ é¢**ï¼xxx
**论æé¾æ¥**ï¼[arXiv](https://arxiv.org/abs/xxxx)
**ä½è
å¢é**ï¼xxx
2. ç´è§å¼å ¥ï¼2-3段ï¼
- ä»ä¸ä¸ªé®é¢æåºæ¯å¼å§
- 让读è 产ç好å¥å¿
- å¼åº”为ä»ä¹éè¦è¿ä¸ªç ç©¶”
3. èæ¯ç¥è¯ï¼3-4段ï¼
- è§£éç¸å ³çåºç¡ææ¯æå岿¹æ³
- ç¨ç®åä¾å说æ
- 让读è çè§£ç°ææ¹æ¡çå±é
4. æ ¸å¿åæ°ï¼4-5段ï¼
- 详ç»è®²è§£è®ºæçåæ°ç¹
- æ¯ä¸ªåæ°ç¹é½è¦æå¾çæ¯æ
- ç¨æ¯å»åä¾å让æ½è±¡æ¦å¿µå ·è±¡å
- å ¬å¼ç¨å¾çå±ç¤ºï¼ä¸ç¨ LaTeX è¯æ³
5. å®éªéªè¯ï¼2-3段ï¼
- å ³é®çå®éªç»æå¾è¡¨
- 对æ¯åæåæ°æ®è§£è¯»
- çªåºæäº®ç¼çç»æ
6. æ·±å ¥åæï¼2-3段ï¼
- æºå¶åæãæ¶èå®éªç
- è§£é”为ä»ä¹è¿ä¸ªæ¹æ³ææ”
- æä¾æ´æ·±å±æ¬¡ççè§£
7. æèä¸å±æï¼1-2段ï¼
- æç¼æ ¸å¿æ´è§
- 颿µæªæ¥å屿¹å
- 个人è§ç¹åè¯ä»·
ç¬¬äºæ¥ï¼è¾åºæ ¼å¼ï¼è¯¢é®ç¨æ·ï¼
é»è®¤è¾åº Markdownï¼æç« åå®å询é®ç¨æ·æ¯å¦éè¦å ¶ä»æ ¼å¼ï¼
“æç« å·²çæï¼
article.mdãéè¦çæ HTML çæ¬åï¼ï¼HTML ä¼åµå ¥å¾çï¼æ¹ä¾¿ç´æ¥å享4
æ ¼å¼å¯¹æ¯ï¼
| æ ¼å¼ | ä¼å¿ | éç¨åºæ¯ |
|---|---|---|
| MDï¼é»è®¤ï¼ | è½»éãæç¼è¾ãå ¬ä¼å·å¯ç´æ¥å¯¼å ¥ | æ¥å¸¸ä½¿ç¨ |
| HTML | å¾çåµå ¥ãåæä»¶å享 | é¢è§ææãå享ç»ä»äºº |
å¦æç¨æ·éè¦ HTMLï¼
python scripts/generate_html.py <article.md> <output.html>
èµæºç´¢å¼
飿 ¼å®ä¹ï¼
styles/storytelling.md– æ äºå飿 ¼styles/academic.md– 妿¯å飿 ¼styles/concise.md– ç²¾ç¼å飿 ¼styles/with-formulas.md– å ¬å¼è®²è§£styles/with-code.md– 代ç åæ
èæ¬ï¼
scripts/mineru_api.py– MinerU Cloud API è°ç¨ï¼æ¨èï¼scripts/convert_pdf.py– æ¬å°è½¬æ¢ï¼å¤éï¼éè¦ PyMuPDFï¼scripts/extract_paper_info.py– æå论æå æ°æ®scripts/generate_html.py– çæ HTMLï¼base64 å¾çï¼
注æäºé¡¹
- ä¼å ä½¿ç¨ MinerU APIï¼ç²¾åº¦æé«ï¼æ¯æå ¬å¼/è¡¨æ ¼
- èçä¸ä¸æï¼ä¸è¦åå¤è¯»åå¾çï¼ä¿¡ä»»å æ°æ®
- ä¸è¾åºåæè¿ç¨ï¼ç¨æ·åªçæç»æç«
- é¿å åç¹å表ï¼ä½¿ç¨èªç¶æ®µè½åè¿°
- å¾çéæ© 3-5 å¼ å ³é®å¾è¡¨
API éå¶
- å个æä»¶æå¤§ 200MB
- å个æä»¶æå¤ 600 页
- æ¯æ PDFãDOCãPPTãå¾ççæ ¼å¼