generate-import-html

📁 adobe/helix-website 📅 7 days ago

总安装量

周安装量

#51552

全站排名

安装命令

npx skills add https://github.com/adobe/helix-website --skill generate-import-html

Agent 安装分布

mcpjam 1

claude-code 1

replit 1

junie 1

windsurf 1

zencoder 1

Skill 文档

Generate Import HTML

Create plain HTML file with block structure from authoring analysis.

When to Use This Skill

Use this skill when:

You have complete authoring analysis (all sequences have decisions)
You have section styling validation (from authoring-analysis)
Ready to generate the HTML file for preview

Invoked by: page-import skill (Step 4)

Prerequisites

From previous skills, you need:

â Authoring analysis with block selections (from authoring-analysis)
â Section styling decisions (from authoring-analysis Step 3e)
â metadata.json with paths and metadata (from scrape-webpage)
â cleaned.html with content (from scrape-webpage)
â Block structures fetched (from authoring-analysis Step 3d)

Related Skills

page-import – Orchestrator that invokes this skill
authoring-analysis – Provides authoring decisions and styling validation
scrape-webpage – Provides metadata, paths, cleaned HTML, images
preview-import – Uses this skill’s HTML output

â ï¸ CRITICAL REQUIREMENT: Complete Content Import

YOU MUST IMPORT ALL CONTENT FROM THE PAGE. PARTIAL IMPORT IS UNACCEPTABLE.

â NEVER truncate or skip sections due to length concerns
â NEVER summarize or abbreviate content
â NEVER use placeholders like “”
â NEVER omit content because the page is “too long”
â ALWAYS import every section from authoring analysis
â ALWAYS include all text, images, and structure from cleaned.html
â If you encounter length issues, generate the FULL HTML anyway

Validation requirement: You MUST verify that the number of sections in your HTML matches the number of sections from identify-page-structure. If they don’t match, you have made an error.

HTML Generation Workflow

Structure Requirements

IMPORTANT CHANGE: The AEM CLI now automatically wraps HTML content with headful structure (head, header, footer). You MUST generate ONLY the section content.

What to generate:

â Section divs with content: <div>...</div> (one per section)
â Blocks as <div class="block-name"> with nested divs
â Default content (headings, paragraphs, links, images)
â Section metadata blocks where validated in authoring-analysis

What NOT to generate:

â NO <html>, <head>, or <body> tags
â NO <header> or <footer> elements
â NO <main> wrapper element
â NO head content (meta tags, title, etc. – this comes from project’s head.html)

Structure format:

<div>
  <!-- Section 1 content -->
</div>
<div>
  <!-- Section 2 content with section-metadata if needed -->
  <div class="section-metadata">
    <div>
      <div>Style</div>
      <div>grey</div>
    </div>
  </div>
  <!-- Section 2 blocks/content -->
</div>
<div>
  <!-- Section 3 content -->
</div>

For detailed block structure patterns: See ../page-import/resources/html-structure.md

Section Metadata Application

Apply validated decisions from authoring-analysis Step 3e:

WITH section-metadata (section provides container styling):

<div>
  <div class="section-metadata">
    <div>
      <div>Style</div>
      <div>dark</div>
    </div>
  </div>
  <div class="tabs">
    <!-- Tabs block content -->
  </div>
</div>

WITHOUT section-metadata (background is block-specific):

<div>
  <div class="hero">
    <!-- Hero block content with its own dark background -->
  </div>
</div>

Important:

Only migrate visible body content sections (skip header, navigation, footer – auto-generated)
Use consistent style names from identify-page-structure
Apply validated decisions from authoring-analysis Step 3e – Skip section-metadata for single-block sections where background is block-specific
Place section-metadata div at the start of each section that needs styling
The metadata div will be processed and removed by the platform
Each section is a separate top-level <div> element

Page Metadata Block

Unless user explicitly requested to skip metadata, use the metadata extracted from scrape-webpage to generate a metadata block.

Process:

1. Review extracted metadata from metadata.json

2. Map each property to standard format:

Title:

Compare source title (or og:title) with first H1 on page
If matches first H1 â Omit (platform defaults to H1)
If differs â Include as title property

Description:

Compare source description (or og:description) with first paragraph
If matches first paragraph â Consider omitting (platform defaults to first paragraph)
If differs OR more descriptive â Include as description property
Check: 150-160 characters ideal

Image:

Check source og:image
If matches first content image â Consider omitting (platform defaults to first image)
If custom social image â Include as image property
Ensure absolute URL or correct relative path
Check: 1200×630 pixels recommended

Canonical:

If points to same page URL â Omit (platform auto-generates)
If points to different page â Include as canonical property

Tags:

Map article:tag or keywords â comma-separated tags property

Properties to SKIP (platform auto-populates):

og:url, og:title, og:description, twitter:title, twitter:description, twitter:image
viewport, charset, X-UA-Compatible (belong in head.html)

3. Generate metadata block HTML:

<div>
  <div class="metadata">
    <div>
      <div>title</div>
      <div>[Your mapped title]</div>
    </div>
    <div>
      <div>description</div>
      <div>[Your mapped description]</div>
    </div>
    <!-- Only include image if custom -->
    <!-- Only include canonical if differs from page URL -->
    <!-- Only include tags if present -->
  </div>
</div>

Append metadata block as the last section div at the end of the HTML file.

Detailed guidance: See resources/metadata-extraction.md and resources/metadata-mapping.md

Images Folder Management (CRITICAL)

The images are currently in ./import-work/images/ and the HTML references them as ./images/.... You MUST handle the images folder correctly:

Step 1: Determine the correct images folder location

Based on paths.htmlFilePath from metadata.json:

HTML file: us/en/about.plain.html â Images should be at: us/en/images/
HTML file: products/widget.plain.html â Images should be at: products/images/
HTML file: index.plain.html â Images should be at: images/

Rule: Images folder goes in the same directory as the HTML file.

Step 2: Copy the images folder

# Example: If HTML is at us/en/about.plain.html
mkdir -p us/en/images
cp -r ./import-work/images/* us/en/images/

Step 3: Verify image paths in HTML are correct

The HTML should already reference images as ./images/... which is correct for files in the same directory. No path changes needed in the HTML.

Example:

HTML location: us/en/about.plain.html
Images location: us/en/images/
Image reference in HTML: <img src="./images/abc123.jpg">
Result: â Correct - browser resolves to us/en/images/abc123.jpg

Save HTML File

Save to: Use paths.htmlFilePath from metadata.json (e.g., us/en/about.plain.html)

Read the metadata.json file from scrape-webpage to get the correct file path.

Validation Checklist (MANDATORY)

Before proceeding to preview-import skill, verify:

â Section count: HTML has the same number of top-level <div> sections as identified in identify-page-structure
â All sequences: Every content sequence from authoring-analysis appears in the HTML
â No truncation: No “…” or “” or similar placeholders
â Complete text: All headings, paragraphs, and text from cleaned.html are present
â All images: Every image reference from the scraped page is included
â HTML file saved: HTML file written to disk at the correct path
â Images folder copied: Images folder exists in the same directory as the HTML file
â Images accessible: Verify that at least one image file exists in the copied images folder

If any validation check fails, STOP and fix before proceeding.

Output

This skill provides:

â HTML file at correct path (e.g., us/en/about.plain.html)
â Images folder in same directory (e.g., us/en/images/)
â Complete content import (all sections)
â Proper block structure
â Section metadata applied per validation
â Page metadata block included

Next step: Pass HTML file path to preview-import skill

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台