PDF Document Processor
PDF 文档处理
Intelligent PDF parsing with text, table and image extraction. Supports batch processing of energy industry reports.
npx openclaw skills install pdf-processorFeatures
Use Cases
Installation
npx openclaw skills install pdf-processor
Usage Examples
# Extract text from PDF
pdf-processor extract --input report.pdf --format text
# Extract tables to CSV
pdf-processor extract --input report.pdf --format table --output data.csv
# Batch process directory
pdf-processor batch --input ./reports/ --format text
# Energy report specific extraction
pdf-processor extract --input energy-report.pdf --template energy --include charts
Supported Formats
| Output | Description |
|--------|-------------|
| text | Plain text with paragraph structure |
| markdown | Markdown formatted output |
| csv | Structured table data |
| json | JSON with metadata and content |
| excel | Multi-sheet Excel workbook |