Editor's Note
ocr-document-processor
Extract text and structure from scans, images, and scanned PDFs. Use for OCR, searchable PDFs, table extraction, receipt parsing, and business card parsing.
Install
npx skills add https://github.com/dkyazzentwatwa/chatgpt-skills --skill ocr-document-processorSKILL.md
OCR Document Processor
Handle OCR-heavy inputs where text must be recovered from images or scanned pages.
Use This For
- OCR on images and scanned PDFs
- Searchable PDF export
- Structured extraction to text, markdown, JSON, or HTML
- Table extraction from scanned material
- Receipt parsing and business card parsing
Workflow
- Decide whether plain OCR, structured extraction, or document-specific parsing is needed.
- Preprocess noisy inputs before extraction when skew, blur, or shadows are present.
- Use
scripts/ocr_processor.pyfor core OCR tasks. - Use the focused helpers when the input is specialized:
scripts/business_card_scanner.pyscripts/receipt_scanner.py
- Return confidence caveats when the source is low quality, rotated, handwritten, or multilingual.
Guardrails
- Prefer explicit language selection when accuracy matters.
- Do not claim fields are exact when OCR confidence is weak.
- Route non-scanned digital PDFs to
document-converter-suiteinstead of OCR by default.
Installs3.3K
GitHub Stars53
LanguagePython
AddedDec 14, 2025
Related Frontend Development Skills
View allfind-skills
vercel-labs/skills
1.1M18.6k1.1M
vercel-react-best-practices
vercel-labs/agent-skills
320.4K26.6k320.4K
frontend-design
anthropics/skills
299.9K134.5k299.9K
web-design-guidelines
vercel-labs/agent-skills
256.2K26.6k256.2K
remotion-best-practices
remotion-dev/skills
243.3K3.2k243.3K
agent-browser
vercel-labs/agent-browser
186.7K33.1k186.7K