mcp-pdf-tools

MCP/mcp-pdf-tools

Fork 0

Commit Graph

Author	SHA1	Message	Date
Ryan Malloy	31b8b2e6d4	docs: flag texlive-latex-extra requirement, recommend tectonic Some checks are pending Security Scan / security-scan (push) Waiting to run Details texlive-xetex alone is rarely enough — pandoc's default template needs packages from texlive-latex-extra (Debian) / texlive-latexextra (Arch): lastpage, xcolor, framed, fancyhdr, etc. Real markdown docs fail with "File 'X.sty' not found" without them. Restructure system deps to present three engine routes per platform: - tectonic (recommended): ~30 MB static binary, downloads packages on demand - full TeX: texlive-xetex + texlive-latex-extra + texlive-fonts-extra - weasyprint: skip TeX entirely, pip-installable Add an engine comparison table in the README explaining the disk-size and quality trade-offs so users can pick informed.	2026-05-05 16:29:05 -06:00
Ryan Malloy	964fd14a26	docs: cover markdown_to_pdf, [markdown] extra, uvx + pacman install README: - bump tool count 46 → 47, add Format Conversion bullet - fix `claude mcp add` syntax (needs `--` separator before uvx) - show `uvx --from "mcp-pdf[markdown]" mcp-pdf` for the new tool - note about uvx caching + `--refresh` - new "Format Conversion" tools subsection (markdown_to_pdf alongside pdf_to_markdown) - new "Optional Extras" section explaining [forms], [tables], [markdown], [all] - expand System Dependencies with Arch (pacman) and macOS (brew) recipes for pandoc + a PDF engine QUICKSTART: - replace stale `mcp-pdf-tools` package name with current `mcp-pdf` - add uvx as the recommended end-user install path - add pip install patterns including all optional extras - add pacman block alongside apt-get and brew - add markdown_to_pdf troubleshooting (mktexfmt errors, engine fallback) - add a smoke-test snippet using the new tool	2026-05-05 16:27:28 -06:00
Ryan Malloy	c902e81e4d	Initial commit: Complete MCP PDF Tools server implementation Features: - 8 comprehensive PDF processing tools with intelligent fallbacks - Text extraction (PyMuPDF, pdfplumber, pypdf with auto-selection) - Table extraction (Camelot → pdfplumber → Tabula fallback chain) - OCR processing with Tesseract and preprocessing options - Document analysis (structure, metadata, scanned detection) - Image extraction with filtering capabilities - PDF to markdown conversion with metadata - Built on FastMCP framework with full MCP protocol support - Comprehensive error handling and user-friendly messages - Docker support and cross-platform compatibility - Complete test suite and examples 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-10 16:36:21 -06:00

Author

SHA1

Message

Date

Ryan Malloy

31b8b2e6d4

docs: flag texlive-latex-extra requirement, recommend tectonic

Security Scan / security-scan (push) Waiting to run

Details

texlive-xetex alone is rarely enough — pandoc's default template needs
packages from texlive-latex-extra (Debian) / texlive-latexextra (Arch):
lastpage, xcolor, framed, fancyhdr, etc. Real markdown docs fail with
"File 'X.sty' not found" without them.

Restructure system deps to present three engine routes per platform:
- tectonic (recommended): ~30 MB static binary, downloads packages on demand
- full TeX: texlive-xetex + texlive-latex-extra + texlive-fonts-extra
- weasyprint: skip TeX entirely, pip-installable

Add an engine comparison table in the README explaining the disk-size
and quality trade-offs so users can pick informed.

2026-05-05 16:29:05 -06:00

Ryan Malloy

964fd14a26

docs: cover markdown_to_pdf, [markdown] extra, uvx + pacman install

README:
- bump tool count 46 → 47, add Format Conversion bullet
- fix `claude mcp add` syntax (needs `--` separator before uvx)
- show `uvx --from "mcp-pdf[markdown]" mcp-pdf` for the new tool
- note about uvx caching + `--refresh`
- new "Format Conversion" tools subsection (markdown_to_pdf alongside pdf_to_markdown)
- new "Optional Extras" section explaining [forms], [tables], [markdown], [all]
- expand System Dependencies with Arch (pacman) and macOS (brew) recipes for
  pandoc + a PDF engine

QUICKSTART:
- replace stale `mcp-pdf-tools` package name with current `mcp-pdf`
- add uvx as the recommended end-user install path
- add pip install patterns including all optional extras
- add pacman block alongside apt-get and brew
- add markdown_to_pdf troubleshooting (mktexfmt errors, engine fallback)
- add a smoke-test snippet using the new tool

2026-05-05 16:27:28 -06:00

Ryan Malloy

c902e81e4d

Initial commit: Complete MCP PDF Tools server implementation

Features:
- 8 comprehensive PDF processing tools with intelligent fallbacks
- Text extraction (PyMuPDF, pdfplumber, pypdf with auto-selection)
- Table extraction (Camelot → pdfplumber → Tabula fallback chain)
- OCR processing with Tesseract and preprocessing options
- Document analysis (structure, metadata, scanned detection)
- Image extraction with filtering capabilities
- PDF to markdown conversion with metadata
- Built on FastMCP framework with full MCP protocol support
- Comprehensive error handling and user-friendly messages
- Docker support and cross-platform compatibility
- Complete test suite and examples

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-08-10 16:36:21 -06:00

3 Commits