Server data from the Official MCP Registry
Turn any PDF into structured JSON via AI + OCR: invoices, bank statements, contracts.
Turn any PDF into structured JSON via AI + OCR: invoices, bank statements, contracts.
Remote endpoints: streamable-http: https://mcp.apify.com/?actors=opportunity-biz/document-to-json-mcp
This MCP server implements a PDF-to-JSON document extraction service with reasonable authentication (PayMCP/x402 payments) and appropriate permissions for its stated purpose. However, several security and code quality concerns lower the score: hardcoded configuration imports without validation, broad exception handling that masks errors, incomplete API credential handling, and truncated code that prevents full analysis. The server also lacks input size validation and has potential issues with sensitive data handling during PDF extraction. Supply chain analysis found 14 known vulnerabilities in dependencies (1 critical, 8 high severity).
6 files analyzed ยท 23 issues found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Available as Local & Remote
This plugin can run on your machine or connect to a hosted endpoint. during install.
From the project's GitHub README.
Turn PDFs into structured JSON in seconds. AI-powered, no coding needed.
Paste a PDF URL โ get structured JSON. That's it.
Perfect for:
| Type | What you get | Price |
|---|---|---|
| Invoice | Seller, buyer, line items, totals, VAT, IBAN, payment info | $0.01 |
| Bank Statement | All transactions, balances, fees, account holder | $0.015 |
| Contract | Parties, key clauses, dates, financial terms, jurisdiction | $0.02 |
| Generic | Full text + tables from any document | Free during launch |
{
"success": true,
"data": {
"document_type": "invoice",
"confidence": 0.97,
"metadata": {
"invoice_number": "INV-2024-00123",
"invoice_date": "2024-03-15",
"currency": "EUR"
},
"seller": {
"name": "Acme S.p.A.",
"vat_id": "IT01234567890"
},
"line_items": [
{
"description": "Consulting services",
"quantity": 1,
"unit_price": 5000.00,
"net_amount": 5000.00,
"vat_rate": 22.0,
"total": 6100.00
}
],
"totals": {
"net_total": 5000.00,
"vat_total": 1100.00,
"grand_total": 6100.00
},
"payment_info": {
"iban": "IT60X0542811101000000123456"
}
}
}
| Feature | Benefit |
|---|---|
| Multi-language | English, Italian, Spanish (OCR) |
| OCR included | Works with scanned documents too |
| Validation | Auto-checks totals and dates for accuracy |
| Pay per use | No subscription, pay only for what you process |
That's it. No configuration, no API keys needed.
| Document type | Price |
|---|---|
| Invoice | $0.01 ($10/1000) |
| Bank statement | $0.015 ($15/1000) |
| Contract | $0.02 ($20/1000) |
| Generic | Free during launch |
Pay-per-event via Apify. Pay only for successful extractions. No subscription, no hidden fees.
eng (English), ita (Italian), spa (Spanish)
Combine with + for multi-language scanned documents: eng+ita+spa (default)
This Actor is an MCP server: AI agents can call it directly as a tool to turn any
PDF into JSON, with zero configuration โ just pass a public file_url. Specialized
tools (parse_invoice, parse_bank_statement, parse_contract) and a free
parse_generic_document make it easy for an LLM to pick the right one for the task.
Replace
YOUR_APIFY_TOKENwith your token from Apify โ Settings โ Integrations.
claude mcp add --transport http apify \
"https://mcp.apify.com/?actors=opportunity-biz/document-to-json-mcp"
Add to your MCP config (claude_desktop_config.json or Cursor's mcp.json):
{
"mcpServers": {
"document-to-json": {
"command": "npx",
"args": [
"-y", "mcp-remote",
"https://mcp.apify.com/?actors=opportunity-biz/document-to-json-mcp",
"--header", "Authorization: Bearer YOUR_APIFY_TOKEN"
]
}
}
}
The agent then sees parse_invoice, parse_bank_statement, parse_contract, and
parse_generic_document as tools and calls them on its own.
One call in, JSON out โ run-sync-get-dataset-items returns the result directly:
curl -X POST \
"https://api.apify.com/v2/acts/opportunity-biz~document-to-json-mcp/run-sync-get-dataset-items?token=YOUR_APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"file_url": "https://example.com/invoice.pdf", "document_type": "invoice", "validate_totals": true}'
from apify_client import ApifyClient
client = ApifyClient("YOUR_APIFY_TOKEN")
run = client.actor("opportunity-biz/document-to-json-mcp").call(run_input={
"file_url": "https://example.com/invoice.pdf",
"document_type": "invoice",
})
for item in client.dataset(run["defaultDatasetId"]).iterate_items():
print(item)
import { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: 'YOUR_APIFY_TOKEN' });
const run = await client.actor('opportunity-biz/document-to-json-mcp').call({
file_url: 'https://example.com/invoice.pdf',
document_type: 'invoice',
});
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(items);
Use an HTTP Request node (POST) to the REST API URL above, or the official
Apify node โ select document-to-json-mcp โ set file_url and document_type.
Great for "watch inbox โ extract invoice โ append to Google Sheet" workflows.
Point your agent framework's MCP client at
https://mcp.apify.com/?actors=opportunity-biz/document-to-json-mcp โ the parsing
tools are exposed automatically.
Be the first to review this server!
by Modelcontextprotocol ยท Developer Tools
Read, search, and manipulate Git repositories programmatically
by Modelcontextprotocol ยท Developer Tools
Web content fetching and conversion for efficient LLM usage
by Toleno ยท Developer Tools
Toleno Network MCP Server โ Manage your Toleno mining account with Claude AI using natural language.