docs/tools/extract-tables.md

---
title: Extract Tables
description: Detect and extract tables from PDF documents. Export as CSV, JSON, or Markdown with per-table granularity.
---

# Extract Tables

Detects all tables in a PDF and lets you export them in your choice of format: CSV, JSON, or Markdown. Each table is extracted individually with page and position metadata. Powered by PyMuPDF's table detection engine.

## How It Works

1. Upload a PDF by clicking the drop zone or dragging a file onto it.
2. Select your preferred **export format** -- CSV, JSON, or Markdown.
3. Click **Extract** to start processing.
4. If one table is found, it downloads as a single file. Multiple tables produce a ZIP archive with one file per table.

## Options

- **Export Format** -- choose between:
  - **CSV** -- comma-separated values with proper quoting for fields containing commas, quotes, or newlines. Suitable for spreadsheets and databases.
  - **JSON** -- array-of-arrays structure, pretty-printed with 2-space indentation. Suitable for programmatic consumption.
  - **Markdown** -- pipe-delimited table format rendered by PyMuPDF. Suitable for documentation and README files.

## Output Format

- **Single table**: `filename_table.csv` (or `.json` / `.md`)
- **Multiple tables**: `filename_tables.zip` containing files named `table_1_page3.csv`, `table_2_page5.csv`, etc.

## Use Cases

- Extracting specific tables from lengthy PDF reports without pulling the entire document's data.
- Getting tables in Markdown format for pasting directly into GitHub issues, wikis, or documentation.
- Pulling structured JSON data from PDF tables for API integrations or scripts.
- Comparing tables across different PDF versions by exporting both to CSV and diffing.

## Tips

- This tool gives you per-table control. If you just want all tables dumped into a single CSV, use [PDF to CSV](./pdf-to-csv). For a full Excel workbook, use [PDF to Excel](./pdf-to-excel).
- Tables that span multiple pages may be detected as separate tables per page. You may need to concatenate them manually.
- If no tables are detected, the PDF might be a scanned image. Run it through OCR first to add a text layer.

## Related Tools

- [PDF to CSV](./pdf-to-csv)
- [PDF to Excel](./pdf-to-excel)
- [PDF to Text](./pdf-to-text)
Add documentation for all PDF tools 2026-03-20 21:48:48 +05:30			`---`
			`title: Extract Tables`
			`description: Detect and extract tables from PDF documents. Export as CSV, JSON, or Markdown with per-table granularity.`
			`---`

			`# Extract Tables`

			`Detects all tables in a PDF and lets you export them in your choice of format: CSV, JSON, or Markdown. Each table is extracted individually with page and position metadata. Powered by PyMuPDF's table detection engine.`

			`## How It Works`

			`1. Upload a PDF by clicking the drop zone or dragging a file onto it.`
			`2. Select your preferred export format -- CSV, JSON, or Markdown.`
			`3. Click Extract to start processing.`
			`4. If one table is found, it downloads as a single file. Multiple tables produce a ZIP archive with one file per table.`

			`## Options`

			`- Export Format -- choose between:`
			`- CSV -- comma-separated values with proper quoting for fields containing commas, quotes, or newlines. Suitable for spreadsheets and databases.`
			`- JSON -- array-of-arrays structure, pretty-printed with 2-space indentation. Suitable for programmatic consumption.`
			`- Markdown -- pipe-delimited table format rendered by PyMuPDF. Suitable for documentation and README files.`

			`## Output Format`

			- Single table: `filename_table.csv` (or `.json` / `.md`)
			- Multiple tables: `filename_tables.zip` containing files named `table_1_page3.csv`, `table_2_page5.csv`, etc.

			`## Use Cases`

			`- Extracting specific tables from lengthy PDF reports without pulling the entire document's data.`
			`- Getting tables in Markdown format for pasting directly into GitHub issues, wikis, or documentation.`
			`- Pulling structured JSON data from PDF tables for API integrations or scripts.`
			`- Comparing tables across different PDF versions by exporting both to CSV and diffing.`

			`## Tips`

			`- This tool gives you per-table control. If you just want all tables dumped into a single CSV, use [PDF to CSV](./pdf-to-csv). For a full Excel workbook, use [PDF to Excel](./pdf-to-excel).`
			`- Tables that span multiple pages may be detected as separate tables per page. You may need to concatenate them manually.`
			`- If no tables are detected, the PDF might be a scanned image. Run it through OCR first to add a text layer.`

			`## Related Tools`

			`- [PDF to CSV](./pdf-to-csv)`
			`- [PDF to Excel](./pdf-to-excel)`
			`- [PDF to Text](./pdf-to-text)`