admin管理员组

文章数量:1221751

I want to get all text include table inside pdf file using pymupdf4llm

import pymupdf4llm
import pathlib

md_text = pymupdf4llm.to_markdown("my_file.pdf")
pathlib.Path("result.md").write_bytes(md_text.encode())

But in result found my table have 2 in markdown.

here my pdf

one with no table

Another one in table format

how to get result text with just 1 table format?

本文标签: pythonpymupdf4llm get 1 table formatStack Overflow