python pdf 转为图片 image

创建日期: 2024-01-02 17:26 | 作者: 风波 | 浏览次数: 14 | 分类: Python

方法一

1. 安装 PyMuPDF

pip install pymupdf -i https://mirrors.aliyun.com/pypi/simple/  --trusted-host mirrors.aliyun.com

2. 代码

import sys
import fitz  # PyMuPDF

pdf_path = sys.argv[1]
doc = fitz.open(pdf_path)

for i in range(len(doc)):
    print(f"{i}/{len(doc)}     ", end='\r')
    pix = doc[i].get_pixmap()
    pix.save(f'page_{i + 1}.png')

print("")

方法二

1. 安装 poppler-utils

apt update
apt install poppler-utils -y

2. 安装 pdf2image

pip install pdf2image -i https://mirrors.aliyun.com/pypi/simple/  --trusted-host mirrors.aliyun.com

3. pdf 转 image

import sys
from pdf2image import convert_from_path


## Store Pdf with convert_from_path function
images = convert_from_path(sys.argv[1])

for i in range(len(images)):
    # Save pages as images in the pdf
    images[i].save('img/page'+ str(i) +'.jpg', 'JPEG')
14 浏览
13 爬虫
0 评论