PyPDF2 模块抽取PDF文件部分页面另存新文档

使用第三方模块 PyPDF2

from PyPDF2 import PdfFileReader, PdfFileWriter


def get_pdf_pages(pdf_file_path, output_fname, start_page, end_page):
    """
    pdf_file_path: 字符串,表示pdf文件路径
    output_fname: outpuf file name
    start_page: 开始页
    end_page: 结束页
    """

    pdf_reader = PdfFileReader(open(pdf_file_path, "rb"))
    output = PdfFileWriter()
    num_pages = pdf_reader.getNumPages()

    for idx in range(start_page, end_page):
        output.addPage(pdf_reader.getPage(idx))

    out_file = open(output_fname, "wb")
    output.write(out_file)


if __name__ == '__main__':
    start_page = 0
    end_page = 719

    get_pdf_pages("test.pdf", "output.pdf", start_page, end_page)

相关文档:PyPDF 模块实现多个 PDF 文件合并

你可能感兴趣的:(Python,PyPDF2,PDF处理,Python库,文件操作,页面提取)