http://pybrary.net/pyPdf/
是純python寫的,可以利用python對pdf做一些事:
- extracting document information (title, author, ...),
- splitting documents page by page,
- merging documents page by page,
- cropping pages,
- merging multiple pages into a single page,
- encrypting and decrypting PDF files.
XHTML to PDF
http://pypi.python.org/pypi/pisa/
PDFminer
http://www.unixuser.org/~euske/python/pdfminer/programming.html
Other Python PDF library.
http://stackoverflow.com/questions/25665/python-module-for-converting-pdf-to-text
http://stackoverflow.com/questions/25665/python-module-for-converting-pdf-to-text
引自:http://quickteckiteasy.blogspot.com/2011/01/pythonpdf.html