用python操作PDF

http://pybrary.net/pyPdf/
是純python寫的,可以利用python對pdf做一些事:
  • extracting document information (title, author, ...),
  • splitting documents page by page,
  • merging documents page by page,
  • cropping pages,
  • merging multiple pages into a single page,
  • encrypting and decrypting PDF files.


XHTML to PDF
http://pypi.python.org/pypi/pisa/


PDFminer
http://www.unixuser.org/~euske/python/pdfminer/programming.html

Other Python PDF library.
http://stackoverflow.com/questions/25665/python-module-for-converting-pdf-to-text
http://stackoverflow.com/questions/25665/python-module-for-converting-pdf-to-text

引自:http://quickteckiteasy.blogspot.com/2011/01/pythonpdf.html

你可能感兴趣的:(python,XHTML)