Python Automation Series #9 : How to extract a page from a PDF file with Python ?

preview_player
Показать описание
PDF documents are binary files, which makes them much more complex than plaintext files.

In addition to text, they store font, color and layout information.

PyPDF2 does not have a way to extract images, charts or other media from PDF documents, but it can extract text and return it as a Python string.

Рекомендации по теме
Комментарии
Автор

I wanted to ask what if you want to get all the page numbers from the pdf file? What can you do to do?

georges_production