استخراج الصفحات من PDF في SDK بيthon
استخراج صفحات من وثيقة PDF باستخدام السحابة باستخدام SDK بيthon السحابة.
Get Startedكيفية الحصول على الصفحات من PDF عبر SDK بيthon السحابة
لتناول الصفحات من PDF، سنستخدم Aspose.PDF Cloud Python SDK. هذا SDK السحابة يساعد مبرقبي بيthon في تطوير تطبيقات السحابة الصحة PDF، المشرفين، Penny، والمحولين باستخدام لغة البرمجة بيthon عبر Aspose.PDF REST API. بدون خروجي عن حساب في Aspose for Cloud وامبسج الرسوم المعلومات. عندما يكون لديك Key،觉صول السندوت Aspose.PDF Cloud Python SDK. إذا كان حزمة بيthon موقوفة على Github، يمكنك التثبيت مباشرة من Github:
Installation from Github
pip install git+https://github.com/aspose-pdf-cloud/aspose-pdf-cloud-python.git
Package Manager Console Command
pip install asposepdfcloud
خطوات للحصول على صفحات من PDF عبر SDK البيثن
يمكن لم البرمجيين Aspose.PDF Cloud تحميل الصفحات من PDF lebenge linee من code.
- تثبيت Python SDK ر.1 arriba a PDF document to the Aspose Cloud server
- تتنزيل الوثيقة المعالجة PDF من مفوضية Aspose Cloud server 1.boden page معلومات الوثيقة PDF
Extract Pages from PDF using Python
import shutil
import json
import logging
from pathlib import Path
from asposepdfcloud import ApiClient, PdfApi, DocumentPagesResponse
# Configure logging
logging.basicConfig(level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s")
class Config:
"""Configuration parameters."""
CREDENTIALS_FILE = Path(r"C:\\Projects\\ASPOSE\\Pdf.Cloud\\Credentials\\credentials.json")
LOCAL_FOLDER = Path(r"C:\Samples")
PDF_DOCUMENT_NAME = "sample.pdf"
LOCAL_RESULT_DOCUMENT_NAME = "output_sample.png"
PAGE_NUMBER = 2
class PdfPages:
""" Class for managing PDF pages using Aspose PDF Cloud API. """
def __init__(self, credentials_file: Path = Config.CREDENTIALS_FILE):
self.pdf_api = None
self._init_api(credentials_file)
def _init_api(self, credentials_file: Path):
""" Initialize the API client. """
try:
with credentials_file.open("r", encoding="utf-8") as file:
credentials = json.load(file)
api_key, app_id = credentials.get("key"), credentials.get("id")
if not api_key or not app_id:
raise ValueError("init_api(): Error: Missing API keys in the credentials file.")
self.pdf_api = PdfApi(ApiClient(api_key, app_id))
except (FileNotFoundError, json.JSONDecodeError, ValueError) as e:
logging.error(f"init_api(): Failed to load credentials: {e}")
def upload_document(self):
""" Upload a PDF document to the Aspose Cloud server. """
if self.pdf_api:
file_path = Config.LOCAL_FOLDER / Config.PDF_DOCUMENT_NAME
try:
self.pdf_api.upload_file(Config.PDF_DOCUMENT_NAME, str(file_path))
logging.info(f"upload_document(): File {Config.PDF_DOCUMENT_NAME} uploaded successfully.")
except Exception as e:
logging.error(f"upload_document(): Failed to upload file: {e}")
def get_page_info(self):
""" Get page information of the PDF document. """
if self.pdf_api:
result_pages: DocumentPagesResponse = self.pdf_api.get_page(Config.PDF_DOCUMENT_NAME, Config.PAGE_NUMBER)
if result_pages.code == 200:
logging.info(f"Page #{Config.PAGE_NUMBER} information: {result_pages.page}")
else:
logging.error(f"Failed to get the page #{Config.PAGE_NUMBER}.")
def get_page_as_png(self):
""" Get page information of the PDF document. """
if self.pdf_api:
try:
result_pages = self.pdf_api.get_page_convert_to_png(Config.PDF_DOCUMENT_NAME, Config.PAGE_NUMBER)
local_path = Config.LOCAL_FOLDER / Config.LOCAL_RESULT_DOCUMENT_NAME
shutil.move(result_pages, str(local_path))
logging.info(f"download_result(): File successfully downloaded: {local_path}")
except Exception as e:
logging.error(f"download_result(): Failed to download file: {e}")
if __name__ == "__main__":
pdf_pages = PdfPages()
pdf_pages.upload_document()
pdf_pages.get_page_info()
pdf_pages.get_page_as_png()
Work with Pages in PDF
Extracting pages from a PDF is a common task that serves various purposes across different contexts. This process involves selecting specific pages from a larger document to create a new, separate PDF file. Understanding the reasons behind this practice can help in effectively managing and utilizing PDF documents. Large PDF files can be cumbersome to share or store. By extracting only the necessary pages, users can create smaller, more manageable files. This is particularly useful when only a portion of the document is relevant for a specific purpose. For instance, removing unnecessary pages can significantly decrease the file size, making it easier to handle and distribute. Extracting pages allows users to repurpose content for different applications. For example, one might extract pages from a comprehensive report to create a standalone summary or to isolate specific data for analysis. This enables the reuse of existing content without the need to recreate information from scratch. Extract the Pages from PDF documents with Aspose.PDF Cloud Python SDK.
With our Python library you can:
- Combine PDF documents.
- Split PDF Files.
- Convert PDF to other formats, and vice versa.
- Manipulate Annotations.
- Work with Images in PDF, etc.
- You can try out our free App to test the functionality online.