Compare Word or PDF files using Python SDK

Powerful Python library to compare files in PDF, Word, HTML, TXT, MD and other formats

Use our Python file difference checker, compare two documents and see the differences between them clearly shown in the output. You no longer need to compare your documents manually. Also note that the comparison result can be exported to a DOCX, PDF, DOC and others.

View code snippet

Comparing two files in Python

Our Python solution is designed to give you the best possible results when using a complex comparison algorithm. So, you no longer need to manually look up differences in documents – feel free to use our API to diff two files.

Our comparison engine looks for differences at the level of characters or whole words. And if only one character was changed, the whole word will be highlighted as changed. All found changes can be clearly viewed in the output and exported in a convenient format.

The most popular types of comparison are comparing PDF and Word files. Therefore, we will learn our Python diff tool and its comparison function using these examples.

Compare PDF in Python

Sometimes you need to make sure that your PDF document has not been modified and compare it with the original version. Or you already know that your PDF has really changed, and you need to look where and how. Then you can compare two versions of PDF files and see the differences. And to avoid doing it manually, just use our Python diff checker API.

With our Python solution, you will see even small changes that you could easily miss in a manual comparison.

Compare two Word documents using Python

Word documents are fairly easy to change, so it is important to compare Word files to make sure parts or the entire document have not changed.

To compare Word files, use our Python solution, just like with PDF. You can do this in the example below.

Сompare two documents

To see how our Python document compare works, load two files you want to compare and choose an export file format.

After the comparison is completed, the document with the diff will be automatically downloaded in the format you selected.

It is important that the documents being compared must not have any revisions before the comparison, so we need to accept all revisions before the compare starts. Do not worry, we have taken care of that in our example:

Upload the compared file
Run code
Upload the second file to compare
Select the target format from the list
import asposewordscloud

words_api = 
    WordsApi(client_id = '####-####-####-####-####', client_secret = '##################')
file_name1= 'Input1.docx'
file_name2= 'Input2.docx'
file_result= 'Output.docx'

#  Upload documents to cloud storage.
first_document_content = open(file_name1, 'rb')
upload_first_file_request = 
    asposewordscloud.models.requests.UploadFileRequest(
	file_content=first_document_content, path=file_name1)
words_api.upload_file(upload_first_file_request)

second_document_content = open(file_name2, 'rb')
upload_second_file_request = 
    asposewordscloud.models.requests.UploadFileRequest(
	file_content=second_document_content, path=file_name2)
words_api.upload_file(upload_second_file_request)

#  Compare documents in cloud.
compare_data_options = 
    asposewordscloud.CompareData(
	author='author', comparing_with_document=file_name2)
compare_document_request = 
    asposewordscloud.models.requests.CompareDocumentRequest(
	name=file_name1, compare_data=compare_data_options, dest_file_name=file_result)
words_api.compare_document(compare_document_request)

#  Download result document from cloud storage.
download_file_request = asposewordscloud.models.requests.DownloadFileRequest(path=file_result)
download_file_result = words_api.download_file(download_file_request)
open('compareResult.pdf','wb').write(download_file_result)
Run code
  
Copy Python code to the clipboard

How to Compare two versions of a file

  1. Install Aspose.Words Cloud for Python
  2. Add a library reference (import the library) to your Python project
  3. Load two documents to compare
  4. Call the compare_document() method to compare two docs
  5. Download the result document from the cloud storage

Python library to use compare

Install Aspose.Words Cloud SDK for Python using PyPi repository. Run pip install aspose-words-cloud to install the SDK, then import the package via import asposewordscloud. As an alternative, you can manually clone Aspose.Words Cloud SDK for Python source code from GitHub and use it in your project. Please follow these Instructions to quickly get the necessary security credentials and access our REST API.

System Requirements

Refer to the Repository Documentation to see the details.

Most popular formats for compare operations

5%

Subscribe to Aspose Product Updates

Get monthly newsletters and offers directly delivered to your mailbox.

© Aspose Pty Ltd 2001-2024. All Rights Reserved.