Parse PDF for extraction Texts in Go SDK
API for parsing PDF documents to extract texts using server-side Go API.
Get StartedHow to parse PDF documents for extraction Texts using Cloud Go SDK
For parse PDF documents to extraction Texts via Cloud Go SDK , we’ll use Aspose.PDF Cloud Go SDK This Cloud Go SDK assists Go programmers in developing cloud-based PDF creator, annotator, editor, converter and parser apps using Go programming language via Aspose.PDF REST API. Use the following command from the Package Manager Console.
Package Manager Console Command
go get -u github.com/aspose-pdf-cloud/aspose-pdf-cloud-go/v25
Steps to parse PDF for extaction Texts using Go SDK
Aspose.PDF Cloud developers can easily parse PDF documents for extraction Texts. Developers need just a few lines of code.
- Create a new Configuration object with your Application Secret and Key
- Create an object to connect to the Cloud API
- Upload your document file
- Parse PDF documents for extraction Texts in cloud storage using GetDocumentTextBoxFields function
- Checks the response and logs the result
- Download Text boxes info as JSON file locally if needed
This sample code shows parsing PDF document for extraction Texts
package main
import (
"encoding/json"
"fmt"
"os"
"path"
asposepdfcloud "github.com/aspose-pdf-cloud/aspose-pdf-cloud-go/v25"
)
// Extract text boxes form the document
func ParseExtractTextBoxes(documentName string, localFolder string, remoteFolder string) {
// Get your AppSecret and Key from https://dashboard.aspose.cloud (free registration required).
pdf_api := asposepdfcloud.NewPdfApiService(APP_SID, APP_KEY, "")
args := map[string]interface{}{
"folder": remoteFolder,
}
file, _ := os.Open(path.Join(localFolder, documentName))
_, _, _ = pdf_api.UploadFile(path.Join(remoteFolder, documentName), file, args)
result, httpResponse, err := pdf_api.GetDocumentTextBoxFields(documentName, args)
if err != nil {
fmt.Println(err.Error())
} else if httpResponse.StatusCode < 200 || httpResponse.StatusCode > 299 {
fmt.Println("ParseExtractTextBoxes(): Failed to extract text boxes from the document.")
} else {
if result.Fields == nil || len(result.Fields.List) == 0 {
fmt.Println("ParseExtractTextBoxes(): Text boxes not found in the document.")
} else {
resultJson := "[\n"
for _, textBox := range result.Fields.List {
fmt.Println("TextBox", textBox)
jsTable, _ := json.Marshal(textBox)
resultJson += string(jsTable) + ",\n\n"
}
resultJson += "]"
fileName := path.Join(localFolder, ("parsed_taext_boxes_output_go.json"))
f, _ := os.Create(fileName)
_, _ = f.Write([]byte(resultJson))
fmt.Println("File '" + fileName + "' successfully downloaded.")
}
}
}
Work with the Text parsing in PDF via Go SDK
By parsing PDF documents for texts extraction, you can modify the content of TextBox fields as needed. This maintains the position of the text in the documents while saving time and reducing manual work. Parse PDF documents to extraction texts with Aspose.PDF Cloud Go SDK.
With our Go SDK you can
- Add PDF document’s header & footer in text or image format.
- Add tables & text or image stamps to PDF documents.
- Append multiple PDF documents to an existing file.
- Work with PDF attachments, annotations, & form fields.
- Apply encryption or decryption to PDF documents & set a password.
- Delete all stamps & tables from a page or entire PDF document.
- Delete a specific stamp or table from the PDF document by its ID.
- Replace single or multiple instances of text on a PDF page or from the entire document.
- Extensive support for converting PDF documents to various other file formats.
- Extract various elements of PDF files & make PDF documents optimized.
- You can try out our free App to test the functionality.
- Learning Resources
- Documentation
- Source Code
- API References
- Product Support
- Free Support
- Paid Support
- Blog
- Why Aspose.PDF Cloud for Go?
- Customers List
- Security