Get Doc Info Metadata
This guide will walk you through the example of getting XMP doc info meta data from a pdf file.
Before you begin
You should get your API key from your UniCloud account.
If this is your first time using UniPDF SDK, follow this guide to set up a local development environment.
Project setup
Clone the project repository
In your terminal, clone examples repository. It contains the Go code we will be using for this guide.
git clone https://github.com/unidoc/unipdf-examples.git
Navigate to the metadata
folder in the unipdf-examples directory.
cd unipdf-examples/metadata
Configure environment variables
Replace the UNIDOC_LICENSE_API_KEY
with your API credentials from your UniCloud account.
Linux/Mac
export UNIDOC_LICENSE_API_KEY=PUT_YOUR_API_KEY_HERE
Windows
set UNIDOC_LICENSE_API_KEY=PUT_YOUR_API_KEY_HERE
How it works
The following example code gets metadata from pdf file.
In the above code in lines 9-16
unipdf packages and other necessary libraries are imported.
The init
function in lines 18-25
loads your api key and authenticates your request.
The main
function is defined in lines 27-44
. In lines 28-32
the input files are read from the command line arguments. The loop in lines 35-43
iterates through each file and extracts the document info and prints it using printPdfDocInfo(inputPath)
.
In lines 46-119
the function printPdfDocInfo
which is used to print the document info is defined. In line 47-51
the file is opened using os.Open(inputPath)
.
In lines 53-61
a new pdfReader object is instantiated and the number of pages is retrieved using pdfReader.GetNumPages()
. The document info is extracted using pdfReader.GetPdfInfo()
in line 63
.
In lines 68-71
a new pdfDocInfo
object is instantiated using the following code.
di := pdfDocInfo{
Filename: path.Base(inputPath),
NumPages: numPages,
}
In lines 73-114
all the available information is extracted from the pdfInfo
and is set to the pdfDocInfo
. And in line 116
the information is printed using di.print()
.
In lines 122-136
the pdfDocInfo
type is defined. In lines 139-157
The print
method of pdfDocInfo
which is used to print the extracted pdf document information is defined.
Run the code
go run pdf_metadata_get_docinfo.go <input1.pdf> <input2.pdf> ...