The advanced, utilities workflow component, PDF Metadata to XML can help you to extract both metadata as well as key/values for form fields from a PDF file. The PDF must be loaded into the payload of the workflow before running this component. The component returns the result as XML into the payload.
The component has only a comment as parameter, which is optional and it has not effect on the component:
The resulting XML file can look like this:
The component can e.g. be used as shown in the workflow below:
In this example these components are used:
In this component we read PDF files (with extension .pdf) from a directory, that are to be processed in this workflow.
PDF Metadata to XML
This exacts the metadata as described above - without any parameters.
Here we save the resulting XML file:
If we would want to refer to the name of the input PDF file, then we can find it as one of the predefined variables for input files.