Pairing PDF: Extract Text to a Field and Images: Optical Character Recognition (OCR)
I'm trying to pair these these actions in this sequence:
- Files: Get file info
- PDF: Extract text to a field
- Images: Optical character recognition (OCR)
The OCR step should only start if:
- (The file extension is PDF and the text field from no. 2 is empty) OR
- The file type is image
Interestingly the PDF: Extract text to a field step seems to be returning two (2) line breaks \n\n, even over a .pdf file that was created from an image, i.e. the field is not empty, so the condition isn't met.
Is this expected output for that action? I've tried a few different electronically-created files and the result seems to be the same.