Pairing PDF: Extract Text to a Field and Images: Optical Character Recognition (OCR)

Thomas_937381 Posts: 196
I'm trying to pair these these actions in this sequence:

  1. Files: Get file info
  2. PDF: Extract text to a field
  3. Images: Optical character recognition (OCR)

The OCR step should only start if:

  1. (The file extension is PDF and the text field from no. 2 is empty) OR
  2. The file type is image

Interestingly the PDF: Extract text to a field step seems to be returning two (2) line breaks \n\n, even over a .pdf file that was created from an image, i.e. the field is not empty, so the condition isn't met.

Is this expected output for that action? I've tried a few different electronically-created files and the result seems to be the same.


