Friday, December 20, 2013

OCR: multi-page pdf

I just discovered the excellent answer at:

http://askubuntu.com/questions/271271/how-do-i-produce-a-multi-page-sandwich-pdf-with-hocr2pdf

Basically, one just need (from a sid/schroot):

$ tesseract input.png output.hocr -l fra

Et voila !