OCR your documents before index
Перейти к файлу
Andy Scherzinger 143715fdcc
Merge pull request #71 from nextcloud/docs/noid/reuse-compliance
Add reuse compliance
2024-11-01 15:02:06 +01:00
.github/workflows
LICENSES
appinfo
js
lib
templates
.gitignore
.scrutinizer.yml
AUTHORS.md
CHANGELOG.md
LICENSE
Makefile
README.md docs(readme): Add reuse status badge 2024-10-29 19:24:23 +01:00
REUSE.toml
composer.json
composer.lock

README.md

files_fulltextsearch_tesseract

REUSE status

OCR your documents before index

Installation / Setup

  • install Tesseract

  • download language files from: https://github.com/tesseract-ocr/tessdata

  • copy language files into /usr/share/tessdata/ (or /usr/share/tesseract-ocr/tessdata/, depends on our distribution)

  • configure this app in the Full text search Admin panel

  • report bugs

more

devblog about PDF and OCR: https://daita.github.io/files-fulltextsearch-tesseract-ocr-pdf/