|
Option | Description | Default |
---|---|---|
Automatic indexing of file content | Uses command line tools to extract the information from the files based on their MIME types. | Disabled |
Automatic indexing of emails stored as files | Parses message/rfc822 types of files (aka eml files) and stores individual email headers and content in search index. | Disabled |
Asynchronous indexing | Enabled | |
OCR Files | Extract and index text from supported file types. | Disabled |
OCR Every File | Attempt to OCR every supported file. | Disabled |
Allow file level OCR languages | Allow users to change the default languages that will be used to OCR a file. | Enabled |
OCR limit languages | Limit the number of languages one can select from this list. Auto detect languages | Afrikaans (Afrikaans) | Albanian (Shqip) | Amharic (á ááá) | Arabic | Arabic (اÙعربÙØ©) | Armenian | Armenian (ÕÕ¡ÕµÕ¥ÖÕ¥Õ¶) | Assamese (ঠসমà§à¦¯à¦¼à¦¾) | Azerbaijani (azÉrbaycan dili) | Azerbaijani (azÉrbaycan dili) (cyrl) | Basque (euskara, euskera) | Belarusian (белаÑÑÑÐºÐ°Ñ Ð¼Ð¾Ð²Ð°) | Bengali | Bengali (বাà¦à¦²à¦¾) | Bosnian (bos... |
None |
tesseract path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
/usr/bin/tesseract |
pdfimages path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
Pdfimages |
Option | Description | Default |
---|---|---|
Automatic indexing of file content | Uses command line tools to extract the information from the files based on their MIME types. | Disabled |
Automatic indexing of emails stored as files | Parses message/rfc822 types of files (aka eml files) and stores individual email headers and content in search index. | Disabled |
Asynchronous indexing | Enabled | |
OCR Files | Extract and index text from supported file types. | Disabled |
OCR Every File | Attempt to OCR every supported file. | Disabled |
Allow file level OCR languages | Allow users to change the default languages that will be used to OCR a file. | Enabled |
OCR limit languages | Limit the number of languages one can select from this list. Auto detect languages | Afrikaans (Afrikaans) | Albanian (Shqip) | Amharic (á ááá) | Arabic | Arabic (اÙعربÙØ©) | Armenian | Armenian (ÕÕ¡ÕµÕ¥ÖÕ¥Õ¶) | Assamese (ঠসমà§à¦¯à¦¼à¦¾) | Azerbaijani (azÉrbaycan dili) | Azerbaijani (azÉrbaycan dili) (cyrl) | Basque (euskara, euskera) | Belarusian (белаÑÑÑÐºÐ°Ñ Ð¼Ð¾Ð²Ð°) | Bengali | Bengali (বাà¦à¦²à¦¾) | Bosnian (bos... |
None |
tesseract path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
/usr/bin/tesseract |
pdfimages path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
Pdfimages |
Option | Description | Default |
---|---|---|
Automatic indexing of file content | Uses command line tools to extract the information from the files based on their MIME types. | Disabled |
Automatic indexing of emails stored as files | Parses message/rfc822 types of files (aka eml files) and stores individual email headers and content in search index. | Disabled |
Asynchronous indexing | Enabled | |
OCR Files | Extract and index text from supported file types. | Disabled |
OCR Every File | Attempt to OCR every supported file. | Disabled |
Allow file level OCR languages | Allow users to change the default languages that will be used to OCR a file. | Enabled |
OCR limit languages | Limit the number of languages one can select from this list. Auto detect languages | Afrikaans (Afrikaans) | Albanian (Shqip) | Amharic (á ááá) | Arabic | Arabic (اÙعربÙØ©) | Armenian | Armenian (ÕÕ¡ÕµÕ¥ÖÕ¥Õ¶) | Assamese (ঠসমà§à¦¯à¦¼à¦¾) | Azerbaijani (azÉrbaycan dili) | Azerbaijani (azÉrbaycan dili) (cyrl) | Basque (euskara, euskera) | Belarusian (белаÑÑÑÐºÐ°Ñ Ð¼Ð¾Ð²Ð°) | Bengali | Bengali (বাà¦à¦²à¦¾) | Bosnian (bos... |
None |
tesseract path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
/usr/bin/tesseract |
pdfimages path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
Pdfimages |
Option | Description | Default |
---|---|---|
Automatic indexing of file content | Uses command line tools to extract the information from the files based on their MIME types. | Disabled |
Automatic indexing of emails stored as files | Parses message/rfc822 types of files (aka eml files) and stores individual email headers and content in search index. | Disabled |
Asynchronous indexing | Enabled | |
OCR Files | Extract and index text from supported file types. | Disabled |
OCR Every File | Attempt to OCR every supported file. | Disabled |
Allow file level OCR languages | Allow users to change the default languages that will be used to OCR a file. | Enabled |
OCR limit languages | Limit the number of languages one can select from this list. Auto detect languages |
None |
tesseract path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
Tesseract |
pdfimages path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
/usr/bin/pdfimages |
Option | Description | Default |
---|---|---|
Automatic indexing of file content | Uses command line tools to extract the information from the files based on their MIME types. | Disabled |
Asynchronous indexing | Enabled | |
OCR Files | Extract and index text from supported file types. | Disabled |
OCR Every File | Attempt to OCR every supported file. | Disabled |
Allow file level OCR languages | Allow users to change the default languages that will be used to OCR a file. | Enabled |
OCR limit languages | Limit the number of languages one can select from this list. Auto detect languages |
None |
tesseract path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
sh: 1: where: not found |
pdfimages path | Path to the location of the binary. Defaults to the $PATH location. If blank, the $PATH will be used, but will likely fail with scheduler. |
sh: 1: where: not found |