Skip to main content
All CollectionsUploadCommon Upload Questions
How is Document Metadata Extracted and Displayed?
How is Document Metadata Extracted and Displayed?

Get a deep-dive view of document metadata throughout the application.

John OHara avatar
Written by John OHara
Updated over a week ago

Extracting and Indexing Metadata

Document Date Metadata

Logikcull uses the following sequence of metadata fields (depending on the field's availability in the metadata), in order of priority, to determine a document's date:

File and Cloud Uploads

  1. Email Date Sent

  2. Email Date Received

  3. Email Date Modified

  4. Email Date Created

  5. File Date Modified

  6. File Date Created

  7. System Date Modified

  8. System Date Created

Database Uploads

  1. “import_document_date”, (ie mapping the “document date” field)

  2. “slack_document_date”,

  3. “static_document_date”,

  4. “email_time_sent”,

  5. “email_time_received”,

  6. “email_time_modified”,

  7. “email_time_created”,

  8. “file_time_modified”,

  9. “file_time_created”,

  10. “mtime”,

  11. “ctime”,

  12. “created_at”

Document Language

Logikcull will label a document with the language that is most predominant. If more than one language exists in a document, Logikcull will decide which to call primary based on a sampling of text throughout the record. Translation services are not offered at this time.

Using extracted text, Logikcull detects the following languages:

Albanian
Arabic
Bengali
Bulgarian
Croatian
Czech
Danish
Dutch
English
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Indonesian
Italian
Japanese
Korean
Macedonian
Norwegian
Persian
Polish
Portuguese
PRC Chinese [Simplified Chinese Characters]
Punjabi
Romanian
Russian
Slovakian
Spanish
Swedish
Taiwanese Chinese [Traditional Chinese Characters]
Thai
Turkish
Ukrainian
Vietnamese

Searching by languages in Logikcull

The search builder or the the filter carousel can be utilized to search for a documents' primary language.

Metadata Filters

Logikcull extracts and indexes document metadata into powerfully simple filter facets. View all the available filters here. 🔗

Characters Not Indexed

Certain characters are not indexed and thus are treated as a space when running searches. Logikcull does not index the following characters:

. ? ! , ; : - ( ) [ ] { } ' " < >
​
This also includes unicode characters u+2013 (en dash), u+2014 (em dash), and u+2026 (ellipses): – — …

Displaying Metadata

From the Search Results page

Hover over the vertical dots on the document card and select "Document Info"

From the document viewer

Click the “Show More” button in the document viewer.

From the toolbar

Did this answer your question?