QC stands for Quality Control. That’s the basic idea behind them: we want to alert you to things in your data that might’ve traditionally gone unnoticed and that may or may not be important to your matter.
The QC Tag tells you things like:
How many potentially privileged emails were detected?
How many documents were from a database upload?
How many documents have a duplicate?
What if a person was sending privileged emails with PDF attachments to a law firm that the attachments themselves contained non-searchable attachments, like photos or scanned documents? And what if those non-searchable attachments were critical documents to your case?
With Logikcull, you’d be able to answer that question. This is just one example out of thousands where you’d want to leverage QC Tags to know more about your data.
If you're curious what each QC Tag means, simply hover over the name in the filter and the tooltip box will tell you.
ℹ️ QC Tags with an * next to their name indicate Subscription Only features.
Documents that are not email attachments, but have come from within another document. E.g., a .PPT within a .DOC is an embedded document
Containers that failed to explode any files
Documents that have been imported from Box
Documents imported from a database (i.e. production, load file, etc.)
Documents that are part of a Slack archive
Emails that contain BCC (Blind Carbon Copy) metadata
Has Deep Text
PDFs with additional searchable text that is found after running DTR (Deep Text Recognition). This indicates that the PDF has an embedded image that contains text.
Documents that are duplicates of other documents
Has Embedded Files
Documents that are not email, but contain embedded files as attachments. For instance, a .DOC that contains an embedded .PPT file
Has Hidden Comments
MS Excel documents containing hidden comments
PDFs containing comments or “sticky notes"
ℹ️ Please note, Logikcull does not render hidden comments in the document viewer. Depending on whether the document was uploaded with a text layer that notes the comments, Hidden Comments may be viewable in the Text View.
Has Hidden Worksheets
Documents that contain MS Excel hidden worksheets.
ℹ️ Please note, Logikcull will attempt to render hidden worksheets in the document viewer.
Has MS Office Macros
Documents that contain MS Office embedded macros
Has No Native
Imported documents that have no Native File
Has No Text
Documents without any extracted or OCRed text
Documents that contain MS Word revisions or document comments
Has Speaker Notes
Documents that contain MS PowerPoint speaker notes
Documents that are part of an Email thread
Documents that have been detected to contain a virus. These documents are quarantined during processing and can not be downloaded
Is a Copy
This document is a copy of a document from another project
Overlay(s) applied to document
Email that is the last message of an email thread or is a message without a thread. When part of a thread, this tag indicates the end of a particular thread and not the inclusiveness of the thread's contents within this email.
ℹ️ More information on this tag can be found in THIS ARTICLE.
Documents with incorrect or missing file extensions. E.g. a .DOC file that is actually a .PPT file but with an incorrect extension in the filename metadata
Documents identified as being part of the NSRL database of known computer files
Use this QC tag to find documents that contain zero QC Tags
Documents that were not rendered to PDF during processing
Document where OCR (Optical Character Recognition) was attempted but failed
Documents that were OCRed (Optical Character Recognition) so they can be searched.
Emails that have a law firm email address in the From, To, CC, or BCC fields. They are considered to be potentially privileged. Suggest a new domain name by clicking the Get Support link in your Account drop down menu at the top of the screen.
Documents that are password protected
The document's text was used to render the document to PDF. This happens if all other means to render the document fail
The transfer of this file from Box failed or was corrupted
Truncated Email Metadata
Documents whose To, CC, or BCC fields exceed system capacity for indexing
Documents whose text length exceeds system capacity for indexing
This document was copied to another project
Documents that have a file size of zero bytes. These documents contain no content.
From Google Vault*
Documents imported from Google Vault
Documents imported from MS365
Has Slack Deleted Messages*
Includes messages that were deleted in Slack
Has Slack Edited Messages*
Includes messages that were edited in Slack
PDF documents that have been split into smaller PDF files.
Emails that include all unique content of a thread.
Is Slack 1:1 DM*
Is Slack 1:1 Direct Message between two parties
Is Slack Multi-Party DM*
Is Slack Multi-Party Direct Messages
Is Slack Thread*
Is Slack Thread
Rendered From Import
Imported documents that have been rendered from native
Split From PDF
Documents that were created by splitting a large PDF into smaller PDF files.
Audio content was transcribed and is text-searchable
We were unable to transcribe audio content. This file is not text-searchable.
Personal Identity Information (PII) is detected with a 75% or greater confidence level
PII Detection Failed
PII Detection Failed in the Document
PII Detection Skipped
PII Detection was skipped