OntologyConcept: Collection

Collection

Collections are containers for Documents, where you can add, ingest, view, search, edit and manage Documents.

You can have one to many Collections within a Project.

Collections and Networks

The information extracted from Documents can be consolidated and used to populate a Network, which allows you to visualise and analyse the information.

The information contained in a Collection is used to populate a Network(s).

You can have one to many Collections populating a Network.

A single Collection can populate zero to many Networks.

Default Ingestion Configuration

Every Collection has a default Ingestion configuration assigned.

When you Add Documents to a Collection, the default Ingestion Configuration is selected.

Changing the Default

If you select a different Ingestion configuration when loading data, the selected Ingestion configuration becomes the default.

Warning

If you change the Ingestion Configuration when Documents have already been added into a Collection, you may get inconsistent results, especially when the information is consolidated into a Network.

For example, if the different Ingestion configurations are extracting different information from the Documents and/or applying a different Ontology.

Recommended Practice

If you need to use different Ingestion configurations, create a separate Collection for each configuration. These Collections can still be populating a single Network.

Have a single Ontology for your Project, to ensure consistent results in Networks.

For example, you may have a separate Collection for:

  • structured data from a database

  • information harvested from the Internet

  • reports ingested in PDF format.

all feeding extracted information into a single Network.

Collections and Networks
Add, Manage and View Documents

You can Add (Ingest) Documents from the Add Documents tab or use the Harvest Documents feature.

Once documents are added, they are automatically ingested and stored in a Collection.

The documents can be viewed from the Documents tab, where you can Manage Documents or select a document to view in the Documents pane.

Open a Collection

When you select the Collections tab, the Manage Collections pane is displayed. Clicking on a Collection Name opens that Collection, displaying a list of the documents stored in that Collection.

On the Manage Collections pane, you can also create, copy, import, export, rename, reprocess and delete Collections. See Manage Collections.

Once a Collection has been selected, four panes are available.

  • Once a Collection has been opened, the Document Collections pane is collapsed. This pane allows you to Manage Collections, including opening, creating, renaming, importing, exporting and deleting collections.

  • The Manage Documents pane allows you to manage documents, including adding, opening, reprocessing, renaming, exporting and deleting documents. See Manage Documents.

  • The Document pane allows you to view and edit a selected document. See Documents.

  • Ontology Section pane lists the Ontology currently applied to the documents.

Collections tabs

Below is a summary of each of the tabs displayed:

Collections Toolbar

The Collections Toolbar enables you to modify settings that affect an entire collection Once source data has been ingested, the resulting Documents are stored in a Collection. Once Documents are stored in a Collection, you can browse, search and edit Documents. Only text-based Documents is stored in Collections, not the original source files..

 

Task

Action

Rename a collection

  1. Select Rename Collection .
  2. Enter new name for your Collection.
  3. Select Rename.

Copy a Collection

  1. Select Copy Collection .
  2. Enter new name for your Collection.
  3. Select Copy.

Export a Collection

  1. Select Download Collection .
  2. Select Download.

A zip file containing the Collection documents in xml format will be downloaded to your local file system.

Delete a collection

Select Delete Collection .

All documents within the Collection will be completely erased from Sintelix. You will not be able to retrieve them if you did not export a backup.

Add Documents

Open the Add Documents Panel

Add (Ingest) Documents

Reprocess

Reprocess Documents in a Collection

Option 1 (Basic):
  1. Select Reprocess.
  2. In the Ingestion Configuration list, select an Ingestion Configuration.
  3. Select Reprocess.
Option 2 (Advanced):
  1. Select Reprocess.
  2. In the Ingestion Configuration list, select an Ingestion Configuration.
  3. Select the Exceptions check box.
  4. Select the check boxes for content you want include for reprocessing.
  5. Clear the check boxes for content you want to exclude from reprocessing.
  6. Select Reprocess.
Delete Documents

Delete all documents in a Collection

  1. Select Delete Documents.
  2. Select Delete All.

Network

Open Networks generated from Collection

 

Search

Open Collection in Sintelix Search

 

Text Reference Refiner

Open collection in text reference refinement

 
Ontology pane