Data

Data

Data Model

The data model functions as a structured tabular representation of bioinformatics datasets, enabling systematic organization, integration, and visualization of complex biological information. It establishes the computational foundation for vectorized operations in large-scale workflow executions and, at the same time, provides a unified framework for representing both workflow inputs and outputs.

Create Data Model

  1. Selecting the blue “+” control adjacent to the entity data model invokes the Import Entity Table dialog.

  1. At this stage, users may obtain the standardized data structure by selecting “Download CSV Template” and editing the file locally. The revised CSV must contain at least one valid entity record to ensure the integrity of the data model. Subsequently, the file can be imported by utilizing the upload function or by dragging and dropping it into the designated area, thereby completing the data registration process.

  2. Click “Import Table” to complete the creation of the data model.

Create Data Model

An entity set is constructed by aggregating ≥2 data records into a new collection, eliminating the need for manual array construction. Within the Entity Data Model view, select the target samples (n≥2) and invoke Generate Entity Set. The supplied Entity Set Name becomes the table identifier; if omitted, the identifier is auto-assigned as <EntitySetName>_set--YYYY-MM-DD-HH-mm-ss.

Workspace Data Model

Workspace-level data enables unified administration of shared resources utilized across multiple workflows within a Workspace. By adopting this approach, it is no longer necessary to redundantly associate common resources—such as reference datasets or container image addresses—with each individual record in the entity table.

Typically, you may associate resources like a reference genome with the Workspace Data Model. To import, click Import, then drag and drop a prepared CSV file into the dialog. You may also download or delete previously imported files.

NAS File Storage Data

The Workspace provides support for mounting Network Attached Storage (NAS).

  1. SSH remote connection to the cluster management node

  1. Select the node root directory

  1. Select the NFS directory

  1. Enter the bioos-storage directory and locate the file path mounted by the Workspace

  1. Use scp to upload data to the NAS mount path corresponding to the Workspace

  1. Use the NAS addresses of the reference data and sample data to create the data model

Last updated