Feature variables represent variables that store values that are associated with a feature in a specific sample. The describe e.g., the absolute or relative abundance, the chromatographic retention time, the peak shape, or especially when also processed data are imported, also properties like measurement accuracy.
The feature variables can be accessed in MiDAR functions by either
their internal name, which always starts with feature_
(e.g., feature_intensity
) or by their short names, such as
conc
, intensity
, norm_intesity
,
and rt
.
Key Feature Variables
Following feature variables are essential for organizing the data
processing flow in MiDAR
. The intensity
variable corresponds to the raw signal (e.g. peak area) of the analysis,
which are retrieved from one of the original feature variables (see
below) All feature variables downstream are the result of data
processing, including those available in the MiDAR workflow. If raw,
partial or fully post-processed data is imported into MiDAR, e.g. via
import_data_csv()
, the imported values must be assigned or
match these variables names.
Many data processing and plotting functions in MiDAR use of these
variables as input variable. All of these variable are stored in the
dataset
table present in the MidarExperiment
object.
Variable Short Name | Internal Name | Description | |
---|---|---|---|
intensity | feature_intensity | Raw signal intensity. A copy of either area ,
height , response , or intensity
from the imported raw data. See below. |
|
nom_intensity | fe ature_norm_intensity | Internal Standard-normalized raw signal intensity | |
pmol_total | feature_pmol_total | Total feature (analyte) amount in the measured sample (pmoles/sample) | |
conc | feature_conc | Feature (analyte) concentration | |
conc_normalized | fea ture_conc_normalized | Normalized feature (analyte) concentration normalized by e.g. reference sample or other normalization methods | |
conc_bias | feature_conc_ratio | The ratio of measured and |
expected/known feature | | concentration defined a | | reference sample.
The variable | | is generated by the function | |
calibrate_by_reference() in | | case of absolute
calibration. | |
Backup Feature Variables
Feature variables will be overwritten in some processing by re-calculated values, i.e., feature area after interference correction, concentrations after drift/batch correction or reference sample-based re-calibration. In these cases, the original feature values are stored in a new ‘backup’ feature variable to keep a record and allow exploring the variable at a later stage. Furthermore, specific variables are created by some processing functions, e.g. during drift correction the values of the curve (model fit) data points.
Also these variables are stored in the dataset
table.
The feature variable name is based on the initial name with following
postfixes:
Postfix of Variable Name | Examples | Description |
---|---|---|
_orig | feature _intensity_orig | Currently, only used for the backup before any interference
correction was applied. Corresponds always to the raw intensity values
that were imported into the MidarExperiment object. |
_raw |
f eature_conc_raw featur e_intensity_raw |
Corresponds to the raw (= uncorrected) calculated features values, i.e., before a new correction step was applied, such as drift/batch correction. |
_before |
feat ure_conc_before feature_i ntensity_before |
Corresponds to the last calculated value before a new correction step was applied, i.e., drift/batch correction. For example, adjusted concentration after a drift correction, before batch correction was applied. |
_beforecal | feature _conc_beforecal | The non-calibrated raw or adjusted concentration before `calibrate_by_reference()` was applied. Only applies to the concentration variable |
_fit | conc_before_fit | Model fit data points calculated by drift and batch correction
functions. Use by runscatter() to plot the trends. |
Raw Feature Variables
The supported feature variables listed below characterize the
features in additional aspects. Import is that only of variables can be
is used as “feature raw intensity” for all data processing steps. This
variable is then copied to intensity
after importing the
data, by default area
if available, or then
height
, response
or intensity
in
this order. The feature variable use for intensity
can also
manually be set via `set_intensity_var()`
These variables are stored in the dataset_orig
table,
and are not being modified by any MiDAR function. They are typically not
available in MiDAR’s data processing functions, but plotting functions
support some of these. The support will be extended in upcoming
versions.
Variable Short Name =================== area |
Internal Name ======================= feature_area |
Description ================================ Peak area |
|
height | height_height | Peak height | |
response | height_response | Feature response (vendor specific) | |
intensity | feature_intensity | Feature intensity (vendor specific) | |
rt | feature_rt | Retention time | |
fwhm | feature_fwhm | Full Width at Half Maximum Peak Height | |
width | feature_width | Peak Width | |
int_start | feature_int_start | Peak integration Start Time | |
int_end | feature_int_end | Peak Integration End Time | |
symmetry | feature_symmetry | Peak symmetry | |||
sn_ratio | feature_sn_ratio | Signal-to-Noise ratio | |
accuracy | feature_accuracy | Accuracy (often in reference to target value) | |
ionratio | feature_ionratio | Ion ratio |
QC type is represented with a consistent color scheme (both fill and line colors) and specific point shapes in all plots generated by the MiDAR package. This visual coding allows consistent identification and comparison of different QC types across various visualizations.