Metadata for the model. It is a column in the model catalog,
see Section 3.14.1.1, “The Model Catalog Table”, and a
parameter in ML_MODEL_IMPORT
.
The default value for model_metadata
is
NULL
.
model_metadata
has several fields that
replace deprecated columns in the model catalog, and fields
that support ONNX model import, see:
Section 3.14.2, “ONNX Model Import”.
model_metadata
contains the following
metadata as key-value pairs in JSON format:
-
task:
string
The task type specified in the
ML_TRAIN
query. The default isclassification
when used withML_MODEL_IMPORT
. -
build_timestamp:
number
A timestamp indicating when the model was created, in UNIX epoch time. A model is created when the
ML_TRAIN
routine finishes executing. -
target_column_name:
string
The name of the column in the training table that was specified as the target column.
-
train_table_name:
string
The name of the input table specified in the
ML_TRAIN
query. -
column_names:
JSON array
The feature columns used to train the model.
-
model_explanation:
JSON object literal
The model explanation generated during training. See Section 3.14.7, “Model Explanations”.
-
notes:
string
The
notes
specified in theML_TRAIN
query. It also records any error messages that occur during model training. -
format:
string
The model serialization format.
HWMLv1.0
for a HeatWave AutoML model orONNX
for a ONNX model. The default isONNX
when used withML_MODEL_IMPORT
. -
status:
string
The status of the model. The default is
Ready
when used withML_MODEL_IMPORT
.-
Creating
The model is still being created.
-
Ready
The model is trained and active.
-
Error
Either training was canceled or an error occurred during training. Any error message appears in the
notes
column. The error message also appears inmodel_metadata
notes
.
-
-
model_quality:
string
The quality of the model object. Either
low
orhigh
. -
training_time:
number
The time in seconds taken to train the model.
-
algorithm_name:
string
The name of the chosen algorithm.
-
training_score:
number
The cross-validation score achieved for the model by training.
-
n_rows:
number
The number of rows in the training table.
-
n_columns:
number
The number of columns in the training table.
-
n_selected_rows:
number
The number of rows selected by adaptive sampling.
-
n_selected_columns:
number
The number of columns selected by feature selection.
-
optimization_metric:
string
The optimization metric used for training.
-
selected_column_names:
JSON array
The names of the columns selected by feature selection.
-
contamination:
number
The contamination factor for a model.
-
options:
JSON object literal
The
options
specified in theML_TRAIN
query. -
training_params:
JSON object literal
Internal task dependent parameters used during
ML_TRAIN
. -
onnx_inputs_info:
JSON object literal
Information about the format of the ONNX model inputs. This only applies to ONNX models. See Section 3.14.2, “ONNX Model Import”.
Do not provide
onnx_inputs_info
if the model is not ONNX format. This will cause an error.-
data_types_map:
JSON object literal
This maps the data type of each column to an ONNX model data type. The default value is:
JSON_OBJECT("tensor(int64)": "int64", "tensor(float)": "float32", "tensor(string)": "str_")
-
-
onnx_outputs_info:
JSON object literal
Information about the format of the ONNX model outputs. This only applies to ONNX models. See Section 3.14.2, “ONNX Model Import”.
Do not provide
onnx_outputs_info
if the model is not ONNX format, or iftask
isNULL
. This will cause an error.-
predictions_name:
string
This name determines which of the ONNX model outputs is associated with predictions.
-
prediction_probabilities_name:
string
This name determines which of the ONNX model outputs is associated with prediction probabilities.
-
labels_map:
JSON object literal
This maps prediction probabilities to predictions, known as labels.
-
-
training_drift_metric:
JSON object literal
Contains data drift information about the training data, see: Section 3.14.11, “Data Drift Detection”. This only applies to classification and regression models.
-
mean:
number
The mean value of drift metrics of all the training data. ≥
0
. -
variance:
number
The variance value of drift metrics of all the training data. ≥
0
.
Both
mean
andvariance
should be low. To avoid divide by zero, the lowest value for both is1e-10
. -
-
chunks:
number
The total number of chunks that the model has been split into. This was added in MySQL 9.0.0.