An overview of the Feature and Tabular Analysis toolset
The Feature and Tabular Analysis toolset contains tools for applying machine learning and deep learning algorithms to feature or tabular data.
The Train Using AutoML tool uses automated machine learning (AutoML) to train and fine-tune machine learning models given training data and available compute resources. The trained models can be used in the Predict Using AutoML for predicting both categorical variables (classification) and continuous variables (regression).
Training machine learning (ML) models has traditionally been a complex process that required specialized knowledge of different types of models and how their parameters (known as hyperparameters) can be fine-tuned to get the best results. This is an iterative process that requires several experiments before the most accurate model and its appropriate hyperparameters can be identified. The AutoML tools automate this process without code. While doing so, they provide visibility into the performance and hyperparameters of the trained models, as well as insight into which features have the highest impact on the model results.
The Predict Missing Values Using AI Model replaces missing values (nulls) in a feature class or table by automatically training a machine learning or deep learning model on available data and applying it to estimate missing numerical values. It is designed for data preprocessing and is commonly used prior to training workflows such as Train Using AutoML to improve model performance and stability.
Tools in the Feature and Tabular Analysis toolset
|
Tool |
Description |
|---|---|
|
Replaces missing values (nulls) with estimated feature values. Uses machine learning and deep learning models trained on patterns in the dataset to ensure statistical consistency. |
|
|
Predicts continuous variables (regression) or categorical variables (classification) on unseen compatible datasets using a trained |
|
|
Trains a deep learning model by building training pipelines and automating much of the training process. This includes exploratory data analysis, feature selection, feature engineering, model selection, hyperparameter tuning, and model training. Its outputs include performance metrics of the best model on the training data, as well as the trained deep learning model package ( |