Professional Documents
Culture Documents
or
A B C
apply: Apply a function
h2o.splitFrame: Split an existing H2O column (cbind) or rows (rbind). over an H2O parsed data
h2o.cumprod: Vector of the cumulative By Rows By Cols object (an array) margins.
dataset according to user-specified ratios. h2o.merge: Merges 2 H2OFrames. products of the elements of the argument.
GROUP BY AGGREGATION
MISSING DATA HANDLING h2o.arrange: Sorts an H2OFrame h2o.cumsum: Vector of the cumulative h2o.group_by: Apply an
h2o.impute: Impute a column of data using by columns. sums of the elements of the argument. aggregate function to each
the mean, median, or mode. PRECISION group of an H2O dataset.
ELEMENT INDEX SELECTION
h2o.insertMissingValues: Replaces a user- h2o.which: True Condition’s Row Numbers h2o.round: Round values to the specified TABULATION
specified fraction of entries in an H2O number of decimal places. The default is 0. h2o.table: Use the cross-classifying
CONDITIONAL VALUE SELECTION
dataset with missing values. h2o.signif: Round values to the specified
A B C D
h2o.ifelse: Apply conditional statements to * * * * factors to build a table of counts at
h2o.na_omit: Remove Rows With NAs. numeric vectors in an H2O Frame. number of significant digits. each combination of factor levels.
H2O.ai. • CC BY SA Juan Telleria Ruiz de Aguirre • jtelleria.rproject@gmail.com • http://docs.h2o.ai/ • Learn more at H2O.ai documentation webpage • package version 3.18.0.11 • Updated: 2018-06
h2o: : CHEAT SHEET
Data Modeling Data Munging Cluster Operations
MODEL TRAINING: SUPERVISED LEARNING GRID SEARCH GENERAL COLUMN MANIPULATION H2O KEY VALUE STORE ACCESS
h2o.deeplearning: Perform Deep Learning h2o.grid: Efficient method to build multiple is.na: Display missing elements. h2o.assign: Assign H2O hex.keys to R objects.
Neural Networks on an H2OFrame. models with different hyperparameters.
FACTOR LEVEL MANIPULATIONS h2o.getFrame: Get H2O dataset Reference.
h2o.getGrid: Get a grid object from H2O
h2o.gbm: Build Gradient Boosted Regression distributed K/V store. h2o.levels: Display a list of the unique h2o.getModel: Get H2O model reference.
Trees or Classification Trees. values found in a categorical data column. h2o.ls: Display a list of object keys in the
MODEL SCORING
h2o.glm: Fit a Generalized Linear Model, h2o.predict: Obtain predictions from various h2o.relevel: Reorders levels of an H2O running instance of H2O.
specified by a response variable, a set of fitted H2O model objects. factor, similarly to standard R's relevel. h2o.rm: Remove specified H2O Objects from
predictors, and the error distribution. the H2O server, but not from the R environment.
h2o.scoreHistory: Get Model Score History. h2o.setLevels: Set Levels of H2O Factor.
h2o.naiveBayes: Compute Naive Bayes MODEL METRICS h2o.removeAll: Remove All H2O Objects from
classification probabilities on an H2O Frame. NUMERIC COLUMN MANIPULATIONS
h2o.make_metrics: Given predicted values the H2O server, but not from the R environment.
h2o.cut: Convert H2O Numeric Data to
h2o.randomForest: Perform Random Forest (target for regression, class-1 probabilities, or H2O MODEL IMPORT / EXPORT
Factor by breaking it into Intervals.
Classification on an H2O Frame. binomial or per-class probabilities for h2o.loadModel: Load H2OModel from disk.
h2o.xgboost: Build an Extreme Gradient multinomial), compute a model metrics object. CHARACTER COLUMN MANIPULATIONS h2o.saveModel: Save H2OModel object to disk.
Boosted Model using the XGBoost backend. GENERAL MODEL HELPER h2o.strsplit: “String Split”: Splits the given
h2o.download_pojo: Download the Scoring
h2o.performance: Evaluate the predictive factor column on the input split.
h2o.stackedEnsemble: Build a stacked POJO (Plain Old Java Object) of an H2O Model.
ensemble (aka. Super Learner) using the performance of a Supervised Learning h2o.tolower: Convert the characters of a
h2o.download_mojo: Download the model in
specified H2O base learning algorithms. Regression or Classification Model via various character vector to lower case.
MOJO format.
metrics. Set xval = TRUE for retrieving the h2o.toupper: Convert the characters of a
h2o.automl: Automates the Supervised training cross-validation metrics. H2O CLUSTER CONNECTION
Machine Learning Model Training Process: character vector to upper case.
h2o.init: Connect to a running H2O instance
Automatically Trains and Cross-validates a set REGRESSION MODEL HELPER h2o.trim: “Trim spaces”: Remove leading using all CPUs on the host.
of Models, and trains a Stacked Ensemble. h2o.mse: Display the mean squared error and trailing white space.
calculated from “Predicted Responses” and h2o.shutdown: Shut down the specified H2O
MODEL TRAINING: UNSUPERVISED h2o.gsub: Match a pattern & replace all instance. All data on the server will be lost!
“Actual (Reference) Responses”. Set xval =
LEARNING instances (occurrences) of the matched
TRUE for retrieving the cross-validation MSE. H2O CLUSTER INFORMATION
h2o.prcomp: Perform Principal Components pattern with the replacement string globally.
Analysis on the given H2O Frame. CLASSIFICATION MODEL HELPERS h2o.clusterInfo: Display the name, version,
h2o.accuracy: Get Model Accuracy metric. h2o.sub: Match a pattern & replace the uptime, total nodes, total memory, total cores
h2o.kmeans: Perform k-means Clustering on first instance (occurrence) of the matched and health of a cluster running H2O.
the given H2O Frame. h2o.auc: Retrieve the AUC (area under ROC pattern with the replacement string.
curve). Set xval = TRUE for retrieving the h2o.clusterStatus: Retrieve information on the
h2o.anomaly: Detect anomalies in a H2O cross-validation AUC. DATE MANIPULATIONS status of the cluster running H2O.
Frame using a H2O Deep Learning Model h2o.month: Convert Milliseconds to H2O LOGGING
with Auto-Encoding. h2o.confusionMatrix: Display prediction
Months in H2O Datasets (Scale: 0 to 11). h2o.clearLog: Clear all H2O R command and
errors for classification data (“Predicted” vs
h2o.deepfeatures: Extract the non-linear “Reference : Real Values”). h2o.year: Convert Milliseconds to Years in error response logs from the local disk.
features from a H2O Frame using a H2O H2O Datasets, indexed starting from 1900. h2o.downloadAllLogs: Download all H2O log
Deep Learning Model. h2o.hit_ratio_table: Retrieve the Hit Ratios.
h2o.day: Convert Milliseconds to Day of files to the local disk.
Set xval = TRUE for retrieving the cross-
h2o.glrm: Builds a Generalized Low Rank Month in H2O Datasets (Scale: 1 to 31). h2o.logAndEcho: Write a message to the H2O
validation Hit Ratio.
Decomposition of an H2O Frame. Java log file and echo it back.
CLUSTERING MODEL HELPER h2o.hour: Convert Milliseconds to Hour of
h2o.openLog: Open existing logs of H2O R
h2o.svd: Singular value decomposition of an h2o.betweenss: Get the between cluster Day in H2O Datasets (Scale: 0 to 23). POST commands and error responses on disk.
H2O Frame using the power method. Sum of Squares. h2o.dayOfWeek: Convert Milliseconds to h2o.getLogPath: Get the file path for the H2O
h2o.word2vec: Trains a word2vec model on h2o.centers: Retrieve the Model Centers. Day of Week in a H2OFrame (Scale: 0 to 6) R command and error response logs.
a String column of an H2O data frame. PREDICTOR VARIABLE IMPORTANCE h2o.startLogging: Begin logging H2O R POST
MATRIX OPERATIONS
SURVIVAL MODELS: TIME-TO-EVENT h2o.varimp: Retrieve the variable importance commands and error responses.
%∗%: Multiply two conformable matrices.
h2o.coxph: Trains a Cox Proportional h2o.stopLogging: Stop logging H2O R POST
h2o.varimp_plot: Plot Variable Importances. t: Returns the transpose of an H2OFrame.
Hazards Model (CoxPH) on an H2O Frame. commands and error responses.
H2O.ai. • CC BY SA Juan Telleria Ruiz de Aguirre • jtelleria.rproject@gmail.com • http://docs.h2o.ai/ • Learn more at H2O.ai documentation webpage • package version 3.18.0.11 • Updated: 2018-06