Skip to content
Snippets Groups Projects
Commit 8b57ea4a authored by wm624@hotmail.com's avatar wm624@hotmail.com Committed by Felix Cheung
Browse files

[SPARK-19639][SPARKR][EXAMPLE] Add spark.svmLinear example and update vignettes

## What changes were proposed in this pull request?

We recently add the spark.svmLinear API for SparkR. We need to add an example and update the vignettes.

## How was this patch tested?

Manually run example.

Author: wm624@hotmail.com <wm624@hotmail.com>

Closes #16969 from wangmiao1981/example.
parent 15b144d2
No related branches found
No related tags found
No related merge requests found
...@@ -469,6 +469,8 @@ SparkR supports the following machine learning models and algorithms. ...@@ -469,6 +469,8 @@ SparkR supports the following machine learning models and algorithms.
#### Classification #### Classification
* Linear Support Vector Machine (SVM) Classifier
* Logistic Regression * Logistic Regression
* Multilayer Perceptron (MLP) * Multilayer Perceptron (MLP)
...@@ -532,6 +534,26 @@ head(carsDF_test) ...@@ -532,6 +534,26 @@ head(carsDF_test)
### Models and Algorithms ### Models and Algorithms
#### Linear Support Vector Machine (SVM) Classifier
[Linear Support Vector Machine (SVM)](https://en.wikipedia.org/wiki/Support_vector_machine#Linear_SVM) classifier is an SVM classifier with linear kernels.
This is a binary classifier. We use a simple example to show how to use `spark.svmLinear`
for binary classification.
```{r}
# load training data and create a DataFrame
t <- as.data.frame(Titanic)
training <- createDataFrame(t)
# fit a Linear SVM classifier model
model <- spark.svmLinear(training, Survived ~ ., regParam = 0.01, maxIter = 10)
summary(model)
```
Predict values on training data
```{r}
prediction <- predict(model, training)
```
#### Logistic Regression #### Logistic Regression
[Logistic regression](https://en.wikipedia.org/wiki/Logistic_regression) is a widely-used model when the response is categorical. It can be seen as a special case of the [Generalized Linear Predictive Model](https://en.wikipedia.org/wiki/Generalized_linear_model). [Logistic regression](https://en.wikipedia.org/wiki/Logistic_regression) is a widely-used model when the response is categorical. It can be seen as a special case of the [Generalized Linear Predictive Model](https://en.wikipedia.org/wiki/Generalized_linear_model).
......
...@@ -43,3 +43,4 @@ head(aftPredictions) ...@@ -43,3 +43,4 @@ head(aftPredictions)
# $example off$ # $example off$
sparkR.session.stop() sparkR.session.stop()
#
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#
# To run this example use
# ./bin/spark-submit examples/src/main/r/ml/svmLinear.R
# Load SparkR library into your R session
library(SparkR)
# Initialize SparkSession
sparkR.session(appName = "SparkR-ML-svmLinear-example")
# $example on$
# load training data
t <- as.data.frame(Titanic)
training <- createDataFrame(t)
# fit Linear SVM model
model <- spark.svmLinear(training, Survived ~ ., regParam = 0.01, maxIter = 10)
# Model summary
summary(model)
# Prediction
prediction <- predict(model, training)
showDF(prediction)
# $example off$
sparkR.session.stop()
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment