[Scikit-learn] 1.4 Support Vector Classification

时间 2019-11-18

标签 scikit learn 1.4 support vector classification 栏目 Java 繁體版

原文原文链接

Ref: http://sklearn.lzjqsdd.com/modules/svm.htmlhtml

Ref: CS229 Lecture notes - Support Vector Machinesdom

Ref: Lecture 6 | Machine Learning (Stanford) youtubeide

Ref: 支持向量机通俗导论（理解SVM的三层境界）函数

Ref: 《Kernel Methods for Pattern Analysis》优化

Ref: SVM教程：支持向量机的直观理解【插图来源于此连接，写得不错】spa

支持向量机

其实就是最大间隔分类器，并且是软间隔，加正则。.net

对于时间充裕的年轻人，建议SVM的原理推导一遍，过程当中设计了大部分的数学优化基础，并且SVM是自成体系的sol，大有裨益。设计

Figure, SVM试图找到右图中的“分割线”3d

Support vector machines (SVMs) are a set of supervised learning methods used for classification (分类), regression (回归) and outliers detection ( 异常检测).rest

优点

Effective in high dimensional spaces. 【高维好操做】

Still effective in cases where number of dimensions is greater than the number of samples. 【高维，例如语言模型，但效果好很差是另外一码事】

Uses a subset of training points in the decision function (called support vectors), so it is also memory efficient. 【经过子集判断】

Versatile: different Kernel functions can be specified for the decision function. Common kernels are provided, but it is also possible to specify custom kernels.

劣势

If the number of features is much greater than the number of samples, the method is likely to give poor performances.

SVMs do not directly provide probability estimates, these are calculated using an expensive five-fold cross-validation.　

什么是支持向量

支持向量（support vector）：距离最接近的数据点。

间隔（margin）：支持向量定义的沿着分隔线的区域。

有间隔就会影响分类结果中的偏差大小

SVM容许咱们经过参数 C 指定愿意接受多少偏差，让咱们能够指定如下二者的折衷：

- 较宽的间隔。正确分类训练数据。
- C值较高，意味着训练数据上允许的偏差较少

什么是核

升维使其可分

通常而言，很难找到这样的特定投影。

不过，感谢Cover定理，咱们确实知道，投影到高维空间后，数据更可能线性可分。

谁来作高维投影

SVM将使用一种称为核（kernels）的东西进行投影，这至关迅速。

升维且高效

须要几回运算？在二维情形下计算内积须要2次乘法、1次加法，而后平方又是1次乘法。因此总共是 4次运算，仅仅是以前先投影后计算的运算量的31% 。

看来用核函数计算所需内积要快得多。在这个例子中，这点提高可能不算什么：4次运算和13次运算。然而，若是数据点有许多维度，投影空间的维度更高，在大型数据集上，核函数节省的算力将飞速累积。这是核函数的巨大优点。

大多数SVM库内置了流行的核函数，好比多项式（Polynomial）、径向基函数（Radial Basis Function，RBF）、 Sigmoid 。当咱们不进行投影时（好比本文的第一个例子），咱们直接在原始空间计算点积——咱们把这叫作使用线性核（linear kernel）。

径向基函数 RBF

例：RBF kernel 【径向基函数 (Radial Basis Function 简称 RBF), 就是某种沿径向对称的标量函数，也是默认kernel】

""" ============== Non-linear SVM ============== Perform binary classification using non-linear SVC with RBF kernel. The target to predict is a XOR of the inputs. The color map illustrates the decision function learned by the SVC. """
print(__doc__) import numpy as np import matplotlib.pyplot as plt from sklearn import svm 
# 生成网格型数据 xx, yy = np.meshgrid(np.linspace(-3, 3, 500), np.linspace(-3, 3, 500))
 np.random.seed(0) X = np.random.randn(300, 2) Y = np.logical_xor(X[:, 0] > 0, X[:, 1] > 0) # fit the model
clf = svm.NuSVC() clf.fit(X, Y) # plot the decision function for each datapoint on the grid
Z = clf.decision_function(np.c_[xx.ravel(), yy.ravel()]) Z = Z.reshape(xx.shape) plt.imshow(Z, interpolation='nearest', extent=(xx.min(), xx.max(), yy.min(), yy.max()), aspect='auto', origin='lower', cmap=plt.cm.PuOr_r) contours = plt.contour(xx, yy, Z, levels=[0], linewidths=2, linetypes='--')
 plt.scatter(X[:, 0], X[:, 1], s=30, c=Y, cmap=plt.cm.Paired) plt.xticks(()) plt.yticks(()) plt.axis([-3, 3, -3, 3]) plt.show()

变量：xx, yy

xx
Out[140]: 
array([[-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ],
       [-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ],
       [-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ],
       ..., 
       [-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ],
       [-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ],
       [-3.        , -2.98797595, -2.9759519 , ...,  2.9759519 ,
         2.98797595,  3.        ]])

yy
Out[141]: 
array([[-3.        , -3.        , -3.        , ..., -3.        ,
        -3.        , -3.        ],
       [-2.98797595, -2.98797595, -2.98797595, ..., -2.98797595,
        -2.98797595, -2.98797595],
       [-2.9759519 , -2.9759519 , -2.9759519 , ..., -2.9759519 ,
        -2.9759519 , -2.9759519 ],
       ..., 
       [ 2.9759519 ,  2.9759519 ,  2.9759519 , ...,  2.9759519 ,
         2.9759519 ,  2.9759519 ],
       [ 2.98797595,  2.98797595,  2.98797595, ...,  2.98797595,
         2.98797595,  2.98797595],
       [ 3.        ,  3.        ,  3.        , ...,  3.        ,
         3.        ,  3.        ]])

View Code

np.c_ 降维后的元素的reconstruct

np.c_[np.array([1,2,3]), np.array([4,5,6])]
Out[142]: 
array([[1, 4],
       [2, 5],
       [3, 6]])

np.c_[np.array([[1,2,3]]), 0, 0, np.array([[4,5,6]])]
Out[143]: array([[1, 2, 3, 0, 0, 4, 5, 6]])

View Code

不一样的核：Various kernels

可见，RBF 的分割更为细致。

""" ================================ SVM Exercise ================================ A tutorial exercise for using different SVM kernels. This exercise is used in the :ref:`using_kernels_tut` part of the :ref:`supervised_learning_tut` section of the :ref:`stat_learn_tut_index`. """
print(__doc__) import numpy as np import matplotlib.pyplot as plt from sklearn import datasets, svm iris = datasets.load_iris() X = iris.data y = iris.target X = X[y != 0, :2] y = y[y != 0] n_sample = len(X) np.random.seed(0) order = np.random.permutation(n_sample) X = X[order] y = y[order].astype(np.float) # shuffle
 X_train = X[:.9 * n_sample] y_train = y[:.9 * n_sample] X_test = X[ .9 * n_sample:] y_test = y[ .9 * n_sample:] # fit the model
for fig_num, kernel in enumerate(('linear', 'rbf', 'poly')): clf = svm.SVC(kernel=kernel, gamma=10) clf.fit(X_train, y_train) plt.figure(fig_num) plt.clf() plt.scatter(X[:, 0], X[:, 1], c=y, zorder=10, cmap=plt.cm.Paired) # Circle out the test data
    plt.scatter(X_test[:, 0], X_test[:, 1], s=80, facecolors='none', zorder=10) plt.axis('tight') x_min = X[:, 0].min() x_max = X[:, 0].max() y_min = X[:, 1].min() y_max = X[:, 1].max() XX, YY = np.mgrid[x_min:x_max:200j, y_min:y_max:200j] Z = clf.decision_function(np.c_[XX.ravel(), YY.ravel()]) # Put the result into a color plot
    Z = Z.reshape(XX.shape) plt.pcolormesh(XX, YY, Z > 0, cmap=plt.cm.Paired) plt.contour(XX, YY, Z, colors=['k', 'k', 'k'], linestyles=['--', '-', '--'], levels=[-.5, 0, .5]) plt.title(kernel) plt.show()

Result:

End.