利用WOA算法优化libsvm中SVDD算法的参数

Smartk · 发表于 2020-2-24 15:53

EDA365欢迎您登录！

您需要登录才可以下载或查看，没有帐号？注册

x

一、前言
! X& S$ P1 X2 l9 c* ^+ r    支持向量数据描述（Support Vector Data Description，SVDD）是一种单值分类算法，能够实现目标样本和非目标样本的区分，算法的具体描述可以参考以下文献：
（1）Tax D M J, Duin R P W. Support vector domain description[J]. Pattern recognition letters, 1999, 20(11-13): 1191-1199.
（2）Tax D M J, Duin R P W. Support vector data description[J]. Machine learning, 2004, 54(1): 45-66.

   台湾大学林智仁 (Lin Chih-Jen) 教授等开发设计的 libsvm 工具箱提供了SVDD算法的MATLAB接口，其中两个关键参数 c 和 g 直接影响SVDD的单值分类结果。笔者在此基础上，通过引入鲸鱼优化算法（Whale Optimization Algorithm，WOA），实现对 libsvm 工具箱中的SVDD算法的参数优化。
WOA的具体描述可以参考以下文献：
（1）Mirjalili S, Lewis A. The whale optimization algorithm[J]. Advances in engineering software, 2016, 95: 51-67.

该算法的提出者已经把代码开源在mathworks。

注：（1）笔者已把 libsvm工具箱的svmtrain和svmpredict函数的名字分别改为libsvmtrain和libsvmpredict。
   （2）WOA算法和其他群智能优化算法一样，容易陷入局部最优，若寻优结果出现异常，可以尝试多运行几次。

二、例子1 （libsvm 工具箱提供的heart_scale data）

1.  数据说明
   该数据集共有13个属性，270个样本，包括120个正样本和150个负样本。在该例子中，把正样本作为训练集，标签为1；负样本作为测试集，标签为-1。

2. 主程序代码

clc
clear all
close all
addpath(genpath(pwd))
global traindata trainlabel
% heart_scale data
[traindata, testdata, trainlabel, testlabel] = prepareData;
% Parameter setting of WOA
agent = 10; % Number of search agents
iteration = 20; % Maximum numbef of iterations
lb = [10^-3,2^-4]; % Lower bound of 'c' and 'g'
ub = [10^0,2^4]; % Upper bound of 'c' and 'g'
dim = 2; % Number of Parameter
fobj = @woa_obj; % Objective function
% Parameter optimization using WOA
[Best_score,Best_pos,~] = WOA(agent,iteration,lb,ub,dim,fobj);
% Train SVDD hypersphere using the optimal parameters
cmd = ['-s 5 -t 2 ','-c ',num2str(Best_pos(1,1)),' -g ', ...
num2str(Best_pos(1,2)),' -q'];
model = libsvmtrain(trainlabel, traindata, cmd);
% Test
[predictlabel,accuracy,~] = libsvmpredict(testlabel, testdata, model);, e/ @# M( {' e

最后一次迭代的结果以及最终的分类结果：

ans =
19.0000 0.0667
Accuracy = 80% (96/120) (classification)
Accuracy = 66.6667% (80/120) (classification)
Accuracy = 60% (72/120) (classification)
Accuracy = 80% (96/120) (classification)
Accuracy = 53.3333% (64/120) (classification)
Accuracy = 54.1667% (65/120) (classification)
Accuracy = 42.5% (51/120) (classification)
Accuracy = 35% (42/120) (classification)
Accuracy = 80% (96/120) (classification)
Accuracy = 35% (42/120) (classification)
ans =
20.0000 0.0667
Accuracy = 100% (150/150) (classification), ?+ s# S" Q" i' J$ N

可以看出，利用优化后的参数建立的SVDD模型，训练集的正确率为93.33%，测试集的正确率为100%。
5 \, ?3 S. t. A4 C' m+ Q# Q- Y- Y
三、例子2 （工业过程数据）

1. 数据说明
采用某工业过程数据，该数据集共有10个属性，训练集有400个正样本，测试集有80个样本（前40个样本为正样本，后40个样本为负样本）。

2. 主程序代码

clc
clear all
addpath(genpath(pwd))
global traindata trainlabel
% Industrial process data
load ('.\data\data_2.mat')
% Parameter setting of WOA
agent = 10; % Number of search agents
iteration = 30; % Maximum numbef of iterations
lb = [10^-3,2^-7]; % Lower bound of 'c' and 'g'
ub = [10^0,2^7]; % Upper bound of 'c' and 'g'
dim = 2; % Number of Parameter
fobj = @woa_obj; % Objective function
% Parameter optimization using WOA
[Best_score,Best_pos,~] = WOA(agent,iteration,lb,ub,dim,fobj);
% Train SVDD hypersphere using the optimal parameters
cmd = ['-s 5 -t 2 ','-c ',num2str(Best_pos(1,1)),' -g ', ...
num2str(Best_pos(1,2)),' -q'];
model = libsvmtrain(trainlabel, traindata, cmd);
% Test
[predictlabel,accuracy,~] = libsvmpredict(testlabel, testdata, model);
% Visualize the results
plotResult(testlabel,predictlabel)
~# n/ N" m; U" k& x4 {

最后一次迭代的结果以及最终的分类结果：

Accuracy = 99.5% (398/400) (classification)
Accuracy = 99.25% (397/400) (classification)
Accuracy = 99.75% (399/400) (classification)
Accuracy = 99.75% (399/400) (classification)
Accuracy = 99.5% (398/400) (classification)
Accuracy = 99.25% (397/400) (classification)
Accuracy = 99.75% (399/400) (classification)
Accuracy = 99.75% (399/400) (classification)
Accuracy = 99.5% (398/400) (classification)
Accuracy = 99.5% (398/400) (classification)
ans =
30.0000 0.0025
Accuracy = 93.75% (75/80) (classification)% C5 O5 a5 l& i5 |# L

可以看出，利用优化后的参数建立的SVDD模型，训练集的正确率为99.75%，测试集的正确率为93.75%。
可视化结果如下：

CCxiaom · 发表于 2020-2-26 17:55

利用WOA算法优化libsvm中SVDD算法的参数

帐号		自动登录	找回密码
密码			注册

利用WOA算法优化libsvm中SVDD算法的参数

EDA365欢迎您登录！

浏览过的版块

推荐内容 /1