Dynamic artificial bee colony algorithm for multi-parameters optimization of support vector machine-based soft-margin classifier

Yan, Yiming; Zhang, Ye; Gao, Fengjiao

doi:10.1186/1687-6180-2012-160

Research
Open access
Published: 24 July 2012

Dynamic artificial bee colony algorithm for multi-parameters optimization of support vector machine-based soft-margin classifier

Yiming Yan¹,
Ye Zhang² &
Fengjiao Gao³

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 160 (2012) Cite this article

7216 Accesses
9 Citations
Metrics details

Abstract

This article proposes a ‘dynamic’ artificial bee colony (D-ABC) algorithm for solving optimizing problems. It overcomes the poor performance of artificial bee colony (ABC) algorithm, when applied to multi-parameters optimization. A dynamic ‘activity’ factor is introduced to D-ABC algorithm to speed up convergence and improve the quality of solution. This D-ABC algorithm is employed for multi-parameters optimization of support vector machine (SVM)-based soft-margin classifier. Parameter optimization is significant to improve classification performance of SVM-based classifier. Classification accuracy is defined as the objection function, and the many parameters, including ‘kernel parameter’, ‘cost factor’, etc., form a solution vector to be optimized. Experiments demonstrate that D-ABC algorithm has better performance than traditional methods for this optimizing problem, and better parameters of SVM are obtained which lead to higher classification accuracy.

Introduction

Artificial bee colony (ABC) algorithm was first proposed by Karaboga in 2005 [1]. It has many advantages than earlier swarm intelligence algorithms, especially for constrained optimization problem.

A constrained optimization problem (1) is defined as finding solution $\vec{x}$ that minimizes an objective function $f (\vec{x})$ subject to inequality and/or equality constraints [2]:

\begin{array}{l} minimize \begin{array}{c} f (\vec{x}), & \vec{x} \end{array} = (x_{1}, \dots, x_{D}) \in R^{D} \\ \begin{array}{c} l_{i} \leq x_{i} \leq u_{i}, \end{array} \begin{array}{c} i = 1, \dots, D \end{array} \\ subject \begin{array}{c} to : \end{array} \begin{array}{c} g_{j} (\vec{x}) \leq 0, \end{array} \begin{array}{c} f o r \begin{array}{c} j = 1, \dots, q \end{array} \end{array} \\ \begin{array}{c} h_{j} (\vec{x}) = 0, \begin{array}{c} \begin{array}{c}  \end{array} f o r \begin{array}{c} j = q + 1, \dots, m \end{array} \end{array} \end{array} \end{array}

(1)

when D is larger and each element of $\vec{x}$ represents a specific parameter, it is a multi-parameters optimization problem.

Simulating the foraging behavior of honey bee swarm, ABC algorithm assumes solution $\vec{x}$ as coordinate of nectar source in D-dimensional space, and defines objective function $f (\vec{x})$ which reflects quality of the nectar source. Small value of objective function indicates better nectar source. As bee swarm continually searching better nectar source, the algorithm could find the best solution $\vec{x}$ .

However, ABC algorithm is criticized owing to its poor convergence rate and local optimization problems [3–6]. Many modified methods have been proposed. As the earlier idea of many researchers, poor performance is attributed to ‘roulette wheel’ selection mechanism, which is introduced in the onlooker phase of the original ABC algorithm. Boltzmann selection mechanism was employed instead of roulette wheel selection by Haijun and Qingxian [7]. Interactive ABC, proposed by Tsai et al. [8], introduced the Newtonian law of universal gravitation, which was also for modifying the original selection mechanism. Akbari et al. [9] proposed a modified formula for different phases of ABC algorithm. Actually, according to testing by abundant experiments, these modified methods could improve the original algorithm only when D is not too large. Nevertheless, our findings provide evidence that it is the ‘randomly single element modification (RSEM) process’, which principally leads ABC algorithm to poor performance. In traditional ABC algorithm, for each memorized solution $\vec{x}$ , modifying operation is on single element x_k (k∈[1,D) of $\vec{x}$ in each cycle, and solution $\vec{x}$ changes little after each modification. Moreover, element x_k is randomly selected. It is uncertain whether the modification of x_k could improve the solution $\vec{x}$ , particularly when D is large. Consequently, more cycles are needed for searching best solution, and the algorithm performs poor efficiency relatively. Although Karaboga and Akay [2] introduced modification rate (MR) factor to randomly modify more elements of the solution vector in each cycle, robustness of the algorithm is not quite well. Furthermore, in ABC algorithm, optimization is hierarchical (from global to local), which is implemented mainly by operations of employed bees and onlooker bees, respectively. However, RSEM process is simultaneously utilized in these two phases, which could not effectively guarantee hierarchical optimization. Therefore, RSEM process is abandoned in our D-ABC algorithm. A dynamic ‘activity’ factor is introduced to modify appropriate number of elements of solution $\vec{x}$ and achieve hierarchical optimization. In different optimizing stages, active degree of bees is properly set. More active bees modify more elements of $\vec{x}$ . For bees with different division of labor, ‘activity’ factors are different set. Thus, hierarchical optimization is able to implement.

Based on structural risk minimization principle, support vector machine (SVM) was first proposed by Cortes and Vapnik [10] in the 1990s. It has many advantages on classification, but multiple parameters have to be properly selected. Many research studies have been carried out on this topic. For a specific set of training samples, once classification accuracy is employed as objective function $\vec{x}$ , solution vector $\vec{x}$ is formed by parameters of SVM, training of SVM classifier could be transformed into a multi-parameters optimization problem. Traditionally, most methods for SVM parameter optimization are based on grid search algorithm and genetic algorithm (GA) [11]. The recent focus is swarm intelligence algorithm-based methods, such as ant colony algorithm, particle swarm optimization (PSO) algorithm [12]. ABC algorithm is introduced for SVM parameter optimizing by Hsieh and Yeh [13]. Since multiple parameters of SVM-based soft-margin classifier need to be optimized, our D-ABC algorithm is highly suited for this purpose. Especially for multi-class classification problems, the length D of $\vec{x}$ is larger, and parameters including ‘cost factor’ of each class and kernel parameter are need to be optimized. Performance of classifier is evaluated by average classification accuracy after k-fold cross-validation. Experiments demonstrate that comparing with earlier ABC algorithms, our method have great improvement on convergence rate, and better parameters are obtained which lead to higher classification accuracy.

The main contributions of this article are (1) a modified ABC algorithm is proposed, named D-ABC algorithm; (2) D-ABC algorithm is applied to multi-parameters optimization of SVM soft-margin classifier. The article is organized as follows. In the following section, we introduce traditional ABC algorithm and several modified process along with their drawbacks. Moreover, description of D-ABC algorithm is presented. Multi-parameters optimization of SVM by D-ABC algorithm is illustrated in Section “Multi-parameters optimization of SVM-based soft-margin classifier”, and accordingly experimental settings and analysis are stated. Finally, the last section concludes this study.

Methodology

Traditional ABC algorithm

ABC algorithm is inspired by the foraging behavior of real bee colony. The objective of a bee colony is to maximize the nectar amount stored in the hive. The mission is implemented by all the members of the colony, by efficient division of labor and role transforming. Each bee performs one of following three kinds of roles: employed bees (EB), onlooker bees (OB), and scout bees (SB). They could transform from one role to another in different phases of foraging. The flow of nectar collection is as follow:

1.
In initial phase, there are only some SB and OB in the colony. SB are sent out to search for potential nectar source, and OB wait near the hive for being recruited. If any SB finds a nectar source, it will transform into EB.
2.
EB collect some nectar and go back to the hive, and then dance with different forms to share information of the source with OB. Diverse forms of dance represent different quality of nectar source.
3.
Each OB estimates quality of the nectar sources found by all EB, then follows one of EB to the corresponding source. All OB choose EB according to some probability. Better sources (more nectar) are more attractive (with larger probability to be selected) to OB.
4.
Once any sources are exhausted, the corresponding EB will abandon them, transform into SB and search for new source.

In this way, the bee colony assigns more members to collect the better source and few members to collect the ordinary ones. Thus, the nectar collection is more effective.

Analogously, in ABC algorithm, position of nectar source is presented by the coordinate in D-dimensional space. It is the solution vector $\vec{x}$ of some special problem, and the quality of nectar source is presented by the objective function $f (\vec{x})$ of this problem. Accordingly, optimization of this problem is implemented by simulating behaviors of the three kinds of bees. The flowchart of original ABC algorithm is shown in Figure 1. The main steps are as follow.

1.
Parameters initialization of ABC algorithm. Population number (PN) and scout bee triggering threshold (Limit) are the key parameters of ABC algorithm. Maximum cycle number (MCN) or ideal fitness threshold (IFT) could be set for terminating algorithm. As stated in formula (1), all variables to be optimized form a D-dimensional vector $\vec{x}$ . Restrict both upper bound (UB) and lower bound (LB) of each variable.
2.
Bee colony initialization. In ABC algorithm, since SB transform into EB, they are not reckoned in PN. Generally, the initial nectar sources are found by PN/2 SB, and then they all transform into EB. The other PN/2 bees are OB. The initial PN/2 solutions are generated by formula (2) in principle. Specified initial value could be used only if needed. All further modifications are based on these PN/2 solutions, which is corresponding to the PN/2 EB.
$\begin{array}{l} x_{i}^{(j)} = L B^{(j)} + ϕ_{i}^{(j)} (U B^{(j)} - L B^{(j)}) \\ where \begin{array}{c} \begin{array}{c} i = 1, 2, \dots, P N / 2 & j = 1, 2, \dots, D \end{array} \end{array} \end{array}$
(2)

where $x_{i}^{(j)}$ is the j th elements of the i th solution. $ϕ_{i}^{(j)}$ is uniformly distributed random real number in the range of [0, 1]. Objective function $f (\vec{x})$ is introduced to estimate the fitness of each solution $\vec{x}$ . For parameter optimization of SVM classifier, $f (\vec{x})$ could be minimum classification error or maximum classification accuracy. Vector Failure is a counter, length PN/2, and is set to zero for counting optimizing failure of each EB.
3.
Each cycle includes following phases:
1. 1)
  Employed bee: Each EB randomly modifies single element $x_{i}^{(j)}$ of source i by formula (3). Then fitness of the two solutions (before and after modification) is estimated. Greedy selection criterion is introduced to choose the one with better fitness, and the reserved one becomes new solution of this EB. If fitness of EB is not improved after modification, corresponding Failure counters will increase by 1.
  ${\bar{x}}_{i}^{(j)} = x_{i}^{(j)} + λ_{i}^{(j)} (x_{i}^{(j)} - x_{k}^{(j)})$
  (3)
  
  where $x_{i}^{(j)}$ is defined as in formula (2), and ${\bar{x}}_{i}^{(j)}$ is the corresponding new element of the solution after modification. $λ_{i}^{(j)}$ is uniformly distributed random real number in the range of [−1, 1], and $x_{k}^{(j)}$ is the j th elements of ${\vec{x}}_{k}$ . Note that k ≠ i.
2. 2)
  Estimate recruiting probability. By formula (4), fitness and recruiting probability of each EB are calculated.
  ${\begin{array}{c} Fitness(i) = \frac{1}{1 + f ({\vec{x}}_{i})} \\ prob(i) = \frac{Fitness(i)}{\sum_{i = 1}^{PN/ 2} Fitness(i)} \end{array}$
  (4)
3. 3)
  Onlooker bee: ‘roulette wheel’ selection mechanism is introduced. It forces each OB following one of EB according recruiting probability. Owing to better solutions corresponding to larger recruiting probability, they obtain more chance to be optimized. Then each solution will be modified again by its followers (OB), using same steps as employed bee phase, from steps 1 to 6.
4. 4)
  Record best solution. All PN/2 solutions after modification are ranked according to their fitness, and best solution of current cycle is reserved. The termination conditions are then checked. When cycle counter reach the MCN or an ideal solution is found (reach IFT), the algorithm is over.
5. 5)
  Scout bee. If Failure counters of any solutions exceed Limit, the corresponding solution is abandoned, and scout bee is triggered. For example, if the l th solution is abandoned, a new solution is generated to replace the original one using formula (2), where set i = l.
Figure 1
Flowchart of ABC algorithm. It presents flowchart of original ABC algorithm. Colorful frames show the main phases, and detailed introduction are shown in the followed frame.
Full size image

By above operations, ABC algorithm performs optimization. Nevertheless, in both EB and OB phases, the algorithm merely modify single element of the solution in each cycle. If the length of the solution vector D is large, it makes inefficiency improvement in each cycle. In [2], MR is proposed, which is a real number factor in [0, 1]. For element $x_{i}^{(j)}$ of solution i, a uniformly distributed random real number ( $0 \leq R_{i}^{(j)} \leq 1$ ) is produced. If $R_{i}^{(j)} \leq M R$ , element $x_{i}^{(j)}$ will be modified and others not. Moreover, if all $R_{i}^{(j)}$ are larger than MR, ensure at least one parameter being modified by original algorithm. Although this MR-ABC algorithm improves the convergence rate of basic algorithm to some extent, its robustness is not ideal according to testing by abundant experiments.

D-ABC algorithm

The original idea of ABC algorithm is to perform hierarchical optimization. Overall, global searching is performed by EB and local searching is implemented by OB. However, this idea is not prominent in traditional ABC algorithms, because the modifying extent of EB and OB is similar and relatively fixed. Dynamic modifying extent is more reasonable. To achieve more effective optimization, the activity of bees must be dynamic in different stages of the algorithm. Our idea is that global searching should be dominant in early cycles and local searching should be primary in the posterior cycles. This could be more consistent with actions of real bees: EB become main force in the initial, then more and more OB follow, they play the major role afterwards. Specifically, in early stages of optimization, audaciously modify more elements of $\vec{x}$ in EB phase. That makes the bees approaching better solution by a greater probability. Furthermore, OB become active in posterior stages, and they modify more elements of $\vec{x}$ . That provides more opportunities to jump out of local optimal solution.

Consequently, we propose a dynamic ‘activity’ factor, and introduce it into modification operation of EB and OB phases. Adjust number of elements of the solution vector in each cycle. The ‘activity’ factor δ could be defined as following two forms by formulas (5) and (6), alternatively:

δ = Φ (C_{c}, M C N, D)

(5)

δ = Δ (F_{c}, I F T, D)

(6)

where C_c is current cycle number, , D is length of solution, F_c is current best fitness. The alternation of the two definitions depends on the termination condition of the algorithm. If using MCN to terminate the optimization, δ is defined as formula (5). And if IFT is employed, δ is defined as formula (6). δ_EB and δ_OB are ‘activity’ factor of EB and OB, respectively. Employ τ as the progress rate of the optimization, δ_EB and δ_OB subject to: (1) δ_EB grows with τ, when τ is not beyond half of total progress. $δ_{EB} \in [0, 1]$ ; (2) δ_OB grows with τ, when τ is beyond half of total progress. $δ_{OB} \in [0, 1]$ . Explicit formulas could be determined according to specific problems. In this article, following scheme is suggested when utilize MCN as termination condition of the algorithm.

(1)
In early stages, for EB phase, δ_EB is defined as formula (7). It reduces with C_c increasing, and N_EB elements are randomly picked to be modified; For OB phase, MR method is recommended. Audacious global modification and conservative local modification are implemented.
$\begin{array}{c} E B \to N_{EB} = [δ_{EB} \cdot D] = [(1 - \frac{C_{c}}{M C N}) \cdot D] \\ O B \to MR \begin{array}{c} \end{array} \end{array}} i f C_{c} \leq \frac{M C N}{2}$
(7)
(2)
In posterior stages, for EB phase, MR method is reused; For OB phase, δ_OB is defined as formula (8). It increases with C_c growing, and N_OB elements are randomly picked to be modified. Conservative global modification and audacious local modification are implemented.
$\begin{array}{c} E B \to MR \begin{array}{c} \end{array} \\ O B \to N_{OB} = [δ_{OB} \cdot D] = [\frac{C_{c}}{M C N} \cdot D] \end{array}} \begin{array}{c} i f \end{array} C_{c} > \frac{M C N}{2}$
(8)

Furthermore, D-ABC algorithm is closely to the length D of solution vector. When D is small, there is practically little difference between original ABC algorithm and D-ABC algorithm. And for larger D, the advantages of D-ABC algorithm are prominent on convergence rate and improving the quality of solutions.

Multi-parameters optimization of SVM-based soft-margin classifier

Introduction of SVM parameters optimization

As we all know, training soft-margin classifier is a constrained optimization problem as formula (9). l is number of samples, x_i is i th sample, and y_i is the label of sample i.

\begin{array}{l} \begin{array}{c} m i n i m i z e & \frac{1}{2} \end{array} {‖ w ‖}^{2} + C \sum_{i = 1}^{l} ζ_{i} \\ s u b j e c t t o y_{i} [w x_{i} + b] \geq 1 - ζ_{i} (ζ_{i} \geq 0, i = 1, 2, \dots, l) \end{array}

(9)

It is a quadratic programming problem, which maximum the margin ( $2 / ‖ w ‖$ ) when restricting the least classification error rate.

To solve unbalanced problem of training samples, ‘slack variable’ (ζ) and ‘cost’ factor (C) are introduced to process outlier samples and compromise the position of optimal separating hyper-plane. Large C indicates attaching importance to the loss of outliers of different classes. SVM needs to assign different C for each class. If these cost factors are not properly set, poor classification result will be obtained. However, experience-based setting is not robust. As a result, the multi-parameters optimizing problem needs to be solved, and parameters to be optimized will increase with number of class.

Additionally, parameters of kernel function of SVM need to be optimized. Solving (9) with Lagrange multiplier method, the separating classification function could be obtained as formulas (10) and (11), where α_i is Lagrange factor.

w = \sum_{i = 1}^{l} (α_{i} y_{i} x_{i})

(10)

g (x) = \sum_{i = 1}^{l} a_{i} y < x_{i}, x > + b

(11)

When samples are linearly inseparable, SVM processes nonlinear problem as linear classification in high-dimensional, which is performed by kernel function as formula (12). Both number and type of parameters to be optimized are determined by the kernel function.

g (x) = \sum_{i = 1}^{l} a_{i} y K (x_{i}, x, γ) + b

(12)

All above parameters to be optimized compose a vector $\vec{x}$ , and the multi-parameters optimization problem is defined as (13):

\begin{array}{l} \begin{array}{c} maximize & f (\vec{x}) \end{array} = f ({C, γ, q_{1}, q_{2,} \dots, q_{n}}) \\ subject \begin{array}{c} to & \begin{array}{c} y_{i} [w x_{i} + b] \geq 1 - \end{array} \end{array} ζ_{i} \begin{array}{c} (ζ_{i} \geq 0, i = 1, 2, \dots, l) \end{array} \end{array}

(13)

where C is the cost factor, γ is the kernel parameter, and n is number of labels $q_{j} (j = 1, 2, \dots, n)$ , are weight parameters of each class, which set the cost factor C of class j to q, C. Moreover, to obtain creditable classification accuracy, ‘k-fold’ cross-validation is utilized for testing performance of SVM classifier. In our experiments, k is set to 10. Define the objective function $f (\vec{x})$ as the average classification accuracy of ‘10-fold’ cross-validation as formula (14):

f (\vec{x}) = \sum_{m = 1}^{k} A c c u r a c y_{m} / k

(14)

Consequently, this problem could be solved by optimization algorithm. Owing to multiple parameters need to be optimized, our D-ABC algorithm is more suitable than traditional ABC algorithm. The flowchart of D-ABC algorithm based multi-parameters optimization is shown in Figure 2.

For a set of training samples, D-ABC algorithm modifies parameters vector $\vec{x} = {C, γ, q_{1}, q_{2,} \dots, q_{n}}$ cycle-by-cycle, and search best $\vec{x}$ for maximizing the classification accuracy.

Experiments

In this article, multi-class SVM-based soft-margin classifier is performed by C-support vector classification (C-SVC) toolbox. It is from LIBSVM toolbox supplied by Cheng and Lin [14]. The toolbox supply several typical kernel functions. Radial basis function is employed as kernel function in our experiments, and kernel parameter γ need to be optimized. For n-class classification, all parameters to be optimized and their range are presented in Table 1. Obviously, the length of vector $\vec{x}$ is D = n + 2. The dataset utilized for SVM training is as Table 2 shows. ‘Wine’ and ‘Image Segment’ are two typical testing dataset, which are widely used for testing SVM-based classifier. The two ‘building’ datasets are collected by us especially for multi-parameters optimization problem.

Table 1 Range of parameters to be optimized

Full size table

Table 2 Dataset declaration

Full size table

Performances of PSO algorithm, original ABC algorithm, MR-ABC algorithm, and D-ABC algorithm are compared for this optimization problem. All algorithms are coded under MATLAB 2011b. Main hardware configuration of our computer: Intel®Core(TM)2 Duo CPU P8400@2.26 GHz 2.27 GHz, 2.00 GB RAM.

According to principle of fair comparison: (1) corresponding initialization parameters are same set in these algorithms as Table 3 shows,the settings are according to [2]; (2) using same starting searching points to initialize the colony, as shown in Table 4 and Appendix. Mean value of 20 times running by different algorithms are collected as the final results for the four datasets, which are shown in Figures 3, 4, 5, and 6 and Table 4. Particularly, to verify the robustness of different algorithm, standard deviations of the 20 times run are given by Figures 7 and 8, for the two high-dimensional datasets.

Table 3 Parameters settings of different optimization algorithm

Full size table

Table 4 Initial objection value of different datasets

Full size table

Note that we choose measuring the convergence rate in cycle for following reasons: Generally, computational time of calling objection function is much larger than other parts of ABC algorithm, particularly when our objection function includes multiple times SVM training. The SVM training takes more than much time, and in each cycle, the code of objection function will be called many times (same times in each cycle for different algorithm, and times is determined by parameter PN). Objection function calling occupies more than 90% computational time of both original ABC algorithm and modified ones (for instance, for dataset 4, about 22.3 s are cost by running D-ABC algorithm for each cycle, 21.7 s for MR-ABC algorithm, and 20.5 s for original ABC algorithm. It is obviously that objection function cost most time in each cycle). Moreover, every time the objection function is called, the solution is modified. Therefore, it is more reasonable measuring the convergence rate in cycle than in time. On the contrary, if measuring the convergence rate in specified computational time, each solution might be modified different times, which could be unfair.

As is shown in Figure 3, for dataset 1, D-ABC algorithm rapidly find a solution $\vec{x}$ , with which SVM could best training the data and obtain a classification accuracy of 98.88 %, while original ABC algorithm and MR-ABC obtain that solution slowly. Though PSO algorithm has a good convergence rate, it could not get an ideal solution. As is shown in Figure 4, for dataset 2, similarly, compared with original ABC algorithm and MR-ABC, D-ABC algorithm performs better convergence rate and obtains higher classification accuracy, and the improvement is more obvious than dataset 1. PSO algorithm still converges fast but an unsatisfactory solution. By contrast, D-ABC algorithm obtains a classification accuracy of 92.38 %.

The results above have demonstrated the advantages of D-ABC algorithm for lower dimension of parameter-vector like datasets 1 and 2. Furthermore, datasets 3 and 4 are collected for testing optimization of higher dimensional $\vec{x}$ . As is shown in Figures 5, 6, and Table 5, similar conclusions could be obtained that D-ABC algorithm has certain advantages over other algorithms, which lead to greater improvement on convergence rate and quality of solution, especially when D is larger. Moreover, standard deviations (of 20 times run) for the two groups of datasets are shown in Figures 7 and 8, respectively. The curves illustrate the standard deviations of objection function in different optimizing cycles, and the relatively lower standard deviations have been obtained by D-ABC algorithm for both datasets 3 and 4, which indicates that D-ABC algorithm has good robustness.

Table 5 Best solution obtained by different optimization algorithms

Full size table

Conclusion and discussion

In this article, two parts of work have been studied. First, D-ABC algorithm is introduced to improve the disadvantages of traditional ABC algorithms: poor convergence rate and local optimizing. Second, D-ABC algorithm is utilized for multi-parameters optimization of SVM classifier. Experiments results demonstrate that D-ABC algorithm is in many ways superior to traditional ABC algorithms. It effectively ameliorates the convergence rate and local optimum. Typically, for multi-parameters optimization, when length D of vector (to be optimized) is larger, our study has provided substantial evidence for the advantages of D-ABC algorithm on quality of solution and convergence rate. When D-ABC algorithm is employed for optimizing multi-parameters of SVM-based soft-margin classifier, great improvement is obtained on performance of the classifier. Moreover, the robustness of D-ABC algorithm is proofed. Furthermore, the idea of D-ABC algorithm could be associated with other modified ABC algorithms, whose modification is on other phase of original ABC algorithm, and it might further improve traditional ABC algorithm in future work.

Appendix

The starting searching points of the four group of experiments are tabulated in Tables 6, 7, 8, and 9.

Table 6 The initial colony of dataset 1

Full size table

Table 7 The initial colony of dataset 2

Full size table

Table 8 The initial colony of dataset 3

Full size table

Table 9 The initial colony of dataset 4

Full size table

Abbreviations

ABC:: Artificial bee colony
D-ABC:: Dynamic artificial bee colony
EB:: Employed bees
PSO:: Particle swarm optimization
RSEM:: Randomly single element modification
IFT:: Ideal fitness threshold
LB:: Lower bound
MCN:: Maximum cycle number
MR:: Modification rate
MR-ABC:: Modification rate artificial bee colony
OB:: Onlooker bees
PN:: Population number
SB:: Scout bees
SVM:: Support vector machine
UB:: Upper bound.

References

Karaboga D: An Idea Based on Honey Bee Swarm for Numerical Optimization. Technical Report, TR06. Erciyes University Press, Erciyes; 2005.
Google Scholar
Karaboga D, Akay B: A modified artificial bee colony (ABC) algorithm for constrained optimization problems. Appl. Soft Comput. 2010, 11(3):3021-3031. 10.1016/j.asoc.2010.12.001
Article Google Scholar
Akay B, Karaboga D: A modified artificial bee colony algorithm for real-parameter optimization. Inf. Sci 2012, 192: 120-142. 10.1016/j.ins.2010.07.015
Article Google Scholar
Singh A: An artificial bee colony algorithm for the leaf-constrained minimum spanning tree problem. Appl. Soft Comput. 2010, 9(2):625-631. 10.1016/j.asoc.2008.09.001
Article Google Scholar
Bao L, Zeng J: Comparison and analysis of the selection mechanism in the artificial bee colony algorithm. 1st edition. Paper presented at Ninth International Conference on Hybrid Intelligent Systems (HIS '09), Shenyang, LiaoNing, China; 2009:411-416.
Google Scholar
Parpinelli RS, Benitez CMV, Lopes HS: Parallel approaches for the artificial bee colony algorithm, handbook of swarm intelligence: concepts. Princ. Appl. 2011, 8: 329.
MATH Google Scholar
Haijun D, Qingxian F: Artificial bee colony algorithm based on Boltzmann selection policy. Comput. Eng. Appl. 2009, 45(31):53-55.
Google Scholar
Tsai P, Pan J, Liao B, Chu S: Interactive artificial bee colony (IABC) optimization. ISI2008, Taipei Taiwan; 2008.
Google Scholar
Akbari R, Mohammadi A, Ziarati K: A novel bee swarm optimization algorithm for numerical function optimization. Commun. Nonlinear Sci. Numer. Simul 2010, 15: 3142-3155. 10.1016/j.cnsns.2009.11.003
Article MathSciNet MATH Google Scholar
Cortes C, Vapnik V: Support-vector networks. Mach. Learn 1995, 20: 273-297. 10.1007/BF00994018
MATH Google Scholar
Samadzadegan F, Soleymani A, Abbaspour RA: Evaluation of Genetic Algorithms for tuning SVM parameters in multi-class problems. Paper presented at 11th International Symposium on Computational Intelligence and Informatics (CINTI), Budapest; 2010:323-328.
Google Scholar
Yang L, Wang HT: Classification based on particle swarm optimization for least square support vector machines training. Paper presented at Third International Symposium on Intelligent Information Technology and Security Informatics (IITSI), Jinggangshan; 2010:246-249.
Google Scholar
Hsieh TJ, Yeh WC: Knowledge discovery employing grid scheme least squares support vector machines based on orthogonal design bee colony algorithm. IEEE Trans. Syst. Man Cybern. B: Cybernetics 2011, 41(5):1-15. 10.1109/TSMCB.2011.2116007
Google Scholar
Chang CC, Lin CJ: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2011, 2(3):27. 10.1145/1961189.1961199
Google Scholar

Download references

Author information

Authors and Affiliations

Department of information engineering, Harbin institute of technology, 92 West Dazhi Street, Harbin, 150001, China
Yiming Yan
Department of information engineering, Harbin institute of technology, 92 West Dazhi Street, Harbin, 150001, China
Ye Zhang
Department of modern control, Institute of automation of Heilongjiang academy of Science, Hanshui road, Harbin, 150090, China
Fengjiao Gao

Authors

Yiming Yan
View author publications
You can also search for this author in PubMed Google Scholar
Ye Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fengjiao Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yiming Yan.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 10

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yan, Y., Zhang, Y. & Gao, F. Dynamic artificial bee colony algorithm for multi-parameters optimization of support vector machine-based soft-margin classifier. EURASIP J. Adv. Signal Process. 2012, 160 (2012). https://doi.org/10.1186/1687-6180-2012-160

Download citation

Received: 06 March 2012
Accepted: 06 July 2012
Published: 24 July 2012
DOI: https://doi.org/10.1186/1687-6180-2012-160

Dynamic artificial bee colony algorithm for multi-parameters optimization of support vector machine-based soft-margin classifier

Abstract

Introduction

Methodology

Traditional ABC algorithm

D-ABC algorithm

Multi-parameters optimization of SVM-based soft-margin classifier

Introduction of SVM parameters optimization

Experiments

Conclusion and discussion

Appendix

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords