Abstract
In recent years, a plethora of new metaheuristic algorithms have exploreddifferent sources of inspiration within the biological and natural worlds. Thisnatureinspired approach to algorithm design has been widely criticised. Anotable issue is the tendency for authors to use terminology that is derivedfrom the domain of inspiration, rather than the broader domains ofmetaheuristics and optimisation. This makes it difficult to both comprehend howthese algorithms work and understand their relationships to othermetaheuristics. This paper attempts to address this issue, at least to someextent, by providing accessible descriptions of the most cited natureinspiredalgorithms published in the last twenty years. It also discusses commonalitiesbetween these algorithms and more classical natureinspired metaheuristics suchas evolutionary algorithms and particle swarm optimisation, and finishes with adiscussion of future directions for the field.
Quick Read (beta)
Mitigating Metaphors:
A Comprehensible Guide to Recent NatureInspired Algorithms^{1}^{1}
1
Citation: Lones, M. A., Mitigating Metaphors: A Comprehensible Guide to Recent NatureInspired Algorithms, SN Computer Science 1, 49 (2020). This is the author’s own version. The published version is available at DOI: 10.1007/s4297901900508.
Abstract
In recent years, a plethora of new metaheuristic algorithms have explored different sources of inspiration within the biological and natural worlds. This natureinspired approach to algorithm design has been widely criticised. A notable issue is the tendency for authors to use terminology that is derived from the domain of inspiration, rather than the broader domains of metaheuristics and optimisation. This makes it difficult to both comprehend how these algorithms work and understand their relationships to other metaheuristics. This paper attempts to address this issue, at least to some extent, by providing accessible descriptions of the most cited natureinspired algorithms published in the last twenty years. It also discusses commonalities between these algorithms and more classical natureinspired metaheuristics such as evolutionary algorithms and particle swarm optimisation, and finishes with a discussion of future directions for the field.
arrows \newmdenv[leftmargin=0pt, innerleftmargin=0pt, rightmargin=0pt, innerrightmargin=0pt, outerlinewidth=0pt, linewidth=0pt,skipabove=10pt,skipbelow=0pt,frametitlerule=true,nobreak=true]entry
1 Introduction
This paper is intended to be an objective guide to the most popular natureinspired optimisation algorithms published since the year 2000, measured by citation count. It is not the first paper to review this area [67, 15, 65], but it is arguably the first to present these algorithms in terms that will be familiar to the broader optimisation, metaheuristics, evolutionary computation, and swarm computing communities. Unlike some previous reviews, it does not aim to advocate for this area of research or provide support for the idea of designing algorithms based upon observations of natural systems. It only aims to report and summarise what already exists in more accessible terms.
The aim of this paper is not to explicitly criticise these approaches; other authors have already done this for natureinspired metaheuristics in general [55] and for specific natureinspired algorithms [8, 62]. However, it is important to be aware of one point of criticism that was raised by [55]. This is the tendency for authors to present their algorithm from the perspective of, and using the terminology of, the domain of inspiration. Often natureinspired algorithm papers begin with an initial review of a natural domain, then abstract this into a model of the domain, and this leads to an algorithmic description that contains terms from the domain. In many cases, this includes the introduction of new terms to describe wellestablished concepts from metaheuristics and optimisation. The consequence of this is that it can take considerable time and effort to understand how these algorithms work, even if the reader has a background in metaheuristics.
Well over a hundred natureinspired algorithms have been published since 2000. For instance, the review book by Xing and Gao [65] names 134 of these, and the Evolutionary Computation Bestiary [2] currently lists over 200. The premise for developing new algorithms is often based solely on the desire to capture a behaviour observed in nature, with the assumption (rightly or wrongly) that it will also be relevant within an optimisation context. In more recent papers, it has become common to mention the No Free Lunch theorem [64, 22] as a motivation. This theorem states that no optimiser is better than any other when considered across all possible optimisation problems, which can be interpreted as suggesting a need for diverse optimisers in order to solve diverse problems. Whether this is a valid assumption for the range of real world problems that optimisers are applied to in practice is unclear. Nevertheless, different optimisers are known to perform well on different problems, so there is some value to this argument.
These algorithms have gained a significant uptake. This can be seen in their citation counts: the 32 algorithms reviewed in this paper each have more than 200 citations; a third of them have more than 1000 citations. Given that most computer science papers achieve only a handful of citations per year, this is quite an achievement for a group of papers with an average age of around nine years. However, this combination of high uptake and opaque descriptions has led to fragmentation between the natureinspired optimisation community and the wider metaheuristics community. To raise an observation that should be familiar to these communities: a certain amount of diversification is generally a good thing, but diversification without intensification tends to be ineffective. Applying this observation to the design of optimisation algorithms suggests that focusing on variants of a single natureinspired algorithm (and most algorithms discussed in this paper do have a significant number of variants) is likely to be a suboptimal approach and a potential waste of time and effort. This, in turn, suggests a need to tie back together these different threads of search. This paper aims to contribute towards this goal.
Section 2 presents the approach used to describe algorithms in this paper. Section 3 then uses this approach to describe the most widelycited recent natureinspired algorithms; the intent is for this to be used as a resource where someone can look up a particular algorithm and quickly gain an understanding of its main characteristics. Section 4 then discusses the novelty of these algorithms in terms of both metaheuristic frameworks and broader metaheuristic concepts. Section 5 delves further into the specific overlaps between these algorithms and particle swarm optimisation and its variants. Section 6 discusses some of the broader issues, and offers guidance on how research carried out in this area could be improved. Section 7 concludes.
2 Descriptions and Terminology
This paper attempts to describe algorithms using standard terms. However, this is not as straightforward as it may seem, since different parts of the metaheuristics community use different terminology. For example, those who work with local search algorithms refer to the transition between two points in the search space as a move, and the result of evaluating a point is known as its objective value. In the EA community, where much of the terminology derives from biological roots, these would be called mutation and fitness. In practice, both sets of terms are widely used. However, since the aim of this guide is to divorce natureinspired algorithms from the terminology of their domain, generic terms will be used wherever possible, i.e. move rather than mutation.
When describing populationbased algorithms, a further difficulty is that some algorithms are more naturally described using processcentric terms and others using populationcentric terms. Particle swarm optimisation (PSO), for example, is essentially a distributed algorithm, and is easiest to present in terms of interactions between search processes. Genetic algorithms (GA), on the other hand, involve populationlevel operations such as selection; although these could be described as interactions between search processes, this would be convoluted and would make the algorithm harder to understand. Hence, in this paper a mixture of processcentric and populationcentric terminology is used, depending on whether the algorithm is most appropriately described as the former or the latter.
In general, an attempt has been made to keep descriptions succinct and generic, whilst avoiding the definition of new terms. Little or no reference is made to an algorithm’s source of inspiration from nature, unless this is required to understand the algorithm. Descriptions are intended to be sufficient to indicate the general characteristics of the algorithm, and to allow the reader to draw out similarities with other algorithms. They are not intended to be exhaustive, and hence some of the less important, or less specific, details are omitted. For example, consider the following description of PSO:
[frametitle=Particle Swarm Optimisation (PSO)
Eberhart and Kennedy, 1995, [11], $>$50000 citations]
Each search process has a velocity within the search space, and carries out moves by adding this to its current position at each iteration. The velocity is initially random. Then, at each iteration, each search process modifies its velocity by adding weighted terms based on the vector difference between its current point of search and the best points seen by both itself and by a subset of other search processes. This causes intensification of search by moving towards regions of the search space known to contain points of relatively high objective value. Diversification is provided by moving through the region between the current point and these target regions, and by overshooting these regions due to the momentum gained by maintaining a proportion of the existing velocity at each update.
Unlike other presentations of this algorithm, this description does not use the terms particle or informant, since both of these can be described using generic terms. It does not go into detail about the exact form of each term in the velocity update equation, or how informants are allocated, since these details are not required to understand how the algorithm works, or how it relates to other algorithms. They are also subject to wide variation between implementations. The term velocity, however, is used, since it is a welldefined concept within a vector space, and helps to understand the behaviour of the algorithm. The description also highlights algorithmic features which are expected to promote intensification and diversification of search.
A GA can be described as follows:
{entry}[frametitle=Genetic Algorithm (GA)
Holland, 1975, [21], $>$60000 citations]
At each iteration, search points with relatively high objective value are selected from the existing population. These are organised into pairs, and new search points are then created by exchanging solution components within pairs. This tends to sample the search space between existing search points, thereby both intensifying and diversifying search. Further diversification is provided by carrying out a random move away from the resulting points. Search points with relatively low objective value are removed from the population at each iteration, further intensifying search.
This description is intentionally generic, since the exact details of how selection, recombination, mutation and solution replacement are implemented vary considerably. Domainderived terms like crossover and mutation are not used, though select is, since this has a clear nondomain meaning.
3 Algorithms from A–Z
Given the opaque nature of a lot of these papers, it would be a challenging task to read through and understand all the natureinspired optimisation algorithms that have been published in recent years. Perhaps in reflection of this, previous reviews have generally described these algorithms using their original authors’ words, or have focused on the sources of inspiration rather than trying to understand and present their underlying metaheuristic mechanisms. By comparison, the presentation in this paper aims to be comprehensible rather than comprehensive. Consequently, it focuses on the more popular of these algorithms. Popularity is measured in terms of citation count; this is not, of course, a robust measure of uptake, but it gives some indication of whether the algorithm has been used in practice. To bring the list of algorithms down to a manageable level, this review only covers those which have at least 200 citations, as measured by Google Scholar^{3}^{3} 3 Citations counts were collected in October 2019.. By comparison, the seminal genetic algorithm (GA) work has $\sim $60000 citations, particle swarm optimization (PSO) has $\sim $50000 citations, ant colony optimization (ACO) has $\sim $10000 citations, and evolution strategies (ES) have $\sim $5000. It is notable that a number of algorithms in the list have citation counts approaching that of ES and collectively they have $\sim $30000 citations, roughly halfway between the citation counts of ACO and PSO. So, even taking into account the limitations of citation counts, they are clearly having an impact within the scientific record, and this alone should justify efforts to document and understand them. Fig. 1 plots the approximate number of citations against the year that an algorithm’s seminal paper was published. It can be seen from the trend line that the average citation count per year is $\sim $100.
The glossary below gives an overview of these 32 algorithms. Unless indicated otherwise, it is assumed that each algorithm is a populationbased optimiser which updates the population synchronously over a period of iterations and begins with a population that is uniformly sampled from the search space.
[frametitle=Ant Lion Optimizer (ALO)
Mirjalili, 2015b, [38], $>$300 citations]
At each iteration, search points with relatively high objective value are selected from the existing population. Search processes with relatively low fitness are then restarted within hyperspherical regions centred around the selected points. Over time, the radius of the hyperspheres is reduced, further intensifying search. The restarted search processes carry out random walks within their hyperspherical regions; this is the main source of diversification within the algorithm.
[frametitle=Artificial Bee Colony Algorithm (ABC)
Karaboga, 2005, [23], $>$4500 citations]
Each search process generates local moves in the direction of the current position of another, randomly selected, search process. Only improving moves are accepted. The number of local moves generated by a particular search process is determined by the relative objective value of their current search point. Move sizes are probabilistic and are progressively reduced over time, leading to increased intensification. Diversification is promoted by restarting search processes which have not made progress within a certain number of moves at a randomly sampled location.
[frametitle=Bacterial Foraging Optimization (BFO)
Passino, 2002, [42], $>$2500 citations]

Each search process carries out sequential moves in the same direction until this no longer leads to improvement. When the current direction of search is no longer productive, a random change in direction occurs. After each iteration, the search processes with relatively low fitness are restarted at the current positions of search processes with relatively high fitness, intensifying search. The objective values of search points are adjusted by a crowding term, whose effect is to draw the search processes towards one another, further intensifying search.
[frametitle=Bat Algorithm (BA)
Yang and Gandomi, 2012, [70], $>$600 citations]
Search processes move towards the best solution within the population at different velocities, the magnitude of which is varied randomly at each iteration. There is also a probability of intensifying search by changing each search process’s current position to a random point near the best solution within the population, with the likelihood of doing this decreasing each time a search process makes an improvement. The new search point is then accepted probabilistically, with a likelihood that increases each time a search process makes an improvement.
[frametitle=Bees Algorithm (BeA)
Pham et al., 2006, [44], $>$1000 citations]
At each iteration, solutions are randomly sampled within a fixed radius of the best solutions within the existing population. This radius reduces progressively over time and only improved solutions are accepted. Random restarts are used to maintain diversity. This is similar to an ES with a timedependent mutation strategy.
[frametitle=Big BangBig Crunch (BBBC)
Erol and Eksin, 2006, [13], $>$600 citations]
At the start of each iteration, a point representing the objective valueweighted average of the previous population is calculated and a new population is created by sampling from a normal distribution centred around this point. The width of the distribution is reduced at each iteration, intensifying search. This can be seen as an estimation of distribution algorithm (EDA) with a simple generative model.
[frametitle=BiogeographyBased Optimizer (BBO)
Simon, 2008, [54], $\sim $2000 citations]
At each iteration, for each solution in the population, components are replaced by copying them from other solutions; the likelihood of this is proportional to the solution’s objective value, and the likelihood of choosing another solution as a source is proportional to its objective value. To promote diversity, a local move is carried out away from each resulting solution, with the probability of doing so inversely proportional to its objective value. This has similarities to a multiparent GA.
[frametitle=Brain Storm Optimization (BSO)
Shi, 2011, [52], $>$300 citations]
At each iteration, the population is clustered using kmeans clustering, and the best solution in each cluster is identified. Each solution in the population is then considered for replacement by comparing its objective value against that of a new search point and then keeping the best. Most of the time, this new search point is generated by either a local move from an existing solution, or by recombining two existing solutions in a GAlike manner. In either case, selection of the existing solution(s) is biased towards the cluster bests. To diversify the population, there is also a mechanism to sample random solutions during this process.
[frametitle=Cat Swarm Optimization (CSO)
Chu et al., 2006, [7], $\sim $300 citations]
At each iteration, each search process either carries out a local search, or moves in the direction of the best solution in the population. When carrying out a local search, a specified number of points are sampled in the vicinity of the current position and the best one is kept.
[frametitle=Charged System Search (CSS)
Kaveh and Talatahari, 2010, [25], $\sim $600 citations]
All search processes carry out moves towards the current positions of other search processes. Search processes have velocities, and the speed with which a search process moves towards a given search process is calculated using an inversesquare law weighted by the objective value of that process’s current point of search. Adaptive parameter changes allow the degree of attraction to vary over time, and the best solutions within the population are always preserved. Note that this algorithm is similar to FA and GSA.
[frametitle=Chemical Reaction Optimization (CRO)
Lam and Li, 2010, [30], $>$300 citations]
Search processes carry out either local search or a more disruptive global search using disruptive operators (such as the GA recombination operator). The balance between local and global search, and the likelihood of accepting nonimproving solutions, are both based on the history of the population member: if no improvement has been made for a while, global search replaces local search; if solutions with lower objective values were previously accepted, then they are less likely to be accepted in the future. This algorithm has similarities to memetic algorithms and simulated annealing.
[frametitle=Cuckoo Optimization Algorithm (COA)
Rajabioun, 2011, [46], $\sim $500 citations]
At each iteration, new search points are sampled within a radius of each existing search point, and only the best points are kept. The resulting population is then clustered using kmeans clustering, and the cluster with the highest mean objective value is identified. Search points in the other clusters are then moved towards the fittest cluster.
[frametitle=Cuckoo Search (CS)
Yang and Deb, 2009, [69], $\sim $3000 citations]
Uses a small population of solutions. At each iteration, a search process with a relatively low objective value is restarted, either at a randomly sampled location, or by applying a ‘Lévy flight’ to another, randomly selected, solution. Lévy flights are a kind of random walk with step sizes generated from a heavytailed probability distribution.
[frametitle=Firefly Algorithm (FA)
Yang, 2009, [66], $>$2000 citations]
At each iteration, all search processes carry out moves towards the current positions of other search processes. The degree of movement towards each point is calculated using an inversesquare law weighted by its objective value, causing intensification of search towards points with higher objective values. Note that this algorithm is similar to CSS and GSA.
[frametitle=Firework Algorithm (FWA)
Tan and Zhu, 2010, [61], $>$300 citations]
At each iteration, solutions are sampled in a neighbourhood around the best solutions in the population. Only improving moves are accepted. Neighbourhoods are sampled using a Gaussian distribution centred around the current point. The width of the distribution is inversely proportional to the objective value of the best solution, causing increased intensification as search progresses.
[frametitle=Flower Pollination Algorithm (FPA)
Yang, 2012, [68], $>$500 citations]
At each iteration, search processes carry out moves either towards the best solution in the population or the current position of a randomly selected search process. In the former case, the step size is determined by sampling a Lévy distribution (see CS).
[frametitle=Fruit Fly Optimization Algorithm (FOA)
Pan, 2012, [41], $>$600 citations]
All search processes carry out moves towards the best solution in the population. However, how this is achieved is unclear from the description.
[frametitle=Glowworm Swarm Optimization (GwSO)
Krishnanand and Ghose, 2005, [28], $>$600 citations]
Each search process maintains a numerical value that summarises its recent search progress, increasing this when it finds a an improving search point and decreasing gradually when it makes no progress. At each iteration, each search process carries out moves towards another search process located within a hyperspherical region centred around its current point; a search process with a high search progress value is more likely to be chosen as a target, and the radius of this region shrinks when there are many search processes nearby. Citation count includes [29].
[frametitle=Gravitational Search Algorithm (GSA)
Rashedi et al., 2009, [48], $>$2500 citations]
All search processes carry out moves towards the current positions of other search processes. Search processes have velocities, and the speed with which a search process moves towards a given search process is calculated using an inversesquare law weighted by the objective value of that process’s current point of search. Note that this algorithm is similar to CSS and GSA.
[frametitle=Grey Wolf Optimizer (GWO)
Mirjalili et al., 2014, [40], $>$1000 citations]
At each iteration, each search process carried out moves around the edges of a hypercube centred around a target search point. The target point is selected from a region bounded by the three current best search points within the population. Hypercubes become gradually smaller at each iteration in order to intensify search, and there is a random component in the update equation to inject diversity.
[frametitle=Group Search Optimizer (GSO)
He et al., 2009, [20], $>$500 citations]
The search processes with the current best solutions carry out local moves, using a mathematical model of animal vision to delimit the region they explore at a particular time. The majority of the other search processes carry out moves towards the search processes with the current best solutions. The remaining search processes generate diversity within the population by carrying out random walks.
[frametitle=Harmony Search (HS)
Geem et al., 2001, [19], $\sim $4000 citations]
At each iteration, a single new solution is created from a randomly selected existing solution. For each of its decision variables, a new value is chosen either at random or by copying and slightly modifying the value from another randomlyselected solution. If the new solution has a higher objective value than the worst solution in the population, it replaces it. Note that this algorithm has been proven equivalent to a form of ES [62].
{entry}[frametitle=Imperialist Competitive Algorithm (ICA)
AtashpazGargari and Lucas, 2007, [3], $\sim $1500 citations]
A population is randomly initialised and the best solutions are selected. For each of these solutions, a subpopulation is created with size proportional to its objective value and is filled randomly using the remaining search points within the population. At each iteration, the solutions in the subpopulation are moved towards the best solution within the subpopulation, with some noise added to inject diversity. Then, each subpopulation is given a value based mainly upon the objective value of its best solution, and solutions in subpopulations with low values are reallocated to subpopulations with high values. The algorithm terminates when there is a single nonempty subpopulation.
[frametitle=Invasive Weed Optimization (IWO)
Mehrabian and Lucas, 2006, [35], $>$750 citations]
At each iteration, search processes sample a number of local moves from their current position. The number of moves is proportional to the relative objective value of their current position, and the size of moves decreases nonlinearly over time. Once the population size reaches an upper bound, search processes with relatively poor solutions are ended, and the solutions they generated are removed from the population.
[frametitle=Krill Herd (KH)
Gandomi and Alavi, 2012, [17], $>$600 citations]
Search processes carry out moves towards the population best, their historical best, and the objective valueweighted average of the population. They also carry out moves towards or away from search processes within a given radius, based upon their objective value. Search processes also have a component of random movement. The weighting of components is timedependent, with less random motion and more movement towards the population best as time proceeds. On top of this, GAlike operations are carried out within the population.
[frametitle=Marriage in Honey Bees Optimization (MBO)
Abbass, 2001, [1], $\sim $400 citations]
At each iteration, a number of random walks are carried out, starting from the locations of the best solutions in the population. New solutions are created using an operator that recombines the existing (start of walk) solution with solutions encountered during the walk. The likelihood of this occurring at each step of the walk is based on objective value, and also reduces over the course of the walk. Move sizes progressively decrease during the walk. Local search is used to improve solutions at each iteration of the algorithm; the operator used for this is chosen probabilistically based on its past success rate.
[frametitle=MothFlame Optimization (MFO)
Mirjalili, 2015a, [37], $\sim $250 citations]
Search processes carry out moves in a spiral path towards a target point. The target points are the historical best solutions of other search processes. Initially, all historical bests are used as targets, with the particle that has the highest current objective value moving towards the highest historical best, and the search process with the least current objective value moving towards the lowest historical best. Over time, fewer targets are followed.
[frametitle=Shuffled Frog Leaping Algorithm (SFLA)
Eusuff and Lansey, 2003, [14], $>$1000 citations]
At each iteration, the population is split into subpopulations, each with a broad objective value spread. Each subpopulation is then repeatedly subsampled by objective value, and the worst solution in each subsample is moved towards the best solution in the subsample (or alternatively the population). In each case, if this does not lead to improvement, the solution is replaced by a random search point. After each subpopulation has been processed, the subpopulations are merged, and the procedure is repeated.
[frametitle=Society and Civilisation Algorithm (SCA)
Ray and Liew, 2003, [50], $>$300 citations]
At each iteration, the population is clustered. In each cluster, the best solutions are selected. The remaining solutions in the cluster are then moved towards the selected solutions. A similar procedure is then carried out for the selected solutions from all clusters, with the worst solutions amongst these moved towards the best solutions. The algorithm also takes into account constraint satisfaction.
[frametitle=TeacherLearning Based Optimization (TLBO)
Rao et al., 2011, [47], $>$1000 citations]
At each iteration, the mean position of the population is calculated and subtracted from the population’s best search point. Moves are then carried out by adding a fraction of the resulting vector to each population member (this is similar to differential evolution). Only improving moves are accepted. Each population member is then compared to another randomly selected member; if the target has a higher objective value, it is moved towards it; otherwise, it is moved away. Again, only improving moves are accepted.
[frametitle=Water Cycle Algorithm (WCA)
ShahHosseini, 2009, [51], $\sim $250 citations]
At each iteration, population members with relatively high objective value (but not the population best) are moved closer to the population best by a random amount. The remaining population members are each moved closer to one of these relatively high objective value solutions by a random amount, with proportionally more of them moving towards the best solutions. Random local moves are also applied to maintain diversity.
[frametitle=Whale Optimization Algorithm (WOA)
Mirjalili and Lewis, 2016, [39], $\sim $250 citations]
Each search process carries out moves in a hypercube around a target search point and iteratively moves towards this target either by shrinking the hypercube or through a spiral motion. Target choice is affected by a timedependent parameter; initially this causes random members of the population to be followed; later all search points follow the population best.
4 Commonalities
Recent natureinspired metaheuristics have sometimes been criticised for a lack of novelty. Before discussing this in more detail, it is first useful to consider the meaning of the term metaheuristic. Many authors who develop natureinspired algorithms use this term as a synonym for “optimisation algorithm”, but this is not the original meaning of the term, which is more akin to a generative model that can be used to guide the development of a particular algorithm. Sörensen et al. [56] address this disparity by distinguishing metaheuristic algorithms (i.e. particular implementations of a metaheuristic idea) from metaheuristic frameworks (i.e. the more general models from which these algorithms are derived). This distinction is important when talking about novelty, because whilst there is considerable scope for designing a novel metaheuristic algorithm, there is much less scope for developing a novel metaheuristic framework. For instance, you can create a novel metaheuristic algorithm by modifying the mutation operator used by a GA, or by hybridising a GA with an operator from PSO, but in both cases there is no novel metaheuristic framework being created. It is worth noting that hybridisation, in particular, introduces combinatorial scope for generating algorithms that are technically novel, yet which introduce no novel algorithmic features.
Whilst metaheuristic frameworks are a useful concept for narrowing the definition of novelty, it can also be useful to talk about recurring ideas that appear within multiple frameworks. For instance, EAs and PSO are probably good candidates for being called metaheuristic frameworks, but there are clearly common concepts that occur within both of these; for example, the way in which both techniques have mechanisms for exploring search points that are intermediate to existing ones. In a previous paper [34], an attempt was made to identify and describe some of these more general metaheuristic approaches; an abridged listing of these is reproduced in Table 1.
Concept  Description  Examples 

Hill Climbing  Follow a sequence of local improvements to reach a locally optimal solution.  Steepest ascent, stochastic hill climbing 
Accepting Negative Moves  Allow moves to worse solutions.  Threshold accepting, simulated annealing 
Restarts  Restart the search process in a different region once it has converged at a local optimum.  Randomrestart hill climbing, iterated local search 
Adaptive Memory Programming  Use memory of past search experience to guide future search.  Tabu search, EAs, PSO 
PopulationBased Search  Multiple cooperating search processes that run in parallel.  EAs, PSO, scatter search 
Intermediate Search  Explore the region between two or more previously visited search points.  Crossover, PSO, path relinking 
Directional Search  Identify productive directions within the search space, and carry out moves accordingly.  Gradient ascent, CMAES, PSO 
Variable Neighbourhood Search  Search different neighbourhoods around the location of a known local optimum.  PSO, variable neighbourhood search 
Search Space Mapping  Construct a map to guide search processes that are traversing the search space.  ACO, guided local search, DIRECT 
Technically, almost all the algorithms described in the previous section meet the definition of a novel metaheuristic algorithm, since they differ from standard metaheuristic algorithms such as ESs, GAs and standard PSO. However, it is difficult to argue that any of them are novel metaheuristic frameworks, since most of them clearly borrow (or perhaps rediscover) concepts that are also central to conventional metaheuristic frameworks. Referring to the metaheuristic concepts listed in Table 1, all of the algorithms described in the previous section implement a combination of hill climbing, adaptive memory programming and populationbased search, and this is also true of EAs and PSO. The majority also implement some form of intermediate search, most commonly using either a PSOlike operator that picks a point geometrically between two existing points or an EAlike crossover operator that recombines solution components. Those which use PSOlike operators also carry out directional search in a similar manner to PSO. Many of the algorithms use restarts (ABC, BFO, BeA, CS, SFLA), which are also commonly used in local search algorithms. Many also have strategies for accepting negative moves: some of these resemble simulated annealing (BA, CRO); however, the most common approach involves random walks (ALO, BFO, CS, GSO, KH, MBO), which might be considered a degenerate form of threshold acceptance, but is otherwise a relatively novel idea. Several algorithms use search trajectories that follow a spirallike path around local optima (GWO, MFO, WOA), and this could be considered a form of variable neighbourhood search.
In terms of resemblance to existing metaheuristic frameworks, a large proportion of the algorithms have a clear resemblance to PSO in that a population of search processes move towards each other using vectorbased operations (ABC, BeA, BA, COA, CSO, CSS, FA, FOA, FPA, GSA, GSO, GWO, GwSO, KH, MFO, TLBO, WCA, WOA). A few algorithms might be considered variants of ES (BeA, HS, IWO), and a number of algorithms are broadly EAlike (BBO, BSO, COA, ICA, SFLA, SCA), with a number of these hybridising PSOlike operators (COA, ICA, SFLA, SCA). Some algorithms have notable degrees of selfsimilarity: for instance, CSS, FA and GSA all use inversesquare laws to calculate the attraction between search processes.
5 Commonalities with PSO
PSO has clearly been an influence to many of the natureinspired algorithms reviewed in Section 3, and consequently it is important to dig down further to understand how ideas explored within this group of algorithms intersect with those explored in the PSO literature.
The majority of PSOstyle algorithms listed in Section 3 have similar basic mechanics to PSO, in that search processes move towards other search processes using vector operations. A major difference is that the majority of these algorithms (all except KH and MFO) do not use historical bests, i.e. the best point of search seen by a particular search process. This means that search processes are influenced only by the current locations of both themselves and other search processes. The metaheuristic motivation for this is unclear, since it appears to reduce the amount of information available to guide search. Nevertheless, it should be noted that the idea of “social only” interactions (i.e. ignoring a search process’s own search experience) has been explored in PSO and in both [26] and [43] was found to have no significant effect upon the algorithm’s performance when applied to certain problems; however, this is not the same as not recording historical bests, since search processes are still influenced by the historical bests of other search processes.
Another major difference from standard PSO is that most of the algorithms have no direct analogue of velocity or momentum; rather, move sizes are determined using simpler rules, including timedependent move sizes (ABC, ALO, BBBC, CSS, IWO, MBO), distance dependent move sizes (CSS, FA, GSA) and regionbased sampling (ALO, BA, BeA, BBBC, FWA, GWO, WOA). Time dependent move sizes have also been explored in variants of PSO [53, 49]. Regionbased sampling involves directly sampling from a region of search space that is shaped or bounded by one or more search points, rather than applying vectorbased operations. This approach has earlier been used in Bare Bones PSO [27], where it was introduced as a means of simplifying the dynamics of PSO and making it more tractable for analysis. Distancedependent move sizes are notable: usually in PSO, search processes move faster towards informants that are further away, meaning that move size increases with distance. In CSS, FA and GSA, on the other hand, the search processes are less influenced by distant search processes, so move size reduces with distance. This causes interactions between search processes to become geographically localised, which could be useful for multimodal landscapes; however, it is unclear whether the resulting behaviour is more effective than other mechanisms introduced to PSO to handle these kind of landscapes, such as multiswarm approaches [4].
A consequence of using simpler update rules is that the dynamics of many of these algorithms are much simpler than in standard PSO. A benefit of this is that it potentially makes their behaviour easier to understand. However, by removing exploratory dynamics like overshooting and oscillation, there is a danger that they will only explore the regions between existing search points and suffer premature convergence as a result. To address this, most include one or more mechanisms to promote diversification. These include hybridisation with local search (CSO, CRO, COA, FWA, IWO, MBO, WCA), random restarts (ABC, BFO, BeA, CS, SFLA), random walks (ALO, BFO, CS, GSO, KH, MBO) and spirallike movements (GWO, MFO, WOA). The latter, in particular, may lead to search trajectories that resemble those seen in PSO (and it should be noted that a similar approach is used in spiral optimisation [60]). Hybridisation with local search is also fairly common in PSO, e.g. [6], where restarts have also been used [24]. Random walks are arguably one of the more interesting mechanisms explored in recent natureinspired optimisation algorithms, particularly those that build upon biological knowledge in this area, e.g. CS and BFO, and there is no real analogue in the PSO literature.
When carrying out moves towards other search processes, the manner of choosing target search points varies widely amongst the algorithms in the list. Some (BA, CSO, FOA) only use the population best, relying on other mechanisms (e.g. restarts) to maintain diversity. Several algorithms (GwSO, WCA, GWO, COA) choose targets in a fitnessinformed manner, either probabilistically, by selecting the top n solutions in the population, or in the case of COA, by clustering and picking the cluster with the highest mean fitness. These approaches are somewhat related to variants of PSO that use dynamic allocation of informants, e.g. [10]. A number of algorithms have mechanisms that cause particles to be more influenced by nearby search processes. This includes those that relate move size to distance (see above). It also includes SCA, which dynamically clusters the population based on distance. Distancebased selection of informants has also been used in PSO, e.g. [31]. A number of algorithms use all other search processes as targets, either directly (CSS, FA, GAO), or indirectly by summarising information about them (KH, BBBC). Similar ideas have been investigated in variants of PSO, such as fullyinformed PSO [36] and quantum PSO [58]. Some algorithms use timevarying rules for choosing targets, notably those that move from randomlychosen targets towards the population best over time (MFO, WOA). The idea of dynamicallyvarying the number of informants over time has also been explored in the PSO literature [57].
6 Discussion
Are recent natureinspired algorithms novel? Yes and no. On the one hand, most (but certainly not all) of the algorithms reviewed in this paper are distinct from existing optimisation algorithms, and given a particular search space, they would likely follow different trajectories to existing algorithms. On the other hand, many of these algorithms use variants of wellestablished metaheuristic concepts that are also found in existing metaheuristic frameworks such as PSO, EAs and local search. Furthermore, the analysis of PSOstyle algorithms shows that many of their underlying ideas have also been explored by the more mainstream PSO community. However, chronologically, this hasn’t always been in one direction. Sometimes the PSO community has explored these ideas earlier, sometimes later, and sometimes in parallel to recent natureinspired algorithms. Either way, it shows how the fragmentation of the natureinspired computing community has led to duplicated effort.
Are recent natureinspired algorithms competitive? This is less clear. Most of the cited papers include a performance evaluation. The results are not reported here, because almost all show the algorithm to perform better than the algorithms they were compared against. Even without taking No Free Lunch theorems [64] into account, it is implausible to believe that this is true for all of them. This is not to say that the results are incorrect, but it does reflect the difficulty of designing fair comparative studies [45, 16, 9, 18]. We can speculate that all of these algorithms will sometimes perform better on some problems when compared against other algorithms, since problem landscapes are diverse, and small differences in the topography of a landscape can favour different approaches.
However, given a specific problem, it is difficult to know which algorithm will work well. The field of metalearning [32] has been studying this issue for some time, but progress on understanding how problems can be characterised, categorised and mapped to specific optimisers has so far been limited. This means that performance on one problem currently tells us little about potential performance on another problem, and consequently that practitioners usually have to try out a range of different optimisers to determine which one works well on their problem. In a sense, the recent developments in natureinspired algorithms have increased the number of optimisers available to try out. This may sometimes be beneficial, but it also makes it harder for a practitioner to identify a suitable optimiser that is wellunderstood and has community support. Given the vast scope for creating variants and hybrids of existing algorithms, this situation is only likely to get worse.
An alternative, and arguably more promising, direction of travel can be seen in the hyperheuristics [5, 12] and broader machine learning communities [33, 63]. Both address the problem of choosing an optimiser as an optimisation problem, using a machine learning algorithm to identify an optimiser that is good at solving a specific task. In the case of hyperheuristics, the optimiser, which is usually an evolutionary algorithm, can be used to construct new optimisation algorithms. This can be done either by specialising an existing algorithm (for example, evolving a new mutation operator for an EA) or by assembling existing metaheuristic components in a novel way. In effect, the latter is an automated version of the many manual attempts to hybridise metaheuristics that can be found in the literature. However, this automated approach is currently limited by a lack of standardised interfaces [59], and this arguably is limited by the tendency of the community to think of metaheuristics in terms of algorithms rather than reusable components.
This focus on algorithms rather than components is a particular issue for the natureinspired algorithm community, where the objective of domain modelling is almost always the generation of a single algorithm that captures all pertinent behaviours present within the domain of inspiration. As a consequence, any interesting, novel, components extracted from the domain tend to become conflated with other, less interesting, and sometimes arbitrary, components. This makes it hard to understand the relevance and contribution of individual components within the optimisation setting. Arguably a better approach would be to identify any component of the domain that is particularly interesting, and integrate this individually within one or more existing metaheuristic frameworks. Even better would be to make the code available in reusable form: it could then be used by other algorithm developers, or even used as a new buildingblock within hyperheuristic frameworks.
An important barrier that stands in the way of this kind of integration is the success of previous authors who have not followed this path. This can be seen in the large citation counts amongst recent natureinspired optimisation papers, and the initial career boost that this may provide to their authors. It is perhaps less apparent that association with a part of the field that is seen as less rigorous may result in a career penalty in the long run. Many of the citations to these papers come from researchers who work in applied optimisation. This brings up another important factor in the success of natureinspired optimisers, the false assumption that new means better, which leads to inexperienced practitioners using the most recent, rather than the most appropriate, metaheuristic to solve a particular optimisation problem. This is a difficult problem to address, because it is arguably caused by sociological rather than technological factors, and (due to the wide reach of optimisation) spans a broad range of academic communities. Nevertheless, maybe efforts, such as this, to tie together the loose ends of the community may contribute towards a solution.
7 Conclusions
Numerous papers describing new natureinspired optimisation algorithms have been written over the past 20 years. Unfortunately, it has become common practice for these papers to describe algorithms using nonstandard terminology derived from their domain of inspiration, resulting in papers that are often very difficult to read and understand. In this paper, an attempt has been made to describe the most widely cited of these algorithms using standard metaheuristic terminology. It is hoped that the resulting descriptions will make it easier for readers to gain a quick understanding of how these algorithms work, without having to read the original papers. As a result, this should make it more straightforward for metaheuristics practitioners to read, review and understand work that uses these algorithms.
This paper also makes an attempt to analyse the commonalities between algorithms. The existing literature has raised particular cases where there is a strong similarity between different natureinspired optimisation algorithms, but opaque terminology makes it hard to recognise these similarities in general. The standardised descriptions in this paper make this process easier, and this has been demonstrated by relating each of the algorithms to existing metaheuristic concepts. The resulting analysis suggests that few of the algorithms introduced in the last 20 years introduce fundamentally new concepts; rather, they mostly reassemble existing concepts in new ways. Since many of the algorithms are swarmlike, a closer look was taken at their commonalities with particle swarm optimisation and its variants. This revealed few points of absolute novelty, suggesting that the two communities have largely been following the same tracks. Perhaps suprisingly, it was noted that particle swarm optimisation did not always get there first.
This paper also emphasises the need to bring together the different threads of the metaheuristic community, with the aim of reducing redundancy, making research results more accessible, and developing new approaches that integrate the diverse work that is being done. Some important work has already started in this area, including, for example, efforts to standardise interfaces between metaheuristic components. However, arguably a lot more effort is required if we are to reduce the fragmentation of the field and leverage the diverse talents of the metaheuristics community in useful ways.
References
 Abbass [2001] Hussein A Abbass. MBO: Marriage in honey bees optimization—a haplometrosis polygynous swarming approach. In Proceedings of the 2001 Congress on Evolutionary Computation (CEC 2001), volume 1, pages 207–214. IEEE, 2001.
 Aranha and Campelo [2019] Claus Aranha and Felipe Campelo. Evolutionary computation bestiary, 2019. URL https://github.com/fcampelo/ECBestiary. [Online; accessed 9October2019].
 AtashpazGargari and Lucas [2007] Esmaeil AtashpazGargari and Caro Lucas. Imperialist competitive algorithm: an algorithm for optimization inspired by imperialistic competition. In Proceedings of the 2007 Congress on Evolutionary Computation (CEC 2007), pages 4661–4667. IEEE, 2007.
 Blackwell and Branke [2004] Tim Blackwell and Jürgen Branke. Multiswarm optimization in dynamic environments. In Workshops on Applications of Evolutionary Computation, pages 489–500. Springer, 2004.
 Burke et al. [2013] Edmund K Burke, Michel Gendreau, Matthew Hyde, Graham Kendall, Gabriela Ochoa, Ender Özcan, and Rong Qu. Hyperheuristics: A survey of the state of the art. Journal of the Operational Research Society, 64(12):1695–1724, 2013.
 Chen et al. [2005] Junying Chen, Zheng Qin, Yu Liu, and Jiang Lu. Particle swarm optimization with local search. In International Conference on Neural Networks and Brain (ICNN&B’05), volume 1, pages 481–484. IEEE, 2005.
 Chu et al. [2006] ShuChuan Chu, PeiWei Tsai, and JengShyang Pan. Cat swarm optimization. In Pacific Rim International Conference on Artificial Intelligence, pages 854–858. Springer, 2006.
 Črepinšek et al. [2012] Matej Črepinšek, ShihHsi Liu, and Luka Mernik. A note on teaching–learningbased optimization algorithm. Information Sciences, 212:79–93, 2012.
 Črepinšek et al. [2016] Matej Črepinšek, ShihHsi Liu, Luka Mernik, and Marjan Mernik. Is a comparison of results meaningful from the inexact replications of computational experiments? Soft Computing, 20(1):223–235, 2016.
 Du et al. [2015] WenBo Du, Yang Gao, Chen Liu, Zheng Zheng, and Zhen Wang. Adequate is better: particle swarm optimization with limitedinformation. Applied Mathematics and Computation, 268:832–838, 2015.
 Eberhart and Kennedy [1995] Russell Eberhart and James Kennedy. Particle swarm optimization. In Proceedings of the IEEE International Conference on Neural Networks, volume 4, pages 1942–1948, 1995.
 Epitropakis and Burke [2018] Michael G Epitropakis and Edmund K Burke. Hyperheuristics. Handbook of Heuristics, pages 1–57, 2018.
 Erol and Eksin [2006] Osman K Erol and Ibrahim Eksin. A new optimization method: big bang–big crunch. Advances in Engineering Software, 37(2):106–111, 2006.
 Eusuff and Lansey [2003] Muzaffar M Eusuff and Kevin E Lansey. Optimization of water distribution network design using the shuffled frog leaping algorithm. Journal of Water Resources Planning and Management, 129(3):210–225, 2003.
 Fister Jr et al. [2013] Iztok Fister Jr, XinShe Yang, Iztok Fister, Janez Brest, and Dušan Fister. A brief review of natureinspired algorithms for optimization. Elektrotehniški vestnik, 80(3), 2013.
 Fong et al. [2016] Simon Fong, Xi Wang, Qiwen Xu, Raymond Wong, Jinan Fiaidhi, and Sabah Mohammed. Recent advances in metaheuristic algorithms: Does the Makara dragon exist? The Journal of Supercomputing, 72(10):3764–3786, 2016.
 Gandomi and Alavi [2012] Amir Hossein Gandomi and Amir Hossein Alavi. Krill herd: a new bioinspired optimization algorithm. Communications in Nonlinear Science and Numerical Simulation, 17(12):4831–4845, 2012.
 GarcíaMartínez et al. [2017] Carlos GarcíaMartínez, Pablo D Gutiérrez, Daniel Molina, Manuel Lozano, and Francisco Herrera. Since CEC 2005 competition on realparameter optimisation: a decade of research, progress and comparative analysis’s weakness. Soft Computing, 21(19):5573–5583, 2017.
 Geem et al. [2001] Zong Woo Geem, Joong Hoon Kim, and Gobichettipalayam Vasudevan Loganathan. A new heuristic optimization algorithm: harmony search. simulation, 76(2):60–68, 2001.
 He et al. [2009] Shan He, Q Henry Wu, and JR Saunders. Group search optimizer: an optimization algorithm inspired by animal searching behavior. IEEE Transactions on Evolutionary Computation, 13(5):973–990, 2009.
 Holland [1975] John H. Holland. Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Arbor, MI, 1975.
 Joyce and Herrmann [2018] Thomas Joyce and J Michael Herrmann. A review of no free lunch theorems, and their implications for metaheuristic optimisation. In NatureInspired Algorithms and Applied Optimization, pages 27–51. Springer, 2018.
 Karaboga [2005] Dervis Karaboga. An idea based on honey bee swarm for numerical optimization. Technical report, Technical reporttr06, Erciyes University, engineering faculty, computer engineering department, 2005.
 Kaucic [2013] Massimiliano Kaucic. A multistart oppositionbased particle swarm optimization algorithm with adaptive velocity for bound constrained global optimization. Journal of Global Optimization, 55(1):165–188, 2013.
 Kaveh and Talatahari [2010] A Kaveh and S Talatahari. A novel heuristic optimization method: charged system search. Acta Mechanica, 213(34):267–289, 2010.
 Kennedy [1997] James Kennedy. The particle swarm: social adaptation of knowledge. In IEEE International Conference on Evolutionary Computation, pages 303–308. IEEE, 1997.
 Kennedy [2003] James Kennedy. Bare bones particle swarms. In Proceedings of the 2003 IEEE Swarm Intelligence Symposium (SIS’03), pages 80–87. IEEE, 2003.
 Krishnanand and Ghose [2005] KN Krishnanand and Debasish Ghose. Detection of multiple source locations using a glowworm metaphor with applications to collective robotics. In Proceedings 2005 IEEE Swarm Intelligence Symposium (SIS 2005), pages 84–91. IEEE, 2005.
 Krishnanand and Ghose [2009] KN Krishnanand and Debasish Ghose. Glowworm swarm optimization for simultaneous capture of multiple local optima of multimodal functions. Swarm intelligence, 3(2):87–124, 2009.
 Lam and Li [2010] Albert YS Lam and Victor OK Li. Chemicalreactioninspired metaheuristic for optimization. IEEE Transactions on Evolutionary Computation, 14(3):381–399, 2010.
 Lane et al. [2008] James Lane, Andries Engelbrecht, and James Gain. Particle swarm optimization with spatially meaningful neighbours. In Proceedings 2008 IEEE Swarm Intelligence Symposium (SIS 2008), pages 1–8. IEEE, 2008.
 Lemke et al. [2015] Christiane Lemke, Marcin Budka, and Bogdan Gabrys. Metalearning: a survey of trends and technologies. Artificial intelligence review, 44(1):117–130, 2015.
 Li and Malik [2017] Ke Li and Kitendra Malik. Learning to optimize. In 5th International Conference on Learning Representations, 2017.
 Lones [2014] Michael A Lones. Metaheuristics in natureinspired algorithms. In Proceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation, pages 1419–1422. ACM, 2014.
 Mehrabian and Lucas [2006] Ali Reza Mehrabian and Caro Lucas. A novel numerical optimization algorithm inspired from weed colonization. Ecological informatics, 1(4):355–366, 2006.
 Mendes et al. [2004] Rui Mendes, James Kennedy, and José Neves. The fully informed particle swarm: simpler, maybe better. IEEE Transactions on Evolutionary Computation, 8(3):204–210, 2004.
 Mirjalili [2015a] Seyedali Mirjalili. Mothflame optimization algorithm: A novel natureinspired heuristic paradigm. KnowledgeBased Systems, 89:228–249, 2015a.
 Mirjalili [2015b] Seyedali Mirjalili. The ant lion optimizer. Advances in Engineering Software, 83:80–98, 2015b.
 Mirjalili and Lewis [2016] Seyedali Mirjalili and Andrew Lewis. The whale optimization algorithm. Advances in Engineering Software, 95:51–67, 2016.
 Mirjalili et al. [2014] Seyedali Mirjalili, Seyed Mohammad Mirjalili, and Andrew Lewis. Grey wolf optimizer. Advances in Engineering Software, 69:46–61, 2014.
 Pan [2012] WenTsao Pan. A new fruit fly optimization algorithm: taking the financial distress model as an example. KnowledgeBased Systems, 26:69–74, 2012.
 Passino [2002] Kevin M Passino. Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Systems, 22(3):52–67, 2002.
 Pedersen and Chipperfield [2010] Magnus Erik Hvass Pedersen and Andrew J Chipperfield. Simplifying particle swarm optimization. Applied Soft Computing, 10(2):618–628, 2010.
 Pham et al. [2006] Duc Truong Pham, Afshin Ghanbarzadeh, Ebubekir Koç, Sameh Otri, S Rahim, and Muhamad Zaidi. The bees algorithm—a novel tool for complex optimisation problems. In Intelligent Production Machines and Systems, pages 454–459. Elsevier, 2006.
 Piotrowski [2015] Adam P Piotrowski. Regarding the rankings of optimization heuristics based on artificiallyconstructed benchmark functions. Information Sciences, 297:191–201, 2015.
 Rajabioun [2011] Ramin Rajabioun. Cuckoo optimization algorithm. Applied soft computing, 11(8):5508–5518, 2011.
 Rao et al. [2011] Ravipudi V Rao, Vimal J Savsani, and DP Vakharia. Teaching–learningbased optimization: a novel method for constrained mechanical design optimization problems. ComputerAided Design, 43(3):303–315, 2011.
 Rashedi et al. [2009] Esmat Rashedi, Hossein NezamabadiPour, and Saeid Saryazdi. GSA: a gravitational search algorithm. Information sciences, 179(13):2232–2248, 2009.
 Ratnaweera et al. [2004] Asanga Ratnaweera, Saman K Halgamuge, and Harry C Watson. Selforganizing hierarchical particle swarm optimizer with timevarying acceleration coefficients. IEEE Transactions on Evolutionary Computation, 8(3):240–255, 2004.
 Ray and Liew [2003] Tapabrata Ray and Kim Meow Liew. Society and civilization: An optimization algorithm based on the simulation of social behavior. IEEE Transactions on Evolutionary Computation, 7(4):386–396, 2003.
 ShahHosseini [2009] Hamed ShahHosseini. The intelligent water drops algorithm: a natureinspired swarmbased optimization algorithm. International Journal of BioInspired Computation, 1(12):71–79, 2009.
 Shi [2011] Yuhui Shi. Brain storm optimization algorithm. In International conference in swarm intelligence, pages 303–309. Springer, 2011.
 Shi and Eberhart [1999] Yuhui Shi and Russell C Eberhart. Empirical study of particle swarm optimization. In Proceedings of the 1999 Congress on Evolutionary Computation (CEC 99), volume 3, pages 1945–1950. IEEE, 1999.
 Simon [2008] Dan Simon. Biogeographybased optimization. IEEE transactions on evolutionary computation, 12(6):702–713, 2008.
 Sörensen [2015] Kenneth Sörensen. Metaheuristics—the metaphor exposed. International Transactions in Operational Research, 22(1):3–18, 2015.
 Sörensen et al. [2018] Kenneth Sörensen, Marc Sevaux, and Fred Glover. A history of metaheuristics. Handbook of heuristics, pages 1–18, 2018.
 Suganthan [1999] Ponnuthurai N Suganthan. Particle swarm optimiser with neighbourhood operator. In Proceedings of the 1999 Congress on Evolutionary Computation (CEC 99), volume 3, pages 1958–1962. IEEE, 1999.
 Sun et al. [2004] Jun Sun, Wenbo Xu, and Bin Feng. A global search strategy of quantumbehaved particle swarm optimization. In IEEE Conference on Cybernetics and Intelligent Systems, 2004., volume 1, pages 111–116. IEEE, 2004.
 Swann and Hammond [2015] J. Swann and K. Hammond. Towards ‘metaheuristics in the large’. In Proceedings of 11th Metaheuristics International Conference (MIC 2015), 2015.
 Tamura and Yasuda [2011] Kenichi Tamura and Keiichiro Yasuda. Primary study of spiral dynamics inspired optimization. IEEJ Transactions on Electrical and Electronic Engineering, 6(S1), 2011.
 Tan and Zhu [2010] Ying Tan and Yuanchun Zhu. Fireworks algorithm for optimization. In International Conference in Swarm Intelligence, pages 355–364. Springer, 2010.
 Weyland [2015] Dennis Weyland. A critical analysis of the harmony search algorithm—how not to solve sudoku. Operations Research Perspectives, 2:97–105, 2015.
 Wichrowska et al. [2017] Olga Wichrowska, Niru Maheswaranathan, Matthew W. Hoffman, Misha Denil Sergio Gómez Colmenarejo, Nando Freitas, and Jascha SohlDickstein. Learned optimizers that scale and generalize. In Proceedings of the 34th International Conference on Machine Learning, volume 70, 2017.
 Wolpert and Macready [1997] David H Wolpert and William G Macready. No free lunch theorems for optimization. IEEE transactions on evolutionary computation, 1(1):67–82, 1997.
 Xing and Gao [2016] Bo Xing and WenJing Gao. Innovative computational intelligence: a rough guide to 134 clever algorithms. Springer, 2016.
 Yang [2009] XinShe Yang. Firefly algorithms for multimodal optimization. In International symposium on stochastic algorithms, pages 169–178. Springer, 2009.
 Yang [2010] XinShe Yang. Natureinspired metaheuristic algorithms. Luniver press, 2010.
 Yang [2012] XinShe Yang. Flower pollination algorithm for global optimization. In International conference on unconventional computing and natural computation, pages 240–249. Springer, 2012.
 Yang and Deb [2009] XinShe Yang and Suash Deb. Cuckoo search via lévy flights. In World Congress on Nature & Biologically Inspired Computing (NaBIC 2009), pages 210–214. IEEE, 2009.
 Yang and Gandomi [2012] XinShe Yang and Amir Hossein Gandomi. Bat algorithm: a novel approach for global engineering optimization. Engineering Computations, 29(5):464–483, 2012.