.. AMPGO documentation master file, created by
   sphinx-quickstart on Wed Feb 06 20:51:59 2013.
   You can adapt this file completely to your liking, but it should at least
   contain the root `toctree` directive.
 

.. include:: headings.inc

   
|infinity| Global Optimization Benchmarks
=========================================

This set of HTML pages pretty much superseeds - and greatly expands - the previous analysis
that you can find at this location: http://infinity77.net/global_optimization/ .

As far as I can see, no one has ever attempted a benchmark so comprehensive and multi-faceted,
although hundreds of papers/websites/tools exist to carry out smaller, simpler benchmarks on
Global Optimization algorithms.

The approach I have taken this time is to select as many benchmark test suites as possible: most of
them are characterized by test function *generators*, from which we can actually create almost an unlimited
number of unique test problems.


|introduction| Introduction
---------------------------

These HTML pages contain a series of benchmarks to test a number of numerical Global 
Optimization algorithms; the algorithms are applied to multi-modal/difficult 
multi-dimensional test functions. Some of these benchmarks test functions are taken from the
literature, but quite a few of them are created with the help of test functions generators.

I have attempted a broad categorization of the benchmarks in :ref:`The Benchmarks`. As a very high
level comment, I can anticipate that the whole suite is made up of **6,825** test problems divided across
**16** different test suites: most of these problems are of low dimensionality (2 to 6 variables) with
a few benchmarks extending to 10+ variables.


|motivation| Motivation
-----------------------

This effort stems from the fact that I got fed up with the current attitude of most
mathematicians/numerical optimization experts, who tend to demonstrate the advantages of
an algorithm based on "elapsed time" or "CPU time" or similar meaningless performance
indicators.

I deal with real-life optimization problems, as most of us do.

Most real life optimization problems require intensive and time consuming simulations 
for every **function evaluation**; the time spent by the solver itself doing its 
calculations simply disappears in front of the real process simulation. I know it because 
our simulations take between 2 and 48 hours to run, so what's 300 seconds more or less in
the solver calculations?

.. _the rules:

|rules| The Rules
-----------------

In order to be eligible for the competition, a Global Optimization algorithm **must** satisfy 
the following constraints:

1. It should run on my PCs, equipped with Windows 10 64 bit and Python 2.7 64 bit. This means
   that the algorithm should be coded in pure Python, or the developer should provide a
   Windows installer, or it should be of easy compilation using standard Python tools plus
   Visual Studio/MinGW64, Intel Fortran/gfortran and `f2py <http://www.scipy.org/F2py>`_.

   See :ref:`The Great Excluded` section for a few examples of algorithms written by people
   living in their dream world.
   
2. No unconstrained optimization routines allowed: I mostly care about box-bounded problems,
   so the algorithm should support bounds on independent variables as a minimum condition
   for eligibility.


The test suite is executed in the following manner:

1. Every optimization algorithm is fed with all the test functions:

   a. For the :ref:`SciPy Extended` benchmark, 100 different random starting points are used.
      For any test function, the starting point is the same for all the algorithms. Please see
      the description of the :ref:`SciPy Extended` benchmark as it has been havily modified
      compared to the one in the SciPy distribution.
   
      Benchmark repeatability is enforced through the use of the same random seed every time the
      test suite is run.

   b. For the :ref:`NIST` benchmark, solvers that accept an initial starting point are fed with
      the start point number 1 as suggested by `NIST <https://www.itl.nist.gov/div898/strd/nls/nls_main.shtml>`_.
      This is not exactly 100% fair to solvers that do not accept an initial guess (DE, MCS,
      LeapFrog, PSWARM, SHGO), but life is tough.

   c. For all other benchmark suites, only one initial starting point is provided, randomly
      generated in the function domain.
      For any test function, the starting point is the same for all the algorithms. This is not exactly 100%
      fair to solvers that are forced to use an initial guess (AMPGO, BasinHopping, BiteOpt, CMA-ES, CRS2, DIRECT,
      DualAnnealing, SCE), but life is tough.

      Benchmark repeatability is enforced through the use of the same random seed every time the
      test suite is run.

   Please see the :ref:`The Benchmarks` section and all the sub-sequent pages for a more in-depth explanation
   of the various benchmarks and the results.

2. No tuning of parameters is allowed: all the algorithms have been run with their default
   settings, irrespective of the dimensionality of the problem, the starting point or any 
   other consideration.
   
3. I am now less fixated with the maximum number of function evaluations, but all benchmark suites
   have been run repeatedly, by increasing the available budget of function evaluations each time,
   with 100, 200, 300, ..., 1900, 2000, 5000, 10000 as hard limits.
   
   During each of tose runs, if the assigned limit is exceeded, the test is considered as "failed".
   Otherwise the number of function evaluations reported by the algorithm is recorded for later
   statistical analysis.
   
4. All the test functions on almost all benchmark suites have known global optimum values: in the
   few cases where the global optimum value is unknown (or I was unable to calculate it via analysis)
   their value has been estimated by enormous grid searches plus repeated minimizations.
   I then considered an algorithm to be successful if:
   
   .. math::
   
      \lvert F_{known \hspace{3pt} minimum} - F_{algorithm \hspace{3pt} minimum} \rvert \leq 10^{-6}
   
   
   While still respecting the condition at point (2). All the other checks for tolerances - be those
   on relative change on the objective function from one iteration to the other, or too small variations
   in the variables in the function domain - have been disabled or mercilessly and brutally commented
   out in the code for each solver.


.. _the great excluded:

|excluded| The Great Excluded
-----------------------------

This section presents a (possibly incomplete) list of algorithms/libraries that have been 
excluded from the benchmark competition, as they do not comply with one or more of the rules
spelled out in :ref:`The Rules` section.

1. **ALGENCAN**: praised by many for its robustness and convergence qualities, you can get a 
   copy of the source code from here:
   
   http://www.ime.usp.br/~egbirgin/tango/codes.php
   
   Unfortunately, no Windows installer is available and the code is not easily compilable on Windows.
   
   Fail.

2. **PyGMO**: based on the C++ code of PaGMO, it contains a lot of different optimizers and seems
   very promising:

   http://pagmo.sourceforge.net/pygmo/index.html
   
   Unfortunately, the Windows version is 32 bit-only, and compilation seems to be a 
   `very problematic matter <http://sourceforge.net/mailarchive/message.php?msg_id=28542939>`_.
   
   Fail.
   

|algorithms| Tested Algorithms
------------------------------

I have tried a number of global optimization algorithms on the entire set of benchmarks, 
considering **bound constrained problems only**, and specifically the following ones:

1. **AMPGO**: Adaptive Memory Programming for Global Optimization: this is my Python 
   implementation of the algorithm described here:

   http://leeds-faculty.colorado.edu/glover/fred%20pubs/416%20-%20AMP%20(TS)%20for%20Constrained%20Global%20Opt%20w%20Lasdon%20et%20al%20.pdf

   I have added a few improvements here and there based on my Master Thesis work on the standard
   Tunnelling Algorithm of Levy, Montalvo and Gomez. After AMPGO was integrated in
   `lmfit <https://github.com/lmfit/lmfit-py>`_, I have improved it even more - in my opinion.

2. **BasinHopping**: Basin hopping is a random algorithm which attempts to find the global minimum
   of a smooth scalar function of one or more variables. The algorithm was originally described by 
   David Wales:

   http://www-wales.ch.cam.ac.uk/

   BasinHopping is now part of the standard SciPy distribution.

3. **BiteOpt**: BITmask Evolution OPTimization, based on the algorithm presented in this GitHub link:

   https://github.com/avaneev/biteopt

   I have converted the C++ code into Python and added a few, minor modifications.

4. **CMA-ES**: Covariance Matrix Adaptation Evolution Strategy, based on the following algorithm:

   http://www.lri.fr/~hansen/cmaesintro.html
   
   http://www.lri.fr/~hansen/cmaes_inmatlab.html#python (Python code for the algorithm)

5. **CRS2**: Controlled Random Search with Local Mutation, as implemented in the
   `NLOpt <http://ab-initio.mit.edu/wiki/index.php/NLopt>`_ package:

   http://ab-initio.mit.edu/wiki/index.php/NLopt_Algorithms#Controlled_Random_Search_.28CRS.29_with_local_mutation

6. **DE**: Differential Evolution, described in the following page:

   http://www1.icsi.berkeley.edu/~storn/code.html

   DE is now part of the standard SciPy distribution, and I have taken the implementation as it stands in SciPy:

   https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.differential_evolution.html#scipy.optimize.differential_evolution

7. **DIRECT**: the DIviding RECTangles procedure, described in:

   https://www.tol-project.org/export/2776/tolp/OfficialTolArchiveNetwork/NonLinGloOpt/doc/DIRECT_Lipschitzian%20optimization%20without%20the%20lipschitz%20constant.pdf
   
   http://ab-initio.mit.edu/wiki/index.php/NLopt_Algorithms#DIRECT_and_DIRECT-L (Python code for the algorithm)

8. **DualAnnealing**: the Dual Annealing algorithm, taken directly from the SciPy implementation:

   https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.dual_annealing.html#scipy.optimize.dual_annealing
   
9. **LeapFrog**: the Leap Frog procedure, which I have been recommended for use, taken from:

   https://github.com/flythereddflagg/lpfgopt

10. **MCS**: Multilevel Coordinate Search, it's my translation to Python of the original Matlab code from A. Neumaier
    and W. Huyer (giving then for free also `GLS <https://www.mat.univie.ac.at/~neum/software/ls/>`_ and
    `MINQ <https://www.mat.univie.ac.at/~neum/software/minq/>`_):

    https://www.mat.univie.ac.at/~neum/software/mcs/

    I have added a few, minor improvements compared to the original implementation. See the :ref:`MCS` section for a
    quick and dirty comparison between the Matlab code and my Python conversion.

11. **PSWARM**: Particle Swarm optimization algorithm, it has been described in many online papers. I have used a compiled
    version of the C source code from:
   
    http://www.norg.uminho.pt/aivaz/pswarm/
   
12. **SCE**: Shuffled Complex Evolution, described in:

    Duan, Q., S. Sorooshian, and V. Gupta,
    Effective and efficient global optimization for conceptual rainfall-runoff models, Water Resour. Res., 28, 1015-1031, 1992.
	
    The version I used was graciously made available by Matthias Cuntz via a personal e-mail.

13. **SHGO**: Simplicial Homology Global Optimization, taken directly from the SciPy implementation:

    https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.shgo.html#scipy.optimize.shgo

|

.. note::

   Please feel free to contact me if you think I have missed one (or more) algorithm/library, or
   if you have a procedure you would like to benchmark against. Do send me an email to 
   andrea.gavana@gmail.com, I'll integrate your contribution with due credits and I will re-run 
   the algorithms comparison.

|

.. toctree::
   :maxdepth: 2
   :hidden:

   thebenchmarks
   new_algorithms
   scipy_test_functions


|indices| Indices and tables
=============================

* :ref:`genindex`
* :ref:`modindex`
* :ref:`search`