Parsing and evaluating problems¶

In DESDEO, the problem format (c.f., section The Problem Format) can be parsed into other formats and evaluated. The purpose of evaluating is to allow different solvers, discussed in section Solvers, to solve the modeled multiobjective optimization problem.

Here, we will first discuss the parsing and the parsers found in DESDEO in section Parsing and parsers. Then, we will discuss the evaluators available in DESDEO for evaluating multiobjective optimization problems in section Evaluating and evaluators. After these sections, the reader should have a basic understanding on how the problems defined in DESDEO are evaluated.

Parsing and parsers¶

From the problem format to other formats¶

The most central parsers in DESDEO are those that allow parsing the problem format from the MathJSON format discussed in section The MathJSON format to other formats that different solvers (i.e., optimizers) can understand. These kind of parsers can be found in the JSON parser module.

The MathParser handles parsing of the problem format into a polars format, a pyomo format, a sympy format, or a gurobipy format. In the polars and sympy formats, the problem is represented by polars and sympy expressions. Likewise, in the pyomo and gurobipy formats, the problem is represented as a pyomo or gurobipy model. This allows different solvers to handle the problems we have defined in DESDEO.

From other formats to the problem format¶

DESDEO has also parsers that allow other formats to be parsed into the MathJSON format representing multiobjective optimization problems. The most notable of these, and the currently only one present, is the infix notation parser found in the Infix parser module. This allows mathematical expression formatted in a typical infix format, i.e., how one would regularly write a mathematical expression down, e.g., x * 2 - y, to the MathJSON format:

["Add",
    ["Multiply", "x", 2],
    ["Negate", "y"]
]

This is useful when one wants to model a multiobjective optimization problem in DESDEO since it allows the function expression to be written down as they are, without requiring a manual conversion to the MathJSON format first. When adding function expression to the Problem model, if a func field is initialized with a string in an infix format, it is automatically converted to the JSON format utilizing the infix parser. The infix parser can also prove useful when defining various scalarization functions in a dynamic way (c.f., the section Scalarization).

Note

The infix parser introduces a little bit of overhead. If the parser is utilized to parse infix expressions into the MathJSON format consecutively multiple times, as can be the case with some navigation-based methods, then the overhead might accumulate slowing the method down considerably. In such cases, it is advised to provide the expression directly in the MathJSON format to save time.

Evaluating and evaluators¶

While parsers transform the problem format in DESDEO from one representation to another, evaluators enable the evaluating of a problem, usually by supplying one or more decision variable vectors. Evaluators are not something a user is expected to directly utilize, rather, evaluators are often tied to solvers and solver interfaces, which are discussed in the Solvers section.

It is important to understand in which order the various fields found in the problem format (c.f., The Problem schema and Problem) are evaluated. The first field to be evaluated is the constants field, which is made up of Constant models. Evaluating these fields amounts to replacing the symbol of the constant with its numerical value, or defining an internal variable with the same value.

Next, the extra_funcs field made up of ExtraFunction models is evaluated. This assumes that the extra functions might utilize constants in them, which requires the constants to be available prior to the evaluation. Naturally, evaluating any extra functions is skipped if they are not defined for the problem being evaluated.

After the extra functions, the Objective models, found in the field objectives of the problem, are evaluated. These can depend on constants and extra functions, which means that these must have been evaluated prior to evaluating any objectives.

Then, any Constraint models in the constraints field of the problem are evaluated. This assumes that prior to evaluating any constraints, any constants, extra functions, or objectives utilized in the constraint have been evaluated. Objectives might be part of a constraint in, e.g., the epsilon-constraint scalarization. If no constraints have been defined, then this step is skipped.

Lastly, any ScalarizationFunction present in the scalarizations_funcs field are evaluated. These can again depend on any of the constants, extra functions, constraints, or objectives defined for the problem. If no scalarization functions have been defined, then their evaluation is skipped.

The order of evaluation is important to understand to avoid bugs and unexpected behavior when defining problems in DESDEO. The order of evaluating the fields found in the problem model has been summarized in the list below:

Constants, if any.
Extra functions, if any.
Objective functions.
Constraint functions, if any.
Scalarization functions, if any.

The polars evaluator¶

The PolarsEvaluator is for evaluating problems to be solved with solvers that expect the problem to be defined as Python functions. It can be utilized in two modes: variables or discrete. In the first mode, the problem is expected to be evaluated with a given set of decision variable vectors. In the discrete mode, the problem is evaluated based on its discrete representation (c.f., DiscreteRepresentation).

If the variables mode is used, then the function expressions found in the problem are parsed to polars expressions. This allows any Python-based solver to evaluate the problem with given decision variable values. The polars evaluator currently supports scalar variables and one-dimensional TensorVariables (vectors). Higher dimensions on the decision variables are not supported.

If the discrete mode is used, then the problem is expected to be completely defined by decision and objective vector pairs. When the problem is then evaluated with a given set of decision variable vectors, the closest vectors are searched for in the problem's discrete representation, and the corresponding objective function values are then returned. The discrete evaluator only supports scalar valued variables, not TensorVariables, at the moment.

The polars evaluator is utilized by solvers that do not expect, or require, the exact formulation of the problem. That is, heuristics based and gradient-free solvers. At the moment, these solvers implemented in DESDEO are the Scipy solvers and the ProximalSolver.

Info

For more info about polars, see the polars documentation.

The pyomo evaluator¶

The PyomoEvaluator transforms a problem into a pyomo model. This enables the usage of many external solvers, such as the ones found in the COIN-OR project to be utilized in DESDEO. Unlike the polars evaluator, the pyomo evaluator does not expect decision variables, instead, it provides a pyomo model that external solvers can then utilize to solve the original problem. The pyomo solvers implemented in DESDEO can be found in the pyomo solver interfaces. The pyomo evaluator supports variables that are higher dimensional tensors (TensorVariables) as well.

Note

Currently, the pyomo evaluator utilized only concrete pyomo models (ConcreteModel).

Info

For more info about pyomo, see the pyomo documentation.

The sympy evaluator¶

The SympyEvaluator parses a problem into sympy expressions using MathParser. The parsed expressions are then compiled into callable Python functions via sympy.lambdify(), which allows for fast numerical evaluation.

The sympy evaluator evaluates the problem one decision variable vector at a time (single-sample evaluation). It only supports scalar variables — TensorVariables are not supported. Constants, extra functions, objectives, constraints, and scalarization functions are all handled by substituting the relevant sympy expressions into one another during initialization.

The sympy evaluator is currently used by the NevergradGenericSolver for gradient-free optimization. Beyond evaluation, the use of sympy opens potential for symbolic operations such as symbolic differentiation, simplification, and LaTeX export. Contributions are welcome!

Info

For more info about sympy, see the sympy documentation.

The gurobipy evaluator¶

The GurobipyEvaluator transforms a problem into a Gurobipy model. This model is then used in optimization through the GurobipySolver. The gurobipy evaluator supports variables that are higher dimensional tensors (TensorVariables) as well.

Info

For more info about gurobi, see the gurobi documentation.

The simulator evaluator¶

The SimulatorEvaluator adds support for simulator and surrogate based objectives, constraints and extra functions. This is done by collecting the simulators and surrogates in the evaluator and calling them by providing the decision variables and parameters needed. The connection to simulators happens via simulator files. The surrogate models are loaded from disk and then evaluated with the given decision variable values. This evaluator also handles analytical functions by calling the PolarsEvaluator.

Evaluating simulator and surrogate based problems¶

SimulatorEvaluator can be used to evaluate any problems with analytical, simulator and surrogate based objectives, constraints and extra functions. In what follows, it is explained, how the evaluator works. Here is an example of defining a simulator based problem in DESDEO.

First, we need to initialize the problem we are solving:

In [1]:

Copied!





import tempfile
import warnings
from pathlib import Path

import joblib
from sklearn.datasets import make_friedman2, make_gaussian_quantiles
from sklearn.gaussian_process import GaussianProcessRegressor
from sklearn.gaussian_process.kernels import DotProduct, WhiteKernel
from sklearn.linear_model import LogisticRegression

from desdeo.problem.testproblems import simulator_problem

# Create the test problem with analytical, simulator, and surrogate objectives
problem = simulator_problem("../../tests/data")

# Create temporary surrogate model files for the surrogate-based objectives
tmpdir = Path(tempfile.mkdtemp())

with warnings.catch_warnings():
    warnings.simplefilter("ignore")
    X1, y1 = make_friedman2(n_samples=500, noise=0, random_state=0)
    gpr = GaussianProcessRegressor(kernel=DotProduct() + WhiteKernel(), random_state=0).fit(X1, y1)

surrogate_path_1 = tmpdir / "surr1" / "model.pkl"
surrogate_path_1.parent.mkdir()
joblib.dump(gpr, surrogate_path_1)

X2, y2 = make_gaussian_quantiles(n_samples=500, n_features=4, random_state=0)
lr = LogisticRegression(random_state=0).fit(X2, y2)
surrogate_path_2 = tmpdir / "surr2" / "model.pkl"
surrogate_path_2.parent.mkdir()
joblib.dump(lr, surrogate_path_2)
import tempfile
import warnings
from pathlib import Path

import joblib
from sklearn.datasets import make_friedman2, make_gaussian_quantiles
from sklearn.gaussian_process import GaussianProcessRegressor
from sklearn.gaussian_process.kernels import DotProduct, WhiteKernel
from sklearn.linear_model import LogisticRegression

from desdeo.problem.testproblems import simulator_problem

# Create the test problem with analytical, simulator, and surrogate objectives
problem = simulator_problem("../../tests/data")

# Create temporary surrogate model files for the surrogate-based objectives
tmpdir = Path(tempfile.mkdtemp())

with warnings.catch_warnings():
    warnings.simplefilter("ignore")
    X1, y1 = make_friedman2(n_samples=500, noise=0, random_state=0)
    gpr = GaussianProcessRegressor(kernel=DotProduct() + WhiteKernel(), random_state=0).fit(X1, y1)

surrogate_path_1 = tmpdir / "surr1" / "model.pkl"
surrogate_path_1.parent.mkdir()
joblib.dump(gpr, surrogate_path_1)

X2, y2 = make_gaussian_quantiles(n_samples=500, n_features=4, random_state=0)
lr = LogisticRegression(random_state=0).fit(X2, y2)
surrogate_path_2 = tmpdir / "surr2" / "model.pkl"
surrogate_path_2.parent.mkdir()
joblib.dump(lr, surrogate_path_2)

Out[1]:

['/tmp/tmppgybqu8k/surr2/model.pkl']

The problem we defined has at least some simulator or surrogate based objectives, constraints or extra functions. Therefore, we should use SimulatorEvaluator to evaluate it. To use the evaluator, we first need to initialize it. When initializing the evaluator, we give the problem as an argument. We can also give the evaluator some optional additional arguments:

params: Some parameters for the simulators as a python dict of dicts. For example,

In [2]:

Copied!

{"s_1": {"alpha": 0.1, "beta": 0.2}, "s_2": {"epsilon": 10, "gamma": 20}}
{"s_1": {"alpha": 0.1, "beta": 0.2}, "s_2": {"epsilon": 10, "gamma": 20}}

Out[2]:

{'s_1': {'alpha': 0.1, 'beta': 0.2}, 's_2': {'epsilon': 10, 'gamma': 20}}

where s_1 and s_2 are symbols of two simulators and the corresponding values are dicts with some parameter values for the corresponding simulators. These parameter values are passed to the simulators.

surrogate_paths: Paths to surrogate files stored on disk. Given as a python dict with objective, constraint or extra function symbols as keys and the corresponding surrogate paths as values of the dict. The surrogate file path can be given as either a string of the path or a pathlib.Path object. This is an optional argument and if not defined, the evaluator uses surrogate models in the problem object, if available.

With these arguments we can then import and initialize the evaluator.

In [3]:

Copied!





from desdeo.problem import SimulatorEvaluator

evaluator = SimulatorEvaluator(
    problem=problem,
    params={
        "s_1": {"alpha": 0.1, "beta": 0.2},
        "s_2": {"epsilon": 10, "gamma": 20},
    },
    surrogate_paths={
        "f_5": surrogate_path_1,
        "f_6": surrogate_path_2,
        "g_3": surrogate_path_1,
        "e_3": surrogate_path_2,
    },
)
from desdeo.problem import SimulatorEvaluator

evaluator = SimulatorEvaluator(
    problem=problem,
    params={
        "s_1": {"alpha": 0.1, "beta": 0.2},
        "s_2": {"epsilon": 10, "gamma": 20},
    },
    surrogate_paths={
        "f_5": surrogate_path_1,
        "f_6": surrogate_path_2,
        "g_3": surrogate_path_1,
        "e_3": surrogate_path_2,
    },
)

As the evaluator is initialized, it calls the _load_surrogates method that loads the surrogate models from the disk. If no surrogate paths are passed to the evaluator, it takes the surrogate paths from the problem, if possible.

Now that the evaluator is initialized, we can call its evaluate method. The method takes one argument xs that is a dict that has the problem's decision variables and some values for them that we would like to evaluate the problem with. Say we have four decision variables and we would like to evaluate five samples. Then, the dict of decision variables would have the decision variable symbols as keys and the corresponding values would be lists of decision variable values with a single value for each sample (length of each list in this example would be five). Then we would define the decision variable dict and call the evaluate method as follows.

Note

All the decision variable value lists should be the same length, i.e., same number of samples given of each decision variable.

In [4]:

Copied!





xs = {
    "x_1": [0, 1, 2, 3, 4],
    "x_2": [4, 3, 2, 1, 0],
    "x_3": [0, 4, 1, 3, 2],
    "x_4": [3, 1, 3, 2, 3],
}

results = evaluator.evaluate(xs)
results
xs = {
    "x_1": [0, 1, 2, 3, 4],
    "x_2": [4, 3, 2, 1, 0],
    "x_3": [0, 4, 1, 3, 2],
    "x_4": [3, 1, 3, 2, 3],
}

results = evaluator.evaluate(xs)
results

Out[4]:

shape: (5, 26)

x_1	x_2	x_3	x_4	e_2	f_2	f_2_min	g_2	f_1	f_4	e_1	f_3	g_1	f_1_min	f_3_min	f_4_min	f_5	f_5_uncert	f_6	f_6_uncert	g_3	g_3_uncert	e_3	e_3_uncert	f_5_min	f_6_min
f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	f64	i64	f64	f64	f64	i64	f64	f64	i64
0.0	4.0	0.0	3.0	0.0	4.0	-4.0	-4.0	0.4	8.0	2.0	-4.5	8.2	0.4	4.5	8.0	0.873036	316.240953	2	NaN	0.873036	316.240953	2	NaN	0.873036	2
1.0	3.0	4.0	1.0	12.0	8.0	-8.0	-8.0	2.3	6.2	1.5	-1.5	5.2	2.3	1.5	6.2	3.083028	316.254497	0	NaN	3.083028	316.254497	0	NaN	3.083028	0
2.0	2.0	1.0	3.0	4.0	5.0	-5.0	-5.0	4.2	4.4	1.0	1.5	2.2	4.2	-1.5	4.4	0.889159	316.242774	2	NaN	0.889159	316.242774	2	NaN	0.889159	2
3.0	1.0	3.0	2.0	9.0	7.0	-7.0	-7.0	6.1	2.6	0.5	4.5	-0.8	6.1	-4.5	2.6	1.893414	316.249087	0	NaN	1.893414	316.249087	0	NaN	1.893414	0
4.0	0.0	2.0	3.0	0.0	6.0	-6.0	-6.0	8.0	0.8	0.0	7.5	-3.8	8.0	-7.5	0.8	0.905282	316.249614	0	NaN	0.905282	316.249614	0	NaN	0.905282	0

The evaluator returns a polars Dataframe with each decision variable, objective, constraint and extra function as their own column.

Where to go next?¶

You can keep studying the various parsers found in the modules JSON parser and Infix parser, and the evaluators found in the modules Polars evaluator, Pyomo evaluator, Sympy evaluator, Gurobipy evaluator, and Simulator evaluator. If you are interested in how to solve a multiobjective optimization problem, then the section Scalarization is a good place to check out.