nach oben

4OR

Open Access 30.01.2024 | Invited Survey

Preference learning and multiple criteria decision aiding: differences, commonalities, and synergies–part I

verfasst von: Eyke Hüllermeier, Roman Słowiński

Erschienen in: 4OR

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

Multiple criteria decision aiding (MCDA) and preference learning (PL) are established research fields, which have different roots, developed in different communities – the former in the decision sciences and operations research, the latter in AI and machine learning – and have their own agendas in terms of problem setting, assumptions, and criteria of success. In spite of this, they share the major goal of constructing practically useful decision models that either support humans in the task of choosing the best, classifying, or ranking alternatives from a given set, or even automate decision-making by acting autonomously on behalf of the human. Therefore, MCDA and PL can complement and mutually benefit from each other, a potential that has been exhausted only to some extent so far. By elaborating on the connection between MCDA and PL in more depth, our goal is to stimulate further research at the junction of these two fields. To this end, we first review both methodologies, MCDA in this part of the paper and PL in the second part, with the intention of highlighting their most common elements. In the second part, we then compare both methodologies in a systematic way and give an overview of existing work on combining PL and MCDA.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

The notion of “preference” has a long tradition in various scientific disciplines, including economics and the social sciences, operations research and the decision sciences, psychology, and philosophy. Over the past decades, it has also been studied intensively in artificial intelligence (AI), where preferences provide a means for specifying desires in a declarative and intelligible way, a point of critical importance for effective knowledge representation and reasoning. To this end, suitable representation formalisms and modeling languages have been developed (Rossi et al. 2011).

Before preferences can be used for any sort of reasoning or inference, they have to be acquired and formalized in one way or the other. Preference acquisition is not an easy task, however, especially in domains with many and possibly complex sets of alternatives. This may easily lead to a knowledge acquisition bottleneck. As an alternative to the manual specification of preferences, preference learning (PL) builds on machine learning methodology to support and partly automate the design of preference models in a data-driven way. Roughly speaking, preference learning is concerned with the automated acquisition of preference models from observed (stated or revealed) preference information, that is data from which (possibly uncertain) preference representations can be inferred in a direct or indirect way (Fürnkranz and Hüllermeier 2010). Meanwhile, preference learning has established itself as an independent branch of machine learning research. It attracted renewed attention with the recent advent of generative AI systems such as ChatGPT OpenAI (2022), which seek to create artifacts of high quality, complying as much as possible with the preferences of the user.

Methods for eliciting preferences and constructing preference models from explicit or implicit preference information and feedback have also been studied in other fields, notably in multiple criteria decision aiding¹ (MCDA). The latter aims at helping a decision maker to choose the best alternative, or to rank or classify alternatives on the basis of their performances on multiple criteria (Greco et al. 2016a). To this end, MCDA has developed a wide variety of decision models, most of which aggregate the evaluations on individual criteria into an overall assessment of an alternative. MCDA differs from PL in various ways. For example, while MCDA mostly focuses on a single decision maker (an individual or a group of well-identified stakeholders), a model in PL typically refers to an entire population of individuals and seeks to optimize performance “on average”. Moreover, the interaction between the decision maker and the decision analyst (facilitator) is at the core of MCDA, while being less emphasized in PL. Correspondingly, preference data is sparser but also less corrupted in MCDA, compared to PL, where training data is commonly assumed to be more massive but also afflicted with noise and various sorts of inaccuracies. Besides, with the goal of learning a presumably existing “ground truth” (the data-generating process), PL is inductive in nature, whereas MCDA does not assume such a ground truth and should rather be thought of as a constructive process (cf. Sect. 2.3). The two frameworks are sketched in Fig. 1.

This being said, both MCDA and PL do share the major goal of constructing practically useful decision models, i.e., mathematical models that either support humans in the task of choosing the best among a set of alternatives, classifying them into pre-defined and preferentially ordered decision classes, or ranking these alternatives from the best to the worst, or even automating decision making by acting autonomously on behalf of the human. Therefore, in spite of differences in their objectives and research agenda, or rather because of them, we argue that PL and MCDA can cross-fertilize and mutually benefit from each other, and indeed, several interesting contributions at the junction of PL and MCDA have been made in the recent past. While this is a welcome development, we believe that the potential is far from exhausted. This is the main motivation for the current paper, which, by elaborating on the connection between PL and MCDA in more depth, will hopefully stimulate further research in that direction.

Our paper comes in two parts. This first part continues with a brief survey of the state-of-the-art in MCDA. The survey focuses on MCDA based on constructive preference modeling from holistic preference information, also referred to as decision examples. This is because this methodology of MCDA is the closest to PL. A similar survey of PL comes in the second part. In the latter, both methodologies are also compared with each other, and their differences but also commonalities are presented and discussed in a systematic way. Moreover, we elaborate on existing work on combining PL and MCDA.

2 Multiple criteria decision aiding

2.1 Decision aiding

Before we pass to Multiple Criteria Decision Aiding (MCDA), we should stress the Decision Aiding (DA) aspect of the methodology. The aim of DA is to analyze the decision-making context by identifying the problem with its actors, possible alternatives and their consequences, as well as the stakes (Roy 2000). Among the actors, there are stakeholders (actors concerned by the decision), single or multiple decision makers (users), and an analyst (facilitator). For DA it is important how the decision-making process will unfold, increasing consistency between the values underlying the objectives, and the quality of recommendations based on results from models and computational procedures designed with respect to some working hypotheses. Decision problems considered within DA involve a set ${\mathcal {A}}$ of alternatives (actions, solutions, objects, etc.) evaluated from multiple points of view. The following three main problem statements are considered in DA:

Choice of the best alternative, with multiobjective optimization as a particular case.
Classification (called also sorting) of alternatives into pre-defined and preference-ordered classes.
Ranking of alternatives from the best to the worst.

The above decision problems involve, in general, multiple evaluation criteria (multiple objectives). A decision-maker (DM) seldom has a single clear criterion in mind. Seldom is there a common unit for all scales of criteria, which are rather heterogeneous. That is why it may be very difficult to define a priori a unique criterion able to take into account all relevant points of view. Formally, the criterion is a real-valued function $g_i$ defined on ${\mathcal {A}}$, reflecting a value of each alternative from a particular point of view, such that in order to compare any two alternatives $a,b \in $ ${\mathcal {A}}$ from this point of view, it is sufficient to compare two values: $g_i(a)$ and $g_i(b)$, called performances. It is worth stressing that criteria may have ordinal evaluation scales, where a difference of performance does not have the meaning of intensity, or cardinal evaluation scales, where a difference of performance has the meaning of intensity, of either interval or ratio type. The type of criterion scale determines the allowed arithmetic operations on the performances. For example, additions of (weighted) performances on ordinal scales have no meaning. In the rest of this section, we assume by default that criteria are of the gain-type, i.e., the greater the performance, the better, unless explicitly specified that they are of the cost-type.

Building a family of criteria ${\mathcal {G}}=\{g_1,\dots ,g_n\}$ for a decision problem at hand is the preliminary step of MCDA. According to Roy (1996), the family ${\mathcal {G}}$ should be exhaustive, monotonic, and non-redundant. With the exception of trivial cases, the evaluation criteria are in conflict in the sense that, improving an alternative on one criterion causes deterioration on other criteria, thus, in set ${\mathcal {A}}$, there is no alternative being the best on all criteria. The only objective information that stems from a multiple criteria evaluation of alternatives is the dominance relation in set ${\mathcal {A}}$. The meaning of the dominance relation is the following: alternative a dominates alternative b if a is not worse than b on all criteria and on at least one criterion, it is strictly better. It is clear that such a binary relation is a weak partial order. As such, the dominance relation leaves many alternatives incomparable. Incomparability prevents the formulation of an unambiguous recommendation in the best choice, classification, or ranking. This is why the main preoccupation of MCDA consists of aggregating the multiple criteria into a preference model making the alternatives more comparable in light of users’ preferences.

2.2 The three-step methodology of decision aiding

In this way, we arrived at the point where the MCDA methodology should try to “enrich” the dominance relation using preference information elicited from the decision makers (DMs). Thus, the steps of MCDA that go beyond this point are the following:

Get preference information from DMs revealing their value system confronted with the multiple criteria evaluation of alternatives from set ${\mathcal {A}}$.
Learn or build a preference model that aggregates the vector evaluations of alternatives. The preference model induces a preference relation in set ${\mathcal {A}}$, making comparable a larger number of alternatives than the dominance relation.
Exploit the preference relation in set ${\mathcal {A}}$ to work out a recommendation in terms of choice, classification, or ranking.

This three-step methodology loops in the case of interactive methods.

2.3 Preference handling in MCDA and MCDM

Dealing with DM’s preferences is at the core of MCDA. However, they are rarely clearly structured and known a priori. Hence, the questions of an analyst and the use of dedicated methods should be oriented toward shaping these preferences. In fact, MCDA proceeds by progressively forming a conviction and communicating about the foundations of these convictions. The models, procedures, and provided results constitute a communication and reflection tool. Indeed, they allow the participants of the decision process to carry forward their process of thinking, discover what is important to them, and learn about their values. Such elements of responses obtained by the DMs contribute to recommending and justifying a decision, that increases the consistency between the evolution of the process and the objectives and value systems of the stakeholders. Thus, Decision Aiding should be perceived as a constructive learning process, as opposed to machine discovery of an optimum, of a pre-existing preference system, or a truth that exists outside the involved stakeholders.

The above remark highlights the subtle difference between the two schools known as MCDA (Multiple Criteria Decision Aiding) and MCDM (Multiple Criteria Decision Making). As the research area of both schools is the same, they often meet under the label Multiple Criteria Decision Analysis (Ehrgott et al. 2010).

MCDA underlines the “aiding” in a process, involving the DMs in the co-construction of their preferences (Roy 2005). It assumes that preferences of the DM with respect to considered alternatives do not pre-exist in the DM’s mind. This implies that the concepts, models and methods proposed by MCDA must not be considered as a means of discovering a pre-existing truth. They have to be seen as keys to doors giving access to elements of knowledge contributing to the acceptance of a final recommendation.

MCDM relies on normative and prescriptive Decision Analysis, including tools for identifying, representing, and formally assessing important aspects of a decision, for prescribing a recommended course of action maximizing the utility (Bell et al. 1988). Decision Analysis assumes an ideal rationality and aims at giving an “objectively” best recommendation.

It is important to note that the methodology of MCDA/MCDM has evolved over the past 50 years with the advancement of computing technology. The most affected part of this methodology has been the modeling of preferences. Recently, due to an abundance of data about human choices a “knowledge-driven” approach is prevailing, i.e., holistic preference information is known first and then the model is built explaining the past decisions and predicting future ones. The knowledge-driven trend makes the preference modeling in MCDA similar to Preference Learning according to the Machine Learning paradigm. This is why in MCDA the term Preference Learning is now often used in the sense of model building from holistic preference information. This observation clearly motivates the present paper.

The issue of preference handling in MCDA and MCDM is considered in the first two steps of the three-step DA methodology described in Sect. 2.2. Below, we first characterize the preference models, and then the ways of obtaining preference information used to build these models.

2.4 Preference models

The preference model aggregates the multiple criteria evaluations of alternatives. This aggregation should take into account the preferences of the DM so that the resulting preference model would induce a preference relation in set ${\mathcal {A}}$ richer than the dominance relation.

One can distinguish three families of preference models based on specific ways of aggregation:

Multiple Attribute Utility Theory (MAUT) (KeeneyandRaiffa 1976) using a value function assigning to each alternative $a\in {\mathcal {A}}$ a real value, e.g., the weighted sum of performances $U(a)=\sum _{i=1}^{n}k_i\times g_i(a)$, where $k_i$ are weights interpreted as substitution rates among criteria, or a more general additive function $U(a)=\sum _{i=1}^{n} u_i[ g_i(a)]$, where $u_i$ are marginal value functions, or non-additive integrals handling interactions among criteria, like Choquet integral for cardinal criteria, and Sugeno integral for ordinal criteria (Grabisch 1996).
Outranking methods (Roy 2005) using a system of binary relations, e.g., the outranking relation $S=\{\sim , \succ ^{w}, \succ ^{s}\}$, where $\sim $ means indifference, $\succ ^{w}$ weak preference, and $\succ ^{s}$ strong preference; relation aSb reads: “alternative a is at least as good as alternative b”.
Decision rule models (Greco et al. 2001) composed of logical statements relating conditions on particular criteria and a decision, e.g., “if $g_i(a)\succeq r_i$ & $g_j(a)\succeq r_j$ & ...$g_k(a)\succeq r_k$, then $a \rightarrow $ Class t or better” for classification, and “if $g_i(a)\succeq ^{\ge h(i)}g_i(b)$ & $g_j(a)\succeq ^{\ge h(j)}g_j(b)$ & ...$g_p(a)\succeq ^{\ge h(p)}g_p(b)$, then aSb” for choice or ranking, where $\succeq $ is a weak preference relation, $r_i, r_j,\ldots , r_k$ are threshold values on selected criteria $\{ g_i, g_j,\ldots , g_k \} \subseteq {\mathcal {G}}$ induced from data, $\succeq ^{\ge h(\cdot )}$ is a weak preference relation with intensity in degree at least $h(\cdot )$, and $h(i), h(j),\ldots , h(p)$ are degrees of intensity of preference on cardinal criteria $\{ g_i, g_j,\ldots , g_p \} \subseteq {\mathcal {G}}$, also induced from data.

The above preference models have different capacities for representing DM’s preferences. Comparison of these models at the axiomatic level shows that the decision rule model requires the weakest axioms of all three, which means that the value function model or the outranking model is able to represent particular preferences if and only if the rule model is able to (Słowiński et al. 2002). Another thorough comparison of these models, acknowledging the above claim was performed by Greco et al. (2004).

The classification rules having the syntax exemplified above are called dominance-based rules because, in the condition part, they involve a simple dominance relation between a partial performance profile of an alternative and a vector of threshold values, usually inferred by induction from data (Greco et al. 2016b). The rules are the aggregation operators distinguished by the following features:

they account for the most complex interactions among criteria,
they are non-compensatory because the conditions on particular criteria are considered individually,
they accept ordinal evaluation scales and do not convert ordinal evaluations into cardinal ones, because the dominance relation only uses order.

Moreover, rules identify values that drive DM’s decisions–each rule is an intelligible scenario of a causal relationship between performances on a subset of criteria and a comprehensive judgment.

2.5 Elicitation of preference information

Preference elicitation can be either direct or indirect. In the case of direct elicitation, the DM is expected to provide parameters related to a supposed form of the preference model. Experience indicates that direct elicitation of numerical values of model parameters by DMs demands much of their cognitive effort. For example, defining weights $k_i, {i=1,\ldots ,n}$, of a weighted sum model requires specification of substitution rates $k_i/ k_j$ between all pairs of criteria $\{g_i,g_j\}\subset {\mathcal {G}}$. These substitution rates must be, moreover, constant in the whole range of criteria scales. In the case of heterogeneous criteria, it is not easy to say how much gain in performance on one criterion compensates for a loss of performance on another criterion. The same difficulty concerns the determination of relative importance weights of criteria having different semantics than the weights of the weighted sum. Other preference model parameters, like indifference, preference, and veto thresholds on particular criteria are not easier to elicit directly. A long time ago, Fishburn (1967) listed and classified twenty-four methods of estimating additive utilities. Such a large number of methods for estimating parameters of the additive value function proves the difficulty of direct elicitation.

For this reason, indirect preference elicitation through holistic judgments, i.e., decision examples, gained importance in MCDA. Decision examples are relatively “easy” preference information. Decisions can also be observed without the active participation of DMs. For example, the choices made by people on the Internet provide rich information about their preferences. Finally, psychologists confirm that DMs are more confident exercising their decisions than explaining them (Slovic et al. 1977; March 1978).

Decision examples may either be provided by the DM on a set of real or hypothetical alternatives, or may come from observation of DM’s past decisions. Methods based on indirect preference information are considered more user-friendly than approaches based on direct elicitation of preferences. Moreover, the preference model constructed using decision examples aims to be compatible with the preference information, i.e., to reproduce the holistic judgments observed in decision examples. For this, the DMs can verify if the preference model represents faithfully enough their preferences and what is the impact of the provided preference information on the recommendation produced by the method. This makes the whole decision-aiding process more transparent, increasing the DM’s confidence in the final recommendation. As noted by Stewart (2005), MCDA includes a comprehensive process involving a rich interplay between human judgment, data analysis, and computational processes. Indeed, transparency and explainability of the decision-aiding process are necessary for an efficient interplay.

Decision examples elicited in the indirect approach may have different forms depending on the problem statement and the wishes of stakeholders. These may be: (i) pairwise comparisons of some real or fictitious alternatives, (ii) assignments of some alternatives to decision classes, as their typical examples, (iii) intensity of preference expressed on some quadruples of alternatives, e.g., a is preferred to z more than c is preferred to d, (iv) rank related preferences, e.g., alternative f should be among 5% of the best ones.

The MCDA methods based on the indirect preference information having the form of decision examples are called ordinal regression methods. They were first proposed for the additive value function preference model in the method called UTA (Jacquet-Lagrèze and Siskos 1982). In UTA, the preference information consists of a set of pairwise comparisons of so-called reference alternatives $A^R = \{ a^{*}, b^{*}, \ldots \}$. Usually, $A^R \subseteq {\mathcal {A}}$, and these alternatives are relatively well-known to the DM, so that they are able to say for some $a^*,b^*\in A^R$, that $a^*\succeq b^*$ or $b^*\succeq a^*$ or $a^*\sim b^*$. A compatible additive value function, reproducing as far as possible the pairwise comparisons of reference alternatives, is found as a result of solving a linear programming (LP) problem where variables are marginal values in the breakpoints of piecewise-linear marginal value functions. While the UTA method and its many variants gained a high popularity witnessed by many applications (Siskos et al. 2005), it suffers from the problem of non-uniqueness of LP solutions. In other words, the LP solution identifies a single compatible value function, while there may exist many instances of compatible value functions that reproduce the pairwise comparisons of reference alternatives equally well. Different compatible instances may, however, compare non-reference alternatives differently, making the recommendation ambiguous.

The issue of ambiguity has been resolved by Greco et al. (2008) using the approach called Robust Ordinal Regression (ROR) (Greco et al. 2010d). In ROR, the preference data are the same as in UTA, however, the ordinal regression finds the whole set of compatible instances of the value function. The reconstruction of the DM’s judgments by the ordinal regression in terms of compatible instances of the value function is a preference learning step. As ROR works in a loop with incremental elicitation of preferences, it is called constructive learning (Corrente et al. 2013a). In constructive learning the model aims to reconstruct as faithfully as possible the preference information of the DM, while the DM learns from the consequences of applying all compatible instances of the model on the whole set of alternatives. Constructive learning is ‘robust’ because it tells the truth about preferences based on partial, not exhaustive, preference information. For example, it does not require information about all possible pairwise comparisons, like the AHP method (Saaty 2005).

As ROR has the most in common with PL, we will characterize this approach in more detail in the next subsection.

2.6 Robust ordinal regression (ROR)

A typical scenario of the ROR approach is the following (Greco et al. 2010d). Suppose a DM wants to rank countries with respect to their trend for innovation. Three criteria are used to evaluate the countries: innovation performance, direct innovation inputs, and innovation environment. The preference model chosen to rank the countries is a value function. In order to build a value function representing the preferences of the DM, one needs to know some preference information elicited by the DM. Typically, the DM is sure of some pairwise comparisons between countries that do not dominate each other. These countries constitute a set of reference alternatives $A^R = \{ a^{*}, b^{*}, \ldots \}$. The pairwise comparisons have the form $a^*\succeq b^*$, $b^*\succeq e^*$, $c^*\sim d^*$, and so on, but they do not need to be exhaustive. Moreover, the DM may want to distinguish between intensities of preference between some pairs of reference alternatives, telling, for example, that the intensity of preference of $a^*$ over $b^*$ is not smaller than that of $b^*$ over $e^*$. This preference information is a starting point for the search for a preference model, i.e., the value function. If there is no instance of the preference model reproducing the provided holistic judgments expressed on reference alternatives, the DM could either decide to work with the inconsistency or revise some pieces of preference information impeding the incompatibility. Then, using ROR, i.e., solving a series of special optimization problems, one obtains two preference relations in the set of all countries, called necessary and possible. While the first is true for all instances of the value function compatible with the DM’s preference information, the other is true for at least one such compatible instance of the value function. Moreover, ROR can provide extreme ranks of particular countries and a representative compatible instance of the value function. Analyzing the two outcomes of ROR, as well as the form of the displayed representative preference model, the DM gains insights into their preferences at the current stage of interaction. This stimulates a reaction of the DM who may add a new or revise the old preference information. Such an interactive process ends when the necessary preference relation, extreme ranks, and/or a representative ranking yields a recommendation which, according to the DM, is decisive and convincing.

Before moving to the survey of ROR applied to MCDA with various preference models, let us mention the interesting adaptation of ROR to decisions under risk and uncertainty. This is possible after reformulation of decision under risk and uncertainty in terms of MCDA where criteria are some meaningful quantiles of gain or loss (Corrente et al. 2016b).

Moreover, ROR has been adapted to multiple criteria group decision (Greco et al. 2012). In this case, several DMs cooperate in a decision problem to make a collective decision. DMs share the same “description” of the decision problem (the same set of alternatives, family of criteria, and performance matrix). Each DM provides their own preference information, composed of pairwise comparisons of some reference alternatives. The collective preference model accounts for the preference expressed by each DM. Two approaches for handling preferences of multiple DMs are considered:

DMs have similar expertise and compass the whole, rather small set ${\mathcal {A}}$. Then, the preferences of different DMs are interrelated, and the examination of necessary relations obtained for each DM permits the identification of the spaces of agreement and disagreement between the DMs, which helps in negotiations.
DMs have different expertise and express their opinions on different parts of rather big set ${\mathcal {A}}$. Then, knowledge of all DMs is combined into preference information of a single fictitious DM, and examination of necessary relations obtained for coalitions of DMs permits the identification of the spaces of agreement and disagreement between the coalitions.

2.7 ROR with value function preference model

Let us formulate the ordinal regression problem based on the above-mentioned preference information. In order to model the DM’s preference information, we are using in this subsection a general additive value function:

$$\begin{aligned} U(a)= \sum _{i=1}^{n} u_i(g_i(a)) \end{aligned}$$

(1)

where indices of criteria, $i=1,\ldots n$, form set ${{\mathcal {J}}}$, the marginal value functions $u_i$, ${i \in {\mathcal {J}}}$, are monotone, non-decreasing and normalized so that the additive value (1) is bounded within the interval ${[0, 1]}$. Note that for simplicity of notation, we write ${u_i(a)}$, ${i \in {\mathcal {J}}}$, instead of ${u_{i}(g_i(a))}$. Consequently, the basic set of constraints defining general additive value functions compatible with the preference information composed of pairwise comparisons and comparisons of intensities of preferences, has the following form:

$$\begin{aligned} \left. \begin{array}{l} \;U(a^{*}) \ge U(b^{*}) + \varepsilon , \; \hbox { if } \; a^{*} \succ b^{*}, \hbox { for } \{a^{*}, b^{*}\} \in A^R\\ \;U(a^{*}) = U(b^{*}), \; \hbox { if } \; a^{*} \sim b^{*}, \hbox { for } \{a^{*}, b^{*}\} \in A^R\\ \;U(a^*) - U(b^*) \!\ge \! U(c^*) - U(d^*) \!+\! \varepsilon , \; \hbox { if } \; (a^*, b^*) \succ ^{*} (c^*, d^*), \hbox { for } \{a^{*}, b^{*}\, c^{*}, d^{*}\} \!\in \! A^R \\ \;U(a^*) - U(b^*) = U(c^*) - U(d^*), \; \hbox { if } \; (a^*, b^*) \sim ^{*} (c^*, d^*), \hbox { for } \{a^{*}, b^{*}\, c^{*}, d^{*}\} \in A^R \\ \;u_i(x_i^{h})-u_i(x_i^{(h-1)})\ge 0, \;h=2,\ldots , m_i(A^R),\\ \;u_i(x_i^{1})=0, \ \ \ \sum _{i=1}^n u_i(x_i^{m_i(A^R)})=1, \end{array} \right\} \; E^{A^R}_{BASE} \end{aligned}$$

where $(a^*,b^*) \succ ^{*} (c^*, d^*)$ means $a^*\hbox { is preferred to } b^{*} \hbox { more than } c^{*} \hbox { is preferred to } d^{*}$, $(a^{*},b^{*}) \sim ^{*} (c^*, d^*)$ means $a^*\hbox { is preferred to } b^{*} \hbox { as much as } c^*\hbox { is preferred to } d^{*}$, $m_i(A^R)$ is the number of different performances of reference alternatives on criterion $i \in {\mathcal {J}}$, $\ \ x_i^{1}, \ldots , x_i^{m_i(A^R)}$ are the ordered performances of reference alternatives on criterion $i \in {\mathcal {J}}$, $\ \ x_i^{h} < x_i^{h+1}$, $h=1, \ldots ,$ $m_i(A^R)-1$. The first four constraints represent pairwise comparisons of reference alternatives, as well as comparisons between pairs of reference alternatives, and the last two are monotonicity and normalization constraints.

General monotonic marginal value functions defined in this way do not involve any arbitrary or restrictive parametrization (Greco et al. 2010d). On the contrary, the majority of existing methods employ marginal value functions that are linear or piecewise linear (Siskos et al. 2005). Note that piecewise linear functions require specification of the number of characteristic points which is not easy for most DMs.

In order to verify that the set of value functions ${{\mathcal {U}}_{A^R}}$ compatible with preference information provided by the DM is not empty, the following LP problem has to be solved:

$$\begin{aligned} Maximize: \varepsilon , \hbox { s.t. } \ E^{A^R}_{BASE}. \end{aligned}$$

(2)

Let us denote by $\varepsilon ^{*}$ the maximal value of $\varepsilon $ obtained from the solution of the above LP problem, i.e., $\varepsilon ^{*} =$ max $\varepsilon $, s.t. $E^{A^R}_{BASE}$. It corresponds to a margin of the representation error. One concludes that ${{\mathcal {U}}_{A^R}}$ is not empty, if $E^{A^R}_{BASE}$ is feasible and $\varepsilon ^{*} > 0$. In such a case, all pieces of preference information can be properly reproduced by at least one instance of the value function. On the contrary, when $E^{A^R}_{BASE}$ is infeasible or the margin of the representation error is not greater than zero, some pieces of DM’s preference on the set of reference alternatives $A^R$ are conflicting, and thus, some pairwise comparisons or comparisons between pairs cannot be reproduced by any additive value function. In the latter case, the DM can choose one of three options:

revise some pairwise comparisons or comparisons between pairs that make the reproduction by an additive value function impossible; the “troublesome” items of preference information can be identified by solving an integer LP problem (Mousseau et al. 2003),
continue to work with a set of non-perfectly compatible value functions ${{\mathcal {U}}_{A^R}}$ that reproduce the preference information with an error not greater than $\varepsilon ^{*}$,
replace the additive value function by a more complex model, e.g., the Choquet integral (Angilella et al. 2010), or an additive value function augmented by components modeling positive and negative interactions between some criteria without requiring that all criteria are expressed on the same scale (Greco et al. 2014).

The consequence of considering many instances of the preference model is a univocal recommendation. In the constructive learning perspective, where the aim is not to predict, but rather to construct the preferences from scratch, the user has an interest in investigating what are the consequences of using all instances from ${{\mathcal {U}}_{A^R}}$ on the set of alternatives ${{\mathcal {A}}}$. Obviously, the final recommendation in terms of ranking, best choice, or classification may vary substantially depending on which compatible instance is selected. Remark also that ROR does not consider a probability distribution on the set of all compatible instances of the value function, assigning to all of them the same credibility.

When comparing a pair of alternatives $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$ in terms of the recommendation that is provided by any compatible (or non-perfectly compatible) value function from set ${\ {\mathcal {U}}_{A^R}}$, it is reasonable to verify whether a is weakly preferred to b or vice versa, for all or at least one compatible instance of the value function. Answering these questions, ROR methods (Greco et al. 2008) produce two preference relations in the set of alternatives ${\mathcal {A}}$:

a necessary weak preference relation $\succsim ^N$ holds for a pair of alternatives $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$, in case $U(a) \ge U(b)$ for all compatible instances of the value function,
a possible weak preference relation $\succsim ^P$ holds for a pair of alternatives $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$, in case $U(a) \ge U(b)$ for at least one compatible instance of the value function.

Thus defined, the necessary relations specify the most certain recommendation worked out on the basis of all compatible instances of the value function, while the possible relations identify a recommendation provided by at least one compatible instance of the value function. Consequently, the necessary outcomes can be considered robust regarding the available preference information, as they guarantee that a definite relation is the same whichever compatible instance of the preference model would be used. To verify the truth of the necessary and possible weak preference relations the following linear programs need to be solved (Corrente et al. 2013a):

$$\begin{aligned} {\textit{Maximize}: } \varepsilon \end{aligned}$$

s.t.

$$\begin{aligned} \left. \begin{array}{l} \; U(b) - U(a) \ge \varepsilon \\ \; E^{A^R}_{BASE} \end{array} \right\} E^N(a,b), \end{aligned}$$

and

$$\begin{aligned} {\textit{Maximize}: } \varepsilon \end{aligned}$$

s.t.

$$\begin{aligned} \left. \begin{array}{l} \; U(a) - U(b) \ge 0 \\ \; E^{A^R}_{BASE} \end{array} \right\} E^P(a,b). \end{aligned}$$

We conclude that $a \succsim ^N b$, if $E^N(a,b)$ is not feasible or $\varepsilon _*= max \ \varepsilon $, s.t. $E^N(a,b)$, is not greater than 0, and that $a \succsim ^P b$, if $E^P(a,b)$ if feasible and $\varepsilon ^{*} = max \ \varepsilon $, s.t. $E^P(a,b)$, is greater than 0.

Let us remark that preference relations $\succsim ^N$ and $\succsim ^P$ are meaningful only if there exists at least one compatible instance of the value function. Observe also that in this case, for any $a^{*},b^{*} \in A^R$, $a^{*} \succsim b^{*} \Rightarrow a^{*} \succsim ^{N} b^{*}$ and $a^{*} \succ b^{*} \Rightarrow \; \lnot (b^{*} \succsim ^P a^{*}).$ In fact, if the DM says for $a^{*},b^*\in A^R$ that $a^*\succsim b^*$, then for any compatible instance of the value function $U(a^*) \ge U(b^*)$, and, therefore, $a^*\succsim ^N b^*$. Moreover, if $a^*\succ b^*$, then for any compatible instance of the value function $U(a^*) > U(b^*)$, and, consequently, there is no compatible instance of the value function such that $U(b^*) \ge U(a^*)$, which means $\lnot (b^*\succsim ^P a^*)$.

The necessary weak preference relation $\succsim ^N$ is a partial preorder (i.e., it is reflexive ($a \succsim ^N a$, since for all $a \in {\mathcal {A}}, \ U(a) = U(a)$), and transitive (for all $a, b, c \in {\mathcal {A}}$, if $a \succsim ^N b$ and $b \succsim ^N c$, then $a \succsim ^N c$). Possible weak preference relation $\succsim ^P$ is a strongly complete binary relation (i.e., for all $a, b \in {\mathcal {A}}, \ a \succsim ^P b$ or $b \succsim ^P a$), and negatively transitive (i.e., $\forall a, b, c \in {\mathcal {A}},$ if $\lnot (a \succsim ^P b)$ and $\lnot (b \succsim ^P c)$, then $\lnot (a \succsim ^P c)$).

Moreover, for any pair $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$, $ a \succsim ^N b \Rightarrow a \succsim ^P b$, i.e., $ \succsim ^N \subseteq \succsim ^P$, and for all $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$, $a \succsim ^N b$ or $b \succsim ^P a$.

ROR with a general additive value function preference model has also been applied to multiple criteria classification problems (Greco et al. 2010c; Köksalan and Ozpeynirci 2009). Within ROR for multiple criteria classification one can also handle additional preference information about the intensity of preference, in a way similar to (Figueira et al. 2009).

Since different compatible instances of the value function may order a given alternative from set ${\mathcal {A}}$ at different positions in the ranking, or classify this alternative to different classes, Kadziński et al. (2012) proposed to identify the extreme ranks, or classes, for each alternative, as well as the interval of scores it can attain. This procedure, based on mixed-integer LP, is known in ROR as Extreme Ranking Analysis.

Although richer than the dominance relation, the necessary preference relation $\succsim ^N$ may still leave some pairs of alternatives incomparable, i.e., $a \succsim ^P b$ and $b \succsim ^P a$. To get an estimate of the probability that a randomly selected compatible instance of the value function ranks alternative a higher than b, or vice versa, one can perform the Stochastic Multiobjective Acceptability Analysis (SMAA) (Kadziński and Tervonen 2013). It consists of using a Hit & Run sampling of compatible instances of the value function within constraints $E^{A^R}_{BASE}$, and checking the frequency with which:

$a \succ b$, which gives pairwise winning index p(a, b),
a gets position h in the ranking, which gives rank acceptability index $r^h_a$.

In SMAA, all compatible instances of the value function are equiprobable. In (Corrente et al. 2016a), it has been proposed to induce the probability distribution on compatible instances of the value function from additional uncertain preference information of the type: $a \succ b$ is more credible than $b \succ a$. The procedure is composed of three steps:

sampling a number ($s_v$) of instances of the value function $U_t \in {\mathcal {U}}_{A^R},\ t=1,\ldots ,s_v,$ compatible with the certain part of preference information provided by the DM,
inducing a probability distribution $\left\{ w_t(U_t) \in [0, 1]: \sum _{t=1}^{s_v} w_t(U_t)=1 \right\} $ on the sample of instances generated in the previous step, taking into account the uncertain part of preference information via LP,
sampling probability distributions compatible with uncertain preference information, and taking a barycenter distribution as a representative one for the final ranking.

Yet another way of dealing with multiple compatible value functions in the UTA-like approach has been presented by Bous et al. (2010) as ACUTA method. It is based on the computation of the analytic center of a polyhedron formed by compatible additive value functions for the selection of the representative value function.

2.8 ROR with non-additive value functions

When interactions among criteria appear in the user’s preferences, the general additive model is not able to represent them. Interactions may be either negative (negative synergy) if the importance weight of a subset of criteria is smaller than the sum of importance weights of all criteria belonging to the subset, or positive (positive synergy) if, conversely, the importance weight of a subset of criteria is greater than the sum of importance weights of all criteria belonging to the subset.

To cope with interactions among criteria, one can use non-additive integrals as preference models, such as Choquet integral for cardinal criteria with an interval scale, and Sugeno integral for ordinal criteria (Grabisch 1996). It should be noted, however, that these non-additive integrals require that the evaluations on all criteria are expressed on the same scale, which is hard to satisfy in real-world MCDA (Roy 2009). Another popular nonlinear aggregation function able to handle interactions among criteria expressed on ratio scales is the multilinear model characterized recently by Pelegrina et al. (2020). Differently from the Choquet integral, the multilinear model does not need the commensurability assumption and is obtained as the linear interpolation using all vertices of the hypercube $[0, 1]^n$ (Grabisch 2016).

The commensurability assumption has also been abandoned in the UTA$^{GMS}$-INT method (Greco et al. 2014). The authors proposed to aggregate interacting criteria with heterogeneous scales using a value function composed not only of the sum of marginal non-decreasing value functions $u_i(a), \ i = 1, \ldots , n$, but also of sums of functions:

$$\begin{aligned}{} & {} syn^+_{i_{1},i_{2}}: [x^{min}_{i_{1}}, x^{max}_{i_{1}}] \times [x^{min}_{i_{2}}, x^{max}_{i_{2}}] \rightarrow [0,1], \\{} & {} syn^-_{i_{1},i_{2}}: [x^{min}_{i_{1}}, x^{max}_{i_{1}}] \times [x^{min}_{i_{2}}, x^{max}_{i_{2}}] \rightarrow [0,1], \ \hbox { for all }\ (i_1,i_2) \in {\mathcal {J}} \times {\mathcal {J}},\; i_1>i_2, \end{aligned}$$

where $x^{min}_{i_{1}}, x^{max}_{i_{1}}$ and $x^{min}_{i_{2}}, x^{max}_{i_{2}}$ are the worst (min) and the best (max) performances on criteria $i_1, i_2 \in {\mathcal {J}}$, respectively. Functions $syn^+_{i_{1},i_{2}}(x_{i_{1}},x_{i_{2}}) \hbox { and } syn^-_{i_{1},i_{2}}(x_{i_{1}},x_{i_{2}})$, are non-decreasing in both their two arguments, for all pairs of (possibly) interacting criteria $(i_1,i_2) \in {\mathcal {J}} \times {\mathcal {J}}$, such that $i_1>i_2$. They represent positive and negative interactions, respectively, and augment or decrease the additive component of the value function. In other words, they are bonus and penalty functions with respect to the main additive component. Obviously, a pair of interacting criteria can either be in positive or negative synergy, which means that $syn^+_{i_{1},i_{2}}(\cdot ,\cdot )$ and $syn^-_{i_{1},i_{2}}(\cdot ,\cdot )$ are mutually exclusive.

For an alternative $a \in {\mathcal {A}}$, the value function is defined as:

$$\begin{aligned} U^{int}(a)=\sum _{i=1}^n u_i(a)+\sum _{(i_{1},i_{2}) \in {\mathcal {J}} \times {\mathcal {J}}, i_1>i_2}syn^+_{i_{1},i_{2}}(g_{i_{1}}(a), g_{i_{2}}(a))\\ -\sum _{(i_{1},i_{2}) \in {\mathcal {J}} \times {\mathcal {J}}, i_1>i_2}syn^-_{i_{1},i_{2}}(g_{i_{1}}(a),g_{i_{2}}(a)). \end{aligned}$$

$U^{int}$ takes into account all possible positive and negative interactions between pairs of criteria, which makes it similar to the 2-additive Choquet integral, however, the capacity of representation of preferences is greater for the former than for the latter (Greco et al. 2014). In practical applications, the $U^{int}$ model is even too general, thus a limited number of pairs of interacting criteria is advised for consideration. One can easily identify a minimal set of pairs of criteria for which there is either positive or negative interaction using a procedure presented by Greco et al. (2014).

2.9 ROR with outranking relation preference model

The value function model is a complete and transitive preference relation with a compensatory logic. In many real-life decision situations, when modeling users’ preferences it is reasonable to consider: (i) incomparability between alternatives, (ii) intransitive indifference and intransitive preference, and (iii) non-compensatory multiple criteria aggregation.

MCDA methods based on a preference model in the form of a valued binary relation called outranking relation, such as ELECTRE and PROMETHEE, answer these needs (Greco et al. 2016a). Outranking relation S groups three basic preference relations: indifference ($\sim $), weak preference ($\succ ^w$) and strong preference ($\sim ^s$). aSb reads: “alternative a is at least as good as alternative b”. As this relation is constructed for all ordered pairs of alternatives $\{a,b\} \in {\mathcal {A}} \times {\mathcal {A}}$, one can conclude the following relations between a and b:

$aSb\ \wedge \ bSa \ \ \Leftrightarrow \ \ a \sim b\ $ (indifference),
$aSb\ \wedge \ \lnot (bSa) \ \ \Leftrightarrow \ \ a \succ ^w b\ \vee \ a \succ ^s b\ $ (large preference),
$\lnot (aSb)\ \wedge \ \lnot (bSa) \ \ \Leftrightarrow \ \ a \ ?\ b \ \ $ (incomparability).

S is an incomplete and intransitive relation on set ${\mathcal {A}}$, constructed via concordance and discordance tests (Figueira et al. 2013). Outranking methods assume that evaluation criteria from family ${\mathcal {G}}$ are pseudo-criteria equipped with comparison thresholds. A pseudo-criterion is a real-valued function $g_i \in {\mathcal {G}}$ associated with two threshold functions, $q_i(\cdot )$ and $p_i(\cdot )$, whose arguments are performances of the alternatives. By convention, the thresholds are functions of the worst performance of the two alternatives being compared. They are called indifference and preference thresholds, respectively. The thresholds satisfy the following condition: for all ordered pairs of alternatives $(a, b) \in {\mathcal {A}} \times {\mathcal {A}}$, such that $g_i(a) \ge g_i(b)$, $g_i(a) + p_i(g_j(b))$ and $g_i(a) + q_i(g_i(b))$ are non-decreasing monotone functions of $g_i(b)$, such that $p_i(g_i(b)) \ge q_i(g_i(b)) \ge 0$ for all $a, b \in {\mathcal {A}}$. In this definition, the thresholds are functions of the performance, but they can also be defined as constant values $p_i \ge q_i \ge 0, i \in {\mathcal {J}}$. Their meaning is the following: indifference threshold $q_i$ is the maximum difference of performances of the compared alternatives allowing them to be considered equivalent with respect to criterion $g_i$; preference threshold $p_i$ is the minimum difference of performances of the compared alternatives switching the DM’s preference with respect to criterion $g_i$ from weak to strict.

The concordance test checks if the coalition of criteria concordant with the hypothesis aSb is strong enough. This strength is measured by the concordance index:

$$\begin{aligned} C(a,b)= \frac{\sum _{i=1}^n w_i \times C_i(a,b)}{\sum _{i=1}^n w_i}, \end{aligned}$$

where $w_i, i \in {\mathcal {J}}$, are weights of criteria which do not have the meaning of substitution rates, like weights $k_i, i \in {\mathcal {J}}$, of the weighted sum, because the former weights are not multiplied by performances on the corresponding criteria, but by 0–1 individual concordance coefficients $C_i(a,b)$ defined as:

$$\begin{aligned} C_i(a,b) = \left\{ \begin{array}{ll} 1 &{} \text {if}\, \; g_i(a) - g_i(b) \ge - q_i,\\ \frac{ g_i(a) - g_i(b) + p_i}{p_i \; - \; q_i} &{} \text {if}\, \; -p_i< g_i(a) - g_i(b) < - q_i,\\ 0 &{} \text {if}\, \; g_i(a) - g_i(b) \le - p_i. \end{array} \right. \end{aligned}$$

Thus, the aggregation is not compensatory and weights $w_i, i \in {\mathcal {J}}$, indicate a voting power of $g_i \in {\mathcal {G}}$. The concordance test is positive if $C(a,b) \ge \lambda $, where $\lambda \in [0.5, 1]$ is a cutting level (concordance threshold).

The discordance test checks if among criteria discordant with the hypothesis aSb there is a strong opposition against aSb. Criterion $g_i \in {\mathcal {G}}$ opposes strongly to the assertion aSb if $g_i(b) - g_i(a) \ge v_i$, where $v_i \ge p_i$ is called the veto threshold of $g_i$. In this situation, criterion $g_i$ makes a veto to the assertion aSb.

In conclusion, outranking relation aSb is true if and only if $C(a,b)\ge \lambda $ and there is no criterion strongly opposed (making veto) to the hypothesis. In this way, for each ordered pair $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$ one obtains relation S being true (1) or false (0), presented as a directed outranking graph where the nodes correspond to alternatives and there is an arc from node a to node b if aSb. To choose the best alternative of ${\mathcal {A}}$, one can exploit the outranking relation by searching for a kernel of the outranking graph. When the graph has no cycles, its kernel is unique and includes alternatives that do not mutually outrank; each alternative from outside the kernel is outranked by at least one alternative from the kernel. Thus, the kernel is composed of the best incomparable alternatives.

The many parameters to be fixed by the DM in the outranking method can be avoided in the ROR approach. Then, the DM provides preference information in terms of holistic pairwise comparisons of some reference alternatives: $a^*S b^*$ or $a^*S^c b^*$ for some $a^*,b^*\in A^R \subseteq {\mathcal {A}}$, where $S^c$ is a negation of the outranking relation. Moreover, according to Greco et al. (2011), where ROR was adapted to ELECTRE methods, the DM has a possibility to specify $[q_{i*}, q_i^*]$ and $[p_{i*}, p_i^*]$, being the ranges of indifference and preference thresholds, respectively, allowed by the DM.

Assuming $\sum _{i=1}^n w_i = 1$, one can substitute $C(a,b)= \sum _{i=1}^n w_i C_i(a,b) \ $ by $ \ C(a,b)= \sum _{i=1}^n \Psi _i(a,b)$, where marginal concordance functions $\Psi _i(a,b), i \in {\mathcal {J}},$ are non-decreasing functions of $g_i(a)-g_i(b)$:

$$\begin{aligned} \Psi _i(a,b) = \left\{ \begin{array}{ll} \Psi _i(\beta _i, \alpha _i) &{} \text {if}\, \; g_i(a) - g_i(b) \ge - q_i,\\ \frac{ g_i(a) - g_i(b) + p_i}{p_i \; - \; q_i} &{} \text {if}\, \; -p_i< g_i(a) - g_i(b) < - q_i,\\ 0 &{} \text {if}\, \; g_i(a) - g_i(b) \le - p_i, \end{array} \right. \end{aligned}$$

and $\alpha _i,\beta _i$ are, respectively, the worst and the best-observed performances on criterion $g_i, i=1,\ldots , n$.

The outranking model compatible with preference information provided by the DM is a set of marginal concordance functions $\Psi _i(a,b)$, cutting levels $\lambda $, indifference $q_i$, preference $p_i$, and veto thresholds $v_i, i=1,\ldots , n$, reproducing the DM’s preference information concerning pairs $(a^*,b^*) \in A^R \times A^R$.

In this case, the ordinal regression compatibility constraints $E^{A^R}$ are:

$$\begin{aligned} \left. \begin{array}{l} \;\hbox {If } \; a^*S b^*, \hbox { for } (a^*, b^*) \in A^R \times A^R \hbox {:}\\ \;\hbox {(i) }\; C(a^*,b^*) = \sum _{i=1}^n \Psi _i(a^*,b^*) \ge \lambda ,\\ \;\hbox {(ii) }\; g_i(b^*) - g_i(a^*) +\varepsilon \le v_i, \; i=1,\ldots ,n. \\ \;\hbox {If } \; a^*S^c b^*, \hbox { for } (a^*, b^*) \in A^R \times A^R \hbox {:}\\ \;\hbox {(iii) }\; C(a^*,b^*) = \sum _{i=1}^n \Psi _i(a^*,b^*) + \varepsilon \le \lambda +M_0(a^*,b^*), \\ \;\hbox {(iv) }\; g_i(b^*) - g_i(a^*) \ge v_i - \delta M_i(a^*, b^*), \; i=1,\ldots ,n, \\ \;\hbox {(v) }\; M_i(a^*, b^*) \in \{0, 1\}, \; i=0,1,\ldots ,n, \\ \;\hbox {(vi) }\; \sum _{i=0}^n M_i(a^*, b^*) \le n. \\ \;\hbox {(vii) }\; 0.5 \le \lambda \le 1, \\ \;\hbox {(viii) }\; v_i \ge p_i^*+ \varepsilon , \; i=1,\ldots ,n, \\ \;\hbox {(ix) }\; \varepsilon \ge 0. \\ \end{array} \right\} \; E^{A^R}_{BASE} \end{aligned}$$

where $\delta $ is a big constant value. Here, constraints (i) and (ii) ensure positive concordance and non-discordance tests, respectively. For $a^*S^c b^*$, either the concordance test or the non-discordance test should be negative, which corresponds to constraints (iii) or (iv)–(vi), respectively. Remark that for $M_i=0$, the corresponding constraint (iii) or (iv) is satisfied, which means that either concordance $C(a^*,b^*)$ is below cutting level $\lambda $ or difference $g_i(b^*) - g_i(a^*)$ is exceeding veto threshold $v_i$.

Given a pair of alternatives $(a,b) \in {\mathcal {A}}$, alternative a possibly outranks alternative b (written as $aS^Pb$) if and only if $\varepsilon ^*> 0$, where (Greco et al. 2011):

$$\begin{aligned} \varepsilon ^*= \hbox {max } \varepsilon \end{aligned}$$

s.t.

$$\begin{aligned} \left. \begin{array}{l} \; C(a,b) = \sum _{i=1}^n \Psi _i(a,b^) \ge \lambda , \\ \; g_i(b) - g_i(a) +\varepsilon \le v_i, \; i=1,\ldots ,n,\\ \; E^{A^R}_{BASE}. \end{array} \right\} E^P(a,b). \end{aligned}$$

This means that if $\varepsilon ^*>0$ and constraints $E^P(a,b)$ are feasible (for assertion aSb, concordance test does not fail and no criterion makes veto), then a outranks b for at least one compatible instance of the outranking model.

Moreover, alternative a necessarily outranks alternative b (written as $aS^Nb$) if and only if $\varepsilon ^*\le 0$, where:

$$\begin{aligned} \varepsilon ^*= \hbox {max } \varepsilon \end{aligned}$$

s.t.

$$\begin{aligned} \left. \begin{array}{l} \; C(a,b) = \sum _{i=1}^n \Psi _i(a,b^) + \varepsilon \le \lambda + M_0(a,b), \\ \; g_i(b) - g_i(a) \ge v_i - \delta M_i(a,b), \; i=1,\ldots ,n,\\ \; M_i(a,b) \in \{0, 1\}, \; i=0,\ldots ,n, \; \; \sum _{i=0}^n M_i(a,b) \le n,\\ \; E^{A^R}_{BASE}. \end{array} \right\} E^N(a,b). \end{aligned}$$

This means, in turn, that if $\varepsilon ^*\le 0$ or constraints $E^N(a,b)$ are infeasible (for assertion $aS^cb$ either concordance test fails or one of the criteria makes veto), then a outranks b for all compatible instances of the outranking model. In other words, $aS^Nb$ is true because $aS^cb$ is not true for all compatible instances of the outranking model.

For any pair of alternatives $(a,b) \in {\mathcal {A}}$:

$$\begin{aligned} aS^Nb \Leftrightarrow \lnot (aS^{cP}b),\; \hbox { as well as, }\; aS^{cP}b \Leftrightarrow \lnot (aS^{N}b), \\ aS^Pb \Leftrightarrow \lnot (aS^{cN}b),\; \hbox { as well as, }\; aS^{cN}b \Leftrightarrow \lnot (aS^{P}b), \end{aligned}$$

thus, there are two sources of information $(S^N, S^P)$ about four relations $(S^N, S^{cN}, S^P, S^{cP})$ in set ${\mathcal {A}}$. They have the following properties:

$$\begin{aligned} aS^Nb\Rightarrow & {} aS^Pb \\ aS^Nb\Rightarrow & {} \lnot (aS^{cN}b) \; \hbox { as well as }\; aS^{cN}b \Rightarrow \lnot (aS^{N}b). \end{aligned}$$

Moreover, for given preference information $a^*S b^*$:

$$\begin{aligned} a^*S b^*\Rightarrow a^*S^N b^*\; \hbox { and }\; a^*S b^*\Rightarrow \lnot (b^*S^P a^*). \end{aligned}$$

To work out a recommendation in the choice problem, it is advised to find a kernel of the necessary outranking graph $S^N$. In case of the ranking problem, one can exploit the necessary outranking graph including $S^N$ and $S^{cN}$ relations, using the Net Flow Score procedure. For each alternative $a \in {\mathcal {A}}$, the score $NFS(a) = strength(a) - weakness(a)$, where strength(a) counts all relations (arcs) with other alternatives z, such that $aS^Nz$ or $zS^{cN}a$, while weakness(a) counts all relations (arcs) with other alternatives v, such that $aS^{cN}v$ or $vS^{N}a$. The resulting ranking is a complete preorder determined by $NFS(a) \in {\mathcal {A}}$.

2.10 ROR handling hierarchy of criteria

In many real-world decision problems, family of criteria ${\mathcal {G}}$ has a hierarchical structure, which means that the performances on some criteria are aggregates of performances on sub-criteria. In (Keeney and Raiffa 1976), one can read: “Almost everyone who has seriously thought about the objectives in a complex problem has come up with some sort of hierarchy of objectives”, A similar conviction is expressed by Belton and Stewart (2002): “In the process of structuring the problem, it is possible (even likely) that the criteria may have been constructed hierarchically in terms of a value tree”. For example, in the land use problem considered by Belton and Stewart (2002), the economic benefit criterion has three sub-criteria which are the outputs of agriculture, forestry, and secondary industry. Consideration of a hierarchy of criteria is also a way of decreasing the problem’s complexity by reasoning about the problem at different levels of generality. Once the hierarchy is built, the DMs may first judge the alternatives on the elementary criteria (leaves of the hierarchy tree) and then receive feedback about their impact on criteria performances at higher levels of the hierarchy tree.

ROR has been adapted to handle the hierarchy of criteria in, the so-called, Multiple Criteria Hierarchy Process (MCHP), first, for a general additive value function as a preference model within the ranking problem (Corrente et al. 2012). In this case, the marginal value functions refer to all levels of the hierarchy tree, representing values of particular scores of the alternatives on criteria, sub-criteria, sub-sub-criteria, etc.

In MCHP, the DM is asked to provide preference information concerning a particular criterion or sub-criterion $G_{\textbf{r}}$. It has the form of a partial preorder $\succsim _{\textbf{r}}$ on $A^{R}$ or a partial preorder $\succsim _{\textbf{r}}^{*}$ on $A^{R}\times A^{R}$, which are interpreted as pairwise comparisons of reference alternatives, or statements regarding comparisons of the intensity of preference between pairs of reference alternatives.

The value function assessing alternative $a\in {\mathcal {A}}$ with respect to criterion or sub-criterion $G_{\textbf{r}}$ is:

$$\begin{aligned} U_{\textbf{r}}(a)=\sum _{\textbf{t}\in E(G_{\textbf{r}})}u_{\textbf{t}}(g_{\textbf{t}}(a)) \end{aligned}$$

where $E(G_{\textbf{r}})$ is the set of indices of lowest-level sub-criteria descending from $G_{\textbf{r}}$ located at a higher level.

It is worth noting that in the case of MCHP, the DM can express their preferences for reference alternatives either comprehensively, or with respect to particular sub-criteria $G_{\textbf{r}}$, which allows a more precise expression, and then representation, of preferences.

Since the first proposal of ROR-MCHP with the general additive value function for the ranking problem, this methodology has been extended to:

preference model in terms of an outranking relation used in ELECTRE and PROMETHEE methods for multiple criteria ranking (Corrente et al. 2013b), and for multiple criteria classification (Corrente et al. 2016c, 2017a),
preference model in terms of an outranking relation used in ELECTRE III method for multiple criteria ranking with imprecise weights of criteria and stochastic multiobjective acceptability analysis (Corrente et al. 2017b),
preference model in terms of Choquet integral for multiple criteria ranking with stochastic multiobjective acceptability analysis (Angilella et al. 2016).

2.11 ROR with rule preference model

As mentioned before, a preference model built of “$if \ldots , then \ldots $” decision rules induced from examples of decisions has a double advantage in MCDA: it combines the relatively easy elicitation of indirect preference information with the transparency and explainability of decision rules.

The decision examples come either from observations of DM’s past decisions made in the same decision context or from conscious elicitation by the DM on demand of the analyst. In the latter case, they concern some reference alternatives, as in other ROR approaches. In both cases, decision examples may, however, be inconsistent with the dominance principle commonly accepted for multiple criteria decision problems. Decisions are inconsistent with the dominance principle if:

In case of classification: alternative a has been assigned to a worse decision class than alternative b, although a is at least as good as b on all the considered criteria, i.e., a dominates b.
In case of choice and ranking: a pair of alternatives (a, b) has been assigned a degree of intensity of preference smaller than pair (c, d), although differences in evaluations between a and b on all the considered criteria are at least as large as the respective differences of evaluations between c and d, i.e., pair (a, b) dominates pair (c, d).

The inconsistency may come from many sources. Examples include:

Missing attributes (attributes without preference scale or criteria) in the description of alternatives.
Unstable preferences or hesitation of DMs.

Handling these inconsistencies is important for preference learning. They cannot be simply considered as noise or error to be eliminated from data, or amalgamated with consistent data by some averaging operators. They should be identified and presented as uncertain patterns.

The concept of rough set introduced by Pawlak (1982) appeared to be useful for handling inconsistency in data, however, originally it was limited to inconsistency with respect to the indiscernibility principle. Objects (alternatives) that have the same description are said to be indiscernible. The indiscernibility relation thus generated in the universe of discourse U constitutes the mathematical basis of rough set theory. It induces a partition of the universe into blocks of indiscernible objects, called granules, which can be used to build knowledge about a real or abstract world. Any subset X of the universe may be expressed in terms of these blocks either precisely (as a union of granules) or approximately. In the latter case, the subset X may be characterized by two ordinary sets, called lower approximation and upper approximation of set X. A rough set is defined by means of these two approximations, which coincide in the case of an ordinary set. The lower approximation of X is composed of all the granules included in X (whose elements, therefore, certainly belong to X), while the upper approximation of X consists of all the granules which have a non-empty intersection with X (whose elements, therefore, may belong to X). The difference between the upper and lower approximations constitutes the boundary region of the rough set, whose elements cannot be characterized with certainty as belonging or not to X by using the available information. The information about objects from the boundary region is, therefore, inconsistent or ambiguous. The cardinality of the boundary region states, moreover, the extent to which it is possible to express X in exact terms, on the basis of the available information. For this reason, this cardinality is used as a measure of the vagueness of information about X. The lower and upper approximations of a partition of U into decision classes prepare the ground for inducing, respectively, certain and possible knowledge patterns in the form of “$if \ldots , then \ldots $” decision rules.

To handle inconsistencies with respect to the dominance principle, which is typical for preference data, Greco et al. (2001) generalized the original rough set concept substituting the indiscernibility relation with a dominance relation in the rough approximation of decision classes. The resulting methodology was called Dominance-based Rough Set Approach (DRSA). An important consequence of DRSA is the possibility of inferring the DM’s preference model in terms of decision rules induced either from lower approximations of decision classes (certain rules), or from upper approximations of decision classes (possible rules), or from the difference between upper and lower approximations (approximate rules supported by inconsistent examples).

In DRSA, the granules used for building approximations of decision classes are dominance cones in the evaluation space. DRSA accepts all scales of criteria because it exploits the order only which is a primitive feature of ordinal, interval, and ratio evaluation scales.

Let us explain DRSA to multiple criteria classification problems first.

The finite universe of discourse U is composed of classification examples concerning reference alternatives from set $A^R \in {\mathcal {A}}$, such that, without loss of generality, $g_i:U \rightarrow {\mathbb {R}}$ for each $i=1,\ldots , n$, and, for all alternatives $a^*,b^*\in U$, $g_i(a^*) \ge g_i(b^*)$ means that “$a^*$ is at least as good as $b^*$ with respect to criterion $g_i$”, which is denoted as $a^*\succeq _{i} b^*$. Therefore, $\succeq $$_{i}$ is a complete preorder, i.e., a strongly complete and transitive binary relation, defined on U on the basis of evaluations $g_i(\cdot )$. Furthermore, there is a decision attribute d which makes a partition of U into a finite number of decision classes, called classification, Cl=${\{}Cl_{1}, \ldots ,Cl_{p}\}$, such that each $a^*\in U$ belongs to one and only one class Cl$_{t}$, $t=1, \ldots , p$. We suppose that the classes are preference ordered, i.e., for all $r,s=1, \ldots , p$, such that $r>s$, the alternatives from Cl$_{r}$ are preferred to the alternatives from Cl$_{s}$. More formally, if $\succeq $ is a comprehensive weak preference relation on U, i.e., if for all $a^*,b^*\in U$, $a^*{\succeq }b^*$ reads “$a^*$ is at least as good as $b^*$”, then we suppose

$$\begin{aligned}{}[a^*{\in }Cl_{r}, \ b^*{\in } Cl_{s}, \ r{>}s] \Rightarrow a^*{\succ }b^*, \end{aligned}$$

where $a^*{\succ }b^*$ means $a^*{\succeq }b^*\hbox { and}\ {not}\ b^*{\succeq } a^*$. The above assumptions are typical for consideration of a multiple criteria classification problem (also called ordinal classification with monotonicity constraints).

In DRSA, the explanation of the assignment of alternatives to preference-ordered decision classes is made on the base of their evaluation with respect to a subset of criteria $P \subseteq {\mathcal {J}}$. This explanation is called approximation of decision classes with respect to P. Of course, most commonly, $P={\mathcal {J}}$. In order to take into account the order of decision classes, in DRSA the classes are not considered one by one but, instead, unions of classes are approximated: upward union from class $Cl_t$ to class $Cl_p$, denoted by $Cl^\ge _t$, and downward union from class $Cl_t$ to class $Cl_1$, denoted by $Cl^\le _t$, i.e.:

$$\begin{aligned} \mathop {Cl}\nolimits _t^ \ge = \bigcup \limits _{s \ge t} {\mathop {Cl}\nolimits _s }, \quad \mathop {Cl}\nolimits _t^ \le = \bigcup \limits _{s \le t} {\mathop {Cl}\nolimits _s },\quad t=1,\ldots ,p. \end{aligned}$$

The statement $a^*\in Cl_t^\ge $ reads “$a^*$ belongs to at least class Cl$_{t}$”, while $a^*\in Cl_t^ \le $ reads “$a^*$ belongs to at most class Cl$_{t}$”. Let us remark that $\mathop {Cl}\nolimits _1^ \ge =\mathop {Cl}\nolimits _p^ \le =U$, $\mathop {Cl}\nolimits _p^ \ge $=Cl$_{p}$ and $\mathop {Cl}\nolimits _1^ \le $=Cl$_{1}$. Furthermore, for t=2,...,p, we have:

$$\begin{aligned} \mathop {Cl}\nolimits _{t - 1}^ \le = U \setminus \mathop {Cl}\nolimits _t^ \ge \ \hbox { and } \ \mathop {Cl}\nolimits _t^ \ge =U \setminus \mathop {Cl}\nolimits _{t - 1}^ \le . \end{aligned}$$

The key idea of the rough set approach is an explanation (approximation) of knowledge generated by the decision attribute, using granules of knowledge generated by condition attributes. In DRSA, where condition attributes are criteria and decision classes are preference ordered, the knowledge to be explained is the assignment of alternatives to upward and downward unions of classes, and the granules of knowledge are sets of alternatives contained in dominance cones defined in the space of evaluation criteria.

Alternative $a^*$ dominates alternative $b^*$ with respect to $P \subseteq {\mathcal {J}}$ (shortly, $a^*$ P-dominates $b^*$), denoted by $a^*D_{P} b^*$, if for every criterion $ i \in P$, $g_i(a^*) \ge g_i(b^*)$. The relation of P-dominance is reflexive and transitive, i.e., it is a partial preorder.

Given a set of criteria $ P \subseteq {\mathcal {J}}$ and $a^*\in U$, the granules of knowledge used for approximation in DRSA are:

a set of alternatives dominating $a^*$, called P-dominating set, $D_P^ + \left( a^*\right) $={$b^*\in U$: $b^*D_{P} a^*$},
a set of alternatives dominated by $a^*$, called P-dominated set, $D_P^ - \left( a^*\right) $={$b^*\in U$: $a^*D_{P} b^*$}.

In terms of multiple criteria evaluations, the above definitions correspond to:

$$\begin{aligned} D_P^ + \left( a^*\right) =\{b^*\in U: g_i(b^*) \ge g_i(a^*), \hbox {for all } i \in P \}, \\ D_P^ - \left( a^*\right) =\{b^*\in U: g_i(b^*) \le g_i(a^*), \hbox {for all } i \in P \}. \end{aligned}$$

Given $P \subseteq {\mathcal {J}}$, the inclusion of an alternative $a^*\in U$ to the upward union of classes $\mathop {Cl}_t^ \ge $, $t=2,\ldots ,p$, is inconsistent with the dominance principle if one of the following conditions holds:

$a^*$ belongs to class $Cl_{t}$ or better but it is P-dominated by an alternative $b^*$ belonging to a class worse than $Cl_{t}$, i.e., $a^*\in \mathop {Cl}\nolimits _t^ \ge $ but $\mathop D_P^+ (a^*) \cap Cl_{t - 1}^ \le \ne \emptyset $,
$a^*$ belongs to a worse class than $Cl_{t}$ but it P-dominates an object $b^*$ belonging to class $Cl_{t}$ or better, i.e., $a^*\notin \mathop {Cl}_t^ \ge $ but $\mathop D_P^- (a^*) \cap \mathop {Cl}_t^ \ge \ne \emptyset $.

The above considerations bring us closer to the definition of rough approximations of the upward and downward unions of decision classes.

The P-lower approximation of $\mathop {Cl}_t^ \ge $, denoted by ${\underline{P}}(\mathop {Cl}_t^ \ge ) $, and the P-upper approximation of $\mathop {Cl}_t^ \ge $, denoted by ${\overline{P}}\hbox {(}\mathop {Cl}_t^ \ge \hbox {)}$, are defined as follows ($t=2,\ldots ,p$):

$$\begin{aligned} {\underline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \ge \hbox {)}= & {} {\{}a^*\in U:\mathop D\nolimits _P^ + (a^*) \subseteq \mathop {Cl}\nolimits _t^ \ge {\}}, \\ {\overline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \ge \hbox {)}= & {} {\{}a^*\in U:\mathop D\nolimits _P^ - (a^*) \cap \mathop {Cl}\nolimits _t^ \ge \ne \emptyset {\}}. \end{aligned}$$

The definition of P-lower approximation and the P-upper approximation of $\mathop {Cl}_t^ \le $, is analogous ($t=1,\ldots ,p-1$):

$$\begin{aligned} {\underline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \le \hbox {)}= & {} {\{}a^*\in U:\mathop D\nolimits _P^ - (a^*) \subseteq \mathop {Cl}\nolimits _t^ \le {\}}, \\ {\overline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \le \hbox {)}= & {} {\{}a^*\in U:\mathop D\nolimits _P^ + (a^*) \cap \mathop {Cl}\nolimits _t^ \le \ne \emptyset {\}}. \end{aligned}$$

The P-boundaries of $\mathop {Cl}_t^ \ge $ and $\mathop {Cl}_t^ \le $, denoted by $Bn_{P}(\mathop {Cl}_t^ \ge )$ ($t=2,\ldots ,p$) and $Bn_{P}(\mathop {Cl}_t^ \le )$ ($t=1,\ldots ,p-1$), respectively, are defined as follows:

$Bn_{P}(\mathop {Cl}\nolimits _t^ \ge )={\overline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \ge \hbox {)} \setminus {\underline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \ge \hbox {)}$, $Bn_{P}(\mathop {Cl}\nolimits _t^ \le )={\overline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \le \hbox {)} \setminus {\underline{P}}\hbox {(}\mathop {Cl}\nolimits _t^ \le \hbox {)}$.

For every $P \subseteq {\mathcal {J}}$, the alternatives that are consistent in the sense of the dominance principle with all upward and downward unions of classes are P-correctly classified. For every $P \subseteq {\mathcal {J}}$, the quality of classification Cl by set of criteria P is defined as the ratio of the number of alternatives P-consistent with the dominance principle and the number of all alternatives in U. Since the P-consistent alternatives are those which do not belong to any P-boundary $Bn_{P}(\mathop {Cl}_t^ \ge )$ or $Bn_{P}(\mathop {Cl}_t^ \le )$, and $Bn_{P}(\mathop {Cl}_t^ \ge ) = Bn_{P}(\mathop {Cl}_{t-1}^ \le )$ for $t = 2, \ldots , p$, the quality of classification Cl by a set of criteria $P \in {\mathcal {J}}$ can be written as

$$\begin{aligned} \gamma _{P}({{{\textbf {Cl}}}}) = \frac{\left| U \setminus \left( \bigcup \limits _{t \in \{2, \ldots ,p\} }Bn_{P}(Cl_{t}^{\ge })\right) \right| }{|U|} =\frac{\left| U \setminus \left( \bigcup \limits _{t \in \{1, \ldots ,p-1\} }Bn_{P}(Cl_{t}^{\le })\right) \right| }{|U|}. \end{aligned}$$

$\gamma _{P} ( {{{\textbf {Cl}}}} )$ can be seen as a degree of consistency of the classification examples, where P is the set of criteria and Cl is the considered classification.

Each minimal (in the sense of inclusion) subset $P \subseteq {\mathcal {J}}$, such that $\gamma _{P} ({{{\textbf {Cl}}}} ) = \gamma _{{\mathcal {J}}} ({{{\textbf {Cl}}}} )$, is called a reduct of classification Cl, and is denoted by $RED_{{{\textbf {Cl}}}} $. Let us remark that for a given set of classification examples, one can have more than one reduct. The intersection of all reducts is called the core and is denoted by $CORE_{{{\textbf {Cl}}}} $. Criteria in $CORE_{{{\textbf {Cl}}}} $ cannot be removed from consideration without deteriorating the quality of classification Cl. This means that, in set ${\mathcal {J}}$, there are three categories of criteria:

indispensable criteria included in the core,
exchangeable criteria included in some reducts, but not in the core,
redundant criteria, neither indispensable nor exchangeable, and thus not included in any reduct.

The dominance-based rough approximations of upward and downward unions of decision classes can serve to induce a model of classification decisions (preference model) in terms of “$if \ldots , then \ldots $” decision rules. Given the preference information in terms of classification examples, it is meaningful to consider the following five types of decision rules intending to classify alternative $x \in {\mathcal {A}}$:

certain D$_{\ge } $-decision rules, providing lower profiles of alternatives belonging to ${\underline{P}}({\mathop {Cl}_{t}^{ \ge }} )$: if $g_{i_1}(x)\ge r_{i_1}$ and $ \ldots $ and $g_{i_h}(x)\ge r_{i_h}$, then $x \in Cl_{t}^{\ge }, t=2, \ldots , p,\ $ $\{i_1,...,i_h\} \in P,$ $r_{i_1}, \ldots , r_{i_h} \in {\mathbb {R}}$;

possible D$_{\ge } $-decision rules, providing lower profiles of alternatives belonging to ${\overline{P}} ({\mathop {Cl}_{t}^{ \ge }} )$: if $g_{i_1}(x)\ge r_{i_1}$ and $ \ldots $ and $g_{i_h}(x)\ge r_{i_h}$, then x possibly belongs to $Cl_{t}^{\ge }, t=2, \ldots , p,\,\{i_1,...,i_h\} \in P,$ $r_{i_1}, \ldots , r_{i_h} \in {\mathbb {R}}$;

certain D$_{\le } $-decision rules, providing upper profiles of alternatives belonging to ${\underline{P}} ({\mathop {Cl}_{t}^{ \le }} )$: if $g_{i_1}(x)\le r_{i_1}$ and $ \ldots $ and $g_{i_h}(x)\le r_{i_h}$, then $x \in Cl_{t}^{\le }, t=1, \ldots , p-1,\, \{i_1,...,i_h\} \in P,$ $r_{i_1}, \ldots , r_{i_h} \in {\mathbb {R}}$;

possible D$_{\le } $-decision rules, providing upper profiles of alternatives belonging to ${\overline{P}} ({\mathop {Cl}_{t}^{ \le }} )$: if $g_{i_1}(x)\le r_{i_1}$ and $ \ldots $ and $g_{i_h}(x)\le r_{i_h}$, then x possibly belongs to $Cl_{t}^{\le }, t=1, \ldots , p-1,\, \{i_1,...,i_h\} \in P,$ $r_{i_1}, \ldots , r_{i_h} \in {\mathbb {R}}$;

approximate D$_{{\ge } \mathrm{\le }} $-decision rules, providing simultaneously lower and upper profiles of alternatives belonging to Cl$_{s} \cup $Cl$_{s\mathrm{+} 1} \cup $...$ \cup $Cl$_{t}$, without the possibility of discerning to which class: if $g_{i_1}(x)\ge r_{i_1}$ and $ \ldots $ and $g_{i_k}(x)\ge r_{i_k}$ and $g_{i_{k+1}}(x)\le r_{i_{k+1}}$ and $ \ldots $ and $g_{i_h}(x)\le r_{i_h}$, then $x \in Cl_{s} \cup Cl_{s+1} \cup \ldots \cup Cl_{t}, \{i_1, \ldots , i_h\}\subseteq P, \ $ $s,t \in \{1, \ldots , p\}, \ $ $s < t, \ $ $r_{i_1}, \ldots , r_{i_h} \in {\mathbb {R}}$.

The rules of type 1) and 3) represent certain preference patterns extracted from decision examples, while the rules of types 2) and 4) represent possible preference patterns. Rules of type 5) represent doubtful preference patterns. Depending on the classification examples supporting the induced rules, they are characterized by different values of adopted interestingness measures. In (Greco et al. 2016c), some rule interestingness measures with desirable properties are characterized in four perspectives of Bayesian and likelihoodist confirmation. The interestingness measures are used to classify new alternatives by decision rules in situations where these alternatives are matched by (i) no rule, (ii) exactly one rule, (iii) several rules (even contradictory). Such a classification scheme has been proposed by Błaszczyński et al. (2007).

The above definitions of rough approximations are based on a strict application of the dominance principle. In practice, however, it is beneficial to accept a limited proportion of inconsistent examples in lower approximations, particularly for large data sets. Two parametric versions of DRSA have been proposed, relaxing in different ways the definition of rough approximations: Variable Precision DRSA (Inuiguchi et al. 2009) and Variable Consistency DRSA (Błaszczyński et al. 2009). Statistical interpretation of these two parametric DRSA from the viewpoint of empirical risk minimization typical for machine learning has been given by Kusunoki et al. (2021). It is also worth mentioning the stochastic DRSA model presented in (Kotłowski and Słowiński 2008; Kotłowski et al. 2008).

Algorithms for induction of decision rules from rough approximations of upward and downward unions of decision classes perform various strategies. The most popular induction strategy, called minimal-cover, offers a minimal set of rules representing the decision examples in the most concise way (i.e., without any redundancy) (Błaszczyński et al. 2012). It is usually performed by greedy heuristics of sequential covering type, giving an approximately minimal-cover set of rules. Other strategies aim at discovering a set of decision rules satisfying some pre-defined user’s requirements, e.g., with respect to minimal support, maximal length, or maximal number of rules. Yet other strategies propose to induce an exhaustive set of rules, i.e., all rules compatible with the provided preference information, which form the most comprehensive knowledge base with respect to the analyzed data set. The recent trend consists of integrating several base classifiers into ensembles or committees of classifiers (Kotłowski and Słowiński 2009). Several methods have been proposed to get diverse base classifiers to be integrated within an ensemble of classifiers. The best known are bagging (Błaszczyński et al. 2010) and boosting (Dembczyński et al. 2010a), which modify the set of objects by sampling or weighting particular examples and use the same learning algorithm to create base classifiers.

ROR has been applied to the rule preference model by Kadziński et al. (2015). It consists of taking into account all minimal-cover (MC) by sets of decision rules compatible with the provided decision examples when computing recommendations for non-reference alternatives. An MC set of rules is a single base instance of the preference model. In order to generate all MC sets of rules, first an exhaustive set of rules is induced and then, all compatible MC sets of rules are obtained by solving a series of integer LP problems which guarantee that each reference alternative is covered by at least one rule from each MC set. Such an ensemble of MC classifiers is applied to the set of all considered alternatives, producing two types of classification results. Alternative a is necessarily assigned to class $Cl_h$ if and only if a is assigned to $Cl_h$ for all MC sets of rules, and a is possibly assigned to $Cl_h$, if and only if a is assigned to $Cl_h$ for at least one MC set of rules. Since all compatible MC sets of rules are considered, it is also possible to compute class acceptability indices defined as the share of MC sets of rules assigning an alternative to a single class or a set of contiguous classes. The robustness analysis is continued in a twofold way. First, one selects a representative MC set of rules that builds on the class acceptability indices and highlights the most stable part of the robust classification. Then, for each alternative, one can identify rules that are decisive in terms of recommendation in the greatest number of compatible MC sets. These rules directly influence the assignment suggested by different instances, being of particular interest to the DM willing to understand the obtained recommendation.

Let us move to the explanation of DRSA to multiple criteria choice and ranking problems. While the classification of alternatives is based on their absolute evaluation on the considered criteria, choice and ranking refer to relative evaluation by means of pairwise comparisons of alternatives. Indeed, the position of an alternative in the ranking depends on the quality of other competing alternatives. This required an adaptation of DRSA to pairwise comparisons of alternatives instead of absolute multiple criteria evaluations considered in classification problems. Thus, in the case of choice and ranking problems, the rough approximation concerns a comprehensive preference relation in the set of reference alternatives, and the approximation is done using marginal preference relations on particular criteria for pairs of reference alternatives (Greco et al. 2001).

Technically, the modifications of DRSA necessary to approach the problems of choice and ranking are twofold:

A pairwise comparison table (PCT) is considered instead of the simple classification table: PCT is a classification table whose rows represent pairs of reference alternatives for which multiple criteria evaluations and a comprehensive preference relation are known.

A dominance principle is considered for comparisons of pairs of alternatives instead of simple alternatives: if alternative a is preferred to b at least as strongly as c is preferred to d on all the considered criteria, then a must be comprehensively preferred to b at least as strongly as c is comprehensively preferred to d.

The application of DRSA to the choice or ranking problems proceeds as follows. First, the DM gives examples of pairwise comparisons of reference alternatives from set $A^R \in {\mathcal {A}}$. This can be a partial or complete ranking from the best to the worst of these alternatives. From this set of decision examples, a PCT is constructed with an ordered pair of reference alternatives in each row, pairs of evaluations of these alternatives on particular criteria in the columns corresponding to criteria, and comprehensive preference relation in the last column corresponding to the decision. The pairs of evaluations on particular criteria for a pair of reference alternatives $(a^*, b^*) \in A^R \times A^R$ can be converted to graded differences of evaluations (intensities of preference) in case of criteria with interval or ratio scale. Remark that such a PCT with comprehensive preference relations in the decision is analogous to a classification table where instead of single alternatives there are pairs of alternatives and instead of decision classes there are comprehensive preference relations, e.g., outranking relation $a^*S b^*$ and non-outranking relation $a^*S^c b^*$ in the simplest case. Thus, all the DRSA methodology developed for the multiple criteria classification can be applied to PCT and yields lower and upper approximations of the comprehensive preference relations, e.g., S and $S^c$, composed of pairs of reference objects. These approximations prepare the ground for induction of certain, possible, and approximate “if ..., then...” decision rules. VC-DRSA methodology applies perfectly to this scheme (Szela̧g et al. 2014a). For example, in the case of comprehensive preference relations S and $S^c$, decision rules induced from VC-DRSA lower approximations have the following form:

if $g_{i_1}(x) - g_{i_1}(y) \ge \delta _{i_1}$ $and \ldots and$ $g_{i_h}(x) - g_{i_h}(y) \ge \delta _{i_h}$ and $(g_{i_{h+1}}(x) \ge r_{i_{h+1}}$ and $g_{i_{h+1}}(y) \le s_{i_{h+1}})$ $and \ldots and$ $(g_{i_{z}}(x) \ge r_{i_{z}}$ and $g_{i_{z}}(y) \le s_{i_{z}}),$ $\ then$ $ \ xSy$, where $\{i_1, \ldots , i_h\}\subseteq P$ indicate criteria with an interval or ratio scale, $\{i_{p+1}, \ldots , i_z\}\subseteq {P}$ indicate criteria with an ordinal scale, and $\ \delta _{i_1}, \ldots , \delta _{i_h} \in {\mathbb {R}}$, as well as $\ r_{i_{p+1}}, \ldots , r_{i_z}, s_{i_{h+1}}, \ldots , s_{i_z} \in {\mathbb {R}}$;
if $g_{i_1}(x) - g_{i_1}(y) \le \gamma _{i_1}$ $and \ldots and$ $g_{i_k}(x) - g_{i_k}(y) \le \gamma _{i_k}$ and $(g_{i_{k+1}}(x) \le r_{i_{k+1}}$ and $g_{i_{k+1}}(y) \ge s_{i_{k+1}})$ $and \ldots and$ $(g_{i_{v}}(x) \le r_{i_{v}}$ and $g_{i_{v}}(y) \ge s_{i_{v}}),$ $\ then$ $ \ xS^c y$, where $\{i_1, \ldots , i_k\}\subseteq {P}$ indicate criteria with an interval or ratio scale, $\{i_{k+1}, \ldots , i_v\}\subseteq {P}$ indicate criteria with an ordinal scale, and $\ \gamma _{i_1}, \ldots , \gamma _{i_k} \in {\mathbb {R}}$, as well as $\ r_{i_{k+1}}, \ldots , r_{i_v}, s_{i_{k+1}}, \ldots , s_{i_v} \in {\mathbb {R}}$.

These rules constitute a preference model of the DM. They are applied to the set of alternatives ${\mathcal {A}}$ with the intention of ranking them. Any pair of alternatives $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$ can match these rules in one of four ways:

aSb and $\lnot (aS^c b),$ which is true outranking $aS^T b$,
$aS^c b$ and $\lnot (aSb),$ which is false outranking $aS^F b$,
aSb and $aS^c b,$ which is contradictory outranking $aS^K b$,
$\lnot (aSb)$ and $\lnot (aS^c b),$ which is unknown outranking $aS^U b$.

This 4-valued outranking relation underlines the presence or the absence of positive or negative arguments for outranking. For any $(a,b) \in {\mathcal {A}} \times {\mathcal {A}}$ it can be faithfully represented by 3-valued fuzzy relation $R_{3v}$:

$$\begin{aligned} R_{3v}(a,b)=\frac{[aSb] + (1-[aS^c b])}{2}, \end{aligned}$$

where $[\ \cdot \ ]$ denotes the indicator function taking value 0 or 1. In order to obtain a final recommendation, relation $R_{3v}$ is exploited using a ranking method. In (Szela̧g et al. 2014a), several ranking methods were compared with respect to a set of desirable properties. It appeared that the best ranking method is the Net Flow Rule which builds a ranking of alternatives using a scoring function NFS(a) for all $a\in {\mathcal {A}}$:

$$\begin{aligned} NFS(a)= & {} \sum _{b\in {\mathcal {A}}\setminus \{a\}}\left( R_{3v}(a,b) - R_{3v}(b,a) \right) \\ {}= & {} \sum _{b\in {\mathcal {A}}\setminus \{a\}}\left( [aSb] - [bSa] - [aS^c b] +[bS^c a] \right) , \end{aligned}$$

where $[\ \cdot \ ]$ denotes again a (0–1) indicator function. The recommendation for ranking is a complete preorder on set ${\mathcal {A}}$ determined by $NFS( \cdot )$, and the best choice recommendation is alternative ${\hat{a}} \in {\mathcal {A}}$, such that $NFS({\hat{a}})= \max \limits _{a\in {\mathcal {A}}} \{ NFS(a) \}$.

In (Dembczyński et al. 2010b), the DRSA for choice and ranking has been compared with statistical learning of a utility function for single objects, based on the boosting technique. Similar to the Net Flow Rule, the utility function also gives a linear ranking of objects. In conclusion of a computational experiment, the authors state that both analyzed approaches to preference learning share good properties of the decision rule preference model and have good performance in massive-data learning problems.

DRSA has been adapted to a large variety of multi-attribute decision problems (Słowiński et al. 2015). Let’s list a few of them:

decision under uncertainty and time preference (Greco et al. 2010b),
robustness analysis for decision under uncertainty (Kadziński et al. 2016),
classification with missing data (Szela̧g et al. 2017),
classification with a hierarchical structure of evaluation criteria (Dembczyński et al. 2002),
classification with imprecise evaluations and assignments (Dembczyński et al. 2009),
interactive evolutionary multiobjective optimization (Corrente et al. 2024).

The DRSA methodology gained popularity as a useful tool for structuring preference data prior to the induction of the preference model in terms of decision rules. It is based on elementary concepts and mathematical tools (sets and set operations, binary relations), without recourse to any complex algebraic or analytical structures (Greco et al 2010a); the decision rules have very natural interpretation and the key concept of dominance is rational and objective.

3 Conclusion

This paper is the first part of our systematic comparison of multiple criteria decision aiding (MCDA) and preference learning (PL), in which we provided a review of the MCDA methodology highlighting its recent trend in preference modeling from decision examples. This review will be continued with a similar overview of PL in the second part of the paper, which furthermore compares both methodologies in a systematic way, and gives an overview of existing work on combining PL and MCDA.

Acknowledgements

Eyke Hüllermeier acknowledges supported by the German Research Foundation (DFG, TRR 318/1 2021–438445824) and LMUexcellent, funded by the Federal Ministry of Education and Research (BMBF) and the Free State of Bavaria under the Excellence Strategy of the Federal Government and the Länder as well as by the Hightech Agenda Bavaria. Roman Słowiński wishes to acknowledge the support of the SBAD funding from the Polish Ministry of Education and Science.

Declaration

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

An alternative name to MCDA is multiple criteria decision making (MCDM), however, in this paper, we will use the name MCDA as it better underlines the supporting role of the considered methodology.

Angilella S, Greco S, Matarazzo B (2010) Non-additive robust ordinal regression: a multiple criteria decision model based on the Choquet integral. Eur J Oper Res 201(1):277–288CrossRef

Angilella S, Corrente S, Greco S et al (2016) Robust ordinal regression and stochastic multiobjective acceptability analysis in multiple criteria hierarchy process for the Choquet integral preference model. Omega 63:154–169CrossRef

Bell D, Raiffa H, Tversky A (1988) Decision making: descriptive, normative, and prescriptive interactions. Cambridge University Press, CambridgeCrossRef

Belton V, Stewart T (2002) Multiple criteria decision analysis: an integrated approach. Springer, BerlinCrossRef

Błaszczyński J, Greco S, Słowiński R (2007) Multi-criteria classification—a new scheme for application of dominance-based decision rules. Eur J Oper Res 181(3):1030–1044CrossRef

Błaszczyński J, Greco S, Słowiński R et al (2009) Monotonic variable consistency rough set approaches. Int J Approx Reason 50(7):979–999CrossRef

Błaszczyński J, Słowiński R, Stefanowski J (2010) Variable consistency bagging ensembles. Transactions on rough sets, vol XI. Springer, Berlin, pp 40–52

Błaszczyński J, Słowiński R, Szela̧g M (2012) Sequential covering rule induction algorithm for variable consistency rough set approaches. Inf Sci 181:987–1002

Bous G, Fortemps P, Glineur F et al (2010) ACUTA: a novel method for eliciting additive value functions on the basis of holistic preference statements. Eur J Oper Res 206(2):435–444CrossRef

Corrente S, Greco S, Słowiński R (2012) Multiple criteria hierarchy process in robust ordinal regression. Decis Support Syst 53(3):660–674CrossRef

Corrente S, Greco S, Kadziński M et al (2013a) Robust ordinal regression in preference learning and ranking. Mach Learn 93:381–422

Corrente S, Greco S, Słowiński R (2013b) Multiple criteria hierarchy process with ELECTRE and PROMETHEE. Omega 41:820–846

Corrente S, Greco S, Kadziński M et al (2016a) Inducing probability distributions on the set of value functions by subjective stochastic ordinal regression. Knowl Based Syst 112:26–36

Corrente S, Greco S, Matarazzo B et al (2016b) Robust ordinal regression for decision under risk and uncertainty. J Bus Econ 86(1):55–83

Corrente S, Greco S, Słowiński R (2016c) Multiple criteria hierarchy process for ELECTRE Tri methods. Eur J Oper Res 252(1):191–203

Corrente S, Doumpos M, Greco S et al (2017a) Multiple criteria hierarchy process for sorting problems based on ordinal regression with additive value functions. Ann Oper Res 251:117–139

Corrente S, Figueira J, Greco S et al (2017b) A robust ranking method exending ELECTRE III to hierarchy of interacting criteria, imprecise weights and stochastic analysis. Omega 73:1–17

Corrente S, Greco S, Matarazzo B et al (2024) Explainable interactive evolutionary multiobjective optimization. Omega 122:102925CrossRef

Dembczyński K, Greco S, Słowiński R (2002) Methodology of rough-set-based classification and sorting with hierarchical structure of attributes and criteria. Control Cybern 31(4):891–920

Dembczyński K, Greco S, Słowiński R (2009) Rough set approach to multiple criteria classification with imprecise evaluations and assignments. Eur J Oper Res 198(2):626–636CrossRef

Dembczyński K, Kotłowski W, Słowiński R (2010a) Beyond sequential covering—boosted decision rules. In: Joone K et al (ed) Advances in machine learning I. Springer, Berlin, pp 209–225

Dembczyński K, Kotłowski W, Słowiński R et al (2010b) Learning of rule ensembles for multiple attribute ranking problems. In: Fürnkranz J, Hüllermeier E (eds) Preference learning. Springer, Berlin, pp 217–247

Ehrgott M, Figueira J, Greco S (eds) (2010) Trends in multiple criteria decision analysis. Springer, Berlin

Figueira J, Greco S, Słowiński R (2009) Building a set of additive value functions representing a reference preorder and intensities of preference: GRIP method. Eur J Oper Res 195(2):460–486CrossRef

Figueira J, Greco S, Roy B et al (2013) An overview of ELECTRE methods and their recent extensions. J Multi-Criteria Decis Anal 20:61–85CrossRef

Fishburn P (1967) Methods of estimating additive utilities. Manag Sci 13(7):435–453CrossRef

Fürnkranz J, Hüllermeier E (2010) Preference learning: an introduction. In: Fürnkranz J, Hüllermeier E (eds) Preference learning. Springer, Heidelberg, pp 1–18

Grabisch M (1996) The application of fuzzy integrals in multicriteria decision making. Eur J Oper Res 89(3):445–456CrossRef

Grabisch M (2016) Set functions, games and capacities in decision making. TDLC, vol 46. Springer, Berlin

Greco S, Matarazzo B, Słowiński R (2001) Rough sets theory for multicriteria decision analysis. Eur J Oper Res 129(1):1–47CrossRef

Greco S, Matarazzo B, Słowiński R (2004) Axiomatic characterization of a general utility function and its particular cases in terms of conjoint measurement and rough-set decision rules. Eur J Oper Res 158:271–292CrossRef

Greco S, Mousseau V, Słowiński R (2008) Ordinal regression revisited: multiple criteria ranking using a set of additive value functions. Eur J Oper Res 191(2):415–435CrossRef

Greco S, Matarazzo B, Słowiński R (2010a) Algebra and topology for dominance-based rough set approach. In: Raś Z, Tsay L (eds) Advances in intelligent information systems. Studies in computational intelligence, vol 265. Springer, Berlin, pp 43–78

Greco S, Matarazzo B, Słowiński R (2010b) Dominance-based rough set approach to decision under uncertainty and time preference. Ann Oper Res 176(1):41–75

Greco S, Mousseau V, Słowiński R (2010c) Multiple criteria sorting with a set of additive value functions. Eur J Oper Res 207(4):1455–1470

Greco S, Słowiński R, Figueira J et al (2010d) Robust ordinal regression. In: Ehrgott M, Figueira J, Greco S (eds) Trends in multiple criteria decision analysis. Springer, Berlin, pp 273–320

Greco S, Kadziński M, Mousseau V et al (2011) ELECTRE$^{GKMS}$: robust ordinal regression for outranking methods. Eur J Oper Res 214(1):118–135CrossRef

Greco S, Kadziński M, Mousseau V et al (2012) Robust ordinal regression for multiple criteria group decision: UTA$^{GMS}$-GROUP and UTADIS$^{GMS}$-GROUP. Decis Support Syst 52:549–561CrossRef

Greco S, Mousseau V, Słowiński R (2014) $\text{ UTA}^\text{ GMS }$-INT: robust ordinal regression of value functions handling interacting criteria. Eur J Oper Res 239(2):711–730CrossRef

Greco S, Ehrgott M, Figueira J (eds) (2016a) Multiple criteria decision analysis: state of the art surveys, 2nd edn. Springer, New York

Greco S, Matarazzo B, Słowiński R (2016b) Decision rule approach. In: Greco S, Ehrgott M, Figueira J (eds) Multiple criteria decision analysis: state of the art surveys, 2nd edn. Springer, New York, pp 497–552

Greco S, Słowiński R, Szczȩch I (2016c) Measures of rule interestingness in four perspectives of confirmation. Inf Sci 346–347:216–235

Inuiguchi M, Yoshioka Y, Kusunoki Y (2009) Variable-precision dominance-based rough set approach and attribute reduction. Int J Approx Reason 50(8):1199–1214CrossRef

Jacquet-Lagrèze E, Siskos Y (1982) Assessing a set of additive utility functions for multicriteria decision making: the UTA method. Eur J Oper Res 10:151–164CrossRef

Kadziński M, Tervonen T (2013) Stochastic ordinal regression for multiple criteria sorting problems. Decis Support Syst 55(1):55–66CrossRef

Kadziński M, Greco S, Słowiński R (2012) Extreme ranking analysis in robust ordinal regression. Omega 40(4):488–501CrossRef

Kadziński M, Słowiński R, Greco S (2015) Multiple criteria ranking and choice with all compatible minimal cover sets of decision rules. Knowl Based Syst 89:569–583CrossRef

Kadziński M, Słowiński R, Greco S (2016) Robustness analysis for decision under uncertainty with rule-based preference model. Inf Sci 328:321–339CrossRef

Keeney R, Raiffa H (1976) Decisions with multiple objectives: preferences and value tradeoffs. Wiley, New York

Köksalan M, Ozpeynirci SB (2009) An interactive sorting method for additive utility functions. Comput Oper Res 36:2565–2572CrossRef

Kotłowski W, Słowiński R (2008) Statistical approach to ordinal classification with monotonicity constraints. In: Fürnkranz J, Hüllermeier E (eds) Proceedings of the ECML/PKDD 2008 Workshop preference learning

Kotłowski W, Słowiński R (2009) Rule learning with monotonicity constraints. In: Proceedings of the 26th annual international conference on machine learning (ICML 2009). Omnipress, ACM International conference proceedings series, vol 382, art. no. 67. Montreal, pp 537–544

Kotłowski W, Dembczyński K, Greco S et al (2008) Stochastic dominance-based rough set model for ordinal classification. Inf Sci 178(21):4019–4037CrossRef

Kusunoki Y, Błaszczyński J, Inuiguchi M et al (2021) Empirical risk minimization for dominance-based rough set approaches. Inf Sci 567:395–417CrossRef

March J (1978) Bounded rationality, ambiguity and the engineering of choice. Bell J Econ 9:587–608CrossRef

Mousseau V, Dias L, Figueira J et al (2003) Resolving inconsistencies among constraints on the parameters of an MCDA model. Eur J Oper Res 147(1):72–93CrossRef

OpenAI (2022) ChatGPT: optimizing language models for dialogue. https://openai.com/blog/chatgpt, (accessed 2023-02-02)

Pawlak Z (1982) Rough sets. Int J Comput Inf Sci 11:341–356CrossRef

Pelegrina GD, Duarte LT, Grabisch M et al (2020) The multilinear model in multicriteria decision making: the case of 2-additive capacities and contributions to parameter identification. Eur J Oper Res 282:945–956CrossRef

Rossi F, Venable KB, Walsh T (2011) A short introduction to preferences. Springer, BerlinCrossRef

Roy B (1996) Multicriteria methodology for decision aiding. Kluwer, DordrechtCrossRef

Roy B (2000) Decision science or decision-aid science. Eur J Oper Res 66:184–203CrossRef

Roy B (2005) Paradigms and challenges. In: Figueira J, Greco S, Ehrgott M (eds) Multiple criteria decision analysis: state of the art surveys. Springer, Boston, pp 3–26CrossRef

Roy B (2009) À propos de la signification des dépendances entre critères : quelle place et quels modes de prise en compte pour l’aide à la décision? RAIRO Oper Res 43(3):255–275CrossRef

Saaty T (2005) The analytic hierarchy and analytic network processes for the measurement of intangible criteria and for decision-making. In: Figueira J, Greco S, Ehrgott M (eds) Multiple criteria decision analysis: state of the art surveys. Springer, Boston, pp 345–408CrossRef

Siskos Y, Grigoroudis E, Matsatsinis N (2005) UTA methods. In: Figueira J, Greco S, Ehrgott M (eds) Multiple criteria decision analysis: state of the art surveys. Springer, Boston, pp 297–344CrossRef

Slovic P, Fischhoff B, Lichtenstein S (1977) Behavioral decision theory. Annu Rev Psychol 28(1):1–39CrossRef

Słowiński R, Greco S, Matarazzo B (2002) Axiomatization of utility, outranking and decision-rule preference models for multiple-criteria classification problems under partial inconsistency with the dominance principle. Control Cybern 31(4):1005–1035

Słowiński R, Greco S, Matarazzo B (2015) Rough set methodology for decision aiding. In: Kacprzyk J, Pedrycz W (eds) Handbook of computational intelligence. Springer, Berlin, pp 349–370

Stewart T (2005) Dealing with uncertainties in MCDA. In: Figueira J, Greco S, Ehrgott M (eds) Multiple criteria decision analysis: state of the art surveys. Springer, Boston, pp 445–470CrossRef

Szela̧g M, Greco S, Słowiński R (2014a) Variable consistency dominance-based rough set approach to preference learning in multicriteria ranking. Inf Sci 277:525–552

Szela̧g M, Błaszczyński J, Słowiński R (2017) Rough set analysis of classification data with missing values. In: Lipe P et al (eds) International joint conference on rough sets (IJCRS 2017), Part I, LNAI 10313. Springer, Cham, pp 552–565

Titel: Preference learning and multiple criteria decision aiding: differences, commonalities, and synergies–part I
verfasst von: Eyke Hüllermeier
Roman Słowiński
Publikationsdatum: 30.01.2024
Verlag: Springer Berlin Heidelberg
Erschienen in: 4OR
Print ISSN: 1619-4500
Elektronische ISSN: 1614-2411
DOI: https://doi.org/10.1007/s10288-023-00560-6

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Publisher's Note

1 Introduction

2 Multiple criteria decision aiding

2.1 Decision aiding

2.2 The three-step methodology of decision aiding

2.3 Preference handling in MCDA and MCDM

2.4 Preference models

2.5 Elicitation of preference information

2.6 Robust ordinal regression (ROR)

2.7 ROR with value function preference model

2.8 ROR with non-additive value functions

2.9 ROR with outranking relation preference model

2.10 ROR handling hierarchy of criteria

2.11 ROR with rule preference model

3 Conclusion

Acknowledgements

Declaration

Conflict of interest

Publisher's Note

Premium Partner

Marktübersichten