FIDE
FIDE
8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Abstract
The present paper aims to demonstrate the interest of fuzzy inference
systems in system modelling when human interaction is important. It dis-
cusses the originality of FIS and their capability to integrate expertise and
rule learning from data into a single framework, analyzing their place rela-
tively to concurrent approaches. An open source software implementation is
presented, with a focus on the useful features for modelling. Two real world
case studies are presented to illustrate the approach and the software utility.
Keywords: cooperation, rule learning, fuzzy logic, knowledge
formalization, real word applications, open source software
1. Introduction
Fuzzy logic, since the pioneer work by [59], has proved to be a powerful
interface between symbolic and numerical spaces, especially by the means of
the linguistic variable concept [60].
Forty years later, fuzzy inference systems (FIS) have become one of the
most famous applications of fuzzy logic. One of the reasons for this success is
the ability of fuzzy systems to incorporate human expert knowledge with its
nuances, as well as to express the behaviour of the system in an interpretable
way for humans.
Historically, the first kind of fuzzy rule based systems focused on the
ability of fuzzy logic to model natural language [42].
A second approach, proposed in the mid eighties [54], was based on au-
tomatic learning from data. As discussed by [34], fuzzy methods in machine
learning and data mining have been thoroughly investigated and proved use-
ful.
With these new developments, the age of expert knowledge-based fuzzy
systems was over, and the data driven rule generation methods played the
main role in fuzzy system design, moving away from the philosophy of fuzzy
set theory initially directed to bridge the gap between human reasoning and
machine processing, which can be summarized as Computing with words.
This drift has been clearly stated by [19]: ”Fuzzy controllers, and fuzzy
rule-based modelling which have become the most popular and visible side
of applied fuzzy set theory, are only the emerged part of the fuzzy iceberg,
and as time passes this technology seems to owe less and less to fuzzy set
theory itself, and mainly becomes a tool for approximating functions.”
As a reaction, fuzzy logic researchers developed a new field of research. As
shown in [29], the use of fuzzy formalism is not sufficient to ensure the inter-
pretability of a knowledge base. Three conditions have to be fulfilled. First,
semantic integrity should be respected within the partition. Secondly, the
number of rules should be small. The third condition is specific to complex
systems with a large number of input variables: rules must not systemat-
ically include all input variables, but only the important ones in the rule
context. This kind of rules is usually referred to as incomplete rules. As
the interpretability constraints may conflict with the numerical error mini-
mization objective of automatic learning methods, several works have been
carried out to propose a trade off between interpretability and accuracy [13].
Though these efforts allow to propose FIS as an integrated framework for
system modelling, to get the most out of expert knowledge and data, not
enough attention as deserved has been paid to the potential interest of this
approach. FIS are still mostly used as predictive models, their performance
being evaluated in terms of accuracy, and the behavior of their inference
engine is rarely discussed.
Operational, in a process/system context, denotes a working method or
a philosophy that focuses principally on cause and effect relationships (or
stimulus/response, behavior, etc.) of specific interest to a particular domain
at a particular point in time. In this sense, a rule system is a good way
to model a specific kind of knowledge, the so-called operational knowledge.
Fuzzy concepts, whose content, value, or boundaries of application can vary
according to context, operator and conditions, instead of being fixed once
2
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
and for all, arise naturally in the operational approach of system modelling,
hence the relevance of FIS for that matter.
The work presented here aims to be a contribution to this research trend,
both asserting the need for fuzzy logic in system modelling, and proposing
an advanced software for exploratory design and analysis of FIS, allowing to
understand the system operation and its sensitivity.
Its objectives are i)to discuss the interest of FIS as modelling tools with
imposed interpretability constraints in methods and software implementa-
tion, ii)to compare, using the case studies, the FIS with other models (e.g.
statistical multiple regression) and to analyze two inference mechanisms: im-
plicative rules and conjunctive rules. An open source software implementa-
tion of FIS design and optimization is proposed in FisPro 1 , which corresponds
to ten years of research and software development in the field of learning in-
terpretable FIS from data.
Detailed illustrations will be provided through two case studies of real
world problems in Agriculture and Environmental modelling, a field where
expert knowledge and data often complement each other.
The structure of the paper is as follows: Section 2 recalls FIS principles
and analyses the specific role of fuzzy logic in system modelling. Section 3
explains the ways of cooperation between knowledge and data in a modelling
approach based on FIS. A state of the art of fuzzy software and FisPro main
features are presented in Section 4. The case studies are presented in Section
5. The first one deals with pesticide loss modelling in agricultural spraying,
the second one illustrates an inference mechanism which, though little used,
allows to model logical constraints, and in this way, is closer to classical logic
than the inference mechanism used for fuzzy controllers. Finally a conclusion
is given with some perspectives in Section 6.
1
http://www.inra.fr/internet/Departements/MIA/M/fispro/
3
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
In this section we first recall the general FIS structure, then we examine
some concurrent approaches before explaining the originality of FIS.
4
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
where m is the number of fuzzy sets in the partition and µf (x) is the mem-
bership degree of x to the f th fuzzy set. Equation 1 means that any point
belongs at most to two fuzzy sets when the fuzzy sets are convex.
1 1 2 3 4 5
µ2 (x)
µ1 (x)
0
x
5
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
6
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
defined partitions, are the most popular ones2 . The statistical models are
often difficult to interpret: for instance it is problematic to give a meaning to
some regression coefficients or to transformed variables. [61] advocates that
most of the theories can be generalized using the fuzzy formalism. This also
applies to statistics. A large effort within the Fuzzy Logic research commu-
nity has been dedicated to this topic with more or less convincing results.
The fuzzy generalization of the well known k-means clustering algorithm,
called fuzzy c-means [9] is a valuable example.
Statistics can also be used in cooperation with other modelling techniques,
including FIS, especially in data pre-processing: univariate or multivariate
analysis, outlier detection, for instance.
Artificial Intelligence provides many knowledge representation tech-
niques (such as graphs or ontologies) or data mining methods (such as as-
sociation rules). How can FIS, used as a modelling framework integrating
knowledge and data, be compared with them?
According to [20]: ”An important part of the concern and research in
fuzzy logic and possibility theory does focus on issues such as knowledge rep-
resentation, approximate reasoning and reasoning under uncertainty, which
are central to artificial intelligence.”
Without going into detail, let us point out that knowledge representation
methods intrinsically handle symbolic knowledge, and data mining tools are
designed for coping with data. A few bridges exist between these two topics
of research [52, 1, 41].
2
Few papers in the literature uses this opportunity, in almost all cases prior distributions
are estimated from data and partition bounds are automatically generated.
7
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Other methods, particularly statistical models, are more fit to that purpose
than FIS. Generally speaking, semantic is needed when the human being has
to interact with the system. Knowledge formalization or training support
tools require transparent models: the user is willing to easily analyze the
underlying reasoning. Semantic is also needed when the system output is a
symbolic label (instead of a numerical value), such as in quality evaluation
or risk management.
As there is no underlying assumption about the data distribution, FIS
automatic modelling does not yield a confidence interval as regression or
other statistical methods do. Nevertheless, system sensitivity to input values
can be assessed by simulation using noisy data, bootstrap, . . .
FIS can manage expert knowledge and also be used for knowledge discov-
ery. To take advantage of both kinds of knowledge, one should use them in
their respective areas of excellence: the key idea is to keep the expert at the
linguistic level and to use the data to define the numerical characteristics of
the linguistic terms or to refine the rules.
8
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
• Design: Expert knowledge and data are likely to cooperate in each step
of the FIS design process, especially in input and output partitioning
[32] and rule definition.
Given the input/output partitioning, the rule base may include induced
as well as expert rules. As an example, expert knowledge may serve to
define linguistic terms or rules in areas where no data are available.
• FIS use: Either expert rule and model validation with data, or rule
automatic generation and expert assessment of induced knowledge, is
possible. In the first case, data are used to define the precise meaning,
in the numerical space, of an expert linguistic concept while in the
second one, experts give a linguistic meaning to rules induced from
data, thanks to system interpretability. A dialectic motion between
these two approaches may be beneficial in system modelling.
Figure 3 shows the outline of the proposed approach for modelling with FIS,
allowing to formalize and introduce expert knowledge at all steps: fuzzy par-
tition and rule design, where knowledge can complement automatic design,
by adding MF and rules in areas where no data are available, FIS parameter
optimization and system validation. In this last step, it is proposed not only
to check numerical accuracy, but also to analyze induced knowledge. The sys-
tem generalization ability can be studied through automatic cross-validation
procedures, and also analyzed by studying the areas of low performance and
the links between data items and inference rules.
As the use of the fuzzy formalism increases system complexity (more
parameters, operators . . . ), it has to be justified.
FIS based cooperation between expert knowledge and data is suitable
when there exists prior knowledge. Expertise often expresses trends in purely
linguistic terms, which have to be completed by data to tune the models.
The interpretability constraints restrict this approach to low dimensional
problems. From our experience, as in many cases there is not a unique
system able to model the input-output relationships, several human-machine
iterations are needed.
4. Software environments
Exploratory analysis is essential in a modelling approach associating ex-
pert knowledge and data. Fuzzy software does not always pay a lot of atten-
tion to that point.
9
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Design
MF parameters
Validation
# MF (Input data only)
(Graphical tools and indices)
k−means
Partitions Modal points Hfp Check numerical accuracy
Regular
Complement partition and rule base Optimization Analyze induced knowledge
by adding MF or rule in areas MF parameters Study generalization ability:
where there are no available data. Rule conclusion links between items and rules
Manual design Learning poor performance areas
Rules (few variables) Fuzzy Decision Tree cross validation or test
FPA
OLS
Wang & Mendel
In this section, we present a brief state of the art of available fuzzy soft-
ware, and then we introduce FisPro. We describe its most important fea-
tures, and put them in relation with the modelling approach proposed in the
previous section.
10
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
11
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Rule learning
When rule learning mechanisms are available, it is important to guarantee
the interpretability of the learnt rule bases. For that purpose, the same
linguistic concepts, therefore the same fuzzy sets, should be shared by all of
the rules whatever the rule induction method.
In FisPro, all of the rule induction methods use predefined linguistic
labels. None of them is allowed to tune the labels nor to add a new concept
(fuzzy set) to the partition.
In order to permit fuzzy set sharing and interpretability, some methods
well known in fuzzy learning such as [56] and Orthogonal least squares (OLS)
[14, 15, 55, 33] have been revisited. Fuzzy decision trees [12, 47, 57] are
available and meet the interpretability criteria.
The implementation of an interpretable fuzzy OLS is presented in [17].
The key idea, which is valid for all these revisited methods, is the use of
predefined strong fuzzy partitions for the rule generation, instead of the data
based Gaussian membership functions used in the original OLS. Besides the
usual learning passes, an extra one is proposed to reduce the number of rule
conclusions with a user defined loss of accuracy.
New methods, such as hierarchical fuzzy partitioning [30], have been im-
plemented with the same preoccupation of fuzzy set readability. They gen-
erate hierarchical embedded partitions and make FIS refinement possible.
12
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Inference mechanism
As discussed in Section 2.1.2, there exist different kinds of fuzzy rule
bases: conjunctive ones and implicative ones.
Let us consider an interesting property, called inferential independence.
Let RB be a consistent rule base. In Boolean logic, when a rule premise
is true for a given multidimensional input, the value inferred by RB is that
rule conclusion, independently of the presence and content of the other rules.
Although this is not usually pointed out, this property does not hold for a
fuzzy conjunctive rule base. Given a multidimensional fuzzy input strictly
identical to the fuzzy rule premise, overlapping membership functions trigger
more than one rule and the rule base inference result is computed as the union
of the rule conclusions weighted by the respective matching degrees. With
implicative rules and strong fuzzy partitions, the inferential independence
property holds.
Each kind of rule base: conjunctive and implicative, has its pros and
cons, see [38] for a detailed comparison. In FisPro, rule learning methods
are implemented using conjunctive inference, and implicative rule bases are
available for expert design. Graphical visualization may help to understand
the differences between the inference mechanisms, and specific tools have
been developed.
13
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
• Inference can be done manually or on the current data file, with eval-
uation criteria which take into account the numerical accuracy as well
as the significance of data items regarding the FIS.
Response surfaces are also available for an exploratory analysis of the
system behaviour.
5. Case study
This section aims to illustrate the potential of fuzzy inference systems to
deal with real world applications involving both data and expert knowledge.
The first study is a supervised learning case, and the second one is essen-
tially based on expertise modelling. First we present the evaluation indices,
that will be used in the case studies.
14
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
• T : Air temperature (o C)
In the present paper, the study is restricted to the Fine case. The sample
size is 32 experiments.
15
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
3
All the statistical analysis are done using the R environment: http://www.r-
project.org/
16
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Figure 5: The crisp regression tree induced from pesticide loss data
The tree is easy to interpret, but crisp regression trees as this one suffer
from well known drawbacks. The automatic binary variable partitioning for
a given split may or may not be meaningful to the user. The results are
highly sensitive to the split conditions and there is no interpolation between
rules due to the crisp thresholds. Indeed, the possible inferred values are
limited by the number of leaves, three in this case, giving poor convincing
results.
The [47] derived implementations of decision trees allow multiple non
binary splits learning using predefined partitions, but they do not interpolate
either.
These drawbacks are likely to be overcome by fuzzy decision trees (FDT).
17
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
µ(x)
µ
0 0
0.30 1.40 3.00 19.00 25.00
−1 Temperature (°C)
Wind speed (m s )
Low High Unstable Stable
1 1
µ(x)
0 µ(x) 0
5.00 10.00 −10.00 10.00
Wet bulb temperature depression (ΔT) Stability (z/L)−1
Figure 7: Fuzzy decision tree for pesticide loss data (fine spray)
spraying periods: avoid windy times but also, even if the wind velocity
18
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Optimization
FisPro includes an optimization module. The algorithm is based upon
the work by [53] and [25]. It is summarized in Algorithm 1.
As the algorithm memorizes the good directions in the research space,
only a set of connected parameters can be optimized within a given run. The
sequence proposed in FisPro consists in optimizing each of the selected input
variable partitions in turn, then if the output is fuzzy, the output partition,
and finally the rule conclusions.
19
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
20
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
This sequence is run to optimize the fuzzy decision tree equivalent FIS.
As the number of data items is small (32), the ten-fold cross validation is
done on a 85/15 % basis, i.e. 27 items for each learning set, and 5 for the
corresponding test set. Only the T and ∆T variables are optimized, as the
z/L one is absent from the rules, and the W (Wind speed) partition is an
expert one, which should be left untouched. The final FIS is computed as the
median FIS: each parameter is replaced by the median of the ten optimized
systems.
∆T T
3 7 11 16 20 25
Figure 9: Loss data FIS: the two modified partitions (in dashed lines)
The two modified partitions are displayed in Figure 9, together with the
initial ones, and the rule base conclusions are printed in Table 1 for the initial
and the final median FIS.
Table 1: Loss FIS: initial (I) and final (F) rule bases - Integers in the rule premise are MF
numbers
21
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Table 2: Loss FIS optimization results: MAE averaged over the test sets
Initial 0.65
Optim 0.62
Gain (%) 4.6
Final 0.54
Gain (%) 14.8
Considering the whole data set, the final R2 corresponding to the median
FIS is 0.79, i.e. a 12% relative gain.
The optimization procedure allows a significant improvement of the FIS
accuracy without modifying the FIS structure. The fuzzy partitions count
with the same number of membership functions, the modifications lie in the
numerical space and do not affect the linguistic interpretation. The same
occurs for the rules: as the premises remain unchanged the rule base analysis
is still valid.
This case study illustrates various ways of cooperation between expert
knowledge and data. First the fuzzy partitions are expert designed to guar-
antee a high level of semantics, especially for the wind speed variable. Then,
the data are used for the rule learning. Thanks to the system interpretabil-
ity, the expert domain is able to analyze the system behavior. Finally, data
is used again to optimize some system parameters while keeping the FIS
structure and its properties. The accuracy is improved and the semantics is
preserved.
22
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
low
high
1
MC
0
54 55 56 57
low high
1
DEE
0
0.2 0.5 0.7 0.8
Figure 10: Input partitions - Precise and imprecise values used for the inference are plotted
in dashed lines
The rule system presented here is a simplified system that does not take
into account the whole process complexity:
• If MC is high and DEE is low then cheese will be soft
23
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Implicative inference
A screenshot of FisPro interactive inference window, using the precise
data displayed in Figure 10, MC=54.5 and DEE=0.6, is shown in Figure 11.
The aggregation result is plotted in the first row of the Firmness column,
while each of the four following rows illustrate the inference mechanism for
the corresponding rule.
The Resher-Gaines implication, see [38] for details, is used. This operator
yields a crisp interval, which coincides with the core of the output obtained
by all other residuated implications.
The output partition displayed in Figure 11 (top right) and in Figure 12
(bottom) is not a Strong Fuzzy Partition. It is automatically built from a
two term Strong Fuzzy Partition, and it is called a Quasi Strong Partition
[38]. It ensures both the interpretability and the consistency required by the
conjunctive aggregation.
Figure 11: Firmness inference with implicative rules and a precise value
24
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
For crisp data, the FITA4 inference mechanism is used, i.e. the inference
is done rule by rule, and then the aggregation is performed.
The screenshot of FisPro interactive inference window, using the precise
data displayed in Figure 10, MC=54.5 and DEE=0.6,is shown in Figure 11.
Let us comment on this figure. The membership degrees within the rule
premises are combined with the min operator, then the Resher-Gaines impli-
cation operator is applied to the fuzzy set that appears in the rule conclusion,
resulting in a door shape possibility distribution. These intermediate possi-
bility distributions are intersected to give the final one, plotted in the top
right cell of the Firmness column: this is the aggregation step.
Thanks to the conjunctive aggregation, the output width is meaningful,
it contains all the values which satisfy the constraints expressed by the rules.
Note that an empty result would point out a rule base inconsistency [21].
If needed, a precise value can be inferred using a defuzzification step, for
instance, the mean of the output possibility distribution core. This value
(5.75) is displayed just below the Firmness field in Figure 11.
Dealing with imprecise data, the inference cannot be done rule by rule.
Only the FATI5 mechanism is correct, and it is much more difficult to carry
out in practice. In the FisPro implementation, the imprecise data are ap-
proximated by nested doors. Figure 12 shows the Fispro inference window
and the aggregation details, for the imprecise data plotted in Figure 10, and
with a three level nested door decomposition.
The input imprecision is respected in the output distribution.
As the output is a normalized possibility distribution, it can be inter-
preted as such, without the need for defuzzification. Thus the support and
kernel widths increase with the input imprecision. This appears in Figure 12,
where the core and support of the possibility distribution are wider than the
ones resulting from the implicative inference with the precise value, plotted
in Figure 11. This follows expectations, and it is an important point for de-
cision support systems handling imprecise data, as the imprecision remains
interpretable and meaningful. Furthermore, as input imprecision is taken
account of all through the implicative inference process, implicative FIS al-
low to design cascading FIS respectfully of approximate inputs, and thus to
design interpretable hierarchical fuzzy inference systems.
4
First Infer Then Aggregate
5
FATI means ”First Aggregate Then Infer”.
25
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Figure 12: Firmness inference with implicative rules and an imprecise value: FisPro win-
dow and detail, the output is the union of the three doors.
26
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
Conjunctive inference
To point out the differences with the well known Mamdani FIS, the infer-
ence results of the same rule base, using a conjunctive Mamdani rule base,
with the same data, are now presented.
The screenshot from FisPro is shown on top of Figure 13.
Figure 13: Firmness inference with a conjunctive rule system and a crisp value-FisPro
window and defuzzification detail
The inferred output overlaps the three output fuzzy sets. Consequently,
it is difficult to interpret this result without defuzzification. Centroid-like
defuzzification gives us a firmness equal to 6.3. Note that defuzzification
27
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
is highly sensitive not only to the defuzzification method, but also to the
membership function shapes.
The inference of imprecise data with the conjunctive rule base is similar
to what is done in the case of precise data, as the imprecise data are used
to compute a single matching degree with each of the fuzzy sets within the
partition.
This degree is then used in the same way than the membership degree of
the precise value. The result is not plotted here as, due to defuzzification, it
does not yield any interpretable result when examining the output possibility
distribution versus the input imprecision.
With conjunctive inference the inferred output is a subnormalized pos-
sibility distribution, with no kernel, and it always requires a defuzzification
step to yield a crisp output. Furthermore the support width is not related
to the input imprecision.
Data validation
A set of 103 representative cheese sample has been analyzed by an expert
panel. The final firmness result from the expert score aggregation. This
sample has been used to validate the implicative rule base.
The inference process yields a possibility distribution. For the Resher-
Gaines implication operator, this distribution reduces to an interval, as all
the values have a possibility degree equal to 1 or 0. It is, of course, possible, to
defuzzify this distribution, by computing a real value using this interval, for
instance the mean, minimum or maximum. But, this step is not mandatory
with an implicative rule base. Moreover, it would hide some interesting
properties of the distribution, such as its range.
To take advantage of this inference mechanism, a more complex label is
proposed to characterize the inferred output distribution with respect to the
target value. The label is composed of two distinct integers:
• Precision: this value depends on the output interval width. To avoid
the use of a threshold value, it is defined as the number of distinct MF
in which the membership degrees of any point within the interval are
the highest.
• Accuracy: this value is the prediction error, quantified as a number
of MF. It is computed as the smallest difference between the indexes
of the MF the target value mainly belongs to and the MF previously
defined in the precision label.
28
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
1111
0000 11111
00000
soft normal hard
I1 I2
1
0000
1111 00000
11111
0000
1111 00000
11111
0000
1111
0000
1111 00000
11111
00000
11111
0
0000
1111
0000
1111 00000
11111
00000
11111
5 6.5 Firmness
x 3.5 y
Table 3: Labels according to the target (row) and estimated (column) values
I1 I2
x Correct and precise Incorrect
y Correct Correct but imprecise
We now present the implicative FIS inference results, obtained on the ex-
perimental data set, and labelled in terms of precision and accuracy. Results
are summarized in Table 4.
The use of precision and accuracy labels makes it possible to analyze the
results with regard to these two criteria, contrary to what is usually done
with Mamdani or Sugeno conjunctive FIS, where only numerical accuracy is
discussed, without specifically considering the impact of input data impreci-
sion. First of all, Bad predictions, due to examples that are in contradiction
with the expert rules, show that these rules are not as general as it could be
thought. Nevertheless, though simple, the rule base, made up of only 4 rules,
29
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
is able to reflect some of the main data trends: 70 % of the predictions are
good. A significant part of these good predictions are still imprecise, and this
imprecision can be reduced by refining the rules with new variables. Indeed,
with an implicative rule base, refinement leads to more constraints applied
onto the data.
6. Conclusion
Cooperation between knowledge and data is still an open challenge in
system modelling. In this paper, we show how fuzzy inference systems can
be used for that purpose, by providing a powerful interface between the
symbolic space used for knowledge representation and the numerical one.
We present the main steps of an integrated approach for modelling with FIS
and discuss its practical application.
Some ways of cooperation are illustrated through two case studies. In the
first one, a study of pesticide loss during air spray, data play the central role in
the modelling process, even though expert knowledge, when available, is also
involved, for instance to base the wind velocity linguistic variable definition
on the Beaufort scale. Valuable operational information can be derived from
the rule base, for making recommendations about pesticide spraying.
The second case study discusses how expert knowledge about a cheese
making process is formalized into an implicative rule base. Thanks to the
conjunctive aggregation of implicative rules, the output distribution width
is meaningful. When the input values are not precise, the imprecision is
respected in the output distribution. From an engineering point of view,
implicative rules allow the incremental design of systems. Adding a new
variable to a given rule makes it more specific. There is no need to modify
the existing rules. It can be noted that additional work is needed to learn
and optimize implicative rule bases.
The approach framework is implemented in a free software, FisPro. Its
30
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
originality stems from rule interpretability and exploratory tools to study the
FIS behavior. Open, it welcomes contributions from scientists or engineers
to increase its functionality and its scope. Some characteristics are given
below.
• FisPro is open source, and to facilitate the inclusion in other programs,
its core is delivered as a C++ library. The interface is written in Java,
and available in several languages.
• Hierarchical FIS are available. This allows the reuse of a FIS output
as input to another FIS.
• An optimization module allows to tune all parts of Mamdani and
Sugeno FIS, regardless of the way they were built, by hand or through
automatic learning.
• Sample generation is available prior to learning, as well as learning
scripts including cross validation procedures.
Fuzzy rule base merging, for instance expert and induced rules, using
the HILK method [4] is implemented in the Generating Understandable and
Accurate fuzzy models in a Java Environment (GUAJE) software [2], which
is based on FisPro.
FisPro has been used for various modelling projects [48, 11, 3, 8], and we
hope that the approach presented in this paper will help in new modelling
tasks.
References
[1] Adomavicius, G., Tuzhilin, A., 2001. Expert-driven validation of rule-
based user models in personalization applications. Data Mining and
Knowledge Discovery 5 (1-2), 33–58.
[2] Alonso, J. M., 2010. Guaje: Generating understand-
able and accurate fuzzy models in a java environment.
http://www.softcomputing.es/guaje.
[3] Alonso, J. M., Magdalena, L., González-Rodrı́guez, G., 2009. Looking
for a good fuzzy system interpretability index: An experimental ap-
proach. International Journal of Approximate Reasoning 51 (1), 115 –
134.
31
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
[4] Alonso, J. M., Magdalena, L., Guillaume, S., 2008. Hilk:a new method-
ology for designing highly interpretable linguistic knowledge bases using
the fuzzy logic formalism. International Journal of Intelligent Systems
23 (7), 761–794.
[6] Babuška, R., 2003. Neuro-fuzzy methods for modeling and identification.
Physica-Verlag GmbH, Heidelberg, Germany.
[8] Bergasa, L. M., Nuevo, J., Sotelo, M. A., Barea, R., Lopez, M. E., 2006.
Real-time system for monitoring driver vigilance. IEEE Transactions on
Intelligent Transportation Systems 7 (1), 63–77.
[9] Bezdek, J. C., 1981. Pattern Recognition with Fuzzy Objective Func-
tions Algorithms. Plenum Press, New York.
[10] Borgelt, C., Gonzáles-Rodrı́guez, G., 2007. Frida - a free intelligent data
analysis toolbox. In: [35], pp. 1892–1896.
[11] Bossomaier, T., Standish, R. K., Harré, M., 2010. Simulation of trust in
client-wealth management adviser relationships. International Journal
of Simulation and Process Modelling 6 (1), 40–49.
[12] Breiman, L., Friedman, J. H., Olshen, R. A., Stone, C. J., 1984. Classifi-
cation and Regression Trees. Wadsworth International Group, Belmont
CA.
[13] Casillas, J., Cordón, O., Herrera, F., Magdalena, L., 2003. Interpretabil-
ity improvements to find the balance interpretability-accuracy in fuzzy
modeling: an overview. In: Interpretability Issues in Fuzzy Modeling.
Springer, pp. 3–22.
[14] Chen, S., Billings, S. A., Luo, W., 1989. Orthogonal least squares meth-
ods and their application to non-linear system identification. Int. J. Con-
trol 50, 1873–1896.
32
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
[15] Chen, S., Cowan, C. F. N., Grant, P. M., March 1991. Orthogonal
least squares learning algorithm for radial basis function networks. IEEE
Transactions on Neural Networks 2 (No 2), 302–309.
[17] Destercke, S., Guillaume, S., Charnomordic, B., 2007. Building an in-
terpretable fuzzy rule base from data using orthogonal least squares-
application to a depollution problem. Fuzzy Sets and Systems 158, 2078–
2094.
[18] Dubois, D., Prade, H., 1996. What are fuzzy rules and how to use them.
Fuzzy Sets and Systems 84(2), 169–186.
[19] Dubois, D., Prade, H., 1997. The three semantics of fuzzy sets. Fuzzy
Sets and Systems 90, 141–150.
[20] Dubois, D., Prade, H., 2003. Fuzzy set and possibility theory-based
methods in artificial intelligence. Artificial Intelligence 148, 1–9.
[21] Dubois, D., Prade, H., Ughetto, L., 1997. Checking the coherence and
redundancy of fuzzy knowledge bases. IEEE Trans. on Fuzzy Systems 5
(5), 398–417.
[22] Dubois, D., Prade, H., Ughetto, L., 2003. A new perspective on rea-
soning with fuzzy rules. International Journal of Intelligent Systems 18,
541–567.
[23] Foulloy, L., Galichet, S., Boukezzoula, R., 2005. Floulib [online].
http://www.polytech.univ-savoie.fr/index.php?id=859.
[24] Gil, Y., Sinfort, C., Guillaume, S., Brunet, Y., Palagos, B., 2008. Influ-
ence of micrometeorological factors on pesticide loss to the air during
vine spraying: Data analysis with statistical and fuzzy inference models.
Biosystems Engineering 100 (2), 184–197.
33
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
[29] Guillaume, S., 2001. Designing fuzzy inference systems from data: an
interpretability-oriented review. IEEE Transactions on Fuzzy Systems
9 (3), 426–443.
[32] Guillaume, S., Magdalena, L., 2006. Expert guided integration of in-
duced knowledge into a fuzzy knowledge base. Soft computing 10 (9),
773 – 784.
[33] Hohensohn, J., Mendel, J. M., 1994. Two pass orthogonal least-squares
algorithm to train and reduce fuzzy logic systems. In: Proc. IEEE Conf.
Fuzzy Syst. Orlando, Florida, pp. 696–700.
[34] Hullermeier, E., 2005. Fuzzy methods in machine learning and data
mining: Status and prospects. Fuzzy sets and Systems 156, 387–406.
[36] InfraLogic, T., 1995. The world’s source for fuzzy logic solutions [online].
http://www.ortech-engr.com/fuzzy/togai.html.
34
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
[38] Jones, H., Charnomordic, B., Dubois, D., Guillaume, S., 2009. Practical
inference with systems of gradual implicative rules. IEEE Transactions
on Fuzzy Systems 17 (1), 61–78.
[39] Kruse, R., 2007. Neural networks and fuzzy systems [online].
http://fuzzy.cs.uni-magdeburg.de/.
[40] Lee, C. C., 1990. Fuzzy logic in control systems: Fuzzy logic controller-
part ii. IEEE Trans. on Systems, Man, and Cybernetics 20 (2), 419–435.
[41] Ling, T., Kang, B. H., Johns, D. P., Walls, J., Bindoff, I., 2008. Expert-
driven knowledge discovery. In: Latifi, S. (Ed.), Proceedings of the fifth
international conference on information technology: new generations.
pp. 174–178.
[42] Mamdani, E. H., Assilian, S., 1975. An experiment in linguistic synthe-
sis with a fuzzy logic controller. International journal of Man-Machine
Studies 7, 1–13.
[43] Martin-Clouaire, R., 1989. Semantic and computation of the generalized
modus ponens: the long paper. International Journal of Approximate
Reasoning 3(2), 195–217.
[44] MathWorks, T., 2007. Fuzzy logic toolbox [online].
http://www.mathworks.com/products/fuzzylogic.
[45] Nauck, D. D., 2007. Gnu fuzzy. In: [35], pp. 1019–1024.
[46] Orchard, R. A., 2004. Fuzzy extension to the
clips expert system shell (fuzzyclips) [online].
http://www.nrc-cnrc.gc.ca/fra/licences/iti/non-commerciale9.html.
[47] Quinlan, J. R., 1986. Induction of decision trees. Machine Learning 1,
81–106.
[48] Rajaram, T., Das, A., 2010. Modeling of interactions among sustain-
ability components of an agro-ecosystem using local knowledge through
cognitive mapping and fuzzy inference system. Expert Systems with Ap-
plications 37 (2), 1734–1744.
[49] Research, W., 2007. Mathematica fuzzy logic [online].
http://www.wolfram.com/products/applications/fuzzylogic.
35
Author-produced version of the article published in Expert Systems with Applications, 2012, N°39, p. 8744- 8755
The original publication is available at http://dx.doi.org/10.1016/j.eswa.2012.01.206
Doi: 10.1016/j.eswa.2012.01.206
[51] Sazonov, E., 2002. Open source fuzzy inference engine for java [online].
http://people.clarkson.edu/ esazonov/FuzzyEngine.htm.
[52] Sester, M., 2000. Knowledge acquisition for the automatic interpreta-
tion of spatial data. International Journal of Geographical Information
Science 14 (1), 1–24.
[53] Solis, F. J., Wets, R. J., 1981. Minimization by random search tech-
niques. Mathematics of Operation Research 6, 19–30.
[54] Takagi, T., Sugeno, M., 1985. Fuzzy identification of systems and its
applications to modeling and control. IEEE Transactions on System
Man and Cybernetics 15, 116–132.
[55] Wang, L.-X., Mendel, J. M., 1992. Fuzzy basis functions, universal ap-
proximation, and orthogonal least squares learning. IEEE Transactions
on Neural Networks 3, 807–814.
[56] Wang, L.-X., Mendel, J. M., 1992. Generating fuzzy rules by learning
from examples. IEEE Transactions on Systems, Man and Cybernetics
22 (6), 1414–1427.
[57] Weber, R., 1992. Fuzzy-id3: A class of methods for automatic knowledge
acquisition. In: 2nd International conference on fuzzy logic and neural
networks. pp. 265–268.
[58] Weisbrod, J., 1998. A new approach to fuzzy reasoning. Soft Computing
2, 89–99.
[59] Zadeh, L. A., 1965. Fuzzy sets. Information and Control 8, 338–353.
[60] Zadeh, L. A., 1975. The concept of linguistic variable and its application
to approximate reasoning - parts i, ii and iii. Information Sciences 8-9,
199–249,301–357,43–80.
[61] Zadeh, L. A., 2008. Is there a need for fuzzy logic? Information Sciences
178 (13), 2751–2779.
36