ONE APPROACH TO FUZZY EXPERT SYSTEMS CONSTRUCTION. Dmitry A. Kropotov, Dmitry P. Vetrov. Russia, Moscow, 119991 Dorodnicyn Computing Centre of the Russian Academy of Sciences. FUZZY LOGIC IS… Fuzzy logic is based on the theory of fuzzy sets, proposed by Zadeh in 1965. The main idea of this theory is generalization of classical sets theory for the continuous case. This means that the objects may partly belong to the sets with different n degree of membership. In other words fuzzy set can be associated with their membership function : R [0,1]. It is easy to define the generalized logical operations, like, for example, conjunction and disjunction, over the fuzzy sets as minimum and maximum of membership functions. Fuzzy implication can be defined by different ways, but in present report, we use Mamdani scheme by taking minimum of membership functions from the sumption and using it as a belonging degree to the result set of the rule. Average temperature A ( x) 1 1 0 2 5 Normal Low High Crop x 0 Crisp set “Between 2 and 5” 1 Below average Average Above average 0 1 Fuzzy sets Qualitative relations 0 Knowledge base Feature values 5 2 x Good Rich centner per hectare mm 0 Mamdani’s implication scheme for the rule “IF Average temperature is Normal AND Precipitations are Average THEN Crop is Good” Fuzzy set “Approx. between 2 and 5” Fuzzy set Normal MIN Precipitations A ( x) Poor 1 t, C Forecast … A GOOD WAY OF CONSTRUCTING EXPERT SYSTEMS. The main advantage of fuzzy logic is its ability to operate in natural terms for a user. By defining linguistic variables, one may construct fuzzy rules. Fuzzy expert system works by interpreting input data as linguistic variables, implicating the Fuzzyficator Defuzzyficator Fuzzy rules necessary fuzzy rules and then defuzzyfying the result. But THERE APPEAR TWO BOTTLENECKS IN BUILDING FUZZY EXPERT SYSTEMS: Principal scheme of fuzzy expert system. HOW TO OBTAIN THE SHAPES OF FUZZY SETS… It is usually difficult to form the exact shapes of fuzzy sets for the expert. It is not obvious “how fuzzy” they should be. In order to solve this problem we use so-called (a, b)-parameterization. Let the shapes of membership functions belong to the parameterized family of isosceles trapeziums. The location of fuzzy set is defined by the approximate borders (ai , ai 1 ) which are part of the partition of numerical axis, that can be easily assigned by the user. In this case a–parameter can be interpreted as a crossing degree of the sets, and b–parameter shows the “fuzziness” of set. To reduce the number of coefficients to be optimized, we use an assumption that pair (a, b) is responsible for the properties of the whole feature rather than a single set. In this case we may find the coefficients by solving optimization task on the learning sample. ( x) 1 a2 a1 0 Representation - the rate of objects in the sumption of the rule. c1 d1 a1 a2 b1 0 c2 b2 d2 d 2 c2 a3 a2 a3 c3 b3 x d3 d 3 c3 a4 a3 a4 (a,b) – parameterization of fuzzy sets’ shapes Effectiveness - the rate of objects from the sumption that satisfy the rule. Each rule can be represented as a point in effectiveness/representation plane. SUCH SYSTEM CAN BE USED EITHER FOR FORECASTING… … AND HOW TO GET THE NECESSARY FUZZY RULES. In many areas one hasn’t enough knowledge about the process being researched in order to form linguistic rules. If there are some precedents with known forecasts, the necessary rules may be generated automatically. The proposed algorithm is based on the two notions: representation and effectiveness of the rule. The first shows the rate of objects being involved in the consideration, while the second shows the rate of objects that satisfy the rule. The more representation and effectiveness are, the better is the rule. In fact, we want to increase the effectiveness at least to some threshold, holding the representation above the predefined level. To do this, we fuse (restrict) several rules to one of higher order, by conjuncting their sumptions, until we exhaust the set of potential rules. During such process the representation becomes lower, but the effectiveness may become higher. All rules that have both representation and effectiveness higher than corresponding thresholds are accepted; rules that are not enough representative are rejected; and all other rules are used for further fusion To forecast continuous values, we find (a, b) coefficients by using least squares method. In fact we just minimize the sum of squared deviations from the correct answer. And to calculate the forecasted variable according to the given number of fuzzy rules, use the centre of gravity defuzzification method. This mode was used to predict the places of football teams in Russian Championship according to the tournament table (won scores were excluded) and for forecasting magnetic field oscillations in cavities of accelerating klystrons (DESY, Hamburg). The results received by described method (program ExSys) were compared with linear regression for football and Matlab fuzzy logic toolbox for cavities. It appeared that in some cases fuzzy logic works better than linear regression even for such simple and obviously linear tasks like finding the place of the team by the tournament table. As for Matlab toolbox, the system tends to overfit to the learning data even after the use of independent precedents, which had to prevent overtraining. Classification methods Melanoma (super small sample) Phoneme (big sample) Liver (average sample, many classes) MLP 65.6 78.2 77.5 LDF 59.4 77.4 77.5 TA 62.5 65.5 65.7 LM 50.0 77.2 79.3 SVM 56.3 76.4 83.1 QNN 62.5 84.7 80.3 ExSys 66.6 77.5 76.5 Russian football championship. Sum of squared deviations for linear regression is 38.056, for ExSys is 21.936. Melanoma – 48/32 objects, 33 features, 3 classes. Phoneme – 2200/1404 objects, 6 features, 2 classes. Liver – 170/150 objects, 8 features, 7 classes. Oscillations of magnetic field amplitude. Sum of squared deviations for MatLab is 0.405, for ExSys is 0.274. … OR CLASSIFICATION. If one needs to predict the value which belong to the finite set (i.e. classification task), the quality functional is just the rate of misclassified objects. And as defuzzification method, we use defuzzification by mode. In other words the object is classified to the set with the maximal value of membership function. The system in such mode was tested on several tasks and was compared with numerous recognition methods – linear Fisher discriminant (LDF), q-nearest neighbors (QNN), test algorithm (TA), committee of hyperplanes (LM), support vector machines (SVM) and multilayer perceptron (MLP). The results of work are presented in the table. Results of classifications THEORY OF STATISTICAL SOLUTIONS IS USED TO FIND THE NECESSARY THRESHOLDS AND PREVENT OVERFITTING. It’s clear that changing representation threshold, we may thus regulate the number of discovered fuzzy rules and hence the time of rule generation. Unfortunately we can’t do the same directly with the effectiveness threshold. Making it too low forces computer to generate a lot of “parasitic” rules, which contain no useful information. As a result the expert system suffers from overtraining and degrades greatly. To avoid this, the apparatus of mathematical statistics was applied. This allowed to define the effectiveness threshold by finding the upper bound of confidence interval after fixing the rate of “parasitic” rules we would like to exclude. By varying the level of confidence we get different thresholds and may regulate the number of discovered rules. To reduce the time of rule generation, the lower effectiveness bound is calculated. If the effectiveness of the rule if less than this bound, it cannot be raised to the demanded level without decreasing representation value too low (lower than corresponding threshold). With these remarks, the effectiveness/representation plane becomes as shown on the figure. Fuzzy Expert System Low and upper bounds for effectiveness THE PROPOSED SYSTEM CAN BE USED IN MULTIPLE MODES… Feature partition Fuzzy sets Fuzzy rules Optimization Auto partitioning Auto generation Manual definition Manual partitioning Manual input Depending on what the expert can do, the system described above, may work in different modes. When there is no prior information about the task, all necessary steps can be done automatically. The user only needs to input the number of fuzzy sets to be generated for each variable and to give them names. If needed, expert can define the approximate borders of sets himself. The rules are either generated according to the learning table or/and entered by the expert. He can also change the sumptions or weights of some rules if necessary. The shapes of sets are either set or found as a result of optimization procedure. The modes, that system support, may vary from the most autonomous way, to the case, when all fuzzy sets and rules are known for the expert. In the last case system acts as classical fuzzy controller. Different modes of ExSys … NOT ONLY FOR FORECASTING, BUT ALSO FOR UNDERSTANDING THE NATURE OF PROCESS. Using rule generation option, we may not only use the discovered rules for further forecasting, but also examine them for understanding the nature of the process being researched. Here are some rules extracted from the football tournament table. Such knowledge can be extremely useful if some of known features can be managed. Knowing the influence they have on the hidden value, we may effectively control the process by adjusting the features that are available for changing. IF Dropped goals are Not many AND Losses are Few THEN Rank is High IF Wins are Not few AND Scored goals are Many AND Draws are Some THEN Rank is Very high IF Scored goals are Few THEN Rank is Low IF Dropped goals are Not few AND Draws are Many THEN Rank is Medium Examples of generated fuzzy rules for football ranks predictions
© Copyright 2026 Paperzz