next up previous
Next: Case Example Up: The Analysis Method Previous: A Combined Interactive System

The Data Analysis Procedure

To analyze gallup data multiple approaches were tried, but the following concept seemed to provide clearest results. Other attempts resulted in less definite group boundaries and/or larger variances in neurons. The phases are discussed in more detail in the case example.

  1. Fuzzification of the original opinions to class memberships (for example Agree, Don't Know, Disagree).
  2. Omission of the class(es) representing uncertainty.
  3. Training of the TS-SOM with the remaining class memberships.
  4. Visualization of the class membership values of the resulting neurons.
  5. Selection of the groups of neurons containing large values in at least a couple of training variables at the same regions of the SOM.
  6. Visual or rule-based verification of all other variables against the located groups.
  7. Conclusions of what the groups represent.

The grouping of data can be performed visually, but it seems, that the human abilities make it a bit difficult to make clear decisions on which variables should be included in each group and which not. Data files, such as the one analyzed in the case example, usually contain lots of separate small groups. One good method is to use rules to verify, which variables dominate in each region. However, the regions should be formed at the same time as the variables are chosen. Therefore, some kind of optimization method is needed to form the groups of neurons indicating similar behavior.

A completely self-reliant process could be deviced to form the groups without human intervention. If there are lots of variables, all the possible combinations of variables cannot be verified. Therefore a genetic algorithm with random initial groups and a well constructed goodness criteria could produce good results quickly. However, the ideas for altering the population: refusal of entities, crossbreeding and mutations have to be considered carefully. There are also other choices of algorithms that can be used to find the optimal groups. This area would need some extensive research.


next up previous
Next: Case Example Up: The Analysis Method Previous: A Combined Interactive System

Anssi Lensu
Tue Nov 3 12:42:42 EET 1998