Coarse Classes
The Coarse Classes handler enables to solve the following problems:
- Conversion of the continuous and discrete input fields used for training of the models related to the binary classification by means of the binning based on totality-of-evidence approach or WoE analysis (weights of evidence, WoE).
- Reduction of data dimensionality by excluding the indicators with low significance, by decreasing variety of indicator values.
- Null data recovery when null data forms a separate binning range caption, or it is joined with the nearest one that is close by the WoE coefficient value.
- The struggle against outliers and extreme values is based on formation of the binning range captions during discretization of the continuous field or union of rare unique values into one category.
- Simplification of description of the objects under study.
The visualizer includes the following items:
Note: The Coarse classes visualizer can be used only with the Coarse classes node.
Interface
Operations
For the List of input columns Hide side bar operation is available. Pressing the button enables to hide the bar with the list of input columns, whereas the repeated pressing returns the bar.
Area of coarse classes can be represented in the table form - Show coarse class table (Alt+One) or in the form of charts - Show coarse class charts (Alt+Two).
List of Input Columns
- Field status:
- Unlocked shows whether this field can be used in the process of the coarse classes generation while model overfitting.
- Locked shows that this field will not be used while overfitting.
Note: It is not possible to change the field status. It only displays settings in the corresponding Coarse classes node.
- Column caption.
- Estimated significance level IV (refer to Figure 1).
Area of Fine Classes
The fine classes are displayed in the form of one display option - a chart.
When hovering the mouse cursor over the chart in the right upper corner, button appears. Its pressing enables to expand the chart over the whole visualizer area.
Fine classes chart can be as follows:
- Rate.
- WoE (weight of evidence).
- IV (information value).
Display selection is switched by buttons (refer to Figure 2) in the lower part of the chart.
When expanding the chart over the whole visualizer area, the drop-down list appears. It enables to select the field for which the chart will be generated (refer to Figure 3).
When hovering the mouse cursor over the chart column, the pop-up window appears. It displays the field value and WoE/IV/Rate value (refer to Figure 3).
Area of Coarse Classes
The coarse classes are displayed in the form of a table or WoE chart (refer to Figure 4) / chart IV. Chart IV can display as follows:
- Count (refer to Figure 5).
- Rate (refer to Figure 6).
Table
The table consists of the following fields:
Field | Description |
---|---|
№ | Class number |
Caption | A set of names of the fine classes included into the coarse class |
Lower | Lower bin bound |
Upper | Upper bin bound |
Events | One of the states of the binary target class variable |
Non-events | Opposite state of the binary target class variable |
Total | The sum of class events and non-events |
Rate | The coarse class rate based on the total volume of records |
Weight of evidence | WoE coefficient |
Inf. value | The value that defines the indicator significance in the binary classification model |
The sum is displayed in the lower part of the table:
- Events of all classes.
- Non-events of all classes.
- Events and non-events of all classes.
- Information values of all classes.
Chart
When hovering the mouse cursor over the chart in the right upper corner, button appears. It enables to expand the chart over the whole visualizer area.
buttons are used to switch Count and Rate charts.
When hovering mouse cursor over the chart column, the window appears (refer to. Figure 7) which displays as follows:
- For WoE chart:
- WoE value.
- For chart IV:
- value IV;
- events count;
- non-events count.