Column Union

The source data set is appended with the fields of the joined data sets by means of the Column Union handler. In this case, each record of the source data set is joined with the record of the additional data set with the same order number.

If the joined data sets have different number of records, the resulting data set can contain null values. The wizard enables to use different processing options in this situation. The joined data sets can be cut up to the number of records of the smaller data set, or they can be appended up to the largest one.

Variables can be added to the source data set. Each of the joined variables adds a new column to the source data set. In this case, according to the settings of Missing Terminal Rows Filling parameter, the value of the joined variable will be added:

  • to all rows of the new column;
  • only to the first row, and the null value will be displayed in other rows for the fields related to the string and variable types, for the fields related to other types - empty cell.

Input

  • Main table: the port for the input data set.
  • Add another port enables to create new input ports for the joined tables and variables. There are two types of the new ports:
    • Joined table [N] where N is an order number of the table.
    • Joined variables [N] where N is an order number of the variables port.

Output

  • Output data set Output data set: table with joined columns.

Wizard

The following parameters can be configured:

  • Missing Terminal Rows Filling: it is required to select one of the options of filling of the table records least in number:
    • Leave empty: filling with the records the fields of which will contain null values.
    • Repeat all rows: copies of the table records starting from the first one are repeated in the tables.
    • Repeat the last row: filling with the last row copies.
  • Row count complies: it is required to select one of the options to determine the number of records of the resulting data set:
    • With minimum data set.
    • With maximum data set.
    • Data set defined: when selecting this option, the Data set defining row count list becomes available. It is required to select a data set in it according to which the row count of the resulting data set will be defined.
Example:

Let's consider two tables as an example.

Main table:

Full name Year of birth
Clark 1972
Davis 1956
Evans 1978
Adams 1979
Allen 1980
Bell 1983
Anderson 1982
Baker 1963
Boucher 1984
Cameron 1965

Joined table:

Labor participation coefficient Cluster
> 0.8 1
> 0.8 1
0.5 - 0.8 2
0.5 - 0.8 2
0.2 - 0.5 3
< 0.2 4

Missing Terminal Rows Filling parameter is set to Leave empty value, Row count complies parameter — to With maximum data set value.

Resulting table:

Full name Year of birth Labor participation coefficient Cluster
Clark 1972 > 0.8 1
Davis 1956 > 0.8 1
Evans 1978 0.5 - 0.8 2
Adams 1979 0.5 - 0.8 2
Allen 1980 0.2 - 0.5 3
Bell 1983 < 0.2 4
Anderson 1982 <null> <null>
Baker 1963 <null> <null>
Boucher 1984 <null> <null>
Cameron 1965 <null> <null>

results matching ""

    No results matching ""