Home > Transforms Nodes > Filter Columns > Sample > Edit Sample Node > Stratified
For a stratified sample, specify the following:
Column: Select the column for stratification.
Seed: the default seed is 12345; you can specify a different integer.
Case ID (optional): Select a Case ID from the drop down list.
If you specify a seed and a Case ID, the sample is reproducible.
Distribution: Specify how the sample is to be created. There are three possibilities:
Original: The distribution of the selected column in the sample is the same as the distribution in data source. For example, suppose that the column is GENDER with value M for 95% of the cases. In the sample, the value of GENDER is M for 95% of the cases.
Balanced: The distribution of the values of the column is equal in the sample, regardless of the distribution in the data source. If the column is GENDER and GENDER has two values (M and F), 50% of the time the value of GENDER is M.
Custom: You define how the values of the column are distributed in the sample. You must run the node once before you define the custom distribution. Click Edit to launch the Custom Balance dialog
The stratified dialog displays a histogram of the values of the selected column at the bottom of the window. To see more details, click View to display the Custom Balance dialog.