Home > Transforms Nodes > Filter Columns > Sample
A sample node allows you to sample data in one of the following ways:
Random Sample: A sample in which every element of the data set has an equal chance of being selected.
Top N Sample: The default sample that selects the first N values.
Stratified Sample: A sample that is created as follows: Divide the data set into disjoint subsets (strata) and then take a random sample from each of the subsets. This technique is used when the distribution of target values is skewed greatly. For example, response to a marketing campaign may have a positive target value 1% of the time or less.
Sampling nested data is best done with a Case ID, as described in Sample Nested Data.
The following topics describe Sample nodes: