Sample / Manipulation Layer

Select a random subset of rows from the DataFrame. Similar to pandas' sample() or R's sample_n()/sample_frac().

Sampling methods:

Common applications:

Example: From 1000 rows, sample 100 (n=100) or 10% (fraction=0.1) for analysis.

Table

SampleType

oneof

Fraction

f64

0.1

Proportion of rows to sample (0.0 to 1.0). Common uses:

u32

Fixed number of rows to sample. Typical scenarios:

bool

false

Control sampling method:

false (default): Each row selected once at most
true: Same row may be selected multiple times Critical for: Bootstrap sampling, simulation studies, probability analysis

bool

false

Control output order:

false (default): Maintain relative order of selected rows
true: Randomize order of sampled rows Important for: Random batching, unbiased selection, order-sensitive analysis

u32

Random seed for reproducible sampling. Essential for: