Std / List Layer
Calculate the standard deviation of numeric lists. Similar to numpy.std() or R's sd(). Returns the sample (n-1) or population (n) standard deviation. Supports lists of different lengths.
Mathematical definition: For a list of values with mean :
Population standard deviation (ddof=0):
Sample standard deviation (ddof=1):
Example transformation:
lists | std (ddof=1) |
---|---|
[1, 2, 3, 4] | 1.291 |
[10, 10, 10] | 0.0 |
[1, null, 3] | 1.414 |
[2, 4, 6, 8, 10] | 3.162 |
[5] | null |
[] | null |
Common applications:
- Analyzing price volatility in financial data
- Measuring sensor reading stability in IoT
- Detecting anomalous patterns in metrics
- Computing confidence intervals in statistics
- Evaluating process consistency in manufacturing
- Analyzing performance variation in systems
Note: Only works with numeric lists. Lists with fewer than 2 elements return null. Null values are ignored in calculation. The choice between population (ddof=0) and sample (ddof=1) depends on whether the data represents an entire population or a sample.
Select
columnThe variable-length numeric list column to analyze. Examples:
- Price history: [10.5, 11.2, 10.8, 11.5]
- Temperature readings: [22.1, 22.3, 22.0]
- Performance scores: [85, 92, 88, 90]
- Mixed lengths: [1.1, 1.2], [2.1, 2.2, 2.3] Lists can have different lengths. Only numeric types supported.
Ddof
u8Delta Degrees of Freedom (DDOF
) affecting standard deviation calculation:
- 1 (default): Sample standard deviation ()
- 0: Population standard deviation () Use 1 for sample analysis (more common), 0 when data represents entire population.
AsColumn
nameName for the new column. If not provided, the system generates a unique name. If AsColumn
matches an existing column, the existing column is replaced. The name should follow valid column naming conventions.