Expand description
Multi-column and row-index reductions over a DataSet.
Aggregate semantics (nulls, all-null groups, casting) are documented in
docs/REDUCE_AGG_SEMANTICS.md at the repository root.
Structs§
- Feature
Mean Std - Per-column mean and standard deviation (square root of variance under
std_kind).
Functions§
- arg_
max_ row - Returns
Noneifcolumnis not in the schema. Otherwise [Some(None)] if there is no non-null comparable value, or [Some(Some((row_index, value)))] for the first row attaining the maximum (stable tie-break). - arg_
min_ row - Same as
arg_max_rowfor the minimum. - feature_
wise_ mean_ std - One pass over all rows: compute mean and std dev for each listed numeric column (
Int64/Float64). Nulls are ignored. If a column has no non-null values, both fields areValue::Null. Sample std dev is undefined for fewer than two values →Value::Null. - top_
k_ by_ frequency - Non-null value frequencies; returns the top
kpairs by count (desc), breaking ties by [value_sort_key] ascending.k == 0yields an empty vector.