DataFrameStats extension

Statistical extensions for DataFrame.

on
  • DataFrame

Methods

anovaOneWay(List<String> groupCols) HypothesisTestResult

Available on DataFrame, provided by the DataFrameStats extension

Performs One-way ANOVA on multiple columns (treating each column as a group).
autocorrelation(String col, int lag) num

Available on DataFrame, provided by the DataFrameStats extension

Calculates Autocorrelation for a column at a specific lag.
chiSquareTest(String observedCol, String expectedCol) HypothesisTestResult

Available on DataFrame, provided by the DataFrameStats extension

Chi-square goodness-of-fit wrapper using two columns (observed vs expected).
correlation(String col1, String col2) double

Available on DataFrame, provided by the DataFrameStats extension

Pearson Correlation Coefficient between two columns.
correlationMatrix(List<String> cols) → DataFrame

Available on DataFrame, provided by the DataFrameStats extension

Computes the Correlation Matrix for a list of columns. Returns a DataFrame.
covariance(String col1, String col2) double

Available on DataFrame, provided by the DataFrameStats extension

Covariance between two columns.
covarianceMatrix(List<String> cols) → DataFrame

Available on DataFrame, provided by the DataFrameStats extension

Computes the Covariance Matrix for a list of columns.
describeColumn(String col) Map<String, num>

Available on DataFrame, provided by the DataFrameStats extension

Descriptive statistics for a column.
exponentialSmoothing(String col, double alpha) List<num>

Available on DataFrame, provided by the DataFrameStats extension

Calculates exponential smoothing for a column.
kurtosisColumn(String col) double

Available on DataFrame, provided by the DataFrameStats extension

Calculates the kurtosis of a column.
linearRegression(String xCol, String yCol) RegressionResult

Available on DataFrame, provided by the DataFrameStats extension

Performs simple linear regression on two columns.
logisticRegression(List<String> xCols, String yCol) RegressionResult

Available on DataFrame, provided by the DataFrameStats extension

Performs logistic regression.
maxColumn(String col) num

Available on DataFrame, provided by the DataFrameStats extension

Maximum value in a column.
meanColumn(String col) double

Available on DataFrame, provided by the DataFrameStats extension

Calculates the mean of a column.
medianColumn(String col) num

Available on DataFrame, provided by the DataFrameStats extension

Calculates the median of a column.
minColumn(String col) num

Available on DataFrame, provided by the DataFrameStats extension

Minimum value in a column.
modeColumn(String col) List<num>

Available on DataFrame, provided by the DataFrameStats extension

Calculates the mode of a column.
movingAverage(String col, int window) List<num>

Available on DataFrame, provided by the DataFrameStats extension

Calculates simple moving average for a column.
multipleLinearRegression(List<String> xCols, String yCol) RegressionResult

Available on DataFrame, provided by the DataFrameStats extension

Performs multiple linear regression.
normalizeColumn(String col) List<num>

Available on DataFrame, provided by the DataFrameStats extension

Normalizes a column (Min-Max Scaling) to [0, 1]. Returns a new List<num>.
polynomialRegression(String xCol, String yCol, int degree) RegressionResult

Available on DataFrame, provided by the DataFrameStats extension

Performs polynomial regression.
quantileColumn(String col, double q) num

Available on DataFrame, provided by the DataFrameStats extension

Calculates the q-th quantile of a column.
seasonalDecompose(String col, int period, {String model = 'additive'}) SeasonalDecomposition

Available on DataFrame, provided by the DataFrameStats extension

Performs seasonal decomposition on a column.
skewnessColumn(String col) double

Available on DataFrame, provided by the DataFrameStats extension

Calculates the skewness of a column.
standardizeColumn(String col) List<num>

Available on DataFrame, provided by the DataFrameStats extension

Standardizes a column (Z-Score Scaling). Returns a new List<num>.
stdDevColumn(String col) double

Available on DataFrame, provided by the DataFrameStats extension

Calculates the standard deviation of a column.
sumColumn(String col) num

Available on DataFrame, provided by the DataFrameStats extension

Calculates the sum of a column.
tTestOneSample(String col, num populationMean) HypothesisTestResult

Available on DataFrame, provided by the DataFrameStats extension

Performs a one-sample t-test on a column against a population mean.
tTestPair(String col1, String col2, {bool equalVariance = true}) HypothesisTestResult

Available on DataFrame, provided by the DataFrameStats extension

Performs a two-sample t-test between two columns.
varianceColumn(String col) double

Available on DataFrame, provided by the DataFrameStats extension

Calculates the variance of a column.
zTest(String col, num mu0, num sigma) HypothesisTestResult

Available on DataFrame, provided by the DataFrameStats extension

Z-test wrapper.