the-stats-duck
A DuckDB extension that adds statistical functions to the SQL surface. The shared analytics layer that quietly powers both KoliLang and Bedevere.
What it does
DuckDB ships with a strong but deliberately compact statistics surface. the-stats-duck extends it with the additional
functions and distributions that clinical and tabular analytics actually need — quantile families, distribution CDFs and
inverses, hypothesis-test machinery, the supporting numerics — without forcing analysts to leave SQL for a host language.
Charts from SQL
Statistics are only half the story. the-stats-duck also adds a VISUALIZE directive to DuckDB’s SQL
surface, so a query can describe a chart and hand back a plot — the same SQL-native approach as the stats, and the charting
primitive Bedevere builds on.
Why it exists
Both KoliLang and Bedevere lean on DuckDB for the heavy lifting. When KoliLang needs a quantile or a distribution function for a
SAS proc, or Bedevere needs the same primitive for an interactive query, the answer should come from the same place. That place is
the-stats-duck.
Use it
The extension is open source. Build it from source, load it into any DuckDB session, and the new functions are available on the SQL surface alongside the built-ins. The repo is the source of truth for current function coverage and build instructions.
Repository: github.com/KoliStat/the-stats-duck