register login shopping cart shopping cart 0 items in cart

function libraries, tools and services for Microsoft SQL Server

 
 

SQLStatistics

Every kind of statistical test and metric
built right into the database layer

Buy now Download Free 30-day trial
 
Edit

SQLStatistics extends and enhances your Transact-SQL programming environment by adding 87 powerful new functions and aggregates that add up to the fastest, easiest to use and most comprehensive statistical analysis toolkit for Microsoft SQL Server available today. Functions include:

  • A portfolio of tests and metrics of correlation, similarity or dependence between datasets, including chi-squared tests of independence and goodness of fit; covariance metrics; F-tests to compare variances; paired and unpaired Student's-T tests to compare means; Pearson's product moment correlation coefficient and its square; the Fisher transformation; and the Z-test.
  • A broad range of descriptive statistics including geometric, harmonic and quadratic means; truncated and inter-quartile means and the tri-mean; modes of both continuous and discrete data; medians, quartiles, percentiles, percentile rank and absolute rank; skewness and kurtosis; weighted averages, variances, standard deviations and power means; range, mid-range, inter-quartile range, nth largest, nth smallest and more.
  • A comprehensive set of complete and partial beta and gamma functions and their inverses, complements and derivatives.
  • Many others including confidence tests using both normal and Student's-T distributions; functions for linear and exponential regression and extrapolation, including forecasting, error estimation, slope and intercept estimation; combinations and permutations; sum of squared deviations; normalization and more.


Beyond Excel

The functions available in SQLStatistics include numerically faithful equivalents of the familiar statistical functions found in Microsoft Excel and other popular packages, but also many others which - combined with the innate power and ability of SQL Server to rapidly crunch through huge volumes of data - gives you an analytical capability that completely blows away anything you could do with spreadsheets.


Native Look and Feel

All TotallySQL Transact-SQL extensions are implemented using familiar programming objects within SQL (scalar and table-valued functions, aggregate functions, user-defined types, stored procedures) , keeping the syntax clean and making them as fast and easy to use as the built-in native statistics functions such as SUM, AVG etc.


Outstanding Performance and Numerical Accuracy

The algorithms behind the SQLStatistics functions use industry-proven, best-in-class techniques devised by some of the world's leading numerical computing experts to maximise performance and accuracy even for fairly extreme input values. Careful heuristics and edge-case checks through the code ensure that problems caused by the limitations of floating-point arithmetic on digital computers (such as cancellation errors) are kept to an absolute mininum, and a broad range of special-purpose, high-performance numerical computing approaches are used in place of the slow, expensive computations that would result from 'naive' implementations of the underlying mathematics.


Multi-Valued Aggregates - even in SQL2005 !

Several of the SQLStatistics functions (for example weighted metrics for average, variance, standard deviation etc) are in the form of aggregate functions that take multiple input values. This works fine for SQL2008 but is commonly supposed to be impossible in SQL2005 - since you can supply only one input parameter to an aggregate function. We have, however, devised a very simple but smart way of doing this in SQL2005 using SQLTuples: you simply combine the input parameters (of any type) into a SQLTuples object on-the-fly as part of the function call. Only a very minor syntax change is needed and there is no measurable performance penalty even when used with truly enormous data sets.