Libraries.Compute.Statistics.Tests.CompareDistributions Documentation
This class implements two tests: CompareDistributionToNormal is a Shapiro-Wilk Test Check if a sample is distributed normally For more information: https://en.wikipedia.org/wiki/Shapiro%E2%80%93Wilk_test ________________ is a Kolmogorov-Smirnov Test Check a sample's distribution against a known distribution For more information: https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test It was partially adapted from a similar model in Apache Commons, and FORTAN but was expanded upon to simplify the library and add a variety of helper actions and calculations that were missing. More information about this class can be found on then documentation pages KolmogorovSmirnovTest: https://commons.apache.org/proper/commons-math/javadocs/api-3.6.1/index.html FORTRAN: http://lib.stat.cmu.edu/apstat/R94
Example Code
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:AddColumn(0)
frame:Calculate(compare)
output frame:GetFormalSummary()
Inherits from: Libraries.Compute.Statistics.DataFrameCalculation, Libraries.Compute.Statistics.Tests.StatisticalTest, Libraries.Language.Object, Libraries.Compute.Statistics.Inputs.ColumnInput, Libraries.Compute.Statistics.Inputs.FactorInput
Actions Documentation
AddColumn(integer column)
This action adds a value to the end of the input.
Parameters
- integer column
AddFactor(integer column)
This action adds a value to the end of the input.
Parameters
- integer column
Calculate(Libraries.Compute.Statistics.DataFrame frame)
Parameters
Compare(Libraries.Language.Object object)
This action compares two object hash codes and returns an integer. The result is larger if this hash code is larger than the object passed as a parameter, smaller, or equal. In this case, -1 means smaller, 0 means equal, and 1 means larger. This action was changed in Quorum 7 to return an integer, instead of a CompareResult object, because the previous implementation was causing efficiency issues.
Parameters
- Libraries.Language.Object: The object to compare to.
Return
integer: The Compare result, Smaller, Equal, or Larger.
Example
Object o
Object t
integer result = o:Compare(t) //1 (larger), 0 (equal), or -1 (smaller)
CompareDistributionToNormal(Libraries.Compute.Statistics.DataFrame frame)
Shapiro-Wilk Test tests for normality on each sample Null hypothesis: The sample MAY be normally distributed Alternative hypothesis: The sample is not normally distributed.
Parameters
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("data.csv")
CompareDistributions compare
compare:AddColumn(0)
compare:AddColumn(1)
compare:AddColumn(2)
compare:Calculate(frame)
output compare:GetFormalSummary()
EmptyColumns()
This action empty's the list, clearing out all of the items contained within it.
EmptyFactors()
This action empty's the list, clearing out all of the items contained within it.
Equals(Libraries.Language.Object object)
This action determines if two objects are equal based on their hash code values.
Parameters
- Libraries.Language.Object: The to be compared.
Return
boolean: True if the hash codes are equal and false if they are not equal.
Example
use Libraries.Language.Object
use Libraries.Language.Types.Text
Object o
Text t
boolean result = o:Equals(t)
GetColumn(integer index)
This action gets the item at a given location in an array.
Parameters
- integer index
Return
integer: The item at the given location.
GetColumnIterator()
This action gets an iterator for the object and returns that iterator.
Return
Libraries.Containers.Iterator: Returns the iterator for an object.
GetColumnSize()
This action gets the size of the array.
Return
integer:
GetDegreesOfFreedom()
This returns the degrees of freedom if only one result exists.
Return
number: the Degrees of Freedom.
GetFactor(integer index)
This action gets the item at a given location in an array.
Parameters
- integer index
Return
integer: The item at the given location.
GetFactorIterator()
This action gets an iterator for the object and returns that iterator.
Return
Libraries.Containers.Iterator: Returns the iterator for an object.
GetFactorSize()
This action gets the size of the array.
Return
integer:
GetFormalSummary()
This action summarizes the results and places them into formal academic language, in APA format. For more information: https://apastyle.apa.org/instructional-aids/numbers-statistics-guide.pdf
Return
text: a condensed formal result of the test
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:Add(0)
compare:Add(1)
frame:Calculate(compare)
output compare:GetFormalSummary()
GetGroups(Libraries.Compute.Statistics.DataFrame frame)
Gets the the fully factored samples/groups in an array of dataframes. Using an array of dataframes instead of a single dataframe helps with multivariate cases.
Parameters
Return
GetHashCode()
This action gets the hash code for an object.
Return
integer: The integer hash code of the object.
Example
Object o
integer hash = o:GetHashCode()
GetProbabilityValue()
This returns the probability if only one result exists.
Return
number: the P-Value.
GetReport(Libraries.System.File file)
Parameters
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:Add(0)
compare:Add(1)
frame:Calculate(compare)
DataFrame result = compare:GetSummaryDataFrame()
result:Save("myresult.csv")
GetResult()
This returns a result if only one exists.
Return
Libraries.Compute.Statistics.Reporting.CompareDistributionsResult: the CompareDistributionsResult object
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareVariances compare
compare:AddColumn(0)
compare:AddColumn(1)
compare:Calculate(frame)
CompareDistributions result = compare:GetResult()
GetResults()
Return
Libraries.Containers.Array: an array of CompareDistributionsResult objects
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:AddColumn(0)
compare:AddColumn(1)
compare:AddColumn(2)
frame:Calculate(compare)
Array<CompareDistributionsResult> results = compare:GetResults()
GetSignificanceLevel()
A list of unique items of the factor
Return
number:
GetStatisticalFormatting()
GetSummary()
Return
text: a list of the important statistics of the test
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:Add(0)
compare:Add(1)
frame:Calculate(compare)
output compare:GetSummary()
GetSummaryDataFrame()
Return
Libraries.Compute.Statistics.DataFrame: a DataFrame of the important statistics of the test
Example
use Libraries.Compute.Statistics.DataFrame
use Libraries.Compute.Statistics.Tests.CompareDistributions
DataFrame frame
frame:Load("Data/Data.csv")
CompareDistributions compare
compare:Add(0)
compare:Add(1)
frame:Calculate(compare)
DataFrame result = compare:GetSummaryDataFrame()
result:Save("myresult.csv")
GetTestStatistic()
This returns the test statistic if only one result exists.
Return
number: the test statistic.
IsEmptyColumns()
This action returns a boolean value, true if the container is empty and false if it contains any items.
Return
boolean: Returns true when the container is empty and false when it is not.
IsEmptyFactors()
This action returns a boolean value, true if the container is empty and false if it contains any items.
Return
boolean: Returns true when the container is empty and false when it is not.
RemoveColumn(integer column)
This action removes the first occurrence of an item that is found in the Addable object.
Parameters
- integer column
Return
boolean: Returns true if the item was removed and false if it was not removed.
RemoveColumnAt(integer index)
This action removes an item from an indexed object and returns that item.
Parameters
- integer index
RemoveFactor(integer column)
This action removes the first occurrence of an item that is found in the Addable object.
Parameters
- integer column
Return
boolean: Returns true if the item was removed and false if it was not removed.
RemoveFactorAt(integer index)
This action removes an item from an indexed object and returns that item.
Parameters
- integer index
SetSignificanceLevel(number significanceLevel)
Sets the significance level of the test (default is 0.05).
Parameters
- number significanceLevel: the significance level between 0 and 1.
SetStatisticalFormatting(Libraries.Compute.Statistics.Reporting.StatisticsFormatting formatting)
Create a new frame based on that list
Parameters
On this page
Variables TableAction Documentation- AddColumn(integer column)
- AddFactor(integer column)
- Calculate(Libraries.Compute.Statistics.DataFrame frame)
- Compare(Libraries.Language.Object object)
- CompareDistributionToNormal(Libraries.Compute.Statistics.DataFrame frame)
- EmptyColumns()
- EmptyFactors()
- Equals(Libraries.Language.Object object)
- GetColumn(integer index)
- GetColumnIterator()
- GetColumnSize()
- GetDegreesOfFreedom()
- GetFactor(integer index)
- GetFactorIterator()
- GetFactorSize()
- GetFormalSummary()
- GetGroups(Libraries.Compute.Statistics.DataFrame frame)
- GetHashCode()
- GetProbabilityValue()
- GetReport(Libraries.System.File file)
- GetResult()
- GetResults()
- GetSignificanceLevel()
- GetStatisticalFormatting()
- GetSummary()
- GetSummaryDataFrame()
- GetTestStatistic()
- IsEmptyColumns()
- IsEmptyFactors()
- RemoveColumn(integer column)
- RemoveColumnAt(integer index)
- RemoveFactor(integer column)
- RemoveFactorAt(integer index)
- SetSignificanceLevel(number significanceLevel)
- SetStatisticalFormatting(Libraries.Compute.Statistics.Reporting.StatisticsFormatting formatting)