Statistics package: Difference between revisions
Jump to navigation
Jump to search
(→Missing functions: randsample implemented) |
(→Development: updated expfit and explike status) |
||
(13 intermediate revisions by 6 users not shown) | |||
Line 1: | Line 1: | ||
The {{Forge|statistics|statistics package}} is part of the | The {{Forge|statistics|statistics package}} is part of the [[Octave Forge]] project. | ||
== Core package == | |||
We need to decide what to do with the functions in the existing Forge package when they are not implemented or have been removed from the corresponding Matlab Toolbox: | |||
=== To be decided === | |||
{| class="wikitable" | |||
! function | |||
! decision/notes | |||
|- | |||
|anderson_darling_cdf | |||
| | |||
|- | |||
|cl_multinom | |||
| | |||
|- | |||
|dcov | |||
| | |||
|- | |||
| gevfit_lmom | |||
| | |||
|- | |||
| iwishpdf | |||
| | |||
|- | |||
| jsucdf | |||
| | |||
|- | |||
|jsupdf | |||
| | |||
|- | |||
|monotone_smooth | |||
| | |||
|- | |||
|normalise_distribution | |||
| | |||
|- | |||
|princomp | |||
|matlab says that the function has been removed but likely is a lie. We should keep it too. | |||
|- | |||
|qrandn | |||
| | |||
|- | |||
|runstest | |||
| | |||
|- | |||
|sigma_pts | |||
|not yet released | |||
|- | |||
|violin | |||
| | |||
|- | |||
|vmpdf | |||
| | |||
|- | |||
|vmrnd | |||
| | |||
|- | |||
|wishpdf | |||
| | |||
|} | |||
=== Matlab incompatible === | |||
These functions have the same name as Matlab functions but have a different interface | |||
* boxplot | |||
* gpcdf | |||
* gpinv | |||
* gppdf | |||
* gprnd | |||
=== Can be reused in other functions === | |||
{| class="wikitable" | |||
!forge function | |||
!matlab counterpart | |||
|- | |||
|anderson_darling_test | |||
|adtest | |||
|- | |||
|bbscdf | |||
|BirnbaumSaundersDistribution class | |||
|- | |||
|bbsinv | |||
|BirnbaumSaundersDistribution class | |||
|- | |||
|bbspdf | |||
|BirnbaumSaundersDistribution class | |||
|- | |||
|bbsrnd | |||
|BirnbaumSaundersDistribution class | |||
|- | |||
|binotest | |||
|binofit | |||
|- | |||
|burrcdf | |||
|BurrDistribution class | |||
|- | |||
|burrinv | |||
|BurrDistribution class | |||
|- | |||
|burrpdf | |||
|BurrDistribution class | |||
|- | |||
|burrrnd | |||
|BurrDistribution class | |||
|- | |||
|nakacdf | |||
|NakagamiDistribution class | |||
|- | |||
|nakainv | |||
|NakagamiDistribution class | |||
|- | |||
|nakapdf | |||
|NakagamiDistribution class | |||
|- | |||
|nakarnd - should be used to implement the | |||
|NakagamiDistribution class | |||
|- | |||
|regress_gp | |||
|RegressionGP class | |||
|- | |||
|repanova | |||
|RepeatedMeasuresModel.ranova | |||
|- | |||
|tricdf | |||
|TriangularDistribution class | |||
|- | |||
|triinv | |||
|TriangularDistribution class | |||
|- | |||
|tripdf | |||
|TriangularDistribution class | |||
|- | |||
|trirnd | |||
|TriangularDistribution class | |||
|- | |||
|logistic_cdf | |||
|LogisticDistribution class | |||
|- | |||
|logistic_inv | |||
|LogisticDistribution class | |||
|- | |||
|logistic_pdf | |||
|LogisticDistribution class | |||
|- | |||
|logistic_rnd | |||
|LogisticDistribution class | |||
|- | |||
|stdnormal_cdf | |||
|NormalDistribution class | |||
|- | |||
|stdnormal_inv | |||
|NormalDistribution class | |||
|- | |||
|stdnormal_pdf | |||
|NormalDistribution class | |||
|- | |||
|stdnormal_rnd | |||
|NormalDistribution class | |||
|- | |||
|anova | |||
|anova method in different *Model classes | |||
|- | |||
|manova | |||
|manova methods in different *Model classes | |||
|- | |||
|bartlett_test | |||
|barttest | |||
|- | |||
|kolmogorov_smirnov_test | |||
|ktest | |||
|- | |||
|kolmogorov_smirnov_test_2 | |||
|ktest2 | |||
|- | |||
|kruskal_wallis_test | |||
|kruskalwallis | |||
|} | |||
=== Ready to go === | |||
These functions seem to be matlab compatible | |||
<div style="column-count:4;-moz-column-count:4;-webkit-column-count:4"> | |||
* anovan | |||
* betastat | |||
* binostat | |||
* binotest | |||
* canoncorr | |||
* caseread | |||
* casewrite | |||
* cdf | |||
* chi2stat | |||
* cmdscale | |||
* combnk | |||
* copulacdf | |||
* copulapdf | |||
* copularnd | |||
* crossval | |||
* @cvpartition | |||
* dendrogram | |||
* expstat | |||
* ff2n | |||
* fitgmdist | |||
* fstat | |||
* fullfact | |||
* gamfit | |||
* gamlike | |||
* gamstat | |||
* geomean | |||
* geostat | |||
* gevcdf | |||
* gevfit | |||
* gevinv | |||
* gevlike | |||
* gevpdf | |||
* gevrnd | |||
* gevstat | |||
* gmdistribution | |||
* grp2idx | |||
* harmmean | |||
* hist3 | |||
* histfit | |||
* hmmestimate | |||
* hmmgenerate | |||
* hmmviterbi | |||
* hygestat | |||
* iwishrnd | |||
* jackknife | |||
* kmeans | |||
* linkage | |||
* lognstat | |||
* mad | |||
* mahal | |||
* mnpdf | |||
* mnrnd | |||
* mvncdf | |||
* mvnpdf | |||
* mvnrnd | |||
* mvtcdf | |||
* mvtpdf | |||
* mvtrnd | |||
* nanmax | |||
* nanmean | |||
* nanmedian | |||
* nanmin | |||
* nanstd | |||
* nansum | |||
* nanvar | |||
* nbinstat | |||
* normplot | |||
* normstat | |||
* pcacov | |||
* pcares | |||
* pdf | |||
* pdist2 | |||
* pdist | |||
* plsregress | |||
* poisstat | |||
* random | |||
* randsample | |||
* raylcdf | |||
* raylinv | |||
* raylpdf | |||
* raylrnd | |||
* raylstat | |||
* regress | |||
* signtest | |||
* squareform | |||
* stepwisefit | |||
* tabulate | |||
* tblread | |||
* tblwrite | |||
* trimmean | |||
* tstat | |||
* ttest2 | |||
* ttest | |||
* unidstat | |||
* unifstat | |||
* vartest2 | |||
* vartest | |||
* wblstat | |||
* wishrnd | |||
* ztest | |||
* prctile | |||
* qqplot | |||
* betacdf | |||
* betainv | |||
* betapdf | |||
* betarnd | |||
* binocdf | |||
* binoinv | |||
* binopdf | |||
* binornd | |||
* chi2cdf | |||
* chi2inv | |||
* chi2pdf | |||
* chi2rnd | |||
* expcdf | |||
* expinv | |||
* exppdf | |||
* exprnd | |||
* fcdf | |||
* finv | |||
* fpdf | |||
* frnd | |||
* gamcdf | |||
* gaminv | |||
* gampdf | |||
* gamrnd | |||
* geocdf | |||
* geoinv | |||
* geopdf | |||
* geornd | |||
* hygecdf | |||
* hygeinv | |||
* hygepdf | |||
* hygernd | |||
* logncdf | |||
* logninv | |||
* lognpdf | |||
* lognrnd | |||
* nbincdf | |||
* nbininv | |||
* nbinpdf | |||
* nbinrnd | |||
* normcdf | |||
* norminv | |||
* normpdf | |||
* normrnd | |||
* poisscdf | |||
* poissinv | |||
* poisspdf | |||
* poissrnd | |||
* tcdf | |||
* tinv | |||
* tpdf | |||
* trnd | |||
* unidcdf | |||
* unidinv | |||
* unidpdf | |||
* unidrnd | |||
* unifcdf | |||
* unifinv | |||
* unifpdf | |||
* unifrnd | |||
* wblcdf | |||
* wblinv | |||
* wblpdf | |||
* wblrnd | |||
</div> | |||
== Development == | == Development == | ||
Line 68: | Line 422: | ||
* candexch | * candexch | ||
* candgen | * candgen | ||
* capability | * capability | ||
* capaplot | * capaplot | ||
* ccdesign | * ccdesign | ||
* cdfplot | * cdfplot | ||
* cell2dataset | * cell2dataset | ||
Line 85: | Line 437: | ||
* clustering.evaluation.GapEvaluation | * clustering.evaluation.GapEvaluation | ||
* clustering.evaluation.SilhouetteEvaluation | * clustering.evaluation.SilhouetteEvaluation | ||
* coefCI | * coefCI | ||
* coefTest | * coefTest | ||
Line 103: | Line 454: | ||
* createns | * createns | ||
* crosstab | * crosstab | ||
* datasample | * datasample | ||
* dataset | * dataset | ||
Line 113: | Line 462: | ||
* daugment | * daugment | ||
* dcovary | * dcovary | ||
* designMatrix | * designMatrix | ||
* devianceTest | * devianceTest | ||
Line 131: | Line 479: | ||
* evrnd | * evrnd | ||
* evstat | * evstat | ||
* export | * export | ||
* factoran | * factoran | ||
Line 154: | Line 501: | ||
* glmval | * glmval | ||
* glyphplot | * glyphplot | ||
* gname | * gname | ||
* gpcdf | * gpcdf | ||
Line 165: | Line 510: | ||
* gprnd | * gprnd | ||
* gpstat | * gpstat | ||
* grpstats | * grpstats | ||
* gscatter | * gscatter | ||
Line 179: | Line 523: | ||
* ismissing | * ismissing | ||
* isundefined | * isundefined | ||
* jbtest | * jbtest | ||
* johnsrnd | * johnsrnd | ||
Line 218: | Line 561: | ||
* mvregress | * mvregress | ||
* mvregresslike | * mvregresslike | ||
* nancov | * nancov | ||
* nbinfit | * nbinfit | ||
Line 257: | Line 599: | ||
* pca | * pca | ||
* pdf | * pdf | ||
* pearsrnd | * pearsrnd | ||
* perfcurve | * perfcurve | ||
Line 327: | Line 668: | ||
* setlabels | * setlabels | ||
* signrank | * signrank | ||
* silhouette | * silhouette | ||
* slicesample | * slicesample | ||
Line 342: | Line 682: | ||
* svmtrain | * svmtrain | ||
* table2dataset | * table2dataset | ||
* tabulate | |||
* tdfread | * tdfread | ||
* tiedrank | * tiedrank | ||
* truncate | * truncate | ||
* unifit | * unifit | ||
* vartestn | * vartestn | ||
* wblfit | * wblfit | ||
* wbllike | * wbllike | ||
* wblplot | * wblplot | ||
* x2fx | * x2fx | ||
* xptread | * xptread | ||
</div> | </div> | ||
=== Missing options === | === Missing options === | ||
[[Category:Octave | *explike: censoring and frequency aren't yet implemented | ||
[[Category:Octave Forge]] | |||
[[Category:Missing functions]] |
Revision as of 22:46, 12 August 2021
The statistics package is part of the Octave Forge project.
Core package
We need to decide what to do with the functions in the existing Forge package when they are not implemented or have been removed from the corresponding Matlab Toolbox:
To be decided
function | decision/notes |
---|---|
anderson_darling_cdf | |
cl_multinom | |
dcov | |
gevfit_lmom | |
iwishpdf | |
jsucdf | |
jsupdf | |
monotone_smooth | |
normalise_distribution | |
princomp | matlab says that the function has been removed but likely is a lie. We should keep it too. |
qrandn | |
runstest | |
sigma_pts | not yet released |
violin | |
vmpdf | |
vmrnd | |
wishpdf |
Matlab incompatible
These functions have the same name as Matlab functions but have a different interface
- boxplot
- gpcdf
- gpinv
- gppdf
- gprnd
Can be reused in other functions
forge function | matlab counterpart |
---|---|
anderson_darling_test | adtest |
bbscdf | BirnbaumSaundersDistribution class |
bbsinv | BirnbaumSaundersDistribution class |
bbspdf | BirnbaumSaundersDistribution class |
bbsrnd | BirnbaumSaundersDistribution class |
binotest | binofit |
burrcdf | BurrDistribution class |
burrinv | BurrDistribution class |
burrpdf | BurrDistribution class |
burrrnd | BurrDistribution class |
nakacdf | NakagamiDistribution class |
nakainv | NakagamiDistribution class |
nakapdf | NakagamiDistribution class |
nakarnd - should be used to implement the | NakagamiDistribution class |
regress_gp | RegressionGP class |
repanova | RepeatedMeasuresModel.ranova |
tricdf | TriangularDistribution class |
triinv | TriangularDistribution class |
tripdf | TriangularDistribution class |
trirnd | TriangularDistribution class |
logistic_cdf | LogisticDistribution class |
logistic_inv | LogisticDistribution class |
logistic_pdf | LogisticDistribution class |
logistic_rnd | LogisticDistribution class |
stdnormal_cdf | NormalDistribution class |
stdnormal_inv | NormalDistribution class |
stdnormal_pdf | NormalDistribution class |
stdnormal_rnd | NormalDistribution class |
anova | anova method in different *Model classes |
manova | manova methods in different *Model classes |
bartlett_test | barttest |
kolmogorov_smirnov_test | ktest |
kolmogorov_smirnov_test_2 | ktest2 |
kruskal_wallis_test | kruskalwallis |
Ready to go
These functions seem to be matlab compatible
- anovan
- betastat
- binostat
- binotest
- canoncorr
- caseread
- casewrite
- cdf
- chi2stat
- cmdscale
- combnk
- copulacdf
- copulapdf
- copularnd
- crossval
- @cvpartition
- dendrogram
- expstat
- ff2n
- fitgmdist
- fstat
- fullfact
- gamfit
- gamlike
- gamstat
- geomean
- geostat
- gevcdf
- gevfit
- gevinv
- gevlike
- gevpdf
- gevrnd
- gevstat
- gmdistribution
- grp2idx
- harmmean
- hist3
- histfit
- hmmestimate
- hmmgenerate
- hmmviterbi
- hygestat
- iwishrnd
- jackknife
- kmeans
- linkage
- lognstat
- mad
- mahal
- mnpdf
- mnrnd
- mvncdf
- mvnpdf
- mvnrnd
- mvtcdf
- mvtpdf
- mvtrnd
- nanmax
- nanmean
- nanmedian
- nanmin
- nanstd
- nansum
- nanvar
- nbinstat
- normplot
- normstat
- pcacov
- pcares
- pdist2
- pdist
- plsregress
- poisstat
- random
- randsample
- raylcdf
- raylinv
- raylpdf
- raylrnd
- raylstat
- regress
- signtest
- squareform
- stepwisefit
- tabulate
- tblread
- tblwrite
- trimmean
- tstat
- ttest2
- ttest
- unidstat
- unifstat
- vartest2
- vartest
- wblstat
- wishrnd
- ztest
- prctile
- qqplot
- betacdf
- betainv
- betapdf
- betarnd
- binocdf
- binoinv
- binopdf
- binornd
- chi2cdf
- chi2inv
- chi2pdf
- chi2rnd
- expcdf
- expinv
- exppdf
- exprnd
- fcdf
- finv
- fpdf
- frnd
- gamcdf
- gaminv
- gampdf
- gamrnd
- geocdf
- geoinv
- geopdf
- geornd
- hygecdf
- hygeinv
- hygepdf
- hygernd
- logncdf
- logninv
- lognpdf
- lognrnd
- nbincdf
- nbininv
- nbinpdf
- nbinrnd
- normcdf
- norminv
- normpdf
- normrnd
- poisscdf
- poissinv
- poisspdf
- poissrnd
- tcdf
- tinv
- tpdf
- trnd
- unidcdf
- unidinv
- unidpdf
- unidrnd
- unifcdf
- unifinv
- unifpdf
- unifrnd
- wblcdf
- wblinv
- wblpdf
- wblrnd
Development
Follows an incomplete list of stuff missing in the statistics package to be matlab compatible. Bugs are not listed here, search and report them on the bug tracker instead.
this entire section is about the current development version. If a Matlab function is missing from the list and does not appear on the current release of the package, confirm that is also missing in the development sources before adding it.
Missing functions
- ClassificationBaggedEnsemble
- ClassificationDiscriminant
- ClassificationDiscriminant.fit
- ClassificationEnsemble
- ClassificationKNN
- ClassificationKNN.fit
- ClassificationPartitionedEnsemble
- ClassificationPartitionedModel
- ClassificationTree
- ClassificationTree.fit
- CompactClassificationDiscriminant
- CompactClassificationEnsemble
- CompactClassificationTree
- CompactRegressionEnsemble
- CompactRegressionTree
- CompactTreeBagger
- ExhaustiveSearcher
- GeneralizedLinearModel
- GeneralizedLinearModel.fit
- GeneralizedLinearModel.stepwise
- KDTreeSearcher
- LinearMixedModel
- LinearMixedModel.fit
- LinearMixedModel.fitmatrix
- LinearModel
- LinearModel.fit
- LinearModel.stepwise
- NaiveBayes
- NaiveBayes.fit
- NonLinearModel
- NonLinearModel.fit
- ProbDistUnivKernel
- ProbDistUnivParam
- RegressionBaggedEnsemble
- RegressionEnsemble
- RegressionPartitionedEnsemble
- RegressionPartitionedModel
- RegressionTree
- RegressionTree.fit
- TreeBagger
- addTerms
- addedvarplot
- addlevels
- adtest
- andrewsplot
- anova1
- anova2
- ansaribradley
- aoctool
- barttest
- bbdesign
- betafit
- betalike
- binofit
- biplot
- bootci
- bootstrp
- candexch
- candgen
- capability
- capaplot
- ccdesign
- cdfplot
- cell2dataset
- chi2gof
- cholcov
- classify
- classregtree
- cluster
- clusterdata
- clustering.evaluation.CalinskiHarabaszEvaluation
- clustering.evaluation.DaviesBouldinEvaluation
- clustering.evaluation.GapEvaluation
- clustering.evaluation.SilhouetteEvaluation
- coefCI
- coefTest
- compact
- compare
- confusionmat
- controlchart
- controlrules
- cophenet
- copulafit
- copulaparam
- copulastat
- cordexch
- corrcov
- covarianceParameters
- coxphfit
- createns
- crosstab
- datasample
- dataset
- dataset2cell
- dataset2struct
- dataset2table
- datasetfun
- daugment
- dcovary
- designMatrix
- devianceTest
- dfittool
- disttool
- droplevels
- dummyvar
- dwtest
- ecdf
- ecdfhist
- evalclusters
- evcdf
- evfit
- evinv
- evlike
- evpdf
- evrnd
- evstat
- export
- factoran
- fitdist
- fitensemble
- fitglm
- fitlm
- fitlme
- fitlmematrix
- fitnlm
- fitted
- fixedEffects
- fracfact
- fracfactgen
- friedman
- fsurfht
- gagerr
- getlabels
- getlevels
- gline
- glmfit
- glmval
- glyphplot
- gname
- gpcdf
- gpfit
- gpinv
- gplike
- gplotmatrix
- gppdf
- gprnd
- gpstat
- grpstats
- gscatter
- haltonset
- hmmdecode
- hmmtrain
- hougen
- icdf
- inconsistent
- interactionplot
- invpred
- islevel
- ismissing
- isundefined
- jbtest
- johnsrnd
- join
- knnsearch
- kruskalwallis
- ksdensity
- kstest
- kstest2
- labels
- lasso
- lassoPlot
- lassoglm
- levelcounts
- leverage
- lhsdesign
- lhsnorm
- lillietest
- linhyptest
- lognfit
- lognlike
- lsline
- mahal
- maineffectsplot
- makedist
- manova1
- manovacluster
- mat2dataset
- mdscale
- mergelevels
- mhsample
- mle
- mlecov
- mnrfit
- mnrval
- multcompare
- multivarichart
- mvregress
- mvregresslike
- nancov
- nbinfit
- ncfcdf
- ncfinv
- ncfpdf
- ncfrnd
- ncfstat
- nctcdf
- nctinv
- nctpdf
- nctrnd
- nctstat
- ncx2cdf
- ncx2inv
- ncx2pdf
- ncx2rnd
- ncx2stat
- negloglik
- nlinfit
- nlintool
- nlmefit
- nlmefitsa
- nlparci
- nlpredci
- nnmf
- nominal
- normfit
- normlike
- normspec
- optimalleaforder
- ordinal
- parallelcoords
- paramci
- paretotails
- partialcorr
- partialcorri
- pca
- pearsrnd
- perfcurve
- plotAdded
- plotAdjustedResponse
- plotDiagnostics
- plotEffects
- plotInteraction
- plotResiduals
- plotSlice
- poissfit
- polytool
- ppca
- predict
- prob.BetaDistribution
- prob.BinomialDistribution
- prob.BirnbaumSaundersDistribution
- prob.BurrDistribution
- prob.ExponentialDistribution
- prob.ExtremeValueDistribution
- prob.GammaDistribution
- prob.GeneralizedExtremeValueDistribution
- prob.GeneralizedParetoDistribution
- prob.InverseGaussianDistribution
- prob.KernelDistribution
- prob.LogisticDistribution
- prob.LoglogisticDistribution
- prob.LognormalDistribution
- prob.MultinomialDistribution
- prob.NakagamiDistribution
- prob.NegativeBinomialDistribution
- prob.NormalDistribution
- prob.PiecewiseLinearDistribution
- prob.PoissonDistribution
- prob.RayleighDistribution
- prob.RicianDistribution
- prob.TriangularDistribution
- prob.UniformDistribution
- prob.WeibullDistribution
- prob.tLocationScaleDistribution
- probplot
- procrustes
- proflik
- qrandset
- qrandstream
- randomEffects
- randtool
- rangesearch
- ranksum
- raylfit
- rcoplot
- refcurve
- refline
- regstats
- relieff
- removeTerms
- residuals
- response
- ridge
- robustdemo
- robustfit
- rotatefactors
- rowexch
- rsmdemo
- rstool
- sampsizepwr
- scatterhist
- sequentialfs
- setlabels
- signrank
- silhouette
- slicesample
- sobolset
- statget
- statset
- step
- stepwise
- stepwiseglm
- stepwiselm
- struct2dataset
- surfht
- svmclassify
- svmtrain
- table2dataset
- tabulate
- tdfread
- tiedrank
- truncate
- unifit
- vartestn
- wblfit
- wbllike
- wblplot
- x2fx
- xptread
Missing options
- explike: censoring and frequency aren't yet implemented