Category: R

Sep 17

Organism of the week #28 – Fractal art

By polypompholyx in Maths, Organism of the week, R

Flowers are essentially tarts. Prostitutes for the bees. (Uncle Monty, Withnail and I) Our tiny garden has only passing acquaintance with sunshine, so about the only plants that really thrive in its dingy clutches are shade-loving ferns. This Japanese painted fern is my current favourite: who needs flowers anyway, when leaves look like this? The colour is spectacular, but …

Athyrium, fern, fractal, Humata
1 comment

Aug 26

Statistical power

By polypompholyx in Misconceptions, R

Coin distribution overlaps for increasing sample size [CC-BY-SA-3.0 Steve Cook]

In a recent(ish) post, we saw that if a fair coin is flipped 30 times, the probability it will give us 10 or fewer heads is less than 5% (4.937% to be pointlessly precise). Fisher quantified this using the p value of a data set: the probability of obtaining data (or a test statistic based on those data) at …

Leave comment

Mar 03

Nonlinear regression

By polypompholyx in R

Species-area relationship for Caribbean herps [CC-BY-SA-3.0 Steve Cook]

Nonlinear regression is used to see whether one continuous variable is correlated with another continuous variable, but in a nonlinear way, i.e. when a set of x vs. y data you plan to collect do not form a straight line, but do fall on a curve that can be modelled in some sensible way by …

Leave comment

Feb 24

Analysis of variance: ANOVA (2 way)

By polypompholyx in R

The technique for a one-way ANOVA can be extended to situations where there is more than one factor, or – indeed – where there are several factors with several levels each, which may have synergistic or antagonistic effects on each other. In the models we have seen so far (linear regression, one-way ANOVA) all we …

Leave comment

Feb 24

Analysis of variance: ANOVA (1 way)

By polypompholyx in R

Analysis of variance is the technique to use when you might otherwise be considering a large number of pairwise F and t tests, i.e. where you want to know whether a factor with more than 2 levels is a useful predictor of a dependent variable. For example, cuckoo_eggs.csv contains data on the length of cuckoo eggs laid …

2 comments

Feb 17

Comparison of expected and observed count data: the χ² test

By polypompholyx in R

A χ2 test is used to measure the discrepancy between the observed and expected values of count data. The dependent data must – by definition – be count data. If there are independent variables, they must be categorical. The test statistic derived from the two data sets is called χ2, and it is defined as …

1 comment

Feb 10

Correlation of data: linear regression

By polypompholyx in R

Linear regression is used to see whether one continuous variable is correlated with another continuous variable in a linear way, i.e. can the dependent variable y be modelled with a straight-line response to changes in the independent covariate x: Here b is the estimated slope of the best-fit line (a.k.a. gradient, often written m), a …

Leave comment

Feb 03

Statistical testing

By polypompholyx in R

If you want a random yes/no answer to a question, like “who should kick-off this football match?” it’s very common to entrust the decision to the flip of a coin, on the assumption that the coin doesn’t care which side gets the advantage. But what if that trust is misplaced? What if the coin gives …

Leave comment

Feb 03

Comparison of means: the t test

By polypompholyx in R

A t test is used to compare the means of two data sets, and it relies on calculation of a test statistic called t. This statistic is derived from the two data sets and it is defined as the difference between the means of the two data sets, x̅1 and x̅2 (or the difference between a mean x̅ and …

Leave comment

Feb 03

Comparison of variances: the F test

By polypompholyx in R

An F test is used to compare the variances of two data sets: As it is used to compare variances, the dependent data must – by definition – be numeric. As it is used to compare two distinct sets of data, these sets represent the two levels of a factor. The test statistic we use …

Leave comment

Except where otherwise noted, content on this site is licensed under a Creative Commons License CC-BY-SA-3.0 Steve Cook

Made with by Graphene Themes.

Category: R

Organism of the week #28 – Fractal art

Statistical power

Nonlinear regression

Analysis of variance: ANOVA (2 way)

Analysis of variance: ANOVA (1 way)

Comparison of expected and observed count data: the χ² test

Correlation of data: linear regression

Statistical testing

Comparison of means: the t test

Comparison of variances: the F test

Recent Posts

Categories

Blogroll

Archives