Welch’s t-test, named for its creator, Bernard Lewis Welch, is an adaptation of Student’s t-test. Unlike the Student t-test, it doesn’t assume an equal variance in the two populations (Welch 1947). It is also based on hypothesis testing, like chi-squared test and log-likelihood ratio test, but in contrast to them, it takes not only the frequency of a feature into account. Sample mean, standard deviation and sample size are included in a calculation of the t-value. That is the reason why this measure can better deal with frequent words that occur only in one text or one part of a text in a given collection.

