A mathematical technique for comparing large symbol sets suggests that less frequently used words are mainly responsible for the evolution of the English language over the past two centuries.
Similarity of Symbol Frequency Distributions with Heavy Tails
Martin Gerlach, Francesc Font-Clos, and Eduardo G. Altmann
Phys. Rev. X 6, 021009