Vocabulary Dispersion

Many frequency dictionaries include not only a word's frequency but also its dispersion. You can learn a lot about a word from its frequency and dispersion throughout a text.

Open the Report

Go to Analyze > Book Reports > Vocabulary Dispersion Report

Vocabulary Dispersion Report Icon

Absolute Frequency

Absolute frequency is the total number of times a word appears in a text. The more times it appears, the more important the word is. Language learners often use the most frequent words to guide their learning, while editors use the least frequent words to identify typos.

Statistics

Look for the Freq and SectF columns to see a word's absolute frequency. The values in SectF will be the same as the Freq values if you're not analyzing a specific section of the text.

Example of vocabulary dispersion data report

Normalized Frequency

When comparing a word's frequency between texts, the texts often consist of different sizes. The normalized frequency estimates how many times each word would occur if the book or corpus contained exactly one million words.

Statistics

Look for the RelF column to see a word's normalized frequency.

Comparing big book with small book

Even Dispersion

Imagine each text is divided into five buckets. If a word occurs 100 times in each bucket, its dispersion is evenly dispersed. From this, you'll know that the word is important throughout the entire text.

Marks

In WordCruncher, the words within these buckets are called marks. By default, a mark consists of 2,000 words. In the report, each vertical line represents a single mark.

Example of buckets being full

Uneven Dispersion

Some words only occur in one section, so even though one bucket is full, the rest may be empty. This means the word is important for only a section of your text.

Example of only one bucket being full

View Boundary

Within the View Boundary window, you can view a specific section of your text. Here, we have selected '2020.'

Below the View Boundary window, you can select subsections. Here, we have selected 'May.'

Along the top of the Vocabulary Dispersion data window, you can see a latticed section. This is the subsection of the text. In this example, the lattice represents 'May 2020.'

Example of vocabulary dispersion data report

Filter and Sort

Filter by word or frequency.

Sort by word, frequency, count, dispersion, and standard deviation.

Example of filter and sort options