Word Frequency 

Description

Word frequency is the number of occurrences of a word in a given text. 

Why to use

Textual Analysis – Pre Processing 

When to use

When you want to find the frequency of a word, that is, the number of times a particular word has appeared in a given text. 

When not to use

On numerical data.

Prerequisites

It should be textual data.  

Input

It was the best of times it was the worst of times.
It was the age of wisdom it was the age of foolishness.


Output

It – 4
Was – 4
The – 4
Best – 1
Of – 4
Times - 2
Worst - 1
Age – 2
Wisdom - 1
Foolishness – 1

Related algorithms

  • Case Convertor
  • Custom Words Remover
  • Frequent Words Remover
  • Lemmatizer
  • Punctuation Remover
  • Spelling Corrector
  • Stemmer
  • Advanced Entity Extraction
  • Word Correlation

Alternative algorithm


Statistical Methods used



Limitations

It cannot be used on Numerical data.

Word Frequency is located under Textual Analysis (  ) in Pre Processing, in the task pane on the left. Use drag-and-drop method to use algorithm in the canvas. Click the algorithm to view and select different properties for analysis.

Refer to Properties of Word Frequency.

Word frequency is the number of occurrences of a word in the textual data.

Properties of Word Frequency

The available properties of Word Frequency are as shown in the figure given below.













The table given below describes different fields present on properties of Word Frequency.

Field

Description

Remark

Task Name


It displays the name of the selected task.

You can click the text field to edit or modify the name of the task as required.

Text


It allows you to select the text for which you want to count the frequency of words.

  • Only one data field can be selected
  • Textual data fields selected for the reader are visible.
  • Only textual data field can be selected

Group By


It allows you to select the field you want to group by based on the results.

  • Only one field can be selected.
  • It groups the results based on the selected field.

Advanced


Term Type

It allows you to select the type of word whose frequency you want to determine.

The available word types are,

  • Basic
  • Adjective
  • Verb
  • Noun

Number of Words

It allows you to select the number of words for which you want to count the frequency.

The default value is 25.

Interpretation of Word Frequency

The figure given below shows the result of Word Frequency applied to Google News snippets.
In the figure, the Frequency column gives the number of times the words in the corresponding 'Term' column appears in the given text column.
In the highlighted example, the words "for" has appeared  510 times in all the texts in the column.

Table of Contents