Word Frequency | |||||
Description | Word frequency is the number of occurrences of a word in a given text. | ||||
Why to use | Textual Analysis – Pre Processing | ||||
When to use | When you want to find the frequency of a word, that is, the number of times a particular word has appeared in a given text. | When not to use | On numerical data. | ||
Prerequisites | It should be textual data. | ||||
Input | It was the best of times it was the worst of times. | Output | It – 4 | ||
Related algorithms |
| Alternative algorithm | |||
Statistical Methods used | Limitations | It cannot be used on Numerical data. |
Word Frequency is located under Textual Analysis ( ) in Pre Processing, in the task pane on the left. Use drag-and-drop method to use algorithm in the canvas. Click the algorithm to view and select different properties for analysis.
Refer to Properties of Word Frequency.
Word frequency is the number of occurrences of a word in the textual data.
Properties of Word Frequency
The available properties of Word Frequency are as shown in the figure given below.
The table given below describes different fields present on properties of Word Frequency.
Field | Description | Remark | |
---|---|---|---|
Task Name | It displays the name of the selected task. | You can click the text field to edit or modify the name of the task as required. | |
Text | It allows you to select the text for which you want to count the frequency of words. |
| |
Group By | It allows you to select the field you want to group by based on the results. |
| |
Advanced | Term Type | It allows you to select the type of word whose frequency you want to determine. | The available word types are,
|
Number of Words | It allows you to select the number of words for which you want to count the frequency. | The default value is 25. |
Interpretation of Word Frequency
The figure given below shows the result of Word Frequency applied to Google News snippets.
In the figure, the Frequency column gives the number of times the words in the corresponding 'Term' column appears in the given text column.
In the highlighted example, the words "for" has appeared 510 times in all the texts in the column.
Table of Contents