Word Correlation 

Description

Word correlation refers to the association or relationship between two words in a text. It determines whether and how strongly pairs of quantitative and continuous variables (in this case, words) are related to each other. 

Why to use

Textual Analysis – Pre Processing 

When to use

On textual data.

When not to use

On numerical data.

Prerequisites

It should be textual data.

Input


Output


Related algorithms

  • Case Convertor
  • Custom Words Remover
  • Frequent Words Remover
  • Lemmatizer
  • Punctuation Remover
  • Spelling Corrector
  • Stemmer
  • Advanced Entity Extraction
  • Word Frequency

Alternative algorithm

--

Statistical Methods used

-

Limitations

It cannot be used on Numerical data.

Word Correlation is located under Textual Analysis (  ) in Pre Processing, in the task pane on the left. Use drag-and-drop method to use algorithm in the canvas. Click the algorithm to view and select different properties for analysis.

Refer to Properties of Word Correlation.

Word correlation refers to the association or relationship between two words in a text. It determines whether and how strongly pairs of quantitative and continuous variables (in this case, words) are related to each other.

Properties of Word Correlation

The available properties of Word Correlation are as shown in the figure given below.















The table given below describes different fields present on the properties of Word Correlation.

Field

Description

Remark

Task Name


It displays the name of the selected task.

You can click the text field to edit or modify the name of the task as required.

Text


It allows you to select the text for which you want to count the frequency of words.

  • Only one data field can be selected
  • Textual data fields selected for the reader are visible.
  • Only textual data field can be selected

Group By


It allows you to select the field you want to group by based on the results.

  • Only one field can be selected.
  • It groups the results based on the selected field.

Advanced


Correlation Type

It allows you to select the type of word for which you want to find out its frequency.

The available word types are,

  • Basic
  • Adjective
  • Verb
  • Noun

Number of Terms

It allows you to select the number of words for which you want to count the frequency.

The default value is 15.

Interpretation of Word Correlation

The figure given below represents the result of Word Correlation applied on Google News snippets.
In the figure, the correlation score column gives the values of correlation between the two terms Term 1 and Term 2 in a given news snippet.


Table of Contents