In one of my applications I am trying to do data mining by adding scores to certain words to obtain a measurement of the use of bad words in comments that are made in a long text column.
For that I have a list of words with their respective scores in a table and through an expression I can get the score for each word that has been used.
The problem I have is that I do not have an exact measurement since the expression takes the words according to their characters and if a word has a comma, period, semicolon before or after the word the expression does not recognize it as such.
It would be nice to have a functionality or expression where we can NOT ALLOW characters that are not letters before or after a letter.
Eg: DO NOT ALLOW “word.”, “.Word”, “, word”, “word,”, etc …