How to remove top phrases from a word cloud

7 visualizzazioni (ultimi 30 giorni)
I have created a set of word clouds, but want to remove some common two-word phrases that appear in each.
I tried using removeWords, but it did not appear to work!
Is there a way to see what the top phrases are so that I could check if there are any extra spaces, for instance?

Risposta accettata

Pat Canny
Pat Canny il 2 Lug 2020
The removeWords function only removes specific words. It won't remove phrases like you might expect.
The word clouds are showing n-grams, therefore you are looking remove common n-grams. To do so, use the removeNgrams function. Each phrase should be a 1x2 string array (see the "thou art" and "thou dost" example in the removeNgrams Documentation)
If you are interested in the top n-grams in a bag-of-n-grams model (which you used to create the word cloud), you can use the topkngrams function.
Alternatively (and this is not the preferred approach), you can extract the WordData from a word cloud object:
wc = wordcloud(bag);
wordData = wc.WordData;

Più risposte (0)

Categorie

Scopri di più su Labels and Annotations in Help Center e File Exchange

Prodotti


Release

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by