Help CenterHelp Center
Length of documents in document array
N = doclength(documents)
N = doclength(documents) returns the number of tokens in each document in documents.
N
documents
example
collapse all
Find the number of words in an array of tokenized documents. Erase the punctuation characters so they do not get counted as words.
str = [ ... "An example of a short sentence." "A second short sentence."]; documents = tokenizedDocument(str)
documents = 2x1 tokenizedDocument: 7 tokens: An example of a short sentence . 5 tokens: A second short sentence .
documents = erasePunctuation(documents)
documents = 2x1 tokenizedDocument: 6 tokens: An example of a short sentence 4 tokens: A second short sentence
N = 2×1 6 4
tokenizedDocument
Input documents, specified as a tokenizedDocument array.
Document lengths, returned as a vector of nonnegative integers. The size of N is the same as the size of documents.
Introduced in R2017b
context | doc2cell | joinWords | string | tokenizedDocument
context
doc2cell
joinWords
string
Hai fatto clic su un collegamento che corrisponde a questo comando MATLAB:
Esegui il comando inserendolo nella finestra di comando MATLAB. I browser web non supportano i comandi MATLAB.
Select a Web Site
Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .
You can also select a web site from the following list:
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.
Contact your local office