Splitting Characters in A Cell Array

3 visualizzazioni (ultimi 30 giorni)
Aldrich To
Aldrich To il 11 Nov 2015
Commentato: Guillaume il 13 Nov 2015
Hi All,
I am trying to split some content in a cell array into separate portions. I've tried converting to a string and using strsplit, but I am not getting the results I want because of the datatype syntax.
Came across the cellfun command, but not really sure how to implement it.
Here is what I have
'P245/65R17 105S'
'P265/70R16 111S'
'P275/55R20 111H'
'285/60R18 120H'
'P235/70R17 108S'
What I need:
'P245/' '65' 'R' '17' '105' 'S'
'P265/' '70' 'R' '16' '111' 'S'
'P275/' '55' 'R' '20' '111' 'H'
'285/' '60' 'R' '18' '120' 'H'
'P235/' '70' 'R' '17' '108' 'S'
Thanks in advance!

Risposta accettata

Jan
Jan il 11 Nov 2015
Data = {'P245/65R17 105S'; ...
'P265/70R16 111S'; ...
'P275/55R20 111H'; ...
'285/60R18 120H'; ...
'P235/70R17 108S'};
n = numel(Data);
Result = cell(n, 6);
for k = 1:n
S = Data{k};
p = strfind(S, '/');
% 'P245/65R17 105S'
% 'P245/' '65' 'R' '17' '105' 'S'
Result(k, :) = {S(1:p), S(p+1:p+2), S(p+3), S(p+4:p+5), S(p+7:p+9), S(p+10)};
end
Does this help already? Or do strings appear, which do not match this pattern? If so, you can search for the space also, use the length of the strings or whatever.
  5 Commenti
Aldrich To
Aldrich To il 12 Nov 2015
Thanks everyone for all the useful input! Guillaume, regexp actually gave me my desired output. Made an edit that gave me the following:
split = regexp(size,'([P-T])(\d+)(\D+)(\d+)([A-Z])(\d+) (\d+)([A-Z])', 'tokens', 'once');
% 'P' '265' '/' '70' 'R' '16' '111' 'S'
However, it ends up skipping the rows that do not follow that exact character configuration (i.e. skips something like 185/65R15 or LT245/70R17 128/112R) and continues iterating.
The end result I would like is the following:
'P' '265' '/' '70' 'R' '16' '111' 'S'
'275' '/' '50' 'R' '17' 'ST'
'LT' '245' '/' '70' 'R' '17' '128/112' 'R'
Jan
Jan il 13 Nov 2015
@Aldrich: The shown result cannot be represented in Matlab. If it is stored as a cell string, the missing elements must be at least [], because an array must have the same number of elements per row.

Accedi per commentare.

Più risposte (1)

Guillaume
Guillaume il 11 Nov 2015
Modificato: Guillaume il 11 Nov 2015
Use the power of Regular Expressions. It's a daunting language at first but it's very powerful:
data = {'P245/65R17 105S';
'P265/70R16 111S';
'P275/55R20 111H';
'285/60R18 120H';
'P235/70R17 108S'};
splitdata = regexp(data, '(.+/)(\d+)([A-Z])(\d+) (\d+)([A-Z])', 'tokens', 'once');
splitdata = vertcat(splitdata{:})
The regular expression is divided into tokens (the () in the regex)
  • the 1st token is one or more (the +) character (the .) followed by '/'
  • the 2nd token is one or more (the +) digit (the \d)
  • the 3rd token is a single character between A and Z (the [A-Z])
  • 4th token, see 2nd
  • it then matches a space which is not part of any token
  • 5th token, see 2nd
  • 6th token, see 3rd
  3 Commenti
Guillaume
Guillaume il 13 Nov 2015
A regex that would most likely work with all your cases would be
regexp(data, '([A-Z]*)(\d+)(/)(\d+)([A-Z])(\d+) (\d+(/\d+)?)?([A-Z])', 'tokens')

Accedi per commentare.

Categorie

Scopri di più su Characters and Strings in Help Center e File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by