Extracting specific repeating lines of text after a heading using fgetl and textscan
1 visualizzazione (ultimi 30 giorni)
Mostra commenti meno recenti
Vincent Scalfani
il 19 Lug 2016
Commentato: Vincent Scalfani
il 21 Lug 2016
Here is an example of the data I am working with. I would like to extract the line directly following each KEY tag. The files have many thousands of these, so I need to create a loop with textscan or something similar.
> <NAME>
mary
> <AGE>
30
> <KEY>
RDHQFKQIGNG
> <NAME>
john
> <AGE>
56
> <KEY>
JFJNNFNFKFNN
Desired result:
RDHQFKQIGNG
JFJNNFNFKFNN
Here is where I am at (adapted from a similar question in the past), the code does not seem to be moving the cursor, and instead works for the first one, and then grabs all data after it, instead of just the data following the KEY line.
f = fopen('data.txt', 'rt');
tline = fgetl(f);
while isempty(strfind(tline, '> <KEY>'))
if tline == -1
break;
end
line = fgetl(f);
end
if tline ~= -1
data = textscan(f,'%s','Delimiter','\r\n');
else
disp('not found');
end
fclose(f);
Thanks!
0 Commenti
Risposta accettata
Stephen23
il 19 Lug 2016
>> str = fileread('temp1.txt');
>> C = regexp(str,'(?<=> <KEY>\s+)\S+','match')
C =
'RDHQFKQIGNG' 'JFJNNFNFKFNN'
Tested on this file:
3 Commenti
Stephen23
il 20 Lug 2016
Try this:
E = regexp(str,'^> <KEY>\s+\S+','match','lineanchors');
E = strtrim(strrep(E,'> <KEY>',''));
And have a play with this script:
Più risposte (0)
Vedere anche
Categorie
Scopri di più su Data Import and Export in Help Center e File Exchange
Prodotti
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!