the number of occurences of each character of one string,in another

Question

hiva il 28 Dic 2014

1
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another

Modificato: Luuk van Oosten il 24 Gen 2015

i have a string of more than 100 characters (fasta format of a protein sequence. like

'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'

which is being shortened here for simplicity) and i want to find out whether or not it is hydrophobic. so i have to check the number of occurrences of each of the characters in the set 'A C F I L M P V W Y'(hydrophob amino acids) in my fasta string. considering the very long length of fasta strings, is there any easy way to do that by matlab string functions?

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Azzi Abdelmalek il 28 Dic 2014

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163456

Modificato: Azzi Abdelmalek il 28 Dic 2014

Apri in MATLAB Online

str='MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'
p={'A' 'C' 'F' 'I' 'L' 'M' 'P' 'V' 'W' 'Y'}'
out=[p cellfun(@(x) nnz(ismember(str,x)),p,'un',0)]

2 Commenti
Mostra NessunoNascondi Nessuno

hiva il 29 Dic 2014

thanks a lot.i guess this works well for a lot of similar cases that are supposed to work the same way in my code(since it is feature extraction and there are lots of features). also tells me how much i don't know from matlab.thanks.

Stephen23 il 30 Dic 2014

Modificato: Stephen23 il 30 Dic 2014

Apri in MATLAB Online

This could be simplified and speeded-up by using arrayfun instead of cellfun, and removing the ismember:

>> t = 'ACFILMPVWY';
>> arrayfun(@(x)sum(str==x), t)
ans =
     6     2     4     6    13     2     7     7     1     7

Accedi per commentare.

Answer 2

Peter Perkins il 29 Dic 2014

2
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163537

Apri in MATLAB Online

Another possibility:

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> n = hist(double(s),1:90);
>> n(t)
ans =
     6     2     4     6    13     2     7     7     1     7

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Jan il 30 Dic 2014

This is a histogram problem, so histc is an efficient and direct solution.

Accedi per commentare.

Answer 3

Luuk van Oosten il 24 Gen 2015

2
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_165835

Modificato: Luuk van Oosten il 24 Gen 2015

Apri in MATLAB Online

I reckon you are using the BioInformatics Toolbox. In that case you can probably use:

aacount('SEQ')

Where SEQ is of course your sequence of interest: MEQNGLDHDSRSSIDTTINDTQKTFLEF....

and using

nr_A = All.A
nr_C = All.C
nr_F = All.F

etc. (you get the idea)

you get the numbers of your hydrophobic residues. Sum these and you have your hydrophobic score. You might want to 'normalize' this number by dividing this number by the total amount of amino acids in the sequence.

Of course you can write a loop for this and calculate the hydrophobic score for all your sequences in your FASTA file.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Answer 4

Shoaibur Rahman il 28 Dic 2014

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163455

Apri in MATLAB Online

s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
numA = sum(s=='A')
numC = sum(s=='C')
numF = sum(s=='F')
numI = sum(s=='I')
numL = sum(s=='L')
numM = sum(s=='M')
numP = sum(s=='P')
numV = sum(s=='V')
numW = sum(s=='W')
numY = sum(s=='Y')

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

hiva il 29 Dic 2014

very simple and delicate. really thanks

Accedi per commentare.

Answer 5

Stephen23 il 30 Dic 2014

1
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163616

Modificato: Stephen23 il 30 Dic 2014

Apri in MATLAB Online

A neat solution using bsxfun :

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> sum(bsxfun(@eq,s.',t))
ans =
     6     2     4     6    13     2     7     7     1     7

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

hiva il 30 Dic 2014

Modificato: hiva il 30 Dic 2014

wow!!! just wonderful. it works pretty well.thanks a lot.

Accedi per commentare.

the number of occurences of each character of one string,in another

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

2 Commenti
Mostra NessunoNascondi Nessuno

Più risposte (4)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

the number of occurences of each character of one string,in another

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposta accettata

2 Commenti Mostra NessunoNascondi Nessuno

Più risposte (4)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

2 Commenti
Mostra NessunoNascondi Nessuno

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti