gethmmprof
Retrieve hidden Markov model (HMM) profile from PFAM database
Syntax
HMMStruct = gethmmprof(PFAMName)
HMMStruct = gethmmprof(PFAMNumber)
HMMStruct = gethmmprof(...,
'ToFile', ToFileValue, ...)
HMMStruct = gethmmprof(...,
'Mode', ModeValue, ...)
HMMStruct = gethmmprof(...,
'TimeOut', TimeOutValue, ...)
Input Arguments
PFAMName | Character vector specifying a protein family name (unique identifier)
of an HMM profile record in the PFAM database. For example, '7tm_2'. |
PFAMNumber | Integer specifying a protein family number of an HMM profile
record in the PFAM database. For example, 2 is
the protein family number for the protein family 'PF00002'. |
ToFileValue | Character vector specifying a file name or a path and file name for saving the data. If you specify only a file name, that file will be saved in the MATLAB® Current Folder. |
ModeValue | Character vector that specifies the returned alignment mode. Choices are:
|
TimeOutValue | Connection timeout in seconds, specified as a positive scalar. The default value is 5. For details, see here. |
Output Arguments
HMMStruct | MATLAB structure containing information for an HMM profile retrieved from the PFAM database. |
Description
Note
gethmmprof retrieves information from PFAM-HMM
profiles, from file format version HMMER2.0 to HMMER3/f.
searches
the PFAM database (http://pfam.xfam.org/) for the record represented
by HMMStruct = gethmmprof(PFAMName)PFAMName (a protein family name), retrieves
the HMM profile information, and stores it in HMMStruct,
a MATLAB structure containing the following fields corresponding
to parameters of an HMM profile.
| Field | Description |
|---|---|
Name | The protein family name (unique identifier) of the HMM profile record in the PFAM database. |
PfamAccessionNumber | The protein family accession number of the HMM profile record in the PFAM database. |
ModelDescription | Description of the HMM profile. |
ModelLength | The length of the profile (number of MATCH states). |
Alphabet | The alphabet used in the model, 'AA' or 'NT'. Note
|
MatchEmission | Symbol emission probabilities in the MATCH states. The
format is a matrix of size |
InsertEmission | Symbol emission probabilities in the INSERT state. The
format is a matrix of size |
NullEmission | Symbol emission probabilities in the MATCH and INSERT states for the NULL model. The format is a 1-by- Note NULL probabilities are also known as the background probabilities. |
BeginX | BEGIN state transition probabilities. Format
is a 1-by- [B->D1 B->M1 B->M2 B->M3 .... B->Mend] |
MatchX | MATCH state transition probabilities. Format
is a 4-by- [M1->M2 M2->M3 ... M[end-1]->Mend; M1->I1 M2->I2 ... M[end-1]->I[end-1]; M1->D2 M2->D3 ... M[end-1]->Dend; M1->E M2->E ... M[end-1]->E ] |
InsertX | INSERT state transition probabilities. Format
is a 2-by- [ I1->M2 I2->M3 ... I[end-1]->Mend; I1->I1 I2->I2 ... I[end-1]->I[end-1] ] |
DeleteX | DELETE state transition probabilities. Format
is a 2-by- [ D1->M2 D2->M3 ... D[end-1]->Mend ; D1->D2 D2->D3 ... D[end-1]->Dend ] |
FlankingInsertX | Flanking insert states (N and C) used for LOCAL profile alignment. Format is a 2-by-2 matrix: [N->B C->T ; N->N C->C] |
LoopX | Loop states transition probabilities used for multiple hits alignment. Format is a 2-by-2 matrix: [E->C J->B ; E->J J->J] |
NullX | Null transition probabilities used to provide scores with log-odds values also for state transitions. Format is a 2-by-1 column vector: [G->F ; G->G] |
determines
a protein family accession number from HMMStruct = gethmmprof(PFAMNumber)PFAMNumber (an
integer), searches the PFAM database for the associated record, retrieves
the HMM profile information, and stores it in HMMStruct,
a MATLAB structure.
calls HMMStruct = gethmmprof(...,
'PropertyName', PropertyValue,
...)gethmmprof with optional properties
that use property name/property value pairs. You can specify one or
more properties in any order. Each PropertyName must
be enclosed in single quotation marks and is case insensitive. These
property name/property value pairs are as follows:
saves
the data returned from the PFAM database in a file specified by HMMStruct = gethmmprof(...,
'ToFile', ToFileValue, ...)ToFileValue.
Note
You can read an HMM-formatted file back into the MATLAB software
using the pfamhmmread function.
specifies
the returned alignment mode. Choices are:HMMStruct = gethmmprof(...,
'Mode', ModeValue, ...)
'ls'(default) — Global alignment mode.'fs'— Local alignment mode.
For more information on HMM profile models, see HMM Profile Model.
sets the connection timeout (in seconds) to retrieve data the PFAM database.HMMStruct = gethmmprof(...,
'TimeOut', TimeOutValue, ...)
Examples
To retrieve a hidden Markov model (HMM) profile for the global alignment of the 7-transmembrane receptor protein in the secretin family, enter:
hmm = gethmmprof('7tm_2')
hmm =
struct with fields:
Name: '7tm_2'
PfamAccessionNumber: 'PF00002.21'
ModelDescription: '7 transmembrane receptor (Secretin family)'
ModelLength: 241
Alphabet: 'AA'
MatchEmission: [241×20 double]
InsertEmission: [241×20 double]
NullEmission: [1×20 double]
BeginX: [242×1 double]
MatchX: [240×4 double]
InsertX: [240×2 double]
DeleteX: [240×2 double]
FlankingInsertX: [2×2 double]
LoopX: [2×2 double]
NullX: [2×1 double]Version History
Introduced before R2006a
See Also
gethmmalignment | hmmprofalign | hmmprofstruct | pfamhmmread | showhmmprof