
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0201.8
(841 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BM885356 55 2e-07
BI317050 45 2e-04
TC220401 37 0.047
TC219576 similar to UP|Q6ZFI5 (Q6ZFI5) Parathymosin-like, partia... 37 0.047
NP595172 polyprotein [Glycine max] 35 0.14
TC222980 similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%) 34 0.24
TC231423 similar to UP|Q9LID4 (Q9LID4) Similarity to disease res... 34 0.31
BQ742762 PIR|T06782|T0 extensin - soybean, partial (32%) 34 0.31
TC214142 similar to PIR|T49142|T49142 CCR4-associated factor 1-l... 33 0.40
TC214106 weakly similar to UP|Q8L685 (Q8L685) Pherophorin-dz1 pr... 33 0.40
TC229603 weakly similar to UP|Q6Z1G1 (Q6Z1G1) Proline-rich prote... 32 0.89
TC224968 similar to UP|Q39835 (Q39835) Extensin, partial (44%) 32 0.89
TC224792 homologue to UP|Q39865 (Q39865) Hydroxyproline-rich gly... 32 0.89
TC225016 similar to UP|Q39835 (Q39835) Extensin, partial (32%) 32 0.89
TC224876 weakly similar to UP|Q39835 (Q39835) Extensin, partial ... 32 0.89
TC225027 similar to UP|Q39835 (Q39835) Extensin, partial (49%) 32 0.89
TC221928 similar to UP|Q9SN46 (Q9SN46) Extensin-like protein, pa... 32 1.2
TC204062 homologue to UP|Q09083 (Q09083) Hydroxyproline-rich gly... 32 1.5
BE474116 32 1.5
TC204046 UP|Q39835 (Q39835) Extensin, complete 32 1.5
>BM885356
Length = 420
Score = 54.7 bits (130), Expect = 2e-07
Identities = 35/96 (36%), Positives = 41/96 (42%), Gaps = 14/96 (14%)
Frame = -3
Query: 78 PSQTQHSALSSPP-----------LKLIKFSGSDPTFWLLNTEVFFLQHPWPLELRFQFI 126
PSQTQ S P L + +F GSD T W+ FF H P R
Sbjct: 316 PSQTQPEMPSFPAAGPSAAPHRLKLDVPRFDGSDATGWIFKITQFFEYHTTPDHERLTIA 137
Query: 127 ALYLEGQALTWFNLWRH---QLDSWEKFREVFKLQF 159
+ Y+EGQAL WF W H QL SW F +F
Sbjct: 136 SFYMEGQALAWFQ-WMHRNGQLSSWPAFLHALHSRF 32
>BI317050
Length = 425
Score = 44.7 bits (104), Expect = 2e-04
Identities = 24/73 (32%), Positives = 40/73 (53%), Gaps = 2/73 (2%)
Frame = +3
Query: 91 LKLIKFSGSDPTFWLLNTEVFFLQHPWPLELRFQFIALYLEGQALTWFN-LWRH-QLDSW 148
L + +F ++ + FF H P + R Q + YL+G+AL+WF L+R+ QL SW
Sbjct: 12 LDVPRFDDTNAPTLIFKISQFFDYHRTPEDERLQVTSFYLDGEALSWFQWLYRNDQLTSW 191
Query: 149 EKFREVFKLQFIQ 161
F + +++F Q
Sbjct: 192 SSFLQALEMRFAQ 230
>TC220401
Length = 1286
Score = 36.6 bits (83), Expect = 0.047
Identities = 25/94 (26%), Positives = 40/94 (41%)
Frame = +1
Query: 42 ISGLIDQVSFLNGHLRSIPPQPPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDP 101
I+ LIDQ++ + + P P P +P+P H H L +P +F+G DP
Sbjct: 145 INSLIDQIAAITTN----PSSPSPSSQPSPSPIHR-----PHMKLDAP-----RFNGHDP 282
Query: 102 TFWLLNTEVFFLQHPWPLELRFQFIALYLEGQAL 135
W+ FF + + Y++G AL
Sbjct: 283 LGWIFKISQFFDYQTILEQEPLTVASFYMDGSAL 384
>TC219576 similar to UP|Q6ZFI5 (Q6ZFI5) Parathymosin-like, partial (12%)
Length = 556
Score = 36.6 bits (83), Expect = 0.047
Identities = 28/105 (26%), Positives = 43/105 (40%)
Frame = +1
Query: 14 PTVLDAIDAKLASMATKLQVDFDTRLSAISGLIDQVSFLNGHLRSIPPQPPPIYHHNPAP 73
P + ++ + + KL +D + S I ID + L ++ P PPP++ AP
Sbjct: 94 PNSITTVNGVVQQLEAKLGLDLSHKASFIRDQIDHL--LRSQPQTFAPHPPPLHKDYFAP 267
Query: 74 YHNYPSQTQHSALSSPPLKLIKFSGSDPTFWLLNTEVFFLQHPWP 118
+ T H A P F L+ E+ FLQHP P
Sbjct: 268 HTQLHFPTTHFA---------------PHF-ALHDEINFLQHPHP 354
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 35.0 bits (79), Expect = 0.14
Identities = 32/147 (21%), Positives = 62/147 (41%), Gaps = 4/147 (2%)
Frame = +1
Query: 17 LDAIDAKLASMATKLQVDFDTRLSAISGLIDQVSFLNGHLRSIPPQPPPIYHHNPAPYHN 76
++ ++A + K++V T S S L +S + L++IP + H + N
Sbjct: 94 IERLEATNHAQMEKIEVMQSTNDSQFSQLNAVMSQVLQRLQNIP-----MSSHGAS---N 249
Query: 77 YPSQTQHSALSSPPLKLI--KFSGSDPTFWLLNTEVFFLQHPWPLELRFQFIALYLEGQA 134
+ Q S+ +KL +F G + W+ E FF + P R +++L+
Sbjct: 250 SQKEQQRSSFQVRSVKLDFPRFDGKNVMDWIFKAEQFFDYYATPDADRLIIASVHLDQDV 429
Query: 135 LTWFNLWR--HQLDSWEKFREVFKLQF 159
+ W+ + + SW+ F +L F
Sbjct: 430 VPWYQMLQKTEPFSSWQAFTRALELDF 510
>TC222980 similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 662
Score = 34.3 bits (77), Expect = 0.24
Identities = 20/87 (22%), Positives = 38/87 (42%), Gaps = 2/87 (2%)
Frame = +3
Query: 75 HNYPSQTQHSALSSPPLKLIKFSGSDPTFWLLNTEVFFLQHPWPLELRFQFIALYLEGQA 134
+N+P Q ++ + P L GSD W+ E FF H P E R +++L+ +
Sbjct: 315 NNHPFQVRNVKIDFPIL-----DGSDVLQWIFKAEQFFNYHKTPGEQRLIIASIHLDKEV 479
Query: 135 LTWFNLW--RHQLDSWEKFREVFKLQF 159
+ + + + +W F + +F
Sbjct: 480 VPCYQMMTRENSFKTWIAFTRALETEF 560
>TC231423 similar to UP|Q9LID4 (Q9LID4) Similarity to disease resistance
response protein, partial (40%)
Length = 650
Score = 33.9 bits (76), Expect = 0.31
Identities = 15/42 (35%), Positives = 21/42 (49%)
Frame = +1
Query: 63 PPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDPTFW 104
P P H +P P ++PS +S LS PP + + S P W
Sbjct: 40 PIPSQHSSPLPSSSFPSSFSYSPLSPPPKHTVSRAPSPPPPW 165
>BQ742762 PIR|T06782|T0 extensin - soybean, partial (32%)
Length = 420
Score = 33.9 bits (76), Expect = 0.31
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 17/82 (20%)
Frame = +3
Query: 39 LSAISGLIDQVSFLNGHLRSIPPQP-----PPIYHHNPAP---------YHNYPSQTQHS 84
L+ IS + + + ++ S PP P PP Y+H+P P Y++ P +HS
Sbjct: 33 LTIISLTLPSQTLADNYIYSSPPPPKHSPPPPYYYHSPPPPKHSPPPPYYYHSPPPPKHS 212
Query: 85 ---ALSSPPLKLIKFSGSDPTF 103
SPP + K+ P +
Sbjct: 213 PPYKYPSPPPPVYKYKSPPPPY 278
>TC214142 similar to PIR|T49142|T49142 CCR4-associated factor 1-like protein
- Arabidopsis thaliana {Arabidopsis thaliana;} , partial
(85%)
Length = 1439
Score = 33.5 bits (75), Expect = 0.40
Identities = 26/100 (26%), Positives = 41/100 (41%)
Frame = +2
Query: 169 ASSGIVLEKTTVVVPPQKGVTVYHDESETTADPQQKPQVLSDAKPHTPTSMALSSPVSTQ 228
+S+ I+L + T Q ++ D + T +P ++ A TPT+ A SSP ST
Sbjct: 632 SSTTILLSRWT-----QNSPVLFSDHTWTPQNPTTTATTIATA---TPTTTASSSPTSTL 787
Query: 229 KVSAGIIHIEHFCPTVFETPPTKRPLDISTHCSQVISTPP 268
S+ + P F P P ST + + P
Sbjct: 788 STSSRSASLSPTPPATFPISPETAPSGNSTSVTSTLRVTP 907
>TC214106 weakly similar to UP|Q8L685 (Q8L685) Pherophorin-dz1 protein
precursor, partial (22%)
Length = 1296
Score = 33.5 bits (75), Expect = 0.40
Identities = 17/59 (28%), Positives = 25/59 (41%)
Frame = +1
Query: 60 PPQPPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDPTFWLLNTEVFFLQHPWP 118
PP PPP H+P P +P+ SPP ++++ P T V+ P P
Sbjct: 430 PPSPPPCEEHSPPPPSPHPAPYHPPPSPSPPPPPVQYNSPPPPSPPPPTPVYHYNSPPP 606
Score = 32.3 bits (72), Expect = 0.89
Identities = 13/42 (30%), Positives = 21/42 (49%)
Frame = +1
Query: 60 PPQPPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDP 101
PP P P+YH+N P ++P T PP+ + ++ P
Sbjct: 562 PPPPTPVYHYNSPPPPSFPPPTPVYEGPLPPVIGVSYASPPP 687
Score = 29.3 bits (64), Expect = 7.6
Identities = 15/37 (40%), Positives = 20/37 (53%), Gaps = 4/37 (10%)
Frame = +1
Query: 58 SIPPQPPPIYHHNPAP----YHNYPSQTQHSALSSPP 90
S PP PPP++ +P P Y++ P QHS PP
Sbjct: 139 SPPPPPPPVF--SPPPPVQYYYSSPPPPQHSPPPPPP 243
>TC229603 weakly similar to UP|Q6Z1G1 (Q6Z1G1) Proline-rich protein
family-like, partial (57%)
Length = 651
Score = 32.3 bits (72), Expect = 0.89
Identities = 26/119 (21%), Positives = 53/119 (43%), Gaps = 17/119 (14%)
Frame = +1
Query: 389 VNIEQFLMHSIMLRTPNLDSLVLASLIGTSLPA-----------LLGFIITSFDPGICNA 437
++I+ L+H + + T S++ L+ +S A + F IC+
Sbjct: 217 ISIKGILLHLLPITTATTFSMITPVLLPSSKAA*QQFAAVVC*RSVAFSDNHIKSCICSL 396
Query: 438 IFSLSYGDFSKCVYGDSRLSYL-----GLILLLTVAKY-ATHKELKCNATIALSWYNCM 490
++ + YG KC + L+++ L++L+T++ Y H L C+ + S Y C+
Sbjct: 397 VYLVGYGSLDKCC--SNCLTHMCGTWQVLVILVTLSLYRCPHSFLGCSYIVISSLYECI 567
>TC224968 similar to UP|Q39835 (Q39835) Extensin, partial (44%)
Length = 681
Score = 32.3 bits (72), Expect = 0.89
Identities = 14/32 (43%), Positives = 17/32 (52%), Gaps = 3/32 (9%)
Frame = +3
Query: 55 HLRSIPPQPP---PIYHHNPAPYHNYPSQTQH 83
+ S PP PP P Y+H+P P H YP H
Sbjct: 462 YYHSPPPPPPKKKPYYYHSPPPPHPYPHPHPH 557
>TC224792 homologue to UP|Q39865 (Q39865) Hydroxyproline-rich glycoprotein
(Fragment), partial (86%)
Length = 688
Score = 32.3 bits (72), Expect = 0.89
Identities = 13/31 (41%), Positives = 16/31 (50%)
Frame = +2
Query: 60 PPQPPPIYHHNPAPYHNYPSQTQHSALSSPP 90
P PPP Y+H+P P + Y S S PP
Sbjct: 266 PSPPPPYYYHSPPPPYYYQSPPPPSPTPHPP 358
>TC225016 similar to UP|Q39835 (Q39835) Extensin, partial (32%)
Length = 973
Score = 32.3 bits (72), Expect = 0.89
Identities = 14/32 (43%), Positives = 17/32 (52%), Gaps = 3/32 (9%)
Frame = +2
Query: 55 HLRSIPPQPP---PIYHHNPAPYHNYPSQTQH 83
+ S PP PP P Y+H+P P H YP H
Sbjct: 182 YYHSPPPPPPKKKPYYYHSPPPPHPYPHPHPH 277
>TC224876 weakly similar to UP|Q39835 (Q39835) Extensin, partial (66%)
Length = 1340
Score = 32.3 bits (72), Expect = 0.89
Identities = 14/32 (43%), Positives = 17/32 (52%), Gaps = 3/32 (9%)
Frame = +3
Query: 55 HLRSIPPQPP---PIYHHNPAPYHNYPSQTQH 83
+ S PP PP P Y+H+P P H YP H
Sbjct: 774 YYHSPPPPPPKKKPYYYHSPPPPHPYPHPHPH 869
Score = 29.6 bits (65), Expect = 5.8
Identities = 12/31 (38%), Positives = 16/31 (50%)
Frame = +1
Query: 60 PPQPPPIYHHNPAPYHNYPSQTQHSALSSPP 90
PP PP Y+H+P P P + + S PP
Sbjct: 205 PPPPPKYYYHSPPPPSPSPPKKPYYYHSPPP 297
>TC225027 similar to UP|Q39835 (Q39835) Extensin, partial (49%)
Length = 747
Score = 32.3 bits (72), Expect = 0.89
Identities = 14/32 (43%), Positives = 17/32 (52%), Gaps = 3/32 (9%)
Frame = +2
Query: 55 HLRSIPPQPP---PIYHHNPAPYHNYPSQTQH 83
+ S PP PP P Y+H+P P H YP H
Sbjct: 500 YYHSPPPPPPKKKPYYYHSPPPPHPYPHPHPH 595
>TC221928 similar to UP|Q9SN46 (Q9SN46) Extensin-like protein, partial (4%)
Length = 577
Score = 32.0 bits (71), Expect = 1.2
Identities = 15/33 (45%), Positives = 18/33 (54%)
Frame = +2
Query: 58 SIPPQPPPIYHHNPAPYHNYPSQTQHSALSSPP 90
S PP PPP+Y H+P P P +T S PP
Sbjct: 8 SPPPPPPPVY-HSPPPASPPPCETPPSVSPPPP 103
>TC204062 homologue to UP|Q09083 (Q09083) Hydroxyproline-rich glycoprotein
precursor, partial (34%)
Length = 722
Score = 31.6 bits (70), Expect = 1.5
Identities = 14/41 (34%), Positives = 18/41 (43%)
Frame = +1
Query: 61 PQPPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDP 101
P PP YH P P H+ P + + PP K K+ P
Sbjct: 82 PPPPYYYHSPPPPKHSPPPPYYYKSPPPPPKKPYKYPSPPP 204
>BE474116
Length = 427
Score = 31.6 bits (70), Expect = 1.5
Identities = 18/48 (37%), Positives = 24/48 (49%)
Frame = +3
Query: 118 PLELRFQFIALYLEGQALTWFNLWRHQLDSWEKFREVFKLQFIQFRSE 165
P+ELRF LE Q L NL + Q+ W+ + +K IQ R E
Sbjct: 27 PVELRFDASVKKLEPQGLKACNLLKRQMSKWQNSFDSYKEFCIQSRFE 170
>TC204046 UP|Q39835 (Q39835) Extensin, complete
Length = 1627
Score = 31.6 bits (70), Expect = 1.5
Identities = 14/41 (34%), Positives = 18/41 (43%)
Frame = +1
Query: 61 PQPPPIYHHNPAPYHNYPSQTQHSALSSPPLKLIKFSGSDP 101
P PP YH P P H+ P + + PP K K+ P
Sbjct: 556 PPPPYYYHSPPPPKHSPPPPYYYKSPPPPPKKPYKYPSPPP 678
Score = 29.6 bits (65), Expect = 5.8
Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 5/57 (8%)
Frame = +1
Query: 39 LSAISGLIDQVSFLNGHLRSIPPQP-----PPIYHHNPAPYHNYPSQTQHSALSSPP 90
L+ IS + + + ++ S PP P PP Y+H+P P + P + PP
Sbjct: 40 LTIISLTLPSQTLADNYIYSSPPPPKHSPPPPYYYHSPPPPKHSPPPPYYYHSPPPP 210
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.322 0.135 0.410
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,823,913
Number of Sequences: 63676
Number of extensions: 721633
Number of successful extensions: 6101
Number of sequences better than 10.0: 95
Number of HSP's better than 10.0 without gapping: 4774
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5664
length of query: 841
length of database: 12,639,632
effective HSP length: 105
effective length of query: 736
effective length of database: 5,953,652
effective search space: 4381887872
effective search space used: 4381887872
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.9 bits)
Lotus: description of TM0201.8