
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC134822.5 - phase: 0
(195 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CF922226 132 8e-32
BE659348 weakly similar to PIR|JC7809|JC78 sulfakinin receptor p... 39 0.001
TC226200 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp... 36 0.013
TC226201 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp... 36 0.013
TC226202 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp... 36 0.013
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 35 0.017
TC219583 weakly similar to UP|GRP2_NICSY (P27484) Glycine-rich p... 35 0.022
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 33 0.064
TC231744 33 0.11
TC206177 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (G... 32 0.19
TC216231 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (G... 32 0.19
TC208012 similar to UP|Q761Z7 (Q761Z7) BRI1-KD interacting prote... 32 0.19
TC206178 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, part... 32 0.19
TC223754 similar to UP|Q86EQ4 (Q86EQ4) Clone ZZD1536 mRNA sequen... 32 0.19
TC210743 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1,... 32 0.19
NP004897 gag-protease polyprotein 32 0.24
CO984873 30 0.54
TC209175 30 0.71
TC209176 30 0.71
TC207931 similar to UP|Q9SKL4 (Q9SKL4) Expressed protein (At2g15... 30 0.92
>CF922226
Length = 667
Score = 132 bits (333), Expect = 8e-32
Identities = 77/192 (40%), Positives = 118/192 (61%), Gaps = 22/192 (11%)
Frame = -3
Query: 9 TKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLP 68
TKSL +R KQ LYSFKM E +S+ EQL FNK++ DL NI+V ++DED+ALLLLC LP
Sbjct: 662 TKSLVNRLYXKQSLYSFKMHEDRSVGEQLDLFNKLILDLENIDVTIDDEDQALLLLCYLP 483
Query: 69 KSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNR 128
KS+ HFK+T+L+G++ + +L+E+Q+AL +K+L + K+ +++ + EGL +RG + +
Sbjct: 482 KSYSHFKETLLFGRD-SVSLDEVQTALNSKELNERKEKKSSASGEGL-TARGKTFKKDSE 309
Query: 129 GSSK----------SGNKERYKCFKCHKFGHFKR------------DFSEDNENFAQVVS 166
K GN + +C+ C K GH ++ + +D+ N A V
Sbjct: 308 FDKKKQKPENQKNGEGNIFKIRCYHCKKEGHTRKVCPERQKNGGSNNRKKDSGNAAIVQD 129
Query: 167 EEYEDAGALVVS 178
+ YE A AL+VS
Sbjct: 128 DGYESAEALMVS 93
>BE659348 weakly similar to PIR|JC7809|JC78 sulfakinin receptor protein
DSK-R1 - fruit fly (Drosophila melanogaster), partial
(6%)
Length = 770
Score = 39.3 bits (90), Expect = 0.001
Identities = 33/144 (22%), Positives = 59/144 (40%)
Frame = -1
Query: 6 KYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLC 65
K T + ++ L + +E +S + + I L N+ + +L+L
Sbjct: 761 KIATXKQTNHDMVXXVLKAQSAVEEXXLSLEANKLEDIKTKLNNLYM--------VLVLR 606
Query: 66 SLPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGR 125
+ F+H +D +L +E + I LR + N + + +RG GGR
Sbjct: 605 GMHPDFDHIRDQVLTSQEVPSLENLITRLLRVPSPKIGGNSVDNIETSVMVSNRGGQGGR 426
Query: 126 GNRGSSKSGNKERYKCFKCHKFGH 149
GN+G ++G R C C + GH
Sbjct: 425 GNQG-GRAGRGGRP*CSYCKRVGH 357
>TC226200 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp22 splicing
factor), partial (89%)
Length = 936
Score = 35.8 bits (81), Expect = 0.013
Identities = 18/36 (50%), Positives = 23/36 (63%)
Frame = +2
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
SRG GGGRG R SG + KC++C + GHF R+
Sbjct: 284 SRGGGGGRGGR----SGGSD-LKCYECGEPGHFARE 376
>TC226201 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp22 splicing
factor), partial (72%)
Length = 820
Score = 35.8 bits (81), Expect = 0.013
Identities = 18/36 (50%), Positives = 23/36 (63%)
Frame = +1
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
SRG GGGRG R SG + KC++C + GHF R+
Sbjct: 193 SRGGGGGRGGR----SGGSD-LKCYECGEPGHFARE 285
>TC226202 similar to UP|O81126 (O81126) 9G8-like SR protein (RSZp22 splicing
factor), partial (72%)
Length = 1003
Score = 35.8 bits (81), Expect = 0.013
Identities = 18/36 (50%), Positives = 23/36 (63%)
Frame = +3
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
SRG GGGRG R SG + KC++C + GHF R+
Sbjct: 780 SRGGGGGRGGR----SGGSD-LKCYECGEPGHFARE 872
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 35.4 bits (80), Expect = 0.017
Identities = 31/138 (22%), Positives = 58/138 (41%), Gaps = 14/138 (10%)
Frame = +1
Query: 28 LESKSISEQLAEFNKILDDL-ANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEGTT 86
++S+ I +Q A+ K++ DL A E + E+ + + L E+ +I +G+
Sbjct: 1135 IKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLNSKLENMTKSIKMLNKGSD 1314
Query: 87 TLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSS-------------KS 133
TL+E+ L K + L N S G + G++ K
Sbjct: 1315 TLDEV--LLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGATMSQHRSRHHGMQQKK 1488
Query: 134 GNKERYKCFKCHKFGHFK 151
+++++C C K+GH K
Sbjct: 1489 SKRKKWRCHYCGKYGHIK 1542
>TC219583 weakly similar to UP|GRP2_NICSY (P27484) Glycine-rich protein 2,
partial (45%)
Length = 995
Score = 35.0 bits (79), Expect = 0.022
Identities = 24/84 (28%), Positives = 34/84 (39%), Gaps = 10/84 (11%)
Frame = +2
Query: 80 YGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERY 139
YG +G T ++ SA+R++ G RG GGGRG G G + R
Sbjct: 257 YGDDGRTMAVDVTSAVRSRL------------PGGF---RGGGGGRGRGGGRYGGGEGRG 391
Query: 140 K----------CFKCHKFGHFKRD 153
+ C+ C + GH RD
Sbjct: 392 RGFGRRGGGPECYNCGRIGHLARD 463
Score = 28.5 bits (62), Expect = 2.1
Identities = 14/37 (37%), Positives = 19/37 (50%), Gaps = 3/37 (8%)
Frame = +2
Query: 120 GNGGGRGNRGSSK---SGNKERYKCFKCHKFGHFKRD 153
G GGG G+ G ++ G CF C + GHF R+
Sbjct: 473 GQGGGGGDDGRNRRRGGGGGGGGGCFNCGEEGHFARE 583
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 33.5 bits (75), Expect = 0.064
Identities = 31/141 (21%), Positives = 61/141 (42%), Gaps = 17/141 (12%)
Frame = +1
Query: 28 LESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS----LPKSFEHFKDTILYGKE 83
++S+ I +Q A+ K++ AN+E E ++ + L L E+ +I +
Sbjct: 1138 IKSEKILQQEAQLKKVI---ANLEAEKEAHEEEISELKGEVGFLNSKLENMTKSIKMLNK 1308
Query: 84 GTTTLEEI---------QSAL----RTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGS 130
G+ L+E+ Q L ++ T + +NS G +S+ G +
Sbjct: 1309 GSDMLDEVLQLGKNVGNQRGLGFNHKSAGRTTMTEFVPAKNSTGATMSQHRSRHHGTQ-- 1482
Query: 131 SKSGNKERYKCFKCHKFGHFK 151
K +++++C C K+GH K
Sbjct: 1483 QKKSKRKKWRCHYCGKYGHIK 1545
>TC231744
Length = 794
Score = 32.7 bits (73), Expect = 0.11
Identities = 36/194 (18%), Positives = 71/194 (36%), Gaps = 6/194 (3%)
Frame = +2
Query: 5 QKYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLL 64
Q + A L +L S K +I E + E + + L ++++ + ++ L+L
Sbjct: 2 QYFAKNEKAETSNLLDKLISMKYKGKGNIREYIMEISNLASKLKSLKLELGEDLFVHLVL 181
Query: 65 CSLPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGG 124
SLP F FK + K+ + E I ++ ++ + N + +
Sbjct: 182 ISLPAHFGQFKVSYNTQKDKWSLNELISHCVQEEERL*RDRTESAHNKKR---KKTKDVA 352
Query: 125 RGNRGSSKSGNKERYKCFKCHKFGHFKRD------FSEDNENFAQVVSEEYEDAGALVVS 178
K E + C+ C K H K+ + + F +V E + +
Sbjct: 353 EKTS*QKKQQKDEEFTCYFCKKSRHMKKKCPKYAAWRVKKDKFLTLVCSEV-NLAFVPKD 529
Query: 179 CWEDDEGEVSHLDI 192
W D G +H+ +
Sbjct: 530 TWWVDSGATTHISM 571
>TC206177 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (GRP2)
(AT4g38680/F20M13_240), partial (22%)
Length = 406
Score = 32.0 bits (71), Expect = 0.19
Identities = 14/35 (40%), Positives = 16/35 (45%)
Frame = +1
Query: 119 RGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
R GGG G G G C+ C + GHF RD
Sbjct: 40 RYGGGGGGRYGGGGGGGGGGGSCYSCGESGHFARD 144
>TC216231 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (GRP2)
(AT4g38680/F20M13_240), partial (42%)
Length = 891
Score = 32.0 bits (71), Expect = 0.19
Identities = 15/37 (40%), Positives = 18/37 (48%)
Frame = +3
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSE 156
G GGG G G G C+KC + GH RD S+
Sbjct: 183 GGGGGYGGGGGGGGGG-----CYKCGETGHIARDCSQ 278
>TC208012 similar to UP|Q761Z7 (Q761Z7) BRI1-KD interacting protein 117
(Fragment), partial (38%)
Length = 993
Score = 32.0 bits (71), Expect = 0.19
Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 3/98 (3%)
Frame = +1
Query: 101 TKSKDLRANE-NSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSEDNE 159
+ K ++ E +SE + +G G G + K + C+KC K GH RD E +
Sbjct: 13 SSGKSIKKEETSSENDTLDQGKKPGSGPSDAPKVPSDAPKICYKCKKAGHLSRDCKEQPD 192
Query: 160 NF--AQVVSEEYEDAGALVVSCWEDDEGEVSHLDIDAL 195
+ E E+ + + + D + DI+ +
Sbjct: 193 GLLHRNAIGEAEENPKSTAIDTSQADRVAMEEDDINEI 306
>TC206178 similar to UP|Q8LF59 (Q8LF59) DNA-binding protein, partial (69%)
Length = 1138
Score = 32.0 bits (71), Expect = 0.19
Identities = 15/36 (41%), Positives = 18/36 (49%)
Frame = +2
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
S G GGG G RG G ++ C C + GH RD
Sbjct: 776 SGGGGGGGGARGGGGGGYRD-VVCRNCQQLGHMSRD 880
>TC223754 similar to UP|Q86EQ4 (Q86EQ4) Clone ZZD1536 mRNA sequence, partial
(22%)
Length = 742
Score = 32.0 bits (71), Expect = 0.19
Identities = 15/37 (40%), Positives = 17/37 (45%)
Frame = +2
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSE 156
G GGR G G + CF C K GHF R+ E
Sbjct: 359 GMEGGRFGGGGGSGGGGGKSTCFNCGKPGHFARECVE 469
Score = 31.6 bits (70), Expect = 0.24
Identities = 18/45 (40%), Positives = 21/45 (46%)
Frame = +2
Query: 116 CVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSEDNEN 160
CV RG GG G G G+ CF+C FGH RD + N
Sbjct: 173 CV-RGGGGSVGIGGGGGGGS-----CFRCGGFGHMARDCATGKGN 289
Score = 29.6 bits (65), Expect = 0.92
Identities = 15/45 (33%), Positives = 21/45 (46%)
Frame = +2
Query: 117 VSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSEDNENF 161
++R G+GN G SG CF+C + GH RD + F
Sbjct: 257 MARDCATGKGNIGGGGSGGG----CFRCGEVGHLARDCGMEGGRF 379
Score = 28.1 bits (61), Expect = 2.7
Identities = 14/36 (38%), Positives = 15/36 (40%)
Frame = +2
Query: 118 SRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
S +GGG G G CF C FGH RD
Sbjct: 92 SSNSGGGGGGSGGG---------CFNCGGFGHLARD 172
>TC210743 similar to UP|Q42412 (Q42412) RNA-binding protein RZ-1, partial
(53%)
Length = 452
Score = 32.0 bits (71), Expect = 0.19
Identities = 15/31 (48%), Positives = 17/31 (54%)
Frame = +3
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHF 150
G GGGRG+ G +CFKC K GHF
Sbjct: 387 GGGGGRGSNGG---------ECFKCGKPGHF 452
>NP004897 gag-protease polyprotein
Length = 1923
Score = 31.6 bits (70), Expect = 0.24
Identities = 31/141 (21%), Positives = 58/141 (40%), Gaps = 17/141 (12%)
Frame = +1
Query: 28 LESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS----LPKSFEHFKDTILYGKE 83
++S+ I +Q A+ K++ AN+E E ++ + L L E+ +I +
Sbjct: 1138 IKSEKILQQEAQLKKVI---ANLEAEKEAHEEEISELKGEVGFLNSKLENMTKSIKMLNK 1308
Query: 84 GTTTLEEIQSALRTKKLTKSKDLRANENSEGLC---------VSRG----NGGGRGNRGS 130
G+ L+E+ K + + L N S G +S G R +
Sbjct: 1309 GSDMLDEVLQL--GKNVGNQRGLGFNHKSAGRITMTEFVPAKISTGATMSQHRSRHHGTQ 1482
Query: 131 SKSGNKERYKCFKCHKFGHFK 151
K +++++C C K+GH K
Sbjct: 1483 QKKSKRKKWRCHYCGKYGHIK 1545
>CO984873
Length = 754
Score = 30.4 bits (67), Expect = 0.54
Identities = 16/54 (29%), Positives = 28/54 (51%)
Frame = -2
Query: 103 SKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSE 156
+++ R N G+ +G+ + ++K K KCF C+K GH K+D S+
Sbjct: 465 NREKRVNLTLHGM--KKGDQAKNKGKITAKPVIKNESKCFFCNKKGHIKKDCSK 310
>TC209175
Length = 711
Score = 30.0 bits (66), Expect = 0.71
Identities = 18/45 (40%), Positives = 25/45 (55%)
Frame = +3
Query: 91 IQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGN 135
+Q ++K KSKD RAN+ S G +S NGG +G+ K N
Sbjct: 30 LQLKSKSKFKMKSKD-RANKKSHGDVLSSQNGGTQGSNVDHKESN 161
>TC209176
Length = 601
Score = 30.0 bits (66), Expect = 0.71
Identities = 18/45 (40%), Positives = 25/45 (55%)
Frame = +2
Query: 91 IQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGN 135
+Q ++K KSKD RAN+ S G +S NGG +G+ K N
Sbjct: 233 LQLKSKSKFKMKSKD-RANKKSHGDVLSSQNGGTQGSNVDHKESN 364
>TC207931 similar to UP|Q9SKL4 (Q9SKL4) Expressed protein
(At2g15240/F15A23.2), partial (97%)
Length = 1232
Score = 29.6 bits (65), Expect = 0.92
Identities = 23/108 (21%), Positives = 40/108 (36%)
Frame = -1
Query: 42 KILDDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLT 101
+ + N + D +K ++ C L ++ G T ++ + ++ L
Sbjct: 440 RTISQSCNCQKKTADHNKCRIITCPLILGLFVIFSMLIDNFRGRT*MKHLPKSIFNIHLL 261
Query: 102 KSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGH 149
D G+ SRG GGG G R +S C+ +FGH
Sbjct: 260 PLDDSAEVGKQHGIVRSRGRGGGGGRRRTSFG------SCW*HLRFGH 135
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.313 0.132 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,482,305
Number of Sequences: 63676
Number of extensions: 87046
Number of successful extensions: 601
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 575
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 591
length of query: 195
length of database: 12,639,632
effective HSP length: 92
effective length of query: 103
effective length of database: 6,781,440
effective search space: 698488320
effective search space used: 698488320
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (26.2 bits)
Medicago: description of AC134822.5