
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0023.9
(388 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC211561 weakly similar to UP|O22675 (O22675) Reverse transcript... 72 6e-13
BI972659 66 2e-11
TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {A... 60 2e-09
CO982349 59 4e-09
CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polym... 55 7e-08
AI496436 51 2e-07
BU090318 weakly similar to GP|9743345|gb| F21J9.30 {Arabidopsis ... 51 8e-07
TC231922 48 7e-06
CD417230 weakly similar to GP|2244915|emb| reverse transcriptase... 46 2e-05
CF922212 45 6e-05
BM891241 35 1e-04
BM891637 42 6e-04
BI471325 38 0.009
TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR ret... 32 0.64
TC205827 31 1.1
CO982818 30 1.4
BI698930 30 1.4
TC208593 similar to UP|Q8RY62 (Q8RY62) At2g04690/F28I8.27, parti... 29 3.2
TC215936 similar to UP|Q39183 (Q39183) Serine/threonine protein ... 29 3.2
TC228270 29 4.1
>TC211561 weakly similar to UP|O22675 (O22675) Reverse transcriptase
(Fragment) , partial (59%)
Length = 1077
Score = 71.6 bits (174), Expect = 6e-13
Identities = 33/104 (31%), Positives = 55/104 (52%)
Frame = +1
Query: 69 YLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGREILIKSIAQAIPSYIMSCFI 128
YLG+P + +++ + + ++L WK + +S GR LIKS+ +IP Y S F
Sbjct: 598 YLGIPIGANSRRCHMWDSIIRKCERKLSKWK*RHISFGGRVTLIKSVLTSIPIYFFSFFR 777
Query: 129 LPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLCKNKRDGGL 172
+P S+ + ++ F WGG ++ + W RW LC K GG+
Sbjct: 778 VPQSVVDKLVKI*RTFLWGGGPDQKKISWIRWEKLCLPKERGGI 909
>BI972659
Length = 453
Score = 66.2 bits (160), Expect = 2e-11
Identities = 33/109 (30%), Positives = 54/109 (49%)
Frame = +1
Query: 49 NCFHELTQLLEVKEVKSFDKYLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGR 108
N H+ QLL +++ YLG+P + S ++ + + +L W +K LS GR
Sbjct: 124 NWAHDAAQLLNCRQLDIPFHYLGMPIAVKASSRMVWEPLINKFKAKLSKWNQKYLSMGGR 303
Query: 109 EILIKSIAQAIPSYIMSCFILPDSICADIDRMIACFFWGGDVTRRGLHW 157
LIKS+ A+P Y++S F +P I + + F WGG+ + W
Sbjct: 304 VTLIKSVLNALPIYLLSFFKIPQRIVDKLVSLQRTFMWGGNQHHNRISW 450
>TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {Arabidopsis
thaliana;} , partial (18%)
Length = 759
Score = 59.7 bits (143), Expect = 2e-09
Identities = 29/78 (37%), Positives = 40/78 (51%)
Frame = +2
Query: 107 GREILIKSIAQAIPSYIMSCFILPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLCKN 166
GR LIKS+ A+P +S F +P I + + F WGG + W +WA++C
Sbjct: 8 GRVTLIKSVLSALPIXXLSFFKIPQRIVDKLVTLQRQFLWGGTQHHNRIPWVKWADICNP 187
Query: 167 KRDGGLGFRFFRAFKEAL 184
K DGGLG + F AL
Sbjct: 188 KIDGGLGIKDLSNFNAAL 241
>CO982349
Length = 795
Score = 58.9 bits (141), Expect = 4e-09
Identities = 28/96 (29%), Positives = 46/96 (47%)
Frame = +3
Query: 69 YLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGREILIKSIAQAIPSYIMSCFI 128
YLG+P +I + + + +L WK++ +S GR LI +I A+P Y S F
Sbjct: 501 YLGIPIEANPRCREI*DPIIRKCETKLARWKQRHISLGGRVTLINAILTALPIYFFSFFR 680
Query: 129 LPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLC 164
P + + + F WGG +R + W +W +C
Sbjct: 681 APSKVVQKLVNIQRKFLWGGGSXQRRIAWVKWETVC 788
>CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polymerase
homolog T24M8.8 - Arabidopsis thaliana, partial (4%)
Length = 456
Score = 54.7 bits (130), Expect = 7e-08
Identities = 38/147 (25%), Positives = 61/147 (40%), Gaps = 6/147 (4%)
Frame = +2
Query: 125 SCFILPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLCKNKRDGGLGFRFFRAFKEAL 184
S F LP I A ++ + F WGG+ R + W W +C K GGLG + R F L
Sbjct: 5 SFFSLP*KIIAKLESLQRRFLWGGEADSRKIAWVNWKTVCLPKAKGGLGIKDLRTFNTTL 184
Query: 185 VAKN**RIYMFPESLMGRVFKVVYSPSSSI---LMAKKGYITWSSILKAGGKIEGGGGGL 241
+ K ++ + +V + Y ++ K W ++K +++
Sbjct: 185 LGKWRWDLFYIQQEPWAKVLQSKYGGWRALEEGSSGSKDSAWWKDLIKT-QQLQRNIPLK 361
Query: 242 R---WRVGDGFQINLLQDKWLPDGSSL 265
R W+VG G +I +D W SL
Sbjct: 362 RETIWKVGGGDRIKFWEDLWTNTDLSL 442
>AI496436
Length = 414
Score = 51.2 bits (121), Expect(2) = 2e-07
Identities = 26/79 (32%), Positives = 41/79 (50%)
Frame = +1
Query: 69 YLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGREILIKSIAQAIPSYIMSCFI 128
YLG+P + ++ V + ++L WK+K +S GR LI S+ A+P Y++S F
Sbjct: 145 YLGIPIGANSRHSDVWEPVVRKCERKLASWKQKYVSFGGRVTLINSVLTALPIYLLSFFR 324
Query: 129 LPDSICADIDRMIACFFWG 147
+PD + D A F G
Sbjct: 325 IPDKVVDKADYNSA*IFMG 381
Score = 21.9 bits (45), Expect(2) = 2e-07
Identities = 7/14 (50%), Positives = 9/14 (64%)
Frame = +2
Query: 144 FFWGGDVTRRGLHW 157
F WGGD+ R + W
Sbjct: 371 FLWGGDLE*RKIAW 412
>BU090318 weakly similar to GP|9743345|gb| F21J9.30 {Arabidopsis thaliana},
partial (1%)
Length = 422
Score = 51.2 bits (121), Expect = 8e-07
Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 4/116 (3%)
Frame = +3
Query: 155 LHWTRWANLCKNKRDGGLGFRFFRAFKEALVAKN**RIYMFPESLMGRVFKVVYSPSSSI 214
+HWT W LC K+ GGLG R ++L+ K+ * + P L RV + Y I
Sbjct: 6 VHWTSWKALCSPKKSGGLGLRLLSHMNKSLLMKST*NMCTKPNDL*VRVVRSKYK*GKDI 185
Query: 215 L----MAKKGYITWSSILKAGGKIEGGGGGLRWRVGDGFQINLLQDKWLPDGSSLV 266
L K G W I+ KI + ++G+G ++ +D W+ L+
Sbjct: 186 LP*IDPKKTGTNFWQGIVHCWEKIT--HDNIARKIGNGRSVHFWKDVWVLQEGKLI 347
>TC231922
Length = 803
Score = 48.1 bits (113), Expect = 7e-06
Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 13/110 (11%)
Frame = -3
Query: 243 WRVGDGFQINLLQDKWLPDGSSLVYMHEAIA------ELRLEKVSDLLDNGR-----WNL 291
WRVG G +I QD WL +G +L + + L + K+ N R W
Sbjct: 705 WRVGCGDKIKFWQDSWLSEGCNLQQKYNQLYTISRQ*NLTISKMGKFSQNARSWEFKWRR 526
Query: 292 ELLEEVFSSSP--MQKILAIPLSNQGNTDALYWYDTFNGVYVCKSGYKFI 339
L + ++ + M +I I + NQ + D+++W +GVY KS Y+ +
Sbjct: 525 RLFDYEYAMAVDFMNEISGISIQNQAH-DSMFWKADSSGVYSTKSAYRLL 379
>CD417230 weakly similar to GP|2244915|emb| reverse transcriptase like
protein {Arabidopsis thaliana}, partial (7%)
Length = 688
Score = 46.2 bits (108), Expect = 2e-05
Identities = 23/72 (31%), Positives = 38/72 (51%)
Frame = -2
Query: 53 ELTQLLEVKEVKSFDKYLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGREILI 112
+++ L ++ KYLG+P + F+F+ +++ KR WK LS GR L
Sbjct: 291 DISSGLGLQSTDDLGKYLGVPIFHKNPTSHTFDFIIDKVNKRFSRWKSNLLSMVGRLTLT 112
Query: 113 KSIAQAIPSYIM 124
K + A+PS+IM
Sbjct: 111 K*VV*ALPSHIM 76
>CF922212
Length = 445
Score = 45.1 bits (105), Expect = 6e-05
Identities = 23/80 (28%), Positives = 37/80 (45%)
Frame = -2
Query: 129 LPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLCKNKRDGGLGFRFFRAFKEALVAKN 188
+P+ + + R+ F WGG + ++ + W W ++C K GLG R R F AL+ K
Sbjct: 441 VPNKVLDKLVRLQQWFLWGGGLGQKKIAWVNWDSVCLPKEKEGLGNRDLRKFNYALLGKR 262
Query: 189 **RIYMFPESLMGRVFKVVY 208
++ L RV Y
Sbjct: 261 RWNLFHHQGELGARVLDSKY 202
>BM891241
Length = 407
Score = 35.0 bits (79), Expect(2) = 1e-04
Identities = 15/41 (36%), Positives = 21/41 (50%)
Frame = -2
Query: 147 GGDVTRRGLHWTRWANLCKNKRDGGLGFRFFRAFKEALVAK 187
GGD + + W +W +C K GGLG + F AL+ K
Sbjct: 406 GGDHDHKKIPWVKWEVVCLPKTKGGLGIKDISKFNIALLGK 284
Score = 28.1 bits (61), Expect(2) = 1e-04
Identities = 18/56 (32%), Positives = 27/56 (48%), Gaps = 3/56 (5%)
Frame = -3
Query: 213 SILMAKKGYITWSSILKAGGKIEGGG---GGLRWRVGDGFQINLLQDKWLPDGSSL 265
SI+++K GY W L+ + W+VG G +IN DKWL + +L
Sbjct: 201 SIIVSKGGYSYWWRDLRKFYHQSDHSIFHQYMSWKVGCGDKINFWTDKWLGEDYTL 34
>BM891637
Length = 427
Score = 41.6 bits (96), Expect = 6e-04
Identities = 29/114 (25%), Positives = 46/114 (39%), Gaps = 5/114 (4%)
Frame = -1
Query: 157 WTRWANLCKNKRDGGLGFRFFRAFKEALVAKN**RIYMFPESLMGRVFKVVYSPSSSILM 216
W W C + GGLG + + AL+ K ++ P L R+ Y +
Sbjct: 406 WISWRQCCASGDVGGLGIKDIKILNNALLIKWKWLMFHQPHQLWNRILISKYKGWRGLDQ 227
Query: 217 AKKGYI--TWSSILKAGGKIEG---GGGGLRWRVGDGFQINLLQDKWLPDGSSL 265
+ Y W + L+A + + W+VG G QI +D W+ DG+ L
Sbjct: 226 GPQKYYFSPWWADLRAINQHQSMIAASNQFCWKVGRGDQILFWEDSWVDDGTPL 65
>BI471325
Length = 421
Score = 37.7 bits (86), Expect = 0.009
Identities = 25/68 (36%), Positives = 39/68 (56%), Gaps = 9/68 (13%)
Frame = +1
Query: 67 DKYLGLPTIIGKSKTQIFNFVKE---RIWKRL------KGWKEKSLSRAGREILIKSIAQ 117
D+YL +P++I K IFN++++ RI+ L KG +EK L +IK +AQ
Sbjct: 238 DEYLDMPSMIRTKKKAIFNYLRDEFGRIYSTLVQQESFKGQREKCL-------IIKYVAQ 396
Query: 118 AIPSYIMS 125
+I +Y MS
Sbjct: 397 SITTYCMS 420
>TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR retroelement
reverse transcriptase, partial (7%)
Length = 956
Score = 31.6 bits (70), Expect = 0.64
Identities = 18/59 (30%), Positives = 27/59 (45%)
Frame = -2
Query: 69 YLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGREILIKSIAQAIPSYIMSCF 127
Y G+P GK + V ++I +L WK LS GR L+ S+ + Y S +
Sbjct: 235 YFGVPIFKGKPTRRHLYCVADKIKCKLASWKGSILSIMGRIQLVNSVIHGMLLYSFSVY 59
>TC205827
Length = 2695
Score = 30.8 bits (68), Expect = 1.1
Identities = 17/62 (27%), Positives = 32/62 (51%)
Frame = +1
Query: 106 AGREILIKSIAQAIPSYIMSCFILPDSICADIDRMIACFFWGGDVTRRGLHWTRWANLCK 165
A I ++I + S + I+ D I ++D ++C F G++ R L + R ANL +
Sbjct: 1195 ASEGIKERNIIHKVTSSLTGSIIVEDVIYENVDSEVSCIFPSGELMFRRLVFERAANLVQ 1374
Query: 166 NK 167
++
Sbjct: 1375 SE 1380
>CO982818
Length = 291
Score = 30.4 bits (67), Expect = 1.4
Identities = 13/45 (28%), Positives = 30/45 (65%)
Frame = +3
Query: 269 HEAIAELRLEKVSDLLDNGRWNLELLEEVFSSSPMQKILAIPLSN 313
H A+ + RL + +N W++++++++ S+ +Q+I +IP+SN
Sbjct: 48 HMALNK*RLTLIDH--ENKVWHVDIMQQILSNEDIQQIHSIPISN 176
>BI698930
Length = 384
Score = 30.4 bits (67), Expect = 1.4
Identities = 14/33 (42%), Positives = 20/33 (60%)
Frame = +3
Query: 98 WKEKSLSRAGREILIKSIAQAIPSYIMSCFILP 130
WK + LS AGR LI S+ ++P + S F +P
Sbjct: 90 WKHEVLSFAGRTCLINSMLSSLPLFSFSFFKVP 188
>TC208593 similar to UP|Q8RY62 (Q8RY62) At2g04690/F28I8.27, partial (43%)
Length = 719
Score = 29.3 bits (64), Expect = 3.2
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)
Frame = -1
Query: 51 FHELTQLLEVKE-VKSFDKYLGLPTIIGKSKTQIFNFVKERIWKRLKGWKEKSLSRAGRE 109
+H+ T ++E + + YLG T T I+NF K + L+ K ++ RA RE
Sbjct: 611 YHKHTDFKHIRE*ITMVEGYLGRNTQFETRYTSIYNFSKNTLCVHLQPTKGVAIQRAFRE 432
>TC215936 similar to UP|Q39183 (Q39183) Serine/threonine protein kinase
(Protein kinase 5) (AT5g47750/MCA23_7) , partial (8%)
Length = 693
Score = 29.3 bits (64), Expect = 3.2
Identities = 11/34 (32%), Positives = 17/34 (49%)
Frame = +2
Query: 114 SIAQAIPSYIMSCFILPDSICADIDRMIACFFWG 147
S++ PS +S +C+D + CFFWG
Sbjct: 38 SLSTVFPSLSLSLSAKATKLCSDPSN*VVCFFWG 139
>TC228270
Length = 1186
Score = 28.9 bits (63), Expect = 4.1
Identities = 17/38 (44%), Positives = 21/38 (54%)
Frame = -1
Query: 208 YSPSSSILMAKKGYITWSSILKAGGKIEGGGGGLRWRV 245
Y+ S L+ K YI I KA + GGGGG+ WRV
Sbjct: 1180 YNGRMSSLLCAKIYIVHYQIYKANCERVGGGGGV-WRV 1070
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.334 0.147 0.477
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,323,624
Number of Sequences: 63676
Number of extensions: 285496
Number of successful extensions: 2548
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 2472
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2542
length of query: 388
length of database: 12,639,632
effective HSP length: 99
effective length of query: 289
effective length of database: 6,335,708
effective search space: 1831019612
effective search space used: 1831019612
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0023.9