
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144722.14 - phase: 0
(220 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 92 2e-19
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 86 1e-17
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 70 5e-13
BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440... 64 5e-11
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 62 1e-10
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 54 6e-08
CB894419 weakly similar to GP|10177935|db copia-type polyprotein... 52 2e-07
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 49 2e-06
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 39 0.001
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 38 0.003
TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuber... 38 0.003
TC89613 similar to GP|14334448|gb|AAK59422.1 unknown protein {Ar... 31 0.43
BF643344 30 0.73
TC88797 similar to GP|16945788|gb|AAL32135.1 vernalization 2 pro... 30 0.73
BG453508 weakly similar to GP|15625407|gb| like heterochromatin ... 28 2.1
CB892643 similar to PIR|B96582|B96 hypothetical protein F15I1.21... 28 2.1
AW736531 similar to PIR|D84481|D84 probable retroelement pol pol... 28 2.1
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 27 4.7
AL370574 similar to PIR|T51822|T518 N-(5'-phospho-D-ribosylformi... 27 4.7
TC90688 similar to GP|13161405|dbj|BAB33037. VuP5CS {Vigna ungui... 27 4.7
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 92.0 bits (227), Expect = 2e-19
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Frame = +1
Query: 19 IEVEQKNGDIFICQRKYANEMLKKFNMENRKSMSTPMCPKERLCKDVEE-QVDETLYQSL 77
+EV Q I+ICQRKY ++L++F ME P+ P+ +L KD +VD T Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 78 IGCLMYLTTTRSDILYDVNVLSRFMNCAKESLFKGAKRVL 117
+GCLMYL TR D++Y ++++SRFMNC E KRVL
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVL 300
Score = 40.4 bits (93), Expect = 5e-04
Identities = 19/48 (39%), Positives = 29/48 (59%)
Frame = +1
Query: 121 LAATTAVNQALWLRKVLTDLHLEQRETTEVMVDNHATIAISKNHVFHG 168
+AA Q++W+R+VL L Q + + DN++TI +SKN V HG
Sbjct: 511 IAAAFCACQSVWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLHG 654
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 85.9 bits (211), Expect = 1e-17
Identities = 38/98 (38%), Positives = 65/98 (65%)
Frame = +2
Query: 121 LAATTAVNQALWLRKVLTDLHLEQRETTEVMVDNHATIAISKNHVFHGKTKHFSIKLFFL 180
+A+T+ Q +WLR++L +H EQ T++ DN + IA+SKN VFHG++KH I+ +
Sbjct: 134 IASTSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKI 313
Query: 181 RDVQKDGDVCLKYCKTEDQLSYIFTKALPTGRFELLRE 218
R++ + +V ++YC TE++++ IFTK L F L++
Sbjct: 314 RELIAEKEVVIEYCPTEEKIADIFTKPLKIESFYKLKK 427
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 70.5 bits (171), Expect = 5e-13
Identities = 47/120 (39%), Positives = 67/120 (55%), Gaps = 1/120 (0%)
Frame = +2
Query: 5 FEMSDLGEMSYFLVIEVEQKNGDIFICQRKYANEMLKKFNMENRKSMSTPMCPKERLCK- 63
F++ DLG + YFL +EV + I + QRKY E+L+ KS TP +L
Sbjct: 257 FKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSGNLAVKSTLTPYDISLKLHNS 436
Query: 64 DVEEQVDETLYQSLIGCLMYLTTTRSDILYDVNVLSRFMNCAKESLFKGAKRVLSRVLAA 123
D DET Y+ LIG L+YLTTTR DI + V LS+F++ ++ ++ A RVL + A
Sbjct: 437 DSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQFVSKPQQVHYQAAIRVLQYLKTA 616
>BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440.1
[imported] - Arabidopsis thaliana, partial (20%)
Length = 731
Score = 63.9 bits (154), Expect = 5e-11
Identities = 30/84 (35%), Positives = 52/84 (61%)
Frame = +3
Query: 125 TAVNQALWLRKVLTDLHLEQRETTEVMVDNHATIAISKNHVFHGKTKHFSIKLFFLRDVQ 184
TA QA+WL+ +L+++ E E + +DN + IA+++N VFHG+ H + F+R+
Sbjct: 123 TAARQAMWLQDLLSEVTWEPCEEVVIRIDNQSVIALTRNPVFHGRGNHIHKRYHFIRECV 302
Query: 185 KDGDVCLKYCKTEDQLSYIFTKAL 208
++G V +++ E +YI TKAL
Sbjct: 303 ENGQVEVEHVPGEKHRAYI*TKAL 374
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 62.4 bits (150), Expect = 1e-10
Identities = 29/95 (30%), Positives = 57/95 (59%)
Frame = +3
Query: 126 AVNQALWLRKVLTDLHLEQRETTEVMVDNHATIAISKNHVFHGKTKHFSIKLFFLRDVQK 185
A +A+W+++++ +L +Q + T V D+ + + I++N FH +TKH I+ F+R+V +
Sbjct: 240 ACKEAIWMQRLMEELGHKQEQIT-VYCDSQSALHIARNPAFHSRTKHIGIQYHFVREVVE 416
Query: 186 DGDVCLKYCKTEDQLSYIFTKALPTGRFELLREKF 220
+G V ++ T D L+ TK++ T +F R +
Sbjct: 417 EGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSY 521
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 53.5 bits (127), Expect = 6e-08
Identities = 24/86 (27%), Positives = 52/86 (59%)
Frame = +1
Query: 121 LAATTAVNQALWLRKVLTDLHLEQRETTEVMVDNHATIAISKNHVFHGKTKHFSIKLFFL 180
+A V A+WL+ ++ +L + Q E ++ D+ + I ++ + V+H +TKH I+L F+
Sbjct: 238 IAFVEGVKDAIWLKGMIGELGITQ-EYVKIHCDSQSAIHLANHQVYHERTKHIDIRLHFI 414
Query: 181 RDVQKDGDVCLKYCKTEDQLSYIFTK 206
RD+ + ++ ++ +E+ + +FTK
Sbjct: 415 RDMIESKEIVVEKMASEENPADVFTK 492
>CB894419 weakly similar to GP|10177935|db copia-type polyprotein
{Arabidopsis thaliana}, partial (2%)
Length = 170
Score = 51.6 bits (122), Expect = 2e-07
Identities = 27/56 (48%), Positives = 41/56 (73%), Gaps = 1/56 (1%)
Frame = +3
Query: 33 RKYANEMLKKFNMENRKSMSTPMCPKERLCKDVE-EQVDETLYQSLIGCLMYLTTT 87
+K A+++LKKF ME+ K +STP+ K +L ++ + ++VD T Y+SLIG L YLT T
Sbjct: 3 KKCASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTAT 170
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 48.5 bits (114), Expect = 2e-06
Identities = 23/60 (38%), Positives = 37/60 (61%)
Frame = +2
Query: 160 ISKNHVFHGKTKHFSIKLFFLRDVQKDGDVCLKYCKTEDQLSYIFTKALPTGRFELLREK 219
++ N V+H + KH SI + F+RD+ + G + +++ T DQL+ TK L R +LLR K
Sbjct: 53 LTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLSKSRHQLLRNK 232
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 39.3 bits (90), Expect = 0.001
Identities = 20/54 (37%), Positives = 31/54 (57%)
Frame = +1
Query: 1 MMQVFEMSDLGEMSYFLVIEVEQKNGDIFICQRKYANEMLKKFNMENRKSMSTP 54
+ + FE+ DLG + YFL +EV + I QRKY ++LK+ M K++ P
Sbjct: 193 LAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 37.7 bits (86), Expect = 0.003
Identities = 17/31 (54%), Positives = 21/31 (66%)
Frame = +3
Query: 1 MMQVFEMSDLGEMSYFLVIEVEQKNGDIFIC 31
M + F MSDLG+M YFL +EV Q I+IC
Sbjct: 684 MKKEFNMSDLGKMHYFLGVEVTQNEKGIYIC 776
>TC91257 similar to GP|21434|emb|CAA36616.1|| ORF4 {Solanum tuberosum},
partial (8%)
Length = 854
Score = 37.7 bits (86), Expect = 0.003
Identities = 18/34 (52%), Positives = 24/34 (69%)
Frame = +3
Query: 70 DETLYQSLIGCLMYLTTTRSDILYDVNVLSRFMN 103
D Y+ L+G L YLT TR DI Y V+V+S+F+N
Sbjct: 750 DPGRYRRLVGKLNYLTMTRPDISYAVSVVSQFLN 851
>TC89613 similar to GP|14334448|gb|AAK59422.1 unknown protein {Arabidopsis
thaliana}, partial (54%)
Length = 1706
Score = 30.8 bits (68), Expect = 0.43
Identities = 16/42 (38%), Positives = 27/42 (64%)
Frame = -2
Query: 74 YQSLIGCLMYLTTTRSDILYDVNVLSRFMNCAKESLFKGAKR 115
+ S+ L Y+T+T + + Y LSRF++C+ SL+KG+ R
Sbjct: 502 FSSIAYVLAYITSTSTSLRYRRVSLSRFISCS-SSLWKGSTR 380
>BF643344
Length = 649
Score = 30.0 bits (66), Expect = 0.73
Identities = 15/29 (51%), Positives = 20/29 (68%), Gaps = 3/29 (10%)
Frame = +1
Query: 25 NGDIFICQRKYANEMLKKFN---MENRKS 50
NG +F Q KY+NE+L+K N +EN KS
Sbjct: 358 NGFLFDIQAKYSNELLRKMNQIKVENAKS 444
>TC88797 similar to GP|16945788|gb|AAL32135.1 vernalization 2 protein
{Arabidopsis thaliana}, partial (60%)
Length = 1620
Score = 30.0 bits (66), Expect = 0.73
Identities = 16/41 (39%), Positives = 25/41 (60%)
Frame = -1
Query: 115 RVLSRVLAATTAVNQALWLRKVLTDLHLEQRETTEVMVDNH 155
++ S+ L A T ++QA W RKV + L +R T +V+NH
Sbjct: 369 KMKSQTLKART-LHQAKWTRKVFSHFSLSKRIVTVPIVENH 250
>BG453508 weakly similar to GP|15625407|gb| like heterochromatin protein LHP1
{Arabidopsis thaliana}, partial (12%)
Length = 589
Score = 28.5 bits (62), Expect = 2.1
Identities = 11/23 (47%), Positives = 14/23 (60%)
Frame = -3
Query: 162 KNHVFHGKTKHFSIKLFFLRDVQ 184
KNH+FH HF + L FL +Q
Sbjct: 317 KNHLFHHHHHHFHVHLHFLHQLQ 249
>CB892643 similar to PIR|B96582|B96 hypothetical protein F15I1.21 [imported]
- Arabidopsis thaliana, partial (35%)
Length = 758
Score = 28.5 bits (62), Expect = 2.1
Identities = 14/43 (32%), Positives = 22/43 (50%)
Frame = +2
Query: 36 ANEMLKKFNMENRKSMSTPMCPKERLCKDVEEQVDETLYQSLI 78
ANE+L+K + TPM P + + + D T+Y SL+
Sbjct: 593 ANEILRKHARRLKLDSVTPMLPVQGSVFSIGSEEDTTIYSSLL 721
>AW736531 similar to PIR|D84481|D84 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (1%)
Length = 635
Score = 28.5 bits (62), Expect = 2.1
Identities = 17/46 (36%), Positives = 26/46 (55%), Gaps = 2/46 (4%)
Frame = +3
Query: 160 ISKNHVFHGKTKHFSIKLFFLRDVQK-DGDVC-LKYCKTEDQLSYI 203
I+ N VFH +TKH IKL + + + +C LKY + Q +Y+
Sbjct: 459 IASNPVFH*QTKHIEIKLSVYQLIPMINWQICLLKYSGIQGQHTYV 596
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 27.3 bits (59), Expect = 4.7
Identities = 10/19 (52%), Positives = 14/19 (73%)
Frame = +2
Query: 154 NHATIAISKNHVFHGKTKH 172
N T+ I+KN V+H +TKH
Sbjct: 494 NQITLYIAKNQVYHERTKH 550
>AL370574 similar to PIR|T51822|T518
N-(5'-phospho-D-ribosylformimino)-5-amino-1-
(5''-phosphoribosyl)-4-imidazolecarboxamide isomerase,
partial (11%)
Length = 506
Score = 27.3 bits (59), Expect = 4.7
Identities = 17/50 (34%), Positives = 20/50 (40%), Gaps = 13/50 (26%)
Frame = -3
Query: 138 TDLHLEQRETTEVMVDNH-------------ATIAISKNHVFHGKTKHFS 174
T LHL R+ V DNH T I+ NH G TK +S
Sbjct: 342 TALHLHHRDVNVVAEDNHVRMS*QVNGQANAGTTEIATNHQTQGYTKRYS 193
>TC90688 similar to GP|13161405|dbj|BAB33037. VuP5CS {Vigna unguiculata},
partial (35%)
Length = 947
Score = 27.3 bits (59), Expect = 4.7
Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 4/77 (5%)
Frame = +1
Query: 74 YQSLIGCLMYLTTTRSDILYDVNVLSRFMNCAKESLFKGAK----RVLSRVLAATTAVNQ 129
Y +LI + S I V+ L + C +SL+ A +LSRVL +
Sbjct: 16 YITLIFFFFFFFLVFSSITVTVSTLH*LV-CVCDSLWTEAPWIQHELLSRVLNVLLLRLE 192
Query: 130 ALWLRKVLTDLHLEQRE 146
LWL++V+ D H E E
Sbjct: 193 QLWLQEVMED*HWEDLE 243
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.324 0.136 0.393
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,370,786
Number of Sequences: 36976
Number of extensions: 80503
Number of successful extensions: 520
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 516
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 517
length of query: 220
length of database: 9,014,727
effective HSP length: 92
effective length of query: 128
effective length of database: 5,612,935
effective search space: 718455680
effective search space used: 718455680
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 56 (26.2 bits)
Medicago: description of AC144722.14