
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC138453.14 - phase: 0
(1715 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 197 1e-68
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 199 6e-51
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 185 1e-46
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 178 1e-44
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 166 7e-41
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 156 7e-38
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 140 4e-33
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 81 4e-29
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 122 9e-28
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 117 4e-26
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 102 2e-22
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 105 2e-22
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 92 2e-18
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 89 2e-17
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 81 3e-15
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 80 9e-15
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 68 3e-11
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 67 6e-11
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 60 6e-09
TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol po... 59 1e-08
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 197 bits (501), Expect(2) = 1e-68
Identities = 90/141 (63%), Positives = 121/141 (84%)
Frame = -2
Query: 1265 VTRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVESAFLNG 1324
+TRNK++LV QGY+Q+EGIDY E F+PVAR+EAIR+L+++A G LYQMDV+SAF+NG
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 1325 VIEEEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDT 1384
++EEV+VKQPPGFED + P+HV++L K+LYGLKQAPRAWY+RLS FL+KN F+RG++D
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 1385 TLFRRTLKKDILIVQIYVDDI 1405
TLF + ++LI+Q+YVDDI
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 82.8 bits (203), Expect(2) = 1e-68
Identities = 38/62 (61%), Positives = 47/62 (75%)
Frame = -3
Query: 1198 SLIGLLSIIEPKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPSQKNIIGTKWVFRN 1257
S +S IEPK V+EAL D WI +MQEEL+QF+R+ VW LVP+P K +IGT+WVFRN
Sbjct: 615 SFSAFISSIEPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVFRN 436
Query: 1258 KL 1259
KL
Sbjct: 435 KL 430
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 199 bits (507), Expect = 6e-51
Identities = 93/217 (42%), Positives = 143/217 (65%), Gaps = 1/217 (0%)
Frame = +1
Query: 1440 IQINQSKEGVYVHQTKYTKELLKKFKLEDCKVMNTPMHPTCTLSKEDTGTVVDQKLYRGM 1499
+++ Q++EG+Y+ Q KY +LL++F +E + P+ P C L K++ G VD Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 1500 IGSLLYLTASRPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYK 1559
+G L+YL A+RPD+++ + L +RF + P E H+ AVKR+ RYL GT NLG++Y+++ K
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 1560 LIGFCDADYAGDRIERKSTSENCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQL 1619
L + D+DYAGD +RKSTS L +SW+SK+Q + +ST +AE+I+AA C Q
Sbjct: 361 LEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQS 540
Query: 1620 LWMKHQLEDY-QINANSIPIYCDNTAAICLSKNPILH 1655
+WM+ LE + SI +YCDN + I LSKNP+LH
Sbjct: 541 VWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 185 bits (470), Expect = 1e-46
Identities = 97/257 (37%), Positives = 159/257 (61%), Gaps = 5/257 (1%)
Frame = +3
Query: 1200 IGLLSII-EPKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPSQKNIIGTKWVFRNK 1258
+G+L++ +P T EEA+ + W +M E+ +RN+ W+L S IG KW+F+ K
Sbjct: 15 LGMLTMTSDPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTK 194
Query: 1259 LNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAI---NHGIILYQM 1315
LNE GE+ + KARLVA+GYSQQ G+DYTE FAPVAR + IR++++ A G+ + M
Sbjct: 195 LNENGEIEKYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS*M 374
Query: 1316 DVESAFLNGVIEEEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKN 1375
+ + + + + + + ++K++LYGLKQAPRAWY R+ + K
Sbjct: 375 *KAHSCMEN*MRKFLLINH----RVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYFTKE 542
Query: 1376 DFERGQVDTTLFRRTLK-KDILIVQIYVDDIIFGSTNASLCKEFSKLMQDEFEMSMMGEL 1434
FE+ + TLF + + ILI+ +YVDD+IF + ++ +EF K M+ EF MS +G++
Sbjct: 543 GFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKM 722
Query: 1435 KFFLGIQINQSKEGVYV 1451
+FLG+++ Q+++G+Y+
Sbjct: 723 HYFLGVEVTQNEKGIYI 773
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 178 bits (452), Expect = 1e-44
Identities = 95/199 (47%), Positives = 126/199 (62%)
Frame = +1
Query: 1215 LSDDGWILAMQEELNQFQRNDVWDLVPKPSQKNIIGTKWVFRNKLNEQGEVTRNKARLVA 1274
LSD W+ AM+ E N WDLVP P K IG KWV+R K N G V + KARLVA
Sbjct: 82 LSDPRWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPDGSVNKFKARLVA 261
Query: 1275 QGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVESAFLNGVIEEEVYVKQ 1334
+G+SQ G DYTETF+PV + IRL+L+ AI + + Q+D+ +AFLNG ++EEVY+ Q
Sbjct: 262 KGFSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNGFLQEEVYMSQ 441
Query: 1335 PPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLFRRTLKKD 1394
P GFE + V KL KSLYGLKQAPRAWY+ L++ I+ F + + D +L
Sbjct: 442 PQGFE-AANKSLVCKLNKSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDPSLLIYNQNGA 618
Query: 1395 ILIVQIYVDDIIFGSTNAS 1413
+ + IYVDDI+ ++AS
Sbjct: 619 CIYLXIYVDDILITGSSAS 675
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 166 bits (420), Expect = 7e-41
Identities = 82/183 (44%), Positives = 116/183 (62%)
Frame = -1
Query: 1208 PKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPSQKNIIGTKWVFRNKLNEQGEVTR 1267
P++ EEA+ D W ++ E +ND W P K + ++W+F K G + R
Sbjct: 558 PRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIKYKADGSIER 379
Query: 1268 NKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVESAFLNGVIE 1327
K RLVA+G++ G DY ETFAPVA+L IR++LS A+N G L+QMDV++AFL G +E
Sbjct: 378 KKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQGELE 199
Query: 1328 EEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLF 1387
+EVY+ PPG E L +V +LKK++YGLKQ+PRAWY++LS L F + ++D TLF
Sbjct: 198 DEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELDHTLF 19
Query: 1388 RRT 1390
T
Sbjct: 18 TLT 10
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 156 bits (394), Expect = 7e-38
Identities = 87/240 (36%), Positives = 133/240 (55%)
Frame = +2
Query: 1347 VYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLFRRTLKKDILIVQIYVDDII 1406
V +L+KS+YGLKQA R WY +LS LI + + D +LF + + +YVDDI+
Sbjct: 20 VCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDIV 199
Query: 1407 FGSTNASLCKEFSKLMQDEFEMSMMGELKFFLGIQINQSKEGVYVHQTKYTKELLKKFKL 1466
+ S + + D F++ +G L++FLG+++ +SK+G+ ++Q KYT ELL+
Sbjct: 200 LAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSGN 379
Query: 1467 EDCKVMNTPMHPTCTLSKEDTGTVVDQKLYRGMIGSLLYLTASRPDILFSVCLCARFQSD 1526
K TP + L D+ D+ YR +IG L+YLT +RPDI F+V ++F S
Sbjct: 380 LAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQFVSK 559
Query: 1527 PRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYKLIGFCDADYAGDRIERKSTSENCQFLG 1586
P++ H A R+ +YLK GL Y + + KL F D+D+A RKS + FLG
Sbjct: 560 PQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVTGYWVFLG 739
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 140 bits (353), Expect = 4e-33
Identities = 72/146 (49%), Positives = 97/146 (66%), Gaps = 1/146 (0%)
Frame = +2
Query: 1566 ADYAGDRIERKSTSENCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQLLWMKHQ 1625
+D+AGD RKSTS LG ISW+SK+Q +A STAEAEYI++ SC TQ +W++
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 1626 LEDYQINANS-IPIYCDNTAAICLSKNPILHSRAKHIEIKHHFIRDYVQKGILDIQFIDT 1684
LE N+ IYCDN +AI LSKNP+ H R+KHI+I+ H IR+ + + + I++ T
Sbjct: 182 LEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPT 361
Query: 1685 EHQWADIFTKPLSVERFDFIKKNLNM 1710
E + ADIFTKPL +E F +KK L M
Sbjct: 362 EEKIADIFTKPLKIESFYKLKKMLGM 439
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 81.3 bits (199), Expect(2) = 4e-29
Identities = 33/100 (33%), Positives = 61/100 (61%)
Frame = +3
Query: 1611 SAASCCTQLLWMKHQLEDYQINANSIPIYCDNTAAICLSKNPILHSRAKHIEIKHHFIRD 1670
S C + +WM+ +E+ I +YCD+ +A+ +++NP HSR KHI I++HF+R+
Sbjct: 228 SLPQACKEAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVRE 407
Query: 1671 YVQKGILDIQFIDTEHQWADIFTKPLSVERFDFIKKNLNM 1710
V++G +D+Q I T AD TK ++ ++F + + + +
Sbjct: 408 VVEEGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSYGL 527
Score = 67.0 bits (162), Expect(2) = 4e-29
Identities = 34/78 (43%), Positives = 51/78 (64%)
Frame = +1
Query: 1535 VKRIFRYLKGTTNLGLLYRKSLDYKLIGFCDADYAGDRIERKSTSENCQFLGENLISWAS 1594
VKRI RY+KGT+ + + + S + + G+ D+D+AGD +RKST+ L +SW S
Sbjct: 1 VKRIMRYIKGTSGVAVCFGGS-ELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 1595 KRQATIAMSTAEAEYISA 1612
K Q +A+ST EAEY++A
Sbjct: 178 KLQTVVALSTTEAEYMAA 231
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 122 bits (307), Expect = 9e-28
Identities = 71/214 (33%), Positives = 109/214 (50%), Gaps = 2/214 (0%)
Frame = +1
Query: 860 LELLHIDLFGPVNTASLYGSKYGLVIVDDYSRWTWVKFIKSKDYACEVFSSFCTQIQSEK 919
L+ +H DL+GP S G +Y + I+DD+ R WV F++ K+ F + ++++
Sbjct: 97 LDYIHSDLWGPSKVTSYGGRRYMMTIIDDFPRKVWVYFLRYKNETFPTFKKWRILVETQT 276
Query: 920 ELKILKVRSDHGGEFENEPFELFCEKHGILHEFSSPRTPQQNGVVERKNRTLQEMARTMI 979
+ K+ +D+ EF + F FC HGI + PR PQQNGV ER RTL E AR M+
Sbjct: 277 GKNVKKLITDN*LEFCSSDFNEFCTNHGIARHKTIPRNPQQNGVAERMIRTLLERARCML 456
Query: 980 HENNL--AKHFWAEAVNTSCYIQNRIYIRPMLEKTAYELFKGRRPNISYFHQFGCTCYIL 1037
L + W EA +T+C++ NR + K +++ G + S FGC Y L
Sbjct: 457 SNAGL*N*RDLWVEAASTACHLVNRSPHSALDFKVPEDIWSGNLVDYSNLRIFGCPAYAL 636
Query: 1038 NTKDYLKKFDAKAQRGIFLGYSERSKAYKVYNSE 1071
K +A IFL Y+ SK Y+++ S+
Sbjct: 637 VNDG---KLAPRAGECIFLSYASESKGYRLWCSD 729
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein [imported]
- Arabidopsis thaliana, partial (13%)
Length = 718
Score = 117 bits (293), Expect = 4e-26
Identities = 82/247 (33%), Positives = 129/247 (52%)
Frame = -3
Query: 757 VNKDDKSITFKGKRVENVYKINFSDLADQKVVCLLSMNDKKWVWHKRLGHANWRLISKIS 816
V K D + K V N YK +F+ + S+N K +WH RLGH + R ++ +
Sbjct: 680 VTKGDLYMLEKLDPVSN-YKCSFTSSS--------SLN-KDALWHARLGHPHGRALNLM- 534
Query: 817 KLQLVKGLPNIDYHSDALCGACQKGKIVKSSFKSKDIVSTSRPLELLHIDLFGPVNTASL 876
LP + + + C AC GK K+ F V + +L++ DL+ + S
Sbjct: 533 -------LPGVVFENKN-CEACILGKHCKNVFPRTSTVYENC-FDLIYTDLW-TAPSLSR 384
Query: 877 YGSKYGLVIVDDYSRWTWVKFIKSKDYACEVFSSFCTQIQSEKELKILKVRSDHGGEFEN 936
KY + +D+ S++TW+ I SKD + F +F + + KI +RSD+GGE+ +
Sbjct: 383 DNHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAKIKILRSDNGGEYTS 204
Query: 937 EPFELFCEKHGILHEFSSPRTPQQNGVVERKNRTLQEMARTMIHENNLAKHFWAEAVNTS 996
F+ + HGILH+ S P TPQQNGV +RKN+ L E+AR+++ + N V+T+
Sbjct: 203 YAFKSHLDHHGILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMFQAN---------VSTA 51
Query: 997 CYIQNRI 1003
CY+ N I
Sbjct: 50 CYLINWI 30
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza sativa}
[Oryza sativa (japonica cultivar-group)], partial (8%)
Length = 426
Score = 102 bits (254), Expect(2) = 2e-22
Identities = 50/113 (44%), Positives = 69/113 (60%)
Frame = +1
Query: 957 TPQQNGVVERKNRTLQEMARTMIHENNLAKHFWAEAVNTSCYIQNRIYIRPMLEKTAYEL 1016
TPQQNGV ER NRTL E R M+ +AK FWAEAV T+CY+ NR + KT E+
Sbjct: 85 TPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPMEM 264
Query: 1017 FKGRRPNISYFHQFGCTCYILNTKDYLKKFDAKAQRGIFLGYSERSKAYKVYN 1069
+KG+ + S H FGC Y++ K D K+++ IFLGY++ K Y +++
Sbjct: 265 WKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWD 423
Score = 23.5 bits (49), Expect(2) = 2e-22
Identities = 8/22 (36%), Positives = 15/22 (67%)
Frame = +3
Query: 932 GEFENEPFELFCEKHGILHEFS 953
GE+ + F FC++ GI+ +F+
Sbjct: 9 GEYVDGEFLAFCKQEGIVRQFT 74
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 105 bits (261), Expect = 2e-22
Identities = 56/163 (34%), Positives = 92/163 (56%), Gaps = 2/163 (1%)
Frame = +1
Query: 1534 AVKRIFRYLKGTTNLGLLYRKSLDYK--LIGFCDADYAGDRIERKSTSENCQFLGENLIS 1591
A+K + +YL + L Y K+ + L G+ DADYAG+ RKS S L IS
Sbjct: 4 ALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTIS 183
Query: 1592 WASKRQATIAMSTAEAEYISAASCCTQLLWMKHQLEDYQINANSIPIYCDNTAAICLSKN 1651
W + +Q+ + +ST +AEYI+ +W+K + + I + I+CD+ +AI L+ +
Sbjct: 184 WKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAIHLANH 363
Query: 1652 PILHSRAKHIEIKHHFIRDYVQKGILDIQFIDTEHQWADIFTK 1694
+ H R KHI+I+ HFIRD ++ + ++ + +E AD+FTK
Sbjct: 364 QVYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 92.0 bits (227), Expect = 2e-18
Identities = 50/122 (40%), Positives = 73/122 (58%), Gaps = 3/122 (2%)
Frame = +1
Query: 1510 RPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLY---RKSLDYKLIGFCDA 1566
RPDI +SV + ++F DPR+ HL A RI RY++GT GLL+ KS Y+LI + D+
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 1567 DYAGDRIERKSTSENCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQLLWMKHQL 1626
D+ GD R+STS + ISW +K+Q A+S+ EAEYI+ Q LW+ +
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 1627 ED 1628
++
Sbjct: 472 KE 477
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 88.6 bits (218), Expect = 2e-17
Identities = 43/86 (50%), Positives = 61/86 (70%)
Frame = +2
Query: 1239 LVPKPSQKNIIGTKWVFRNKLNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVARLEAI 1298
LV KP+ IG +W+++ K NE G + + KARLVA+GY +Q+GID+ E FAPV R+E I
Sbjct: 65 LVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFAPVVRIETI 244
Query: 1299 RLLLSYAINHGIILYQMDVESAFLNG 1324
LLL+ A +G ++ +DV+ AFLNG
Sbjct: 245 *LLLALAATNGC*IHHIDVKIAFLNG 322
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 81.3 bits (199), Expect = 3e-15
Identities = 45/125 (36%), Positives = 71/125 (56%), Gaps = 3/125 (2%)
Frame = +1
Query: 1359 QAPRAWYDRLSNFLIKNDFERGQVDTTLFRR---TLKKDILIVQIYVDDIIFGSTNASLC 1415
Q+PR W+DR + + K + + Q D +F + T+KK ILIV YVDDI +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIV--YVDDIFLTGDHGK*I 174
Query: 1416 KEFSKLMQDEFEMSMMGELKFFLGIQINQSKEGVYVHQTKYTKELLKKFKLEDCKVMNTP 1475
K L+ +EFE+ +G LK+FLG+++ + K+G + Q KY +LLK+ ++ CK + P
Sbjct: 175 KRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
Query: 1476 MHPTC 1480
C
Sbjct: 355 YGCNC 369
Score = 53.5 bits (127), Expect = 7e-07
Identities = 25/56 (44%), Positives = 35/56 (61%)
Frame = +2
Query: 1468 DCKVMNTPMHPTCTLSKEDTGTVVDQKLYRGMIGSLLYLTASRPDILFSVCLCARF 1523
D K TPM T L D GT+VD+ Y+ ++G L+YL+ +RPDI F VC ++F
Sbjct: 332 DVKPSETPMDATVKLGTLDNGTLVDKGRYQRLVGKLIYLSHTRPDISFVVCTMSQF 499
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 79.7 bits (195), Expect = 9e-15
Identities = 45/163 (27%), Positives = 79/163 (47%), Gaps = 12/163 (7%)
Frame = -2
Query: 996 SCYIQNRIYIRPMLEKTAYELFKGRRPNISYFHQFGCTCYILNTKDYLKKFDAKAQRGIF 1055
+CY+ NRI R + ++ +E+ R+P+++Y FGC CY+L + K +A++++ +F
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 1056 LGYSERSKAYKVYNSETQCVEESMHVKFDDREPGSKTSEQSESNAGTTDSEDA------- 1108
+GYS K YK Y+ E + V S VKF + + Q + T+D
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIEERGYYEEKNQEDLRDLTSDKAGVLRVILEG 345
Query: 1109 ----SESDQPSDSKKYTEVESSP-EAEITPEAESNSEAEPSSK 1146
DQ + S++ E + P A TP + +EP ++
Sbjct: 344 LGIKMNQDQSTRSRQPEESSNEPRRAAQTPHLDHEGGSEPETQ 216
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 67.8 bits (164), Expect = 3e-11
Identities = 32/71 (45%), Positives = 43/71 (60%)
Frame = +2
Query: 1640 CDNTAAICLSKNPILHSRAKHIEIKHHFIRDYVQKGILDIQFIDTEHQWADIFTKPLSVE 1699
CD +A L+ NP+ HSR KHI I HF+RD VQ+G L +Q + T Q AD TKPLS
Sbjct: 29 CDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLSKS 208
Query: 1700 RFDFIKKNLNM 1710
R ++ + +
Sbjct: 209 RHQLLRNKIGV 241
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 67.0 bits (162), Expect = 6e-11
Identities = 38/104 (36%), Positives = 56/104 (53%)
Frame = -3
Query: 1504 LYLTASRPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYKLIGF 1563
+ LT P+I FS+ L +R+ S P H +K I +YLKG ++GL Y K LIG+
Sbjct: 531 ILLTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGY 352
Query: 1564 CDADYAGDRIERKSTSENCQFLGENLISWASKRQATIAMSTAEA 1607
+A Y D + +S + G +ISW S + +TIA S+ A
Sbjct: 351 VNA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 60.5 bits (145), Expect = 6e-09
Identities = 37/96 (38%), Positives = 52/96 (53%)
Frame = -2
Query: 1289 FAPVARLEAIRLLLSYAINHGIILYQMDVESAFLNGVIEEEVYVKQPPGFEDLKHPDHVY 1348
F P+ +L I LLS + L +DV++AFL G + E++Y+ QP GF + V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGF-S*EVGKMVG 378
Query: 1349 KLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDT 1384
KLKKS+YGLKQ PR L + F R ++ T
Sbjct: 377 KLKKSMYGLKQGPRQCI*SLKALCTRKGF-RSEIST 273
>TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol polyprotein
{Oryza sativa}, partial (1%)
Length = 638
Score = 59.3 bits (142), Expect = 1e-08
Identities = 57/203 (28%), Positives = 92/203 (45%), Gaps = 7/203 (3%)
Frame = +2
Query: 1122 EVESSPEAEITPEAESNSEAEPSSKVQNEIASEDFLDNTQQVIQPKFKHKSSHPEELIIG 1181
+VE+S EI + +S+ E + S EI + F + TQ K K SH L+
Sbjct: 26 QVEASDLEEIQEDFKSHGEVQEESNYIEEI--KGFQEPTQL---RKIKE*ESH---LVTL 181
Query: 1182 SKDSPRRTRSHF-------RQEESLIGLLSIIEPKTVEEALSDDGWILAMQEELNQFQRN 1234
+ S H+ + SL + S IEPK +A W+ AM++E+ + N
Sbjct: 182 NSSSKSYHIFHYVGYSFSAKHRASLAAITSNIEPKNYVQAAQ*QEWLAAMEQEIQVLEEN 361
Query: 1235 DVWDLVPKPSQKNIIGTKWVFRNKLNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVAR 1294
+ L P K + + V++ GE+ + KA+LVA+ + Q EG D+ + +
Sbjct: 362 NTSTLEPLREGKKWVDCRPVYKIIHKANGEIEKYKAQLVAKDFVQVEGEDF*D-LCLSNK 538
Query: 1295 LEAIRLLLSYAINHGIILYQMDV 1317
+ R LL+ A G L+ MDV
Sbjct: 539 DDNCRCLLTIAAAKG*QLHLMDV 607
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.314 0.131 0.375
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,565,075
Number of Sequences: 36976
Number of extensions: 678561
Number of successful extensions: 3198
Number of sequences better than 10.0: 102
Number of HSP's better than 10.0 without gapping: 3070
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3175
length of query: 1715
length of database: 9,014,727
effective HSP length: 110
effective length of query: 1605
effective length of database: 4,947,367
effective search space: 7940524035
effective search space used: 7940524035
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 65 (29.6 bits)
Medicago: description of AC138453.14