
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146757.2 + phase: 0
(1715 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG644690 weakly similar to GP|18542179|gb putative pol protein {... 199 1e-68
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 203 5e-52
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 184 3e-46
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 178 1e-44
BG587156 similar to PIR|G85055|G8 probable polyprotein [imported... 166 6e-41
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 160 4e-39
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 143 5e-34
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 81 5e-30
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 125 2e-28
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 112 2e-24
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 108 2e-23
BI262917 weakly similar to GP|19920130|g Putative retroelement {... 99 1e-21
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 95 2e-19
BG586293 weakly similar to PIR|E84473|E84 probable retroelement ... 89 1e-17
BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sati... 82 2e-15
BG586273 weakly similar to PIR|F86470|F8 probable retroelement p... 78 3e-14
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 70 7e-12
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 68 3e-11
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 62 2e-09
TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol po... 61 4e-09
>BG644690 weakly similar to GP|18542179|gb putative pol protein {Zea mays},
partial (22%)
Length = 629
Score = 199 bits (505), Expect(2) = 1e-68
Identities = 91/141 (64%), Positives = 121/141 (85%)
Frame = -2
Query: 1265 VTRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVKSAFLNG 1324
+TRNK++LV QGY+Q+EGIDY E F+PVAR+EAIR+L+++A G LYQMDVKSAF+NG
Sbjct: 424 ITRNKSKLVVQGYNQKEGIDYDEAFSPVARMEAIRILIAFAAFMGFKLYQMDVKSAFING 245
Query: 1325 VIEEEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDT 1384
++EEV+VKQPPGFED + P+HV++L K+LYGLKQAPRAWY+RLS FL+KN F+RG++D
Sbjct: 244 DLKEEVFVKQPPGFEDAEVPNHVFRLNKTLYGLKQAPRAWYERLSKFLLKNGFKRGKIDN 65
Query: 1385 TLFRKTLKKDILIVQIYVDDI 1405
TLF + ++LI+Q+YVDDI
Sbjct: 64 TLFLLKRE*ELLIIQVYVDDI 2
Score = 81.6 bits (200), Expect(2) = 1e-68
Identities = 38/62 (61%), Positives = 47/62 (75%)
Frame = -3
Query: 1198 SLIGLLSIIEPKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPFQKNIIGTKWVFRN 1257
S +S IEPK V+EAL D WI +MQEEL+QF+R+ VW LVP+P K +IGT+WVFRN
Sbjct: 615 SFSAFISSIEPKNVKEALRDADWINSMQEELHQFERSKVWYLVPRPEGKTVIGTRWVFRN 436
Query: 1258 KL 1259
KL
Sbjct: 435 KL 430
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 203 bits (516), Expect = 5e-52
Identities = 94/217 (43%), Positives = 144/217 (66%), Gaps = 1/217 (0%)
Frame = +1
Query: 1440 IQINQSKEGVYVHQTKYTKELLKKFKLEDCKMMNTPMHPTCTLSKEDTGTVVDQKLYRGM 1499
+++ Q++EG+Y+ Q KY +LL++F +E + P+ P C L K++ G VD Y+ +
Sbjct: 1 VEVIQNEEGIYICQRKYVTDLLERFGMEKSNLSRNPIAPRCKLIKDENGVKVDATKYKQI 180
Query: 1500 IGSLLYLTASRPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYK 1559
+G L+YL A+RPD+++ + L +RF + P E H+ AVKR+ RYL GT NLG++Y+++ K
Sbjct: 181 VGCLMYLAATRPDLMYVLSLISRFMNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEK 360
Query: 1560 LIGFCDADYAGDRIERKSTSGNCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQL 1619
L + D+DYAGD +RKSTSG L +SW+SK+Q + +ST +AE+I+AA C Q
Sbjct: 361 LEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQS 540
Query: 1620 LWMKHQLEDY-QINANSIPIYCDNTAAICLSKNPILH 1655
+WM+ LE + SI +YCDN + I LSKNP+LH
Sbjct: 541 VWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 184 bits (466), Expect = 3e-46
Identities = 96/257 (37%), Positives = 159/257 (61%), Gaps = 5/257 (1%)
Frame = +3
Query: 1200 IGLLSII-EPKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPFQKNIIGTKWVFRNK 1258
+G+L++ +P T EEA+ + W +M E+ +RN+ W+L IG KW+F+ K
Sbjct: 15 LGMLTMTSDPTTFEEAVKSEKWRASMNNEMEATERNNTWELTDLRSGAKTIGLKWIFKTK 194
Query: 1259 LNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAI---NHGIILYQM 1315
LNE GE+ + KARLVA+GYSQQ G+DYTE FAPVAR + IR++++ A G+ + M
Sbjct: 195 LNENGEIEKYKARLVAKGYSQQYGVDYTEVFAPVARWDTIRMVIALAAQIKRDGVCIS*M 374
Query: 1316 DVKSAFLNGVIEEEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKN 1375
+ + + + + + + ++K++LYGLKQAPRAWY R+ + K
Sbjct: 375 *KAHSCMEN*MRKFLLINH----RVM*RRVIS*RVKRALYGLKQAPRAWYSRIEAYFTKE 542
Query: 1376 DFERGQVDTTLFRKTLK-KDILIVQIYVDDIIFGSSNASLCKEFSKLMQDEFEMSMMGEL 1434
FE+ + TLF K + ILI+ +YVDD+IF ++ ++ +EF K M+ EF MS +G++
Sbjct: 543 GFEKCPYEHTLFVKLSEGGKILIISLYVDDLIFIGNDENMFEEFKKSMKKEFNMSDLGKM 722
Query: 1435 KFFMGIQINQSKEGVYV 1451
+F+G+++ Q+++G+Y+
Sbjct: 723 HYFLGVEVTQNEKGIYI 773
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 178 bits (452), Expect = 1e-44
Identities = 96/199 (48%), Positives = 126/199 (63%)
Frame = +1
Query: 1215 LSDDGWILAMQEELNQFQRNDVWDLVPKPFQKNIIGTKWVFRNKLNEQGEVTRNKARLVA 1274
LSD W+ AM+ E N WDLVP P K IG KWV+R K N G V + KARLVA
Sbjct: 82 LSDPRWLQAMKTEYKALIDNKTWDLVPLPPHKKAIGCKWVYRVKENPDGSVNKFKARLVA 261
Query: 1275 QGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVKSAFLNGVIEEEVYVKQ 1334
+G+SQ G DYTETF+PV + IRL+L+ AI + + Q+D+ +AFLNG ++EEVY+ Q
Sbjct: 262 KGFSQTLGCDYTETFSPVIKPVTIRLILTIAITYKWEIQQIDINNAFLNGFLQEEVYMSQ 441
Query: 1335 PPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLFRKTLKKD 1394
P GFE + V KL KSLYGLKQAPRAWY+ L++ I+ F + + D +L
Sbjct: 442 PQGFE-AANKSLVCKLNKSLYGLKQAPRAWYEXLTSAQIQFGFTKSRCDPSLLIYNQNGA 618
Query: 1395 ILIVQIYVDDIIFGSSNAS 1413
+ + IYVDDI+ S+AS
Sbjct: 619 CIYLXIYVDDILITGSSAS 675
>BG587156 similar to PIR|G85055|G8 probable polyprotein [imported] -
Arabidopsis thaliana, partial (17%)
Length = 618
Score = 166 bits (421), Expect = 6e-41
Identities = 83/183 (45%), Positives = 116/183 (63%)
Frame = -1
Query: 1208 PKTVEEALSDDGWILAMQEELNQFQRNDVWDLVPKPFQKNIIGTKWVFRNKLNEQGEVTR 1267
P++ EEA+ D W ++ E +ND W P K + ++W+F K G + R
Sbjct: 558 PRSYEEAMEDKEWKESVGAEAGAMIKNDTWYESELPKGKKAVSSRWIFTIKYKADGSIER 379
Query: 1268 NKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLLSYAINHGIILYQMDVKSAFLNGVIE 1327
K RLVA+G++ G DY ETFAPVA+L IR++LS A+N G L+QMDVK+AFL G +E
Sbjct: 378 KKTRLVARGFTLTYGEDYIETFAPVAKLHTIRIVLSLAVNLGWGLWQMDVKNAFLQGELE 199
Query: 1328 EEVYVKQPPGFEDLKHPDHVYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLF 1387
+EVY+ PPG E L +V +LKK++YGLKQ+PRAWY++LS L F + ++D TLF
Sbjct: 198 DEVYMYPPPGLEHLVKRGNVLRLKKAIYGLKQSPRAWYNKLSTTLNGRGFRKSELDHTLF 19
Query: 1388 RKT 1390
T
Sbjct: 18 TLT 10
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 160 bits (405), Expect = 4e-39
Identities = 88/240 (36%), Positives = 135/240 (55%)
Frame = +2
Query: 1347 VYKLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDTTLFRKTLKKDILIVQIYVDDII 1406
V +L+KS+YGLKQA R WY +LS LI + + D +LF K + +YVDDI+
Sbjct: 20 VCELQKSIYGLKQASRQWYSKLSESLISFGYLQSSSDFSLFTKFKDSSFTTLLVYVDDIV 199
Query: 1407 FGSSNASLCKEFSKLMQDEFEMSMMGELKFFMGIQINQSKEGVYVHQTKYTKELLKKFKL 1466
++ S + + D F++ +G L++F+G+++ +SK+G+ ++Q KYT ELL+
Sbjct: 200 LAGNDISEIQHVKCFLIDRFKIKDLGSLRYFLGLEVARSKQGILLNQRKYTLELLEDSGN 379
Query: 1467 EDCKMMNTPMHPTCTLSKEDTGTVVDQKLYRGMIGSLLYLTASRPDILFSVCLCARFQSD 1526
K TP + L D+ D+ YR +IG L+YLT +RPDI F+V ++F S
Sbjct: 380 LAVKSTLTPYDISLKLHNSDSPLYNDETQYRRLIGKLIYLTTTRPDISFAVQQLSQFVSK 559
Query: 1527 PRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYKLIGFCDADYAGDRIERKSTSGNCQFLG 1586
P++ H A R+ +YLK GL Y + + KL F D+D+A RKS +G FLG
Sbjct: 560 PQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVTGYWVFLG 739
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 143 bits (361), Expect = 5e-34
Identities = 73/146 (50%), Positives = 98/146 (67%), Gaps = 1/146 (0%)
Frame = +2
Query: 1566 ADYAGDRIERKSTSGNCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQLLWMKHQ 1625
+D+AGD RKSTSG LG ISW+SK+Q +A STAEAEYI++ SC TQ +W++
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 1626 LEDYQINANS-IPIYCDNTAAICLSKNPILHSRAKHIEIKHHFIRDYVQKGILDIQFIDT 1684
LE N+ IYCDN +AI LSKNP+ H R+KHI+I+ H IR+ + + + I++ T
Sbjct: 182 LEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPT 361
Query: 1685 EHQWADIFTKPLSVERFDFIKKNLNM 1710
E + ADIFTKPL +E F +KK L M
Sbjct: 362 EEKIADIFTKPLKIESFYKLKKMLGM 439
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 81.3 bits (199), Expect(2) = 5e-30
Identities = 33/100 (33%), Positives = 61/100 (61%)
Frame = +3
Query: 1611 SAASCCTQLLWMKHQLEDYQINANSIPIYCDNTAAICLSKNPILHSRAKHIEIKHHFIRD 1670
S C + +WM+ +E+ I +YCD+ +A+ +++NP HSR KHI I++HF+R+
Sbjct: 228 SLPQACKEAIWMQRLMEELGHKQEQITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVRE 407
Query: 1671 YVQKGILDIQFIDTEHQWADIFTKPLSVERFDFIKKNLNM 1710
V++G +D+Q I T AD TK ++ ++F + + + +
Sbjct: 408 VVEEGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSYGL 527
Score = 70.1 bits (170), Expect(2) = 5e-30
Identities = 35/78 (44%), Positives = 52/78 (65%)
Frame = +1
Query: 1535 VKRIFRYLKGTTNLGLLYRKSLDYKLIGFCDADYAGDRIERKSTSGNCQFLGENLISWAS 1594
VKRI RY+KGT+ + + + S + + G+ D+D+AGD +RKST+G L +SW S
Sbjct: 1 VKRIMRYIKGTSGVAVCFGGS-ELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLS 177
Query: 1595 KRQATIAMSTAEAEYISA 1612
K Q +A+ST EAEY++A
Sbjct: 178 KLQTVVALSTTEAEYMAA 231
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 125 bits (313), Expect = 2e-28
Identities = 73/214 (34%), Positives = 110/214 (51%), Gaps = 2/214 (0%)
Frame = +1
Query: 860 LELLHIDLFGPVNTASLYGSKYGLVIVDDYSRWTWVKFIKSKDYACEVFSSFCIQIQSEK 919
L+ +H DL+GP S G +Y + I+DD+ R WV F++ K+ F + I ++++
Sbjct: 97 LDYIHSDLWGPSKVTSYGGRRYMMTIIDDFPRKVWVYFLRYKNETFPTFKKWRILVETQT 276
Query: 920 ELNILKVRSDHGGEFENEPFELFCEKHGILHEFSSPRTPQQNRVVERKNRTLQEMARTMI 979
N+ K+ +D+ EF + F FC HGI + PR PQQN V ER RTL E AR M+
Sbjct: 277 GKNVKKLITDN*LEFCSSDFNEFCTNHGIARHKTIPRNPQQNGVAERMIRTLLERARCML 456
Query: 980 HENNL--AKHFWAEAVNTSCYIQNRIYIRPMLEKTAYELFKGRRPNISYFHQFGCTCYIL 1037
L + W EA +T+C++ NR + K +++ G + S FGC Y L
Sbjct: 457 SNAGL*N*RDLWVEAASTACHLVNRSPHSALDFKVPEDIWSGNLVDYSNLRIFGCPAYAL 636
Query: 1038 NTKDYLKKFDAKAQRGIFLGYSERSKAYRVYNSE 1071
K +A IFL Y+ SK YR++ S+
Sbjct: 637 VNDG---KLAPRAGECIFLSYASESKGYRLWCSD 729
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein [imported]
- Arabidopsis thaliana, partial (13%)
Length = 718
Score = 112 bits (279), Expect = 2e-24
Identities = 80/247 (32%), Positives = 127/247 (51%)
Frame = -3
Query: 757 VNKDDKSITFKGKRVENVYKINFSDLADQKVVCLLSMNDKKWVWHKRLGHANWRLISKIS 816
V K D + K V N YK +F+ + S+N K +WH RLGH + R ++ +
Sbjct: 680 VTKGDLYMLEKLDPVSN-YKCSFTSSS--------SLN-KDALWHARLGHPHGRALNLM- 534
Query: 817 KLQLVKGLPNIDYHSDALCGACQKGKIVKSSFKSKDIVSTSRPLELLHIDLFGPVNTASL 876
LP + + + C AC GK K+ F V + +L++ DL+ + S
Sbjct: 533 -------LPGVVFENKN-CEACILGKHCKNVFPRTSTVYENC-FDLIYTDLW-TAPSLSR 384
Query: 877 YGSKYGLVIVDDYSRWTWVKFIKSKDYACEVFSSFCIQIQSEKELNILKVRSDHGGEFEN 936
KY + +D+ S++TW+ I SKD + F +F + + I +RSD+GGE+ +
Sbjct: 383 DNHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNFQAYVTNHYHAKIKILRSDNGGEYTS 204
Query: 937 EPFELFCEKHGILHEFSSPRTPQQNRVVERKNRTLQEMARTMIHENNLAKHFWAEAVNTS 996
F+ + HGILH+ S P TPQQN V +RKN+ L E+AR+++ + N V+T+
Sbjct: 203 YAFKSHLDHHGILHQTSCPYTPQQNGVAKRKNKHLMEVARSLMFQAN---------VSTA 51
Query: 997 CYIQNRI 1003
CY+ N I
Sbjct: 50 CYLINWI 30
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 108 bits (269), Expect = 2e-23
Identities = 57/163 (34%), Positives = 93/163 (56%), Gaps = 2/163 (1%)
Frame = +1
Query: 1534 AVKRIFRYLKGTTNLGLLYRKSLDYK--LIGFCDADYAGDRIERKSTSGNCQFLGENLIS 1591
A+K + +YL + L Y K+ + L G+ DADYAG+ RKS SG L IS
Sbjct: 4 ALKWVLKYLNESLKSSLKYTKAAQEEDALEGYVDADYAGNVDTRKSLSGFVFTLYGTTIS 183
Query: 1592 WASKRQATIAMSTAEAEYISAASCCTQLLWMKHQLEDYQINANSIPIYCDNTAAICLSKN 1651
W + +Q+ + +ST +AEYI+ +W+K + + I + I+CD+ +AI L+ +
Sbjct: 184 WKANQQSVVTLSTTQAEYIAFVEGVKDAIWLKGMIGELGITQEYVKIHCDSQSAIHLANH 363
Query: 1652 PILHSRAKHIEIKHHFIRDYVQKGILDIQFIDTEHQWADIFTK 1694
+ H R KHI+I+ HFIRD ++ + ++ + +E AD+FTK
Sbjct: 364 QVYHERTKHIDIRLHFIRDMIESKEIVVEKMASEENPADVFTK 492
>BI262917 weakly similar to GP|19920130|g Putative retroelement {Oryza sativa}
[Oryza sativa (japonica cultivar-group)], partial (8%)
Length = 426
Score = 99.4 bits (246), Expect(2) = 1e-21
Identities = 49/113 (43%), Positives = 68/113 (59%)
Frame = +1
Query: 957 TPQQNRVVERKNRTLQEMARTMIHENNLAKHFWAEAVNTSCYIQNRIYIRPMLEKTAYEL 1016
TPQQN V ER NRTL E R M+ +AK FWAEAV T+CY+ NR + KT E+
Sbjct: 85 TPQQNGVAERMNRTLLERTRAMLKTAGMAKSFWAEAVKTACYVINRSPSTVIDLKTPMEM 264
Query: 1017 FKGRRPNISYFHQFGCTCYILNTKDYLKKFDAKAQRGIFLGYSERSKAYRVYN 1069
+KG+ + S H FGC Y++ K D K+++ IFLGY++ K Y +++
Sbjct: 265 WKGKPVDYSSLHVFGCPVYVMYNSQERTKLDPKSRKCIFLGYADNVKGYXLWD 423
Score = 23.5 bits (49), Expect(2) = 1e-21
Identities = 8/22 (36%), Positives = 15/22 (67%)
Frame = +3
Query: 932 GEFENEPFELFCEKHGILHEFS 953
GE+ + F FC++ GI+ +F+
Sbjct: 9 GEYVDGEFLAFCKQEGIVRQFT 74
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 95.1 bits (235), Expect = 2e-19
Identities = 51/122 (41%), Positives = 74/122 (59%), Gaps = 3/122 (2%)
Frame = +1
Query: 1510 RPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLY---RKSLDYKLIGFCDA 1566
RPDI +SV + ++F DPR+ HL A RI RY++GT GLL+ KS Y+LI + D+
Sbjct: 121 RPDICYSVSVISKFMHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDS 300
Query: 1567 DYAGDRIERKSTSGNCQFLGENLISWASKRQATIAMSTAEAEYISAASCCTQLLWMKHQL 1626
D+ GD R+STSG + ISW +K+Q A+S+ EAEYI+ Q LW+ +
Sbjct: 301 DWCGD---RRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVI 471
Query: 1627 ED 1628
++
Sbjct: 472 KE 477
>BG586293 weakly similar to PIR|E84473|E84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 763
Score = 89.0 bits (219), Expect = 1e-17
Identities = 44/86 (51%), Positives = 60/86 (69%)
Frame = +2
Query: 1239 LVPKPFQKNIIGTKWVFRNKLNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVARLEAI 1298
LV KP IG +W+++ K NE G + + KARLVA+GY +Q+GID+ E FAPV R+E I
Sbjct: 65 LVKKPTGVKPIGLRWIYKIKRNEDGTLIKYKARLVAKGYVKQQGIDFDEVFAPVVRIETI 244
Query: 1299 RLLLSYAINHGIILYQMDVKSAFLNG 1324
LLL+ A +G ++ +DVK AFLNG
Sbjct: 245 *LLLALAATNGC*IHHIDVKIAFLNG 322
>BE123913 weakly similar to GP|22093573|d polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 503
Score = 81.6 bits (200), Expect = 2e-15
Identities = 45/125 (36%), Positives = 71/125 (56%), Gaps = 3/125 (2%)
Frame = +1
Query: 1359 QAPRAWYDRLSNFLIKNDFERGQVDTTLFRK---TLKKDILIVQIYVDDIIFGSSNASLC 1415
Q+PR W+DR + + K + + Q D +F K T+KK ILIV YVDDI +
Sbjct: 1 QSPRDWFDRFT*VVKKFGYIQCQTDHAMFIKHSSTVKKAILIV--YVDDIFLTGDHGK*I 174
Query: 1416 KEFSKLMQDEFEMSMMGELKFFMGIQINQSKEGVYVHQTKYTKELLKKFKLEDCKMMNTP 1475
K L+ +EFE+ +G LK+F+G+++ + K+G + Q KY +LLK+ ++ CK + P
Sbjct: 175 KRLKNLLAEEFEIKDLGNLKYFLGMEVARWKKGSSISQRKYVLDLLKETRMIGCKTIRDP 354
Query: 1476 MHPTC 1480
C
Sbjct: 355 YGCNC 369
Score = 53.5 bits (127), Expect = 7e-07
Identities = 25/56 (44%), Positives = 35/56 (61%)
Frame = +2
Query: 1468 DCKMMNTPMHPTCTLSKEDTGTVVDQKLYRGMIGSLLYLTASRPDILFSVCLCARF 1523
D K TPM T L D GT+VD+ Y+ ++G L+YL+ +RPDI F VC ++F
Sbjct: 332 DVKPSETPMDATVKLGTLDNGTLVDKGRYQRLVGKLIYLSHTRPDISFVVCTMSQF 499
>BG586273 weakly similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 705
Score = 77.8 bits (190), Expect = 3e-14
Identities = 44/163 (26%), Positives = 78/163 (46%), Gaps = 12/163 (7%)
Frame = -2
Query: 996 SCYIQNRIYIRPMLEKTAYELFKGRRPNISYFHQFGCTCYILNTKDYLKKFDAKAQRGIF 1055
+CY+ NRI R + ++ +E+ R+P+++Y FGC CY+L + K +A++++ +F
Sbjct: 704 ACYLINRIPTRVLKDQAPFEVLNQRKPSLTYMRVFGCLCYVLVPGELRNKLEARSRKAMF 525
Query: 1056 LGYSERSKAYRVYNSETQCVEESMHVKFDDREPGSKTSEQSESNAGTTDSEDA------- 1108
+GYS K Y+ Y+ E + V S VKF + + Q + T+D
Sbjct: 524 IGYSTTQKGYKCYDPEARRVLVSRDVKFIEERGYYEEKNQEDLRDLTSDKAGVLRVILEG 345
Query: 1109 ----SESDQPSDSEKYTEVESSP-EAEITPEAESNSEAEPSSK 1146
DQ + S + E + P A TP + +EP ++
Sbjct: 344 LGIKMNQDQSTRSRQPEESSNEPRRAAQTPHLDHEGGSEPETQ 216
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 70.1 bits (170), Expect = 7e-12
Identities = 39/104 (37%), Positives = 57/104 (54%)
Frame = -3
Query: 1504 LYLTASRPDILFSVCLCARFQSDPRESHLTAVKRIFRYLKGTTNLGLLYRKSLDYKLIGF 1563
+ LT P+I FS+ L +R+ S P H +K I +YLKG ++GL Y K LIG+
Sbjct: 531 ILLTLQGPNITFSINLLSRYSSAPTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGY 352
Query: 1564 CDADYAGDRIERKSTSGNCQFLGENLISWASKRQATIAMSTAEA 1607
+A Y D + +S +G G +ISW S + +TIA S+ A
Sbjct: 351 VNA*YLSDPHKARS*TGYIFTCGNTVISWRSTK*STIATSSNHA 220
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 67.8 bits (164), Expect = 3e-11
Identities = 32/71 (45%), Positives = 43/71 (60%)
Frame = +2
Query: 1640 CDNTAAICLSKNPILHSRAKHIEIKHHFIRDYVQKGILDIQFIDTEHQWADIFTKPLSVE 1699
CD +A L+ NP+ HSR KHI I HF+RD VQ+G L +Q + T Q AD TKPLS
Sbjct: 29 CDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPLSKS 208
Query: 1700 RFDFIKKNLNM 1710
R ++ + +
Sbjct: 209 RHQLLRNKIGV 241
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 62.0 bits (149), Expect = 2e-09
Identities = 38/96 (39%), Positives = 52/96 (53%)
Frame = -2
Query: 1289 FAPVARLEAIRLLLSYAINHGIILYQMDVKSAFLNGVIEEEVYVKQPPGFEDLKHPDHVY 1348
F P+ +L I LLS + L +DVK+AFL G + E++Y+ QP GF + V
Sbjct: 554 FVPIVKLNTIMFLLSIVAIENLYLE*LDVKTAFLRGDLVEDIYMHQPEGF-S*EVGKMVG 378
Query: 1349 KLKKSLYGLKQAPRAWYDRLSNFLIKNDFERGQVDT 1384
KLKKS+YGLKQ PR L + F R ++ T
Sbjct: 377 KLKKSMYGLKQGPRQCI*SLKALCTRKGF-RSEIST 273
>TC92908 similar to GP|10140704|gb|AAG13538.1 putative gag-pol polyprotein
{Oryza sativa}, partial (1%)
Length = 638
Score = 60.8 bits (146), Expect = 4e-09
Identities = 58/203 (28%), Positives = 93/203 (45%), Gaps = 7/203 (3%)
Frame = +2
Query: 1122 EVESSPEAEITPEAESNSEAEPSSKVQNEIASEDFQDNTQQVIQPKFKHKSSHPEELIIG 1181
+VE+S EI + +S+ E + S EI + FQ+ TQ K K SH L+
Sbjct: 26 QVEASDLEEIQEDFKSHGEVQEESNYIEEI--KGFQEPTQL---RKIKE*ESH---LVTL 181
Query: 1182 SKESPRRTRSHF-------RQEESLIGLLSIIEPKTVEEALSDDGWILAMQEELNQFQRN 1234
+ S H+ + SL + S IEPK +A W+ AM++E+ + N
Sbjct: 182 NSSSKSYHIFHYVGYSFSAKHRASLAAITSNIEPKNYVQAAQ*QEWLAAMEQEIQVLEEN 361
Query: 1235 DVWDLVPKPFQKNIIGTKWVFRNKLNEQGEVTRNKARLVAQGYSQQEGIDYTETFAPVAR 1294
+ L P K + + V++ GE+ + KA+LVA+ + Q EG D+ + +
Sbjct: 362 NTSTLEPLREGKKWVDCRPVYKIIHKANGEIEKYKAQLVAKDFVQVEGEDF*D-LCLSNK 538
Query: 1295 LEAIRLLLSYAINHGIILYQMDV 1317
+ R LL+ A G L+ MDV
Sbjct: 539 DDNCRCLLTIAAAKG*QLHLMDV 607
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.315 0.131 0.375
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,586,516
Number of Sequences: 36976
Number of extensions: 679844
Number of successful extensions: 3200
Number of sequences better than 10.0: 104
Number of HSP's better than 10.0 without gapping: 3053
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3172
length of query: 1715
length of database: 9,014,727
effective HSP length: 110
effective length of query: 1605
effective length of database: 4,947,367
effective search space: 7940524035
effective search space used: 7940524035
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 65 (29.6 bits)
Medicago: description of AC146757.2