
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0388a.4
(1489 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG586326 similar to PIR|G84493|G8 probable retroelement pol poly... 217 2e-56
TC77595 weakly similar to PIR|T18350|T18350 probable pol polypro... 171 2e-42
BF003873 similar to GP|14715222|em putative polyprotein {Cicer a... 168 1e-41
TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alterna... 105 2e-22
BG454871 weakly similar to GP|10140673|g putative gag-pol polypr... 67 7e-11
BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - ... 66 9e-11
BG644699 similar to PIR|T07863|T078 probable polyprotein - pinea... 64 6e-10
BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana... 51 3e-06
CA860311 weakly similar to GP|7289872|gb|A CG17427 gene product ... 42 0.002
AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis... 38 0.026
BG586308 weakly similar to PIR|F84528|F8 probable retroelement p... 37 0.044
BG644733 weakly similar to GP|15289942|db putative polyprotein {... 36 0.097
AA660473 34 0.37
TC93746 weakly similar to GP|22830935|dbj|BAC15800. hypothetical... 32 1.4
TC89832 homologue to GP|21618319|gb|AAM67369.1 unknown {Arabidop... 32 2.4
BQ137251 similar to GP|18026135|gb| cytochrome c oxidase subunit... 30 7.0
>BG586326 similar to PIR|G84493|G8 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 736
Score = 217 bits (553), Expect = 2e-56
Identities = 122/244 (50%), Positives = 161/244 (65%), Gaps = 2/244 (0%)
Frame = +2
Query: 834 TTALILALPKEGEPYDVYCDASHNGLGCVMMQEKKVIAYASRQLKIHEKNYPTHDLELAA 893
T+A IL LP E Y VY DAS GLGCV+ Q +KVIAYASRQL+ HE NYPTHDLE+AA
Sbjct: 8 TSAPILVLP-ELITYVVYTDASITGLGCVLTQHEKVIAYASRQLRKHEGNYPTHDLEMAA 184
Query: 894 IVYALKIWRHYLYGSMFTIYSDHKSLKYLFDQKDLNMRQRRWMEFLQDYEFDLQ*HPGKT 953
+V+ALKIWR YLYG+ I++DHKSLKY+F Q +LN+RQRRWMEF+ DY+ D+ +PGK
Sbjct: 185 VVFALKIWRSYLYGAKVQIHTDHKSLKYIFTQPELNLRQRRWMEFVADYDLDITYYPGKA 364
Query: 954 NVVADALSRKAMHVSSLMVKELELLEAF-*DLSLDVKIAPGKLSFGMVTVS-SGLLDEIK 1011
N+VADALSR+ + VS+ +E + L+ L L+V + S G+ V+ + L I+
Sbjct: 365 NLVADALSRRRVDVSA--EREADDLDGMVRALRLNV-LTKATESLGLEAVNQADLFTRIR 535
Query: 1012 SKQETDEGLLEWRKLVTQGKAPEFSIGSDNILRCKGRVCVPNDATMRRLILDEGHKSRLS 1071
Q DE L + V Q E+ D + GR+ VPND +++ I+ E HKSR S
Sbjct: 536 LAQGQDENL----QKVAQNDRTEYQTAKDGTILVNGRISVPNDRSLKEEIMSEAHKSRFS 703
Query: 1072 IHPG 1075
+HPG
Sbjct: 704 VHPG 715
>TC77595 weakly similar to PIR|T18350|T18350 probable pol polyprotein - rice
blast fungus gypsy retroelement (fragment), partial (14%)
Length = 1708
Score = 171 bits (433), Expect = 2e-42
Identities = 134/457 (29%), Positives = 215/457 (46%), Gaps = 17/457 (3%)
Frame = +2
Query: 1034 EFSIGSDNILRCKGRVCVPNDAT-------MRRLILDEGHKSRLSIHPGMNKMYQDLKLH 1086
E + S L +GR+ VP +R ++ E H S + HPG N + +
Sbjct: 77 ECQLDSLKRLTFRGRIWVPGSDDEESPLNELRTKLVQESHDSTAAGHPGRNGTLEIVSRK 256
Query: 1087 FWWPGMKKQVAEYVSTCLTC*NAKVEHQKPAGKLQSLDVPEWKWDSISMDFVTALPLTRR 1146
F+WPG + V +V C C + Q G L+ L VP +SMDF+T+LP TR
Sbjct: 257 FFWPGQSQTVRRFVRNCDVCGGIHIWRQAKRGFLKPLPVPNRLHSDLSMDFITSLPPTRG 436
Query: 1147 RFDA-IWVVVDRLTKTAHFVPISLNYKVEKLAEIHIAKIVRLHGVPSSIVSDRDSRFTSR 1205
R +WV+VDRL+K+ + + E A+ ++ R HG+P SIVSDR S + R
Sbjct: 437 RGSQYLWVIVDRLSKSVTLEEMD-TMEAEACAQRFLSCHYRFHGMPQSIVSDRGSNWVGR 613
Query: 1206 FWGALQQALGTKLRLSSAYNPQTDGQTERTIQSLEDLLRACVLDHKGSWDELLPLIEFTY 1265
FW + G LS++Y+PQTDG TER Q ++ +LRA V + +W +LLP ++
Sbjct: 614 FWREFCRLTGVTQLLSTSYHPQTDGGTERWNQEIQAVLRAYVCWSQDNWGDLLPTVQLAL 793
Query: 1266 NNSFHASIGMAPYEALYGRRCQTPLCWHQDGEHLVIGPE-----LVQQTTEEVKRIQEKM 1320
N ++SIG P+ +G P+ +D +V E LV++ + IQ ++
Sbjct: 794 RNRHNSSIGATPFFVEHGYHVD-PIPTVEDTGGVVSEGEAAAQLLVKRMKDVTGFIQAEI 970
Query: 1321 RIS*SRQKSYADNRRKELE-FQAGDHVFLRVTPMTGVGRAIKSKKLTPKFIGPYQITERV 1379
+ R ++ A+ RR + +Q GD V+L V+ + K L K Y++T V
Sbjct: 971 VAAQQRSEASANKRRCPADRYQVGDKVWLNVSNYKSPRPSKKLDWLHHK----YEVTRFV 1138
Query: 1380 GPVAYRIALPPFLSHIHDVLHVSQLRKYMAD---DSHVLEPDDIQLKDDLTVVMPPIKIV 1436
P + +P ++ HV LR+ +D V++P + DD V ++ +
Sbjct: 1139 TPHVVELNVP---GTVYPKFHVDLLRRAASDPLPGQEVVDPQPPPIVDDDGEVEWEVEEI 1309
Query: 1437 DRSTKRLRNK*VSLVKVVWNQATGDATWELEDKMRES 1473
+ + +V + DATWE D +RE+
Sbjct: 1310 LAARWHQVGRGRRRQALVKWKGFVDATWEAADAIRET 1420
>BF003873 similar to GP|14715222|em putative polyprotein {Cicer arietinum},
partial (82%)
Length = 559
Score = 168 bits (426), Expect = 1e-41
Identities = 83/135 (61%), Positives = 106/135 (78%), Gaps = 1/135 (0%)
Frame = +2
Query: 1355 GVGRAIKSKKLTPKFIGPYQITERVGPVAYRIALPPFLSHIHDVLHVSQLRKYMADDSHV 1414
GVGRA+KSKKLT +FIGPYQI+ERVG VAYR+ LPP L ++HDV HVSQLRKY+ D SHV
Sbjct: 2 GVGRALKSKKLTVRFIGPYQISERVGTVAYRVGLPPHLLNLHDVFHVSQLRKYVPDPSHV 181
Query: 1415 LEPDDIQLKDDLTVVMPPIKIVDRSTKRLRNK*VSLVKVVWNQATGDA-TWELEDKMRES 1473
++ DD+Q++D+LTV P++I DR K LR K + LV+VVW++A G++ TWELE KM ES
Sbjct: 182 IQSDDVQVRDNLTVETLPVRIDDRKVKTLRGKEIPLVRVVWDRANGESLTWELESKMVES 361
Query: 1474 HPDLFVNP*VSRAKI 1488
+P+LF SR KI
Sbjct: 362 YPELFA*GKFSRTKI 406
>TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alternaria
alternata}, partial (21%)
Length = 1540
Score = 105 bits (261), Expect = 2e-22
Identities = 58/166 (34%), Positives = 92/166 (54%), Gaps = 6/166 (3%)
Frame = +1
Query: 740 KCEFWMEEVKFLGHVISS-QGVAVDPSKIESIMSWEQPKIASDIRSFVGLAGYYRRFVKD 798
KCEF + VK++G ++++ +GV+ DP K+ +I W P RSF+G YY+ F+
Sbjct: 1030 KCEFSVTTVKYVGFILTAGKGVSCDPLKLAAIRDWLPPGSVKGARSFLGFCNYYKDFIPG 1209
Query: 799 YAKLTSPLTQLTKKNQPFAWTEKCEESFQEMKKRLTTALILALPKEGEPYDVYCDASHNG 858
Y+++T PLT+LT+K+ PF W + E +F ++K+ +L + V D S
Sbjct: 1210 YSEITEPLTRLTRKDFPFRWGAEQEAAFTKLKRLFAEEPVLRMFDPEAVTTVETDCSGFA 1389
Query: 859 LGCVMMQEKKV-----IAYASRQLKIHEKNYPTHDLELAAIVYALK 899
LG V+ QE +A+ S++L E NYP HD EL A+ L+
Sbjct: 1390 LGGVLTQEDGTGAAHPVAFHSQRLSPAEYNYPIHDKELLAVWACLR 1527
>BG454871 weakly similar to GP|10140673|g putative gag-pol polyprotein {Oryza
sativa (japonica cultivar-group)}, partial (7%)
Length = 674
Score = 66.6 bits (161), Expect = 7e-11
Identities = 34/85 (40%), Positives = 47/85 (55%)
Frame = +2
Query: 1200 SRFTSRFWGALQQALGTKLRLSSAYNPQTDGQTERTIQSLEDLLRACVLDHKGSWDELLP 1259
S S FW L + GT L +SSAY+P +DGQ+E + E LR + W + P
Sbjct: 20 SSLYSNFWKQLFKLHGTILTMSSAYHP*SDGQSEALNKGXEMYLRCLMFTDPLKWSKAFP 199
Query: 1260 LIEFTYNNSFHASIGMAPYEALYGR 1284
E+ YN S++ S M P++ALYGR
Sbjct: 200 WAEYWYNTSYNISAAMTPFKALYGR 274
Score = 33.1 bits (74), Expect = 0.82
Identities = 20/68 (29%), Positives = 32/68 (46%), Gaps = 2/68 (2%)
Frame = +1
Query: 1328 KSYADNRRKELEFQAGDHVFLRVTPMTGVGRAIK--SKKLTPKFIGPYQITERVGPVAYR 1385
K AD +R+ EFQ G+HV +++ P A++ K +P F + A+
Sbjct: 403 KHQADKKRRHFEFQLGEHVLVKLQPYQQSSVALRKYQKFGSPNFGSLLTVCSL*VESAFH 582
Query: 1386 IALPPFLS 1393
PP+LS
Sbjct: 583 CKSPPYLS 606
>BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (13%)
Length = 763
Score = 66.2 bits (160), Expect = 9e-11
Identities = 68/261 (26%), Positives = 108/261 (41%), Gaps = 19/261 (7%)
Frame = +3
Query: 654 RLTSRRPHSELDMGIVSTS*CRLELPMHPQYLWTT*TVCSDHFWTSSWWCSLTTF*FTRR 713
R+ RR S L G +T *CRL L + + CS W + W + TT * +
Sbjct: 9 RMIWRRQRSSLIEGRTATK*CRLVLRTPARLTKDSSIECSQTNWGTRWRSTSTTC**SHS 188
Query: 714 VKKNTKSICVKC*EYCKGKELYANGSKCEFWME----EVKFLGHVISSQGVAVD---PSK 766
V++ +I S + W ++ + S Q D PSK
Sbjct: 189 VRRTI*TI*K---------------SDLKRWTNT**NSIRPNAPLASPQANFWDTSSPSK 323
Query: 767 IES-IMSWEQP-------KIASDIRSFVGLAGYYRRFVKDYAKLTSPLTQLTKKNQPFAW 818
I+S P +IA G RF+ P +L N+ F W
Sbjct: 324 ESR*ILSRSPPY*TSLVQRIAERSSDSRGRIAALNRFISRSTDKCLPFYKLLCGNKRFVW 503
Query: 819 TEKCEESFQEMKKRLTTALILALPKEGEPYDVYCDASHNGLGCVMMQ----EKKVIAYAS 874
EKCEE+F+++K+ LTT +L+ P+ G+ +Y S + V+++ E+K I Y S
Sbjct: 504 DEKCEEAFEQLKQYLTTPPVLSKPEAGDTLSLYIAISSTAVSSVLIREDRGEQKPIFYTS 683
Query: 875 RQLKIHEKNYPTHDLELAAIV 895
+++ E YPT + A++
Sbjct: 684 KRMTDPETRYPTLEKMAFAVI 746
>BG644699 similar to PIR|T07863|T078 probable polyprotein - pineapple
retrotransposon dea1 (fragment), partial (5%)
Length = 231
Score = 63.5 bits (153), Expect = 6e-10
Identities = 30/73 (41%), Positives = 48/73 (65%), Gaps = 1/73 (1%)
Frame = +2
Query: 1344 DHVFLRVTPMT-GVGRAIKSKKLTPKFIGPYQITERVGPVAYRIALPPFLSHIHDVLHVS 1402
+ V L+V P G R K KL+ ++IGP+++ +R+G VAY +ALPP LS +H V HVS
Sbjct: 2 EQVLLKVLPTERGDCRFGKRGKLSLRYIGPFEVIKRIGEVAYELALPPGLSGVHPVFHVS 181
Query: 1403 QLRKYMADDSHVL 1415
++Y D ++++
Sbjct: 182 MFKRYHGDGNYII 220
>BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana}, partial
(10%)
Length = 624
Score = 51.2 bits (121), Expect = 3e-06
Identities = 48/194 (24%), Positives = 86/194 (43%), Gaps = 5/194 (2%)
Frame = +2
Query: 1039 SDNI-LRCKGRVCVPNDATMRRLILDEGHKSRLSIHPGMNKMYQDLK-LHFWWPGMKKQV 1096
+DNI +RC +P IL H S + H ++K ++ FWWP M K
Sbjct: 56 ADNIYIRCVAEEEIPG-------ILFHCHGSNYAGHFAVSKTVSKIQQAGFWWPTMFKDA 214
Query: 1097 AEYVSTCLTC*---NAKVEHQKPAGKLQSLDVPEWKWDSISMDFVTALPLTRRRFDAIWV 1153
++S C C N ++ P + ++V +D +DF+ P + I V
Sbjct: 215 HSFISKCDPCQRQGNIS*RNEMPQNFILEVEV----FDVWGIDFMGPFPSSYNN-KYILV 379
Query: 1154 VVDRLTKTAHFVPISLNYKVEKLAEIHIAKIVRLHGVPSSIVSDRDSRFTSRFWGALQQA 1213
VD ++K + N + ++ + I GVP ++SD S F ++ + L +
Sbjct: 380 AVDYVSKWVEAIASPTN-DATVVVKMFKSVIFPRFGVPRVVISDGGSHFINKVFEKLLKK 556
Query: 1214 LGTKLRLSSAYNPQ 1227
G + ++++AY+PQ
Sbjct: 557 NGVRHKVATAYHPQ 598
>CA860311 weakly similar to GP|7289872|gb|A CG17427 gene product {Drosophila
melanogaster}, partial (20%)
Length = 192
Score = 41.6 bits (96), Expect = 0.002
Identities = 19/43 (44%), Positives = 28/43 (64%)
Frame = +1
Query: 870 IAYASRQLKIHEKNYPTHDLELAAIVYALKIWRHYLYGSMFTI 912
IAYASR L E+NY + E A ++A++ +RHYL+G F +
Sbjct: 64 IAYASRLLTAAERNYTVVERECLAAIWAIRNFRHYLHGPKFEL 192
>AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis thaliana
chromosome II BAC F26H6; putative retroelement pol
polyprotein, partial (1%)
Length = 636
Score = 38.1 bits (87), Expect = 0.026
Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Frame = -2
Query: 893 AIVYALKIWRHYLYGSMFTIYSDHKSLKYLFDQKDLNMRQRRWMEFLQDYEFDLQ*HPG- 951
A+ +A K RHY+ + S +KY+F++ L R RW L +Y+ + +
Sbjct: 623 ALAWAAKRLRHYMINHTTWLVSKMDPIKYIFEKPALTGRIARWQMLLSEYDIEYRSQKAI 444
Query: 952 KTNVVADALSRKAM 965
K +++AD L+ + +
Sbjct: 443 KGSILADHLAHQPL 402
>BG586308 weakly similar to PIR|F84528|F8 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 686
Score = 37.4 bits (85), Expect = 0.044
Identities = 24/71 (33%), Positives = 37/71 (51%), Gaps = 1/71 (1%)
Frame = -2
Query: 1188 HGVPSSIVSDRDSRFTSRFWGALQQALGTKLRLSSAYNPQTDGQTERTIQSLEDLLRACV 1247
HG+P IV+D S F S + + +L +S PQ++GQ E + + + D L+ +
Sbjct: 685 HGLPYEIVTDNGSHFISNKFREFCERWRIRLNTASPRYPQSNGQAEASNKIIIDGLKKRL 506
Query: 1248 LDHKGSW-DEL 1257
KG W DEL
Sbjct: 505 DLKKGCWADEL 473
>BG644733 weakly similar to GP|15289942|db putative polyprotein {Oryza sativa
(japonica cultivar-group)}, partial (1%)
Length = 174
Score = 36.2 bits (82), Expect = 0.097
Identities = 17/43 (39%), Positives = 28/43 (64%)
Frame = -3
Query: 1146 RRFDAIWVVVDRLTKTAHFVPISLNYKVEKLAEIHIAKIVRLH 1188
+ +++I VVVDRLTK+ F+P +Y + A I + +IV +H
Sbjct: 130 KSYESI*VVVDRLTKSTLFIPFKTSYSAK*YARILLDEIVCIH 2
>AA660473
Length = 655
Score = 34.3 bits (77), Expect = 0.37
Identities = 10/22 (45%), Positives = 15/22 (67%)
Frame = -3
Query: 449 WGLKSSM*FWEWIGWRSITFSW 470
W +S + FWEW+GW+S+ W
Sbjct: 305 WN*ESLICFWEWLGWKSLAIRW 240
>TC93746 weakly similar to GP|22830935|dbj|BAC15800. hypothetical
protein~similar to gag-pol polyprotein {Oryza sativa
(japonica cultivar-group)}, partial (4%)
Length = 1019
Score = 32.3 bits (72), Expect = 1.4
Identities = 14/41 (34%), Positives = 26/41 (63%)
Frame = +2
Query: 1125 VPEWKWDSISMDFVTALPLTRRRFDAIWVVVDRLTKTAHFV 1165
VP+ W+ +++DF L T++ D+ VV D+ ++ AHF+
Sbjct: 476 VPKPPWEDVTIDFSLGLL*TQQLKDSKMVVGDKFSRMAHFI 598
>TC89832 homologue to GP|21618319|gb|AAM67369.1 unknown {Arabidopsis
thaliana}, partial (62%)
Length = 1347
Score = 31.6 bits (70), Expect = 2.4
Identities = 17/49 (34%), Positives = 24/49 (48%)
Frame = +1
Query: 1186 RLHGVPSSIVSDRDSRFTSRFWGALQQALGTKLRLSSAYNPQTDGQTER 1234
+++G PS I R FWG++ LG+ R Y+P TDG R
Sbjct: 727 QIYGPPSKINKARQRTNVQVFWGSVH-GLGSFCRRCYCYSPDTDGIARR 870
>BQ137251 similar to GP|18026135|gb| cytochrome c oxidase subunit II
{Sphaerospira fraseri}, partial (12%)
Length = 1073
Score = 30.0 bits (66), Expect = 7.0
Identities = 21/63 (33%), Positives = 28/63 (44%)
Frame = -3
Query: 1219 RLSSAYNPQTDGQTERTIQSLEDLLRACVLDHKGSWDELLPLIEFTYNNSFHASIGMAPY 1278
RLS + P D R + L LLR SW + PL+EF ++ A G AP
Sbjct: 300 RLSCPFRPPPDTGL-RNLSFLPPLLRNFCFSVSLSWRQPSPLVEFRWSPRCAARAGRAPV 124
Query: 1279 EAL 1281
+L
Sbjct: 123 VSL 115
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.342 0.148 0.511
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,733,986
Number of Sequences: 36976
Number of extensions: 758454
Number of successful extensions: 5152
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 2752
Number of HSP's successfully gapped in prelim test: 195
Number of HSP's that attempted gapping in prelim test: 2272
Number of HSP's gapped (non-prelim): 3160
length of query: 1489
length of database: 9,014,727
effective HSP length: 109
effective length of query: 1380
effective length of database: 4,984,343
effective search space: 6878393340
effective search space used: 6878393340
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (22.0 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0388a.4