
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148219.4 + phase: 0 /pseudo
(1109 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 104 2e-22
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 94 4e-19
BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F... 78 3e-18
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 68 3e-17
AW773859 83 7e-16
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 68 2e-11
AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F... 46 1e-10
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 48 8e-09
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 57 4e-08
BG587170 similar to PIR|F86470|F8 probable retroelement polyprot... 57 5e-08
CA921361 36 4e-06
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 47 3e-05
BE942480 42 0.001
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 41 0.002
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 40 0.004
BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, ... 40 0.005
BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440... 40 0.007
TC93418 35 0.21
AW736531 similar to PIR|D84481|D84 probable retroelement pol pol... 29 1.3
TC90463 32 1.8
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 104 bits (259), Expect = 2e-22
Identities = 53/148 (35%), Positives = 85/148 (56%)
Frame = +2
Query: 958 DWAGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRALSAATYELQWLLYLL 1017
DWAG ++R+S G F LG ISW + KQ ++ S++EAEY A ++ + WL +L
Sbjct: 5 DWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRIL 184
Query: 1018 NDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQAGIIKFLLVSSK 1077
+H +YCDN+SA+ + NP+FH +KHI+I H +R+ + + ++
Sbjct: 185 EVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPTE 364
Query: 1078 DQLADIFTNPLLPQPFSTLLSNLGMLNS 1105
+++ADIFT PL + F L LGM+ +
Sbjct: 365 EKIADIFTKPLKIESFYKLKKMLGMMKA 448
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 93.6 bits (231), Expect = 4e-19
Identities = 45/132 (34%), Positives = 83/132 (62%)
Frame = +1
Query: 954 FPDVDWAGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRALSAATYELQWL 1013
+ D D+AG +D+R+S+ G F L + ISW+ +Q ++ S+++AEY A + WL
Sbjct: 97 YVDADYAGNVDTRKSLSGFVFTLYGTTISWKANQQSVVTLSTTQAEYIAFVEGVKDAIWL 276
Query: 1014 LYLLNDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQAGIIKFLL 1073
++ +L +T + + +CD+QSA+H+ + ++HE TKHI+I H +RD +++ I
Sbjct: 277 KGMIGELGITQEYVKI-HCDSQSAIHLANHQVYHERTKHIDIRLHFIRDMIESKEIVVEK 453
Query: 1074 VSSKDQLADIFT 1085
++S++ AD+FT
Sbjct: 454 MASEENPADVFT 489
>BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (5%)
Length = 721
Score = 77.8 bits (190), Expect(2) = 3e-18
Identities = 40/78 (51%), Positives = 51/78 (65%), Gaps = 1/78 (1%)
Frame = -3
Query: 1000 YRALSAATYELQWLLYLLNDLHVTTVKLPVLYCDNQSAL-HIGTNPMFHEITKHIEIICH 1058
YR++ + E++WL YLLNDL T +K +LYCDNQSA HI N F E TKHIE+ CH
Sbjct: 374 YRSI*STICEIKWLTYLLNDLKFTFIKPAMLYCDNQSAARHIAANSSFLERTKHIELDCH 195
Query: 1059 LVRDKLQAGIIKFLLVSS 1076
+VR KLQ + L + S
Sbjct: 194 IVRVKLQLKLFHILHILS 141
Score = 33.5 bits (75), Expect(2) = 3e-18
Identities = 15/32 (46%), Positives = 23/32 (71%)
Frame = -2
Query: 957 VDWAGCLDSRRSIFGQCFFLGNSLISWRTMKQ 988
+D + CLD+ +SI C FLG+SLI W++ K+
Sbjct: 489 LD*SSCLDT*KSISYFCIFLGDSLICWKS*KK 394
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 67.8 bits (164), Expect(2) = 3e-17
Identities = 38/104 (36%), Positives = 55/104 (52%)
Frame = +3
Query: 1002 ALSAATYELQWLLYLLNDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVR 1061
+L A E W+ L+ +L ++ V YCD+QSALHI NP FH TKHI I H VR
Sbjct: 228 SLPQACKEAIWMQRLMEELGHKQEQITV-YCDSQSALHIARNPAFHSRTKHIGIQYHFVR 404
Query: 1062 DKLQAGIIKFLLVSSKDQLADIFTNPLLPQPFSTLLSNLGMLNS 1105
+ ++ G + + + D LAD T + F S+ G+L +
Sbjct: 405 EVVEEGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSYGLLET 536
Score = 39.7 bits (91), Expect(2) = 3e-17
Identities = 19/49 (38%), Positives = 28/49 (56%)
Frame = +1
Query: 954 FPDVDWAGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRA 1002
+ D D+AG D R+S G F L +SW + Q ++ S++EAEY A
Sbjct: 82 YVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLSKLQTVVALSTTEAEYMA 228
>AW773859
Length = 538
Score = 82.8 bits (203), Expect = 7e-16
Identities = 39/78 (50%), Positives = 52/78 (66%)
Frame = -3
Query: 17 AL*HFRLGHVSYERLAHMSQLYPF*LFFDSNATCDICHFARQKQLPFHLSSSVASNKFEL 76
AL HFRLGH+S +L + +PF + D N+ CDICH++R K+LPF LS++ AS +EL
Sbjct: 233 ALWHFRLGHLSNRKLLSLHSNFPF-ITIDQNSVCDICHYSRHKKLPFQLSTNRASKCYEL 57
Query: 77 LHFDI*GPLIVPFIHNHK 94
HFDI GP IHN +
Sbjct: 56 FHFDIWGPFSTQSIHNQR 3
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 68.2 bits (165), Expect = 2e-11
Identities = 34/103 (33%), Positives = 55/103 (53%)
Frame = +1
Query: 945 RNVLVKVFFFPDVDWAGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRALS 1004
RN K+ + D D+AG LD R+S G F L + +SW + KQ ++ S+++AE+ A +
Sbjct: 343 RNGSEKLEAYTDSDYAGDLDDRKSTSGYVFMLSSGAVSWSSKKQPVVTLSTTKAEFIAAA 522
Query: 1005 AATYELQWLLYLLNDLHVTTVKLPVLYCDNQSALHIGTNPMFH 1047
+ W+ +L L T +YCDN S + + NP+ H
Sbjct: 523 FCACQSVWMRRVLEKLGYTQSGSITMYCDNNSTIKLSKNPVLH 651
>AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F21P8.50 -
Arabidopsis thaliana, partial (4%)
Length = 723
Score = 46.2 bits (108), Expect(2) = 1e-10
Identities = 29/77 (37%), Positives = 45/77 (57%)
Frame = +2
Query: 1012 WLLYLLNDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQAGIIKF 1071
++ Y+++ H T V + YCDN SALHI N +FHE T H E ++V+ + +++
Sbjct: 242 FIFYMISAKHST*V---LQYCDNISALHIAANMVFHERT*HRETDPYIVQG---SRMLQL 403
Query: 1072 LLVSSKDQLADIFTNPL 1088
+ +SKDQ A T PL
Sbjct: 404 MPSASKDQPAYSLTKPL 454
Score = 38.9 bits (89), Expect(2) = 1e-10
Identities = 20/33 (60%), Positives = 25/33 (75%)
Frame = +3
Query: 975 FLGNSLISWRTMKQLTISRSSSEAEYRALSAAT 1007
FL +SLISW++ KQ +SRS SEA RAL+ AT
Sbjct: 126 FLSSSLISWKSKKQCVVSRSFSEA**RALANAT 224
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 48.1 bits (113), Expect(2) = 8e-09
Identities = 51/149 (34%), Positives = 74/149 (49%)
Frame = +3
Query: 802 FMLMMLLLLVIHLMNFNSSNIFYILHSKLKILVS*STFLV*SLLTLHKEYFCARESIVLI 861
+MLM L L + + N N+F ++ K K LV F V LL +K ++ +E+I+L
Sbjct: 180 YMLMTLF*LEMIYLKSNMLNVFLLIVLKSKTLVPYDIF*VLRLLEANKAFYLIKENILLN 359
Query: 862 FSLIQVTLHLNLFLHHMIHHANFIVIIVHLTLICLHTKD**GD*FT*PTLDLTLLLPLNN 921
F I V L N L MI N ++I T++ L+ +D * + F * L L L NN
Sbjct: 360 F*RIVVILL*NPLLLLMIFL*NSTILIRPFTMMKLNIEDS*ANLFI*LLLVLIYPLLFNN 539
Query: 922 LASSSLLLLRNTT*LPLES*DI*RNVLVK 950
A+ L + T L LE +I*+ L K
Sbjct: 540 *ANLFRNLSKFTIRLLLEFYNI*KLPLPK 626
Score = 30.8 bits (68), Expect(2) = 8e-09
Identities = 13/30 (43%), Positives = 19/30 (63%)
Frame = +2
Query: 949 VKVFFFPDVDWAGCLDSRRSIFGQCFFLGN 978
+K+ F D DWA C +R+S+ G FLG+
Sbjct: 653 LKLSSFADSDWATCPTTRKSVTGYWVFLGS 742
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 57.0 bits (136), Expect = 4e-08
Identities = 30/74 (40%), Positives = 41/74 (54%)
Frame = +2
Query: 1029 VLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQAGIIKFLLVSSKDQLADIFTNPL 1088
+L CD SA ++ NP++H KHI I H VRD +Q G +K V + DQLAD T PL
Sbjct: 20 LLRCDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPL 199
Query: 1089 LPQPFSTLLSNLGM 1102
L + +G+
Sbjct: 200 SKSRHQLLRNKIGV 241
>BG587170 similar to PIR|F86470|F8 probable retroelement polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 718
Score = 56.6 bits (135), Expect = 5e-08
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 2/111 (1%)
Frame = -3
Query: 17 AL*HFRLGHVSYERLAHMSQLYPF*LFFDSNATCDICHFARQKQLPFHLSSSVASNKFEL 76
AL H RLGH L M F N C+ C + + F +S+V N F+L
Sbjct: 587 ALWHARLGHPHGRALNLMLPGVVF-----ENKNCEACILGKHCKNVFPRTSTVYENCFDL 423
Query: 77 LHFDI*GPLIVPFIH--NHKYFLTILDDSSRFVWIVLLKSKAEVSQHVKNF 125
++ D+ P + NHKYF+T +D+ S++ W+ L+ SK V KNF
Sbjct: 422 IYTDL---WTAPSLSRDNHKYFVTFIDEKSKYTWLTLIPSKDRVIDAFKNF 279
>CA921361
Length = 466
Score = 36.2 bits (82), Expect(2) = 4e-06
Identities = 18/34 (52%), Positives = 25/34 (72%)
Frame = -2
Query: 989 LTISRSSSEAEYRALSAATYELQWLLYLLNDLHV 1022
+TIS+SS + +YR +++ ELQWL YLLND V
Sbjct: 399 VTISKSS*D-KYRVMTSTICELQWLAYLLNDFKV 301
Score = 33.5 bits (75), Expect(2) = 4e-06
Identities = 19/39 (48%), Positives = 24/39 (60%)
Frame = -1
Query: 1050 TKHIEIICHLVRDKLQAGIIKFLLVSSKDQLADIFTNPL 1088
T+HIE+ C +V +KL + LLVSS LAD T PL
Sbjct: 238 TEHIELDCRIV*EKLPQNLFH-LLVSSSLHLADCVTKPL 125
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 47.4 bits (111), Expect = 3e-05
Identities = 27/76 (35%), Positives = 42/76 (54%)
Frame = +1
Query: 945 RNVLVKVFFFPDVDWAGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRALS 1004
++ + ++ + D DW G RRS G F ++ ISW T KQ + SS EAEY A +
Sbjct: 262 KSEVYELICYSDSDWCG---DRRSTSGYVFKFNDAAISWCTKKQPITALSSYEAEYIAGT 432
Query: 1005 AATYELQWLLYLLNDL 1020
AT++ WL ++ +L
Sbjct: 433 FATFQALWLDSVIKEL 480
>BE942480
Length = 396
Score = 42.4 bits (98), Expect = 0.001
Identities = 18/43 (41%), Positives = 30/43 (68%)
Frame = -2
Query: 1000 YRALSAATYELQWLLYLLNDLHVTTVKLPVLYCDNQSALHIGT 1042
YRA+S+ E++WL Y+++ L V ++K + Y DNQ+A HI +
Sbjct: 317 YRAMSSIVCEIEWLTYIVDVLKVQSIKPTLPYYDNQAARHIAS 189
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 41.2 bits (95), Expect = 0.002
Identities = 34/100 (34%), Positives = 47/100 (47%), Gaps = 8/100 (8%)
Frame = +1
Query: 1010 LQWLLYLL----NDLHVTTVKLPVLYCDNQSALHI---GTNPMFHEITKHIEIICHLVRD 1062
LQWLLY + +H TT L C + + +H G + + K ++I ++R
Sbjct: 421 LQWLLYFFTKPTSSMHTTTSHL---LCQSNNFIHCKKPGLS*KNQTLRKQLDIF--VLRK 585
Query: 1063 KLQAGIIKFLLVSSKDQLADIF-TNPLLPQPFSTLLSNLG 1101
+ GI + LADIF T LLP PF LLS LG
Sbjct: 586 SCKLGIASNFQYLPRTNLADIFFTKSLLP*PFHILLSKLG 705
Score = 40.8 bits (94), Expect = 0.003
Identities = 20/48 (41%), Positives = 28/48 (57%), Gaps = 5/48 (10%)
Frame = +2
Query: 1012 WLLY-----LLNDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIE 1054
W+LY L +L V +L ++YC NQ L+I N ++HE TKH E
Sbjct: 413 WILYNGCYTFLRNLQVQCTRLLLIYCVNQITLYIAKNQVYHERTKH*E 556
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 40.4 bits (93), Expect = 0.004
Identities = 19/63 (30%), Positives = 32/63 (50%)
Frame = +1
Query: 55 FARQKQLPFHLSSSVASNKFELLHFDI*GPLIVPFIHNHKYFLTILDDSSRFVWIVLLKS 114
F +K++ F ++ + +H D+ GP V +Y +TI+DD R VW+ L+
Sbjct: 40 FGNRKKVSFSTATHRTKGILDYIHSDLWGPSKVTSYGGRRYMMTIIDDFPRKVWVYFLRY 219
Query: 115 KAE 117
K E
Sbjct: 220 KNE 228
>BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (5%)
Length = 657
Score = 40.0 bits (92), Expect = 0.005
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 2/112 (1%)
Frame = -2
Query: 960 AGCLDSRRSIFGQCFFLGNSLISWRTMKQLTISRSSSEAEYRALSAATYEL--QWLLYLL 1017
AG + R S G FLG +++ KQ ++R + S + + L L
Sbjct: 473 AGWIVDRGSTSGY*MFLGGNMVE*---KQNVVAR*VQRHNFELCSQGL*RVMDEELKIKL 303
Query: 1018 NDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQAGII 1069
+DL + L+ +N I NP+ H TKHIEI H + +KL +G+I
Sbjct: 302 DDLIINYKDPMTLF*NNNFVSRIAHNPVQHYRTKHIEIDQHFIIEKLYSGLI 147
>BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440.1 [imported]
- Arabidopsis thaliana, partial (20%)
Length = 731
Score = 39.7 bits (91), Expect = 0.007
Identities = 27/99 (27%), Positives = 47/99 (47%)
Frame = +3
Query: 1006 ATYELQWLLYLLNDLHVTTVKLPVLYCDNQSALHIGTNPMFHEITKHIEIICHLVRDKLQ 1065
A + WL LL+++ + V+ DNQS + + NP+FH HI H +R+ ++
Sbjct: 126 AARQAMWLQDLLSEVTWEPCEEVVIRIDNQSVIALTRNPVFHGRGNHIHKRYHFIRECVE 305
Query: 1066 AGIIKFLLVSSKDQLADIFTNPLLPQPFSTLLSNLGMLN 1104
G ++ V + A I T L F + +GM++
Sbjct: 306 NGQVEVEHVPGEKHRAYI*TKALGRIIFREIRYYIGMID 422
>TC93418
Length = 533
Score = 34.7 bits (78), Expect = 0.21
Identities = 14/22 (63%), Positives = 18/22 (81%)
Frame = +3
Query: 1033 DNQSALHIGTNPMFHEITKHIE 1054
DNQSALH+ +N +FHE T HI+
Sbjct: 225 DNQSALHVTSNLIFHEWTNHID 290
>AW736531 similar to PIR|D84481|D84 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (1%)
Length = 635
Score = 28.9 bits (63), Expect(2) = 1.3
Identities = 12/18 (66%), Positives = 15/18 (82%)
Frame = +3
Query: 1038 LHIGTNPMFHEITKHIEI 1055
L+I +NP+FH TKHIEI
Sbjct: 453 LYIASNPVFH*QTKHIEI 506
Score = 21.6 bits (44), Expect(2) = 1.3
Identities = 9/17 (52%), Positives = 12/17 (69%)
Frame = +2
Query: 1072 LLVSSKDQLADIFTNPL 1088
L ++ DQLAD+FT L
Sbjct: 518 LSINPNDQLADMFTKVL 568
>TC90463
Length = 1175
Score = 31.6 bits (70), Expect = 1.8
Identities = 14/29 (48%), Positives = 17/29 (58%)
Frame = -2
Query: 1080 LADIFTNPLLPQPFSTLLSNLGMLNSYHS 1108
L D T L P F + +S LGMLN YH+
Sbjct: 322 LPDFLTKALPPPKFHSFISKLGMLNIYHA 236
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.362 0.160 0.603
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 46,646,226
Number of Sequences: 36976
Number of extensions: 889695
Number of successful extensions: 12535
Number of sequences better than 10.0: 57
Number of HSP's better than 10.0 without gapping: 3983
Number of HSP's successfully gapped in prelim test: 749
Number of HSP's that attempted gapping in prelim test: 8074
Number of HSP's gapped (non-prelim): 5672
length of query: 1109
length of database: 9,014,727
effective HSP length: 106
effective length of query: 1003
effective length of database: 5,095,271
effective search space: 5110556813
effective search space used: 5110556813
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (22.0 bits)
S2: 64 (29.3 bits)
Medicago: description of AC148219.4