
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126781.9 + phase: 0
(195 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 115 1e-26
AJ502495 weakly similar to GP|18071369|g putative gag-pol polypr... 111 2e-25
TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-relate... 70 2e-24
BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2... 94 4e-20
BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F... 82 1e-16
BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F2... 73 8e-14
BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vu... 71 3e-13
BE941052 weakly similar to PIR|B85188|B85 retrotransposon like p... 65 1e-11
AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F... 43 3e-11
BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440... 52 1e-07
BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T... 45 2e-05
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 42 2e-05
CA921361 32 1e-04
BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, ... 42 1e-04
BE942480 40 6e-04
BG644691 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, pa... 39 0.001
TC93418 36 0.008
BG645347 weakly similar to GP|18033111|g functional candidate re... 32 0.20
AW736531 similar to PIR|D84481|D84 probable retroelement pol pol... 32 0.20
TC77790 similar to GP|15294268|gb|AAK95311.1 At1g32920/F9L11_25 ... 30 0.59
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 115 bits (287), Expect = 1e-26
Identities = 54/136 (39%), Positives = 88/136 (64%)
Frame = +1
Query: 37 LHGWCDSDWASCPLTRRSVTGWIVQLGDSPISWKTKKQHTVSRSSAEAEYRSMANTTCEL 96
L G+ D+D+A TR+S++G++ L + ISWK +Q V+ S+ +AEY + +
Sbjct: 88 LEGYVDADYAGNVDTRKSLSGFVFTLYGTTISWKANQQSVVTLSTTQAEYIAFVEGVKDA 267
Query: 97 KWLKGILSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNIL 156
WLKG++ LG+ + + IHCDSQ+AIH+A + V+HERTKHI++ HF+R+ + I+
Sbjct: 268 IWLKGMIGELGITQEY-VKIHCDSQSAIHLANHQVYHERTKHIDIRLHFIRDMIESKEIV 444
Query: 157 PVYVSTANQPADIFTK 172
+++ PAD+FTK
Sbjct: 445 VEKMASEENPADVFTK 492
>AJ502495 weakly similar to GP|18071369|g putative gag-pol polyprotein {Oryza
sativa}, partial (9%)
Length = 542
Score = 111 bits (277), Expect = 2e-25
Identities = 58/146 (39%), Positives = 86/146 (58%)
Frame = +2
Query: 43 SDWASCPLTRRSVTGWIVQLGDSPISWKTKKQHTVSRSSAEAEYRSMANTTCELKWLKGI 102
SDWA TR+S +G+ LG ISW +KKQ V+ S+AEAEY + + + WL+ I
Sbjct: 2 SDWAGDTETRKSTSGYAFHLGTGAISWSSKKQPVVAFSTAEAEYIASTSCATQTVWLRRI 181
Query: 103 LSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNILPVYVST 162
L + + P I+CD+++AI ++KNPVFH R+KHI++ H +R + + ++ Y T
Sbjct: 182 LEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIQFHKIRELIAEKEVVIEYCPT 361
Query: 163 ANQPADIFTKALGKPQFEFLLGKLGI 188
+ ADIFTK L F L LG+
Sbjct: 362 EEKIADIFTKPLKIESFYKLKKMLGM 439
>TC85125 weakly similar to SP|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
(EC 3.4.23.-);, partial (7%)
Length = 705
Score = 70.1 bits (170), Expect(2) = 2e-24
Identities = 34/101 (33%), Positives = 58/101 (56%)
Frame = +3
Query: 88 SMANTTCELKWLKGILSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVR 147
S+ E W++ ++ LG ++ + ++CDSQ+A+HIA+NP FH RTKHI + HFVR
Sbjct: 228 SLPQACKEAIWMQRLMEELGHKQEQ-ITVYCDSQSALHIARNPAFHSRTKHIGIQYHFVR 404
Query: 148 NEVLKNNILPVYVSTANQPADIFTKALGKPQFEFLLGKLGI 188
V + ++ + T + AD TK++ +F + G+
Sbjct: 405 EVVEEGSVDMQKIHTNDNLADAMTKSINTDKFIWCRSSYGL 527
Score = 58.9 bits (141), Expect(2) = 2e-24
Identities = 29/73 (39%), Positives = 45/73 (60%)
Frame = +1
Query: 14 RVVRFLKGNPGQGIFFGSKSDLRLHGWCDSDWASCPLTRRSVTGWIVQLGDSPISWKTKK 73
R++R++KG G + FG S+L + G+ DSD+A R+S TG++ L +SW +K
Sbjct: 7 RIMRYIKGTSGVAVCFGG-SELTVRGYVDSDFAGDHDKRKSTTGYVFTLAGGAVSWLSKL 183
Query: 74 QHTVSRSSAEAEY 86
Q V+ S+ EAEY
Sbjct: 184QTVVALSTTEAEY 222
>BG586159 weakly similar to PIR|T47841|T4 hypothetical protein T2O9.150 -
Arabidopsis thaliana, partial (11%)
Length = 732
Score = 93.6 bits (231), Expect = 4e-20
Identities = 44/133 (33%), Positives = 72/133 (54%)
Frame = +1
Query: 1 MNQPRTEHWEAALRVVRFLKGNPGQGIFFGSKSDLRLHGWCDSDWASCPLTRRSVTGWIV 60
MN P H A RV+R+L G GI + +L + DSD+A R+S +G++
Sbjct: 253 MNCPTELHMHAVKRVLRYLNGTINLGIMYKRNGSEKLEAYTDSDYAGDLDDRKSTSGYVF 432
Query: 61 QLGDSPISWKTKKQHTVSRSSAEAEYRSMANTTCELKWLKGILSNLGVVHDKPMIIHCDS 120
L +SW +KKQ V+ S+ +AE+ + A C+ W++ +L LG + ++CD+
Sbjct: 433 MLSSGAVSWSSKKQPVVTLSTTKAEFIAAAFCACQSVWMRRVLEKLGYTQSGSITMYCDN 612
Query: 121 QAAIHIAKNPVFH 133
+ I ++KNPV H
Sbjct: 613 NSTIKLSKNPVLH 651
>BG647824 weakly similar to PIR|G96722|G96 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (5%)
Length = 721
Score = 82.4 bits (202), Expect = 1e-16
Identities = 58/150 (38%), Positives = 86/150 (56%), Gaps = 4/150 (2%)
Frame = -3
Query: 2 NQPRTEHWEAALRVVRFLKGNPGQGIFFGSKSDLRLHGWCDSD---WASCPLTRRSVTGW 58
+Q +H +AA++++ +LK +P Q IFF S +++ +CDSD A+ L +SV
Sbjct: 608 HQSTAQHPQAAIQIL-YLKISPSQ*IFF--PS*IQIKAFCDSD*IDQAA*TLENQSVIFA 438
Query: 59 IVQLGDSPISWKTKKQHTVSRSSAEAEYRSMANTTCELKWLKGILSNLGVVHDKPMIIHC 118
S KK++ YRS+ +T CE+KWL +L++L KP +++C
Sbjct: 437 SS*ATHSYAGNLKKKRYNFKI------YRSI*STICEIKWLTYLLNDLKFTFIKPAMLYC 276
Query: 119 DSQ-AAIHIAKNPVFHERTKHIEVDCHFVR 147
D+Q AA HIA N F ERTKHIE+DCH VR
Sbjct: 275 DNQSAARHIAANSSFLERTKHIELDCHIVR 186
>BI309716 weakly similar to PIR|G96722|G9 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana, partial (10%)
Length = 744
Score = 72.8 bits (177), Expect = 8e-14
Identities = 30/63 (47%), Positives = 50/63 (78%)
Frame = +2
Query: 1 MNQPRTEHWEAALRVVRFLKGNPGQGIFFGSKSDLRLHGWCDSDWASCPLTRRSVTGWIV 60
+++P+ H++AA+RV+++LK P +G+F+ + S+L+L + DSDWA+CP TR+SVTG+ V
Sbjct: 551 VSKPQQVHYQAAIRVLQYLKTAPAKGLFYSATSNLKLSSFADSDWATCPTTRKSVTGYWV 730
Query: 61 QLG 63
LG
Sbjct: 731 FLG 739
>BF650113 weakly similar to GP|4753889|emb| Tpv2-1c {Phaseolus vulgaris},
partial (13%)
Length = 494
Score = 70.9 bits (172), Expect = 3e-13
Identities = 41/109 (37%), Positives = 62/109 (56%), Gaps = 3/109 (2%)
Frame = +1
Query: 1 MNQPRTEHWEAALRVVRFLKGNPGQGIFF--GSKSDL-RLHGWCDSDWASCPLTRRSVTG 57
M+ PR H AA R++R+++G G+ F G+KS++ L + DSDW RRS +G
Sbjct: 163 MHDPRKPHLIAANRILRYVRGTMEYGLLFPYGAKSEVYELICYSDSDWCG---DRRSTSG 333
Query: 58 WIVQLGDSPISWKTKKQHTVSRSSAEAEYRSMANTTCELKWLKGILSNL 106
++ + D+ ISW TKKQ + SS EAEY + T + WL ++ L
Sbjct: 334 YVFKFNDAAISWCTKKQPITALSSYEAEYIAGTFATFQALWLDSVIKEL 480
>BE941052 weakly similar to PIR|B85188|B85 retrotransposon like protein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 480
Score = 65.5 bits (158), Expect = 1e-11
Identities = 29/74 (39%), Positives = 45/74 (60%)
Frame = +2
Query: 115 IIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNILPVYVSTANQPADIFTKAL 174
++ CD +A ++ NPV+H R KHI +D HFVR+ V + + +V T +Q AD TK L
Sbjct: 20 LLRCDYLSATYLTHNPVYHSRMKHISIDIHFVRDLVQQGKLKVQHVCTVDQLADCLTKPL 199
Query: 175 GKPQFEFLLGKLGI 188
K + + L K+G+
Sbjct: 200 SKSRHQLLRNKIGV 241
>AJ497569 weakly similar to PIR|T04833|T04 hypothetical protein F21P8.50 -
Arabidopsis thaliana, partial (4%)
Length = 723
Score = 43.1 bits (100), Expect(2) = 3e-11
Identities = 25/65 (38%), Positives = 39/65 (59%)
Frame = +2
Query: 110 HDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNILPVYVSTANQPADI 169
H ++ +CD+ +A+HIA N VFHERT H E D + V+ + ++P ++ +QPA
Sbjct: 269 HST*VLQYCDNISALHIAANMVFHERT*HRETDPYIVQGSRML-QLMP--SASKDQPAYS 439
Query: 170 FTKAL 174
TK L
Sbjct: 440 LTKPL 454
Score = 41.2 bits (95), Expect(2) = 3e-11
Identities = 31/78 (39%), Positives = 43/78 (54%)
Frame = +3
Query: 16 VRFLKGNPGQGIFFGSKSDLRLHGWCDSDWASCPLTRRSVTGWIVQLGDSPISWKTKKQH 75
+ +LK PG+ IF + S+ + SCP + R T + L S ISWK+KKQ
Sbjct: 15 LHYLK-TPGKCIFVSNASNPHFNR------GSCPYSIR*TTEFCF-LSSSLISWKSKKQC 170
Query: 76 TVSRSSAEAEYRSMANTT 93
VSRS +EA R++AN T
Sbjct: 171VVSRSFSEA**RALANAT 224
>BG587141 similar to PIR|H86461|H86 hypothetical protein AAF32440.1
[imported] - Arabidopsis thaliana, partial (20%)
Length = 731
Score = 52.0 bits (123), Expect = 1e-07
Identities = 31/94 (32%), Positives = 49/94 (51%)
Frame = +3
Query: 98 WLKGILSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNILP 157
WL+ +LS + + ++I D+Q+ I + +NPVFH R HI HF+R V +
Sbjct: 144 WLQDLLSEVTWEPCEEVVIRIDNQSVIALTRNPVFHGRGNHIHKRYHFIRECVENGQVEV 323
Query: 158 VYVSTANQPADIFTKALGKPQFEFLLGKLGIRNL 191
+V A I TKALG+ F + +G+ +L
Sbjct: 324 EHVPGEKHRAYI*TKALGRIIFREIRYYIGMIDL 425
>BG644677 weakly similar to GP|7682800|gb| Hypothetical protein T15F17.l
{Arabidopsis thaliana}, partial (3%)
Length = 539
Score = 45.1 bits (105), Expect = 2e-05
Identities = 25/81 (30%), Positives = 41/81 (49%)
Frame = -3
Query: 4 PRTEHWEAALRVVRFLKGNPGQGIFFGSKSDLRLHGWCDSDWASCPLTRRSVTGWIVQLG 63
P H + ++LKG G+F+ L G+ ++ + S P RS TG+I G
Sbjct: 462 PTMRH*NGIKHICKYLKGIIDMGLFYSKDCSPDLIGYVNA*YLSDPHKARS*TGYIFTCG 283
Query: 64 DSPISWKTKKQHTVSRSSAEA 84
++ ISW++ K T++ SS A
Sbjct: 282 NTVISWRSTK*STIATSSNHA 220
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 42.4 bits (98), Expect(2) = 2e-05
Identities = 22/51 (43%), Positives = 33/51 (64%)
Frame = +2
Query: 103 LSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKN 153
L NL V + ++I+C +Q ++IAKN V+HERTKH E + F+ E + N
Sbjct: 443 LRNLQVQCTRLLLIYCVNQITLYIAKNQVYHERTKH*ENNWTFLF*EKVAN 595
Score = 21.6 bits (44), Expect(2) = 2e-05
Identities = 12/29 (41%), Positives = 14/29 (47%)
Frame = +1
Query: 159 YVSTANQPADIFTKALGKPQFEFLLGKLG 187
Y+ N FTK+L F LL KLG
Sbjct: 619 YLPRTNLADIFFTKSLLP*PFHILLSKLG 705
>CA921361
Length = 466
Score = 31.6 bits (70), Expect(2) = 1e-04
Identities = 15/34 (44%), Positives = 25/34 (73%)
Frame = -2
Query: 76 TVSRSSAEAEYRSMANTTCELKWLKGILSNLGVV 109
T+S+SS + +YR M +T CEL+WL +L++ V+
Sbjct: 396 TISKSS*D-KYRVMTSTICELQWLAYLLNDFKVL 298
Score = 30.0 bits (66), Expect(2) = 1e-04
Identities = 17/40 (42%), Positives = 24/40 (59%)
Frame = -1
Query: 135 RTKHIEVDCHFVRNEVLKNNILPVYVSTANQPADIFTKAL 174
+T+HIE+DC V E L N+ + VS++ AD TK L
Sbjct: 241 KTEHIELDCRIV-*EKLPQNLFHLLVSSSLHLADCVTKPL 125
>BG453259 homologue to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (5%)
Length = 657
Score = 42.4 bits (98), Expect = 1e-04
Identities = 23/58 (39%), Positives = 33/58 (56%)
Frame = -2
Query: 99 LKGILSNLGVVHDKPMIIHCDSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNIL 156
LK L +L + + PM + ++ IA NPV H RTKHIE+D HF+ ++ IL
Sbjct: 317 LKIKLDDLIINYKDPMTLF*NNNFVSRIAHNPVQHYRTKHIEIDQHFIIEKLYSGLIL 144
>BE942480
Length = 396
Score = 40.0 bits (92), Expect = 6e-04
Identities = 19/42 (45%), Positives = 28/42 (66%)
Frame = -2
Query: 86 YRSMANTTCELKWLKGILSNLGVVHDKPMIIHCDSQAAIHIA 127
YR+M++ CE++WL I+ L V KP + + D+QAA HIA
Sbjct: 317 YRAMSSIVCEIEWLTYIVDVLKVQSIKPTLPYYDNQAARHIA 192
>BG644691 similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial (5%)
Length = 753
Score = 39.3 bits (90), Expect = 0.001
Identities = 22/58 (37%), Positives = 32/58 (54%)
Frame = +1
Query: 119 DSQAAIHIAKNPVFHERTKHIEVDCHFVRNEVLKNNILPVYVSTANQPADIFTKALGK 176
D+ I IA NP+ H+RTKH E+D H + E L + ++V + N + TK L K
Sbjct: 445 DNIIPISIAHNPI*HDRTKHTEIDRHLHQRESLVTP*VLLFVQSINNQR-MLTKGLSK 615
>TC93418
Length = 533
Score = 36.2 bits (82), Expect = 0.008
Identities = 15/29 (51%), Positives = 21/29 (71%)
Frame = +3
Query: 117 HCDSQAAIHIAKNPVFHERTKHIEVDCHF 145
H D+Q+A+H+ N +FHE T HI+ D HF
Sbjct: 219 HYDNQSALHVTSNLIFHEWTNHID-DHHF 302
>BG645347 weakly similar to GP|18033111|g functional candidate resistance
protein KR1 {Glycine max}, partial (9%)
Length = 760
Score = 31.6 bits (70), Expect = 0.20
Identities = 11/21 (52%), Positives = 16/21 (75%)
Frame = -1
Query: 122 AAIHIAKNPVFHERTKHIEVD 142
+ I++ P FHERTKH+E+D
Sbjct: 760 STIYLTYYPTFHERTKHLEID 698
>AW736531 similar to PIR|D84481|D84 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (1%)
Length = 635
Score = 31.6 bits (70), Expect = 0.20
Identities = 12/18 (66%), Positives = 16/18 (88%)
Frame = +3
Query: 124 IHIAKNPVFHERTKHIEV 141
++IA NPVFH +TKHIE+
Sbjct: 453 LYIASNPVFH*QTKHIEI 506
>TC77790 similar to GP|15294268|gb|AAK95311.1 At1g32920/F9L11_25
{Arabidopsis thaliana}, partial (23%)
Length = 1206
Score = 30.0 bits (66), Expect = 0.59
Identities = 12/27 (44%), Positives = 17/27 (62%)
Frame = +1
Query: 98 WLKGILSNLGVVHDKPMIIHCDSQAAI 124
W+ G++S LG+V I CDSQ+ I
Sbjct: 547 WITGLVSELGLVDRSTPCIQCDSQSVI 627
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.321 0.135 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,142,309
Number of Sequences: 36976
Number of extensions: 106039
Number of successful extensions: 500
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 495
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 498
length of query: 195
length of database: 9,014,727
effective HSP length: 91
effective length of query: 104
effective length of database: 5,649,911
effective search space: 587590744
effective search space used: 587590744
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (26.2 bits)
Medicago: description of AC126781.9