
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0108.16
(709 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG585866 100 1e-21
BQ148771 92 7e-19
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp... 59 8e-09
TC83398 similar to PIR|T05150|T05150 hypothetical protein F18E5.... 55 1e-07
BQ122106 similar to GP|9366656|emb|C probable similar to ring-h2... 53 4e-07
BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-li... 51 1e-06
TC83479 similar to GP|23476992|emb|CAD48949. hypothetical protei... 50 4e-06
BG585499 48 6e-06
BG584442 48 1e-05
TC88926 similar to GP|7290507|gb|AAF45960.1| peb gene product {D... 48 1e-05
BG647708 weakly similar to GP|13786450|gb| putative reverse tran... 47 2e-05
TC86742 similar to PIR|T01541|T01541 hypothetical protein A_IG00... 43 4e-04
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 43 5e-04
BG586862 41 0.001
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot... 38 0.015
AW690000 37 0.026
BG452711 35 0.099
BE248682 similar to GP|18568269|gb putative gag-pol polyprotein ... 35 0.13
AW127516 34 0.22
BG449067 33 0.29
>BG585866
Length = 828
Score = 100 bits (249), Expect(2) = 1e-21
Identities = 57/209 (27%), Positives = 95/209 (45%), Gaps = 26/209 (12%)
Frame = +3
Query: 289 ILRARDRLRGGFKFRLGTGNSSIWYNDWSGHGNLCEHLPFVHITDTQLHLRDLVVNNAWD 348
I+R ++ L+ G+ +R G+GNSS WY +WS G L PFV I D L ++D+
Sbjct: 84 IIRDKNVLKSGYTWRAGSGNSSFWYTNWSSLGLLGTQAPFVDIHDLHLTVKDVFTTGGQH 263
Query: 349 FSKLFTSIPEEMKQWFLEVSPSVQNTSHDVWTWGESELGL------------DLEAQSSR 396
L+T +P ++ + + + D + W + G+ E +
Sbjct: 264 TQSLYTILPTDIAEVINNTHLNFNASIGDAYIWPHNSNGVYTAKSGYSWILSQTETVNYN 443
Query: 397 ES--------------EFFVWLCLHDALPVNSKRFHCHLANSEACSRCSYIREDGIHSLR 442
S +FF+WL H+A+P S H ++ NS CSRC E H +R
Sbjct: 444 NSSWSWIWRLKIPEKYKFFLWLACHNAVPTLSLLNHRNMVNSAICSRCGEHEESFFHCVR 623
Query: 443 DCTHSRELWSRMGAGRWNNFWTLNLSDWI 471
DC S+ +W ++G + F + ++ DW+
Sbjct: 624 DCRFSKIIWHKIGFSSPDFFSSSSVQDWL 710
Score = 21.2 bits (43), Expect(2) = 1e-21
Identities = 10/25 (40%), Positives = 13/25 (52%), Gaps = 2/25 (8%)
Frame = +2
Query: 486 GLWGIWKWRCNM--VMDTAPWTIDT 508
GLW IW+ R M M +T+ T
Sbjct: 752 GLWWIWRHRTLMCLTMKLVQFTVST 826
>BQ148771
Length = 680
Score = 92.0 bits (227), Expect = 7e-19
Identities = 49/153 (32%), Positives = 83/153 (54%), Gaps = 1/153 (0%)
Frame = -3
Query: 140 RGRFNFLLENINRKMAAWKTNLLNLAGRACLAKSVIAAMPTYTMQVFWLPRSIIHHIDRA 199
+ RF+ + ++ +A WK N L+LA R LAKSVI A+P Y M +P++ I I +
Sbjct: 612 KSRFSVYYQ-VHVMLANWKANHLSLARRVTLAKSVIEAVPLYPMMTTIIPKACIEEIQKL 436
Query: 200 MRSFIWSKGSGQRGWNLVNWSTVVRDKSHGGLGMKDMSDHNTALLGKAVWLLLKNSNKLW 259
R F+W R ++ V W T+ + K+ GLG++ + N A + K W + SN L
Sbjct: 435 QRKFVWGDTEVSRRYHAVGWETMSKPKTIYGLGLRRLDVMNKACIMKLGWSIYSGSNSLC 256
Query: 260 VQVMQHKYLRDHTILNA-PHRASSSAVWKGILR 291
+VM+ KY R ++ + + S++WK +++
Sbjct: 255 TEVMRGKYQRSESLEEIFLEKPTDSSLWKALVK 157
>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
Arabidopsis thaliana, partial (6%)
Length = 951
Score = 58.5 bits (140), Expect = 8e-09
Identities = 41/141 (29%), Positives = 63/141 (44%)
Frame = +2
Query: 54 VSHLLFADDVLLFCQASTTQVQLVADTLRDFFASSGLKVNIDKSKAISSKGVHPSIRDDI 113
VSHL FA+D LL + ++ + L F A SGLKVN KS + + PS +
Sbjct: 542 VSHLQFANDTLLLETKNWANIRALRAALVIF*AMSGLKVNFHKS-GLVCVNIAPSWLSEA 718
Query: 114 RGIAPIPLVNDLGKYLGFPLSGGRVSRGRFNFLLENINRKMAAWKTNLLNLAGRACLAKS 173
+ + YLG P+ G + ++ I ++ W + L+ GR L KS
Sbjct: 719 ASVLSWKVGKVPFLYLGMPIEGNSRRLSFWEPIVNRIKARLTGWNSRFLSFGGRLVLLKS 898
Query: 174 VIAAMPTYTMQVFWLPRSIIH 194
V+ ++ Y LP S +H
Sbjct: 899 VLTSLSVYA-----LPSSKLH 946
>TC83398 similar to PIR|T05150|T05150 hypothetical protein F18E5.40 -
Arabidopsis thaliana, partial (6%)
Length = 766
Score = 54.7 bits (130), Expect = 1e-07
Identities = 37/104 (35%), Positives = 59/104 (56%), Gaps = 1/104 (0%)
Frame = +2
Query: 574 GSWVTGFMAHYDNGNAFI-AEMLALRDGLKIAWEQGCRRLICESDCLELVRNLASVDWVS 632
G + +GF H D+ N + AE+ A+ GL++A ++C SD L V NL + V
Sbjct: 89 GFFNSGFSGHIDHSNDILFAELHAILMGLQLAQTLNIVDVVCYSDSLHYV-NLINGPSVV 265
Query: 633 LHSHGHILSEIRTMLRWSWRIDIAWIDREGNRVADWLAKRGASI 676
H++ ++ +I+ ++R S + REGNR AD+LAK GAS+
Sbjct: 266 YHAYATLIQDIKDLIRLSKLHTL----REGNRCADFLAKLGASV 385
>BQ122106 similar to GP|9366656|emb|C probable similar to ring-h2 finger
protein rha1a. {Trypanosoma brucei}, partial (18%)
Length = 693
Score = 53.1 bits (126), Expect = 4e-07
Identities = 44/161 (27%), Positives = 76/161 (46%), Gaps = 1/161 (0%)
Frame = -2
Query: 538 VVWKPPPNQCMKMNVDGSFRSDEGLMGTGGALRDSSGSWVTGFMAHYDN-GNAFIAEMLA 596
V W N C +NVDGS + G G +R+ +G + +GF + N + +AE+ A
Sbjct: 575 VKWNCDNNSCYILNVDGSCLGNP*PTGFNGLIRNIAGLFNSGFPGNITNTSDILLAELHA 396
Query: 597 LRDGLKIAWEQGCRRLICESDCLELVRNLASVDWVSLHSHGHILSEIRTMLRWSWRIDIA 656
+ GL++ + G +C D L V +L + + H + ++ +I+ ++ S + +
Sbjct: 395 IFQGLRMISDMGISDFVCYFDSLHYV-SLINGPSMKFHVYATLIQDIKDLVITS-KASVF 222
Query: 657 WIDREGNRVADWLAKRGASIALSEVQELVEPPSELQILLLK 697
EGN AD+L GA A V + P + I L+K
Sbjct: 221 HTLCEGNYCADFLEMLGA--ASDSVLTIHVSPPDGMIQLIK 105
>BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-like protein
{Arabidopsis thaliana}, partial (18%)
Length = 789
Score = 51.2 bits (121), Expect = 1e-06
Identities = 26/80 (32%), Positives = 43/80 (53%)
Frame = -3
Query: 52 PPVSHLLFADDVLLFCQASTTQVQLVADTLRDFFASSGLKVNIDKSKAISSKGVHPSIRD 111
PP++HLLFADD + F +++ + ++ + + A+SG +N KS S +I D
Sbjct: 259 PPINHLLFADDTMFFGKSNASSCAILLSIMDKYRAASGRCIN*TKSAITFSSKTSQAIID 80
Query: 112 DIRGIAPIPLVNDLGKYLGF 131
++G I GKYLG+
Sbjct: 79 RVKGELKIAKEGGTGKYLGY 20
>TC83479 similar to GP|23476992|emb|CAD48949. hypothetical protein {Plasmodium
falciparum 3D7}, partial (0%)
Length = 1222
Score = 49.7 bits (117), Expect = 4e-06
Identities = 27/84 (32%), Positives = 43/84 (51%)
Frame = +2
Query: 540 WKPPPNQCMKMNVDGSFRSDEGLMGTGGALRDSSGSWVTGFMAHYDNGNAFIAEMLALRD 599
WK P K+N DGS + G GG LRD G + F++ G+ F+ E+ A+
Sbjct: 977 WKKPEIGWTKLNTDGSVNKETA--GFGGLLRDYRGEPICAFVSKAPQGDTFLVELWAIWR 1150
Query: 600 GLKIAWEQGCRRLICESDCLELVR 623
GL ++ G + + ESD + +V+
Sbjct: 1151 GLVLSLGLGIKSIWVESDSMSVVK 1222
>BG585499
Length = 792
Score = 48.1 bits (113), Expect(2) = 6e-06
Identities = 29/113 (25%), Positives = 48/113 (41%), Gaps = 10/113 (8%)
Frame = +3
Query: 398 SEFFVWLCLHDALPVNSKRFHCHLANSEACSRCSYIREDGIHSLRDCTHSRELWSRMGAG 457
++ F+WL H + N +R C C E +H L DC + ++W R+
Sbjct: 261 TQTFMWLVAHGCILTNYRRSRWGTRVLATCPCCGNADETVLHVLCDCRPASQVWIRLVPS 440
Query: 458 RW-NNFWTL-NLSDWI--TLHARSDHAVK------FLAGLWGIWKWRCNMVMD 500
W NF++ + DW+ L RS+ K F+ W +W WR + +
Sbjct: 441 DWITNFFSFDDCRDWVFKNLSKRSNGVSKFKWQPTFMTTCWHMWTWRNKAIFE 599
Score = 20.4 bits (41), Expect(2) = 6e-06
Identities = 7/17 (41%), Positives = 10/17 (58%)
Frame = +2
Query: 538 VVWKPPPNQCMKMNVDG 554
+ WK P + K+N DG
Sbjct: 719 IAWKRPLDGWAKLNCDG 769
>BG584442
Length = 775
Score = 48.1 bits (113), Expect = 1e-05
Identities = 25/93 (26%), Positives = 49/93 (51%), Gaps = 1/93 (1%)
Frame = +1
Query: 170 LAKSVIAAMPTYTMQVFWLPRSIIHHIDRAMRSFIWSK-GSGQRGWNLVNWSTVVRDKSH 228
+ K + ++ +Y M +F L S + I++ M +F W G ++G + ++ + K++
Sbjct: 433 MIKYALQSISSYVMSIFLLLNSQVDEIEKIMNTFSWVHVGENRKGMHWMS*EKLFVHKNY 612
Query: 229 GGLGMKDMSDHNTALLGKAVWLLLKNSNKLWVQ 261
GG+G D + N +LGK V L N L+++
Sbjct: 613 GGMGFTDFTTFNIPMLGKQV*SFLLNRTTLFLE 711
>TC88926 similar to GP|7290507|gb|AAF45960.1| peb gene product {Drosophila
melanogaster}, partial (1%)
Length = 1073
Score = 48.1 bits (113), Expect = 1e-05
Identities = 47/153 (30%), Positives = 72/153 (46%), Gaps = 8/153 (5%)
Frame = +3
Query: 550 MNVDGSFRSDEGL--MGTGGALRDSSGSWVTGFMAHYDNGNAFI--AEMLALRDGLKIAW 605
+NVDGS + + G GG L DSSG W+ GF A N N + E A+ GL
Sbjct: 348 LNVDGSLLREREVPSAGCGGVLSDSSGKWLCGF-AQKLNPNLKVDETEKEAILRGLLWVK 524
Query: 606 EQGCRRLICESDCLELVRNLASVDWVSLHSHGHILSEIRTMLRW-SWRIDIAWIDREGNR 664
E+G R+++ +SD +V ++ S+ ++ IR +L W + I N
Sbjct: 525 EKGKRKILVKSDNEGVVYSVN----CGGRSNDPLVCGIRDLLNSPHWEATLTCIHGRSNA 692
Query: 665 VADWLAKRGASIALSEVQELVEPP---SELQIL 694
VAD LA + S ++ + PP + LQI+
Sbjct: 693 VADRLAHKAHSFTSFDLCQFDYPPENCTSLQIM 791
>BG647708 weakly similar to GP|13786450|gb| putative reverse transcriptase
{Oryza sativa}, partial (9%)
Length = 708
Score = 47.4 bits (111), Expect = 2e-05
Identities = 22/54 (40%), Positives = 34/54 (62%)
Frame = +1
Query: 52 PPVSHLLFADDVLLFCQASTTQVQLVADTLRDFFASSGLKVNIDKSKAISSKGV 105
P ++HLLFADD LLF +A+ T+ + L + ++SG VN +KS+ S+ V
Sbjct: 139 PKITHLLFADDSLLFARANLTEAATIMQVLHSYQSASGQLVNFEKSEVSYSQNV 300
>TC86742 similar to PIR|T01541|T01541 hypothetical protein A_IG005I10.16 -
Arabidopsis thaliana, partial (19%)
Length = 2073
Score = 43.1 bits (100), Expect = 4e-04
Identities = 38/143 (26%), Positives = 58/143 (39%), Gaps = 10/143 (6%)
Frame = -1
Query: 569 LRDSSGSWVTGFMAHYDNGNAFIAEMLALRDGLKIAWEQGCRRLICESDCLELVRNLASV 628
+RDS ++ + + AE A ++ A E G + E+D L++V
Sbjct: 465 IRDSQFGFLGALSCNIGHATPLEAEFCACMIAIEKAMELGLNNICLETDSLKVVNAF--- 295
Query: 629 DWVSLHSHGHILSEIRTMLRWSWRIDIAW----------IDREGNRVADWLAKRGASIAL 678
H + I +R W I + I REGN VAD LA+ G ++L
Sbjct: 294 ---------HKIVGIPWQMRVRWHNCIRFCHSIACVCVHIPREGNLVADALARHGQGLSL 142
Query: 679 SEVQELVEPPSELQILLLKDSLG 701
+Q PPS +Q L +D G
Sbjct: 141 FFLQWWPAPPSFIQSFLAQDRYG 73
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 42.7 bits (99), Expect = 5e-04
Identities = 47/233 (20%), Positives = 84/233 (35%), Gaps = 54/233 (23%)
Frame = -3
Query: 274 LNAPHRASSSAVWKGILRARDRLRGGFKFRLGTG-NSSIWYNDWSGHGNLCEHLPFVHIT 332
LNAP + +S W+ I A+ ++ G K +G G N++IW + + + P H +
Sbjct: 762 LNAPLGSWASYAWRSIHSAQHLIKQGAKVIIGNGENTNIWEREMAWKLTCVTNHPNKHSS 583
Query: 333 -----------------DTQLHLRDLVVNNA-------------------------WDFS 350
D R+ + N+ W++S
Sbjct: 582 RAY*APTLYGYEGCRSDDPMRRERNANLINSIFPEGTRRKILSIHPQGPIGEDSYSWEYS 403
Query: 351 K-----------LFTSIPEEMKQWFLEVSPSVQNTSHDVWTWGESELGLDLEAQSSRESE 399
K + T+I Q PS+ + VW + +S +
Sbjct: 402 KSGHYSVKSGYYVQTNIIAAANQRGTVDQPSLDDLYQRVWKYN-----------TSPKVR 256
Query: 400 FFVWLCLHDALPVNSKRFHCHLANSEACSRCSYIREDGIHSLRDCTHSRELWS 452
F+W C+ ++LP + H++ +CSRC E H L C ++R +W+
Sbjct: 255 HFLWRCISNSLPTAANMRSRHISKDGSCSRCGMESETVNHILFQCPYARLIWA 97
>BG586862
Length = 804
Score = 41.2 bits (95), Expect = 0.001
Identities = 30/105 (28%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Frame = -1
Query: 401 FVWLCLHDALPVNSKRFHCHLANSEACSRCSYIREDGIHSLRDCTHSRELW--SRMGAGR 458
F+W LH+ALPV + + S C RC E H +C +++ W S++G
Sbjct: 618 FLWRLLHNALPVKDELHKRGIRCSLLCPRCESKIETVQHLFLNCEVTQKEWFGSQLGI-N 442
Query: 459 WNNFWTLNLSDWIT---LHARSDHAVKFLAGLWGIWKWRCNMVMD 500
+++ L+ DWIT L + + A L+ IW R V +
Sbjct: 441 FHSSGVLHFHDWITNFILKNDEETIIALTALLYSIWHARNQKVFE 307
>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
dioxygenase1 {Pisum sativum}, partial (43%)
Length = 1865
Score = 37.7 bits (86), Expect = 0.015
Identities = 19/49 (38%), Positives = 30/49 (60%)
Frame = -3
Query: 219 WSTVVRDKSHGGLGMKDMSDHNTALLGKAVWLLLKNSNKLWVQVMQHKY 267
W + R K GGLG++D+ N +LL K W LL++ + LW +V++ Y
Sbjct: 975 WHCLPRCK--GGLGVRDIRLVNVSLLAKWWWRLLQDQSSLWKEVLEDIY 835
>AW690000
Length = 652
Score = 37.0 bits (84), Expect = 0.026
Identities = 21/60 (35%), Positives = 29/60 (48%)
Frame = +3
Query: 536 LSVVWKPPPNQCMKMNVDGSFRSDEGLMGTGGALRDSSGSWVTGFMAHYDNGNAFIAEML 595
+ V+W+PP +K N DGS RS GG R+ + F + NAF AE+L
Sbjct: 399 IEVIWRPPIPHWIKCNTDGSSRSHSS--ACGGIFRNHDTDLLLCFAENTGECNAFHAELL 572
>BG452711
Length = 672
Score = 35.0 bits (79), Expect = 0.099
Identities = 12/34 (35%), Positives = 22/34 (64%)
Frame = +3
Query: 419 CHLANSEACSRCSYIREDGIHSLRDCTHSRELWS 452
C+L +S C C+ +D +H+L CT ++++WS
Sbjct: 384 CNLISSNLCPICNQRSQDMLHALFSCTRAKDVWS 485
>BE248682 similar to GP|18568269|gb putative gag-pol polyprotein {Zea mays},
partial (1%)
Length = 441
Score = 34.7 bits (78), Expect = 0.13
Identities = 19/47 (40%), Positives = 25/47 (52%)
Frame = +3
Query: 54 VSHLLFADDVLLFCQASTTQVQLVADTLRDFFASSGLKVNIDKSKAI 100
VSHL +ADD L + + + L+ F +SGLKVN KS I
Sbjct: 159 VSHLQYADDTLCIGMPTVDNLWTLKALLQGFEMASGLKVNFHKSSLI 299
>AW127516
Length = 379
Score = 33.9 bits (76), Expect = 0.22
Identities = 19/42 (45%), Positives = 27/42 (64%)
Frame = +3
Query: 660 REGNRVADWLAKRGASIALSEVQELVEPPSELQILLLKDSLG 701
REGN AD++AK GAS + S+ + PP +L LL D++G
Sbjct: 252 REGNHCADYMAKLGAS-SNSDFSVHLTPPHDLLGLLRNDAIG 374
>BG449067
Length = 578
Score = 33.5 bits (75), Expect = 0.29
Identities = 27/104 (25%), Positives = 46/104 (43%), Gaps = 1/104 (0%)
Frame = -2
Query: 538 VVWKPPPNQCMKMNVDGSFRSDEGLMGTGGALRDSSG-SWVTGFMAHYDNGNAFIAEMLA 596
V WK P +K+N D + S + L G G R+ G + +G + +A AE
Sbjct: 313 VKWKKPEKDIIKLNSDANLSSTD-LWGIGVVARNDEGFAMASGTWFRFGFPSATTAEAWG 137
Query: 597 LRDGLKIAWEQGCRRLICESDCLELVRNLASVDWVSLHSHGHIL 640
+ + A E G ++ ESD +++ L + V+ G I+
Sbjct: 136 IYQAMIFAGEYGFSKVQFESDNERVIQMLNGTEEVNRLYLGSII 5
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.324 0.137 0.457
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,200,323
Number of Sequences: 36976
Number of extensions: 506343
Number of successful extensions: 2966
Number of sequences better than 10.0: 49
Number of HSP's better than 10.0 without gapping: 2913
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2964
length of query: 709
length of database: 9,014,727
effective HSP length: 103
effective length of query: 606
effective length of database: 5,206,199
effective search space: 3154956594
effective search space used: 3154956594
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 62 (28.5 bits)
Lotus: description of TM0108.16