
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0118c.6
(278 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra... 63 1e-10
BG585499 55 3e-08
BF640104 45 4e-05
TC83479 similar to GP|23476992|emb|CAD48949. hypothetical protei... 43 1e-04
TC93101 42 2e-04
BG585866 41 4e-04
BG449067 39 0.002
BG586862 38 0.004
CB892607 similar to GP|8927657|gb| EST gb|N38213 comes from this... 37 0.006
TC89958 similar to GP|12483623|dbj|BAB21445. NADH dehydrogenase ... 36 0.014
TC88926 similar to GP|7290507|gb|AAF45960.1| peb gene product {D... 35 0.024
BF006335 similar to GP|14334538|gb| unknown protein {Arabidopsis... 31 0.46
BG452711 31 0.60
AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanog... 31 0.60
TC93136 31 0.60
TC82520 30 1.0
BF003590 29 1.7
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 29 1.7
TC92402 similar to GP|20198252|gb|AAM15483.1 subtilisin-like ser... 28 5.1
CA918228 weakly similar to GP|18478606|gb| repetitive proline ri... 28 5.1
>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase
100033-105622 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 885
Score = 63.2 bits (152), Expect = 1e-10
Identities = 54/216 (25%), Positives = 94/216 (43%), Gaps = 18/216 (8%)
Frame = +3
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIWLRLGALAWTNFGVGDHGSWIRSQARS------RNS 54
CPRC + E I H CP S+ +W G+ NF + ++I + +
Sbjct: 243 CPRCHSKTETITHLFMSCPLSKRVW--FGSNLCINFDNLPNPNFIN*LYEAIL*KDECIT 416
Query: 55 VRFLAGVWGVWKWRNNMVFEDSPWLVDEACRRICHEHDEFITFSNGV-----------AG 103
+ A ++ +W RN V ED L + +R + ++ +
Sbjct: 417 I*IAAIIYNLWHARNLSVLEDQTILEMDIIQRASNCISDYKQANTQAPPSMARTGYDPRS 596
Query: 104 DHGSWLSSRWQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWLGGFYAHRSGGN-A 162
H +++W+ P G+VK+N D + + G G +IRD+ G+ + G + A
Sbjct: 597 QHRPAKNTKWKRPNLGLVKVNTDANLQNHGK-WGLGIIIRDEVGLVMAASTWETDGNDRA 773
Query: 163 LIAEATALLLGLELVWDLGYRQVMVEVDCGELLQVM 198
L AEA ALL G+ D G+ +V E D +L++++
Sbjct: 774 LEAEAYALLTGMRFAKDCGFXKVXFEGDNEKLMKMV 881
>BG585499
Length = 792
Score = 55.1 bits (131), Expect = 3e-08
Identities = 28/85 (32%), Positives = 43/85 (49%), Gaps = 10/85 (11%)
Frame = +3
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIWLRLGALAW-TN-FGVGDHGSWI-RSQARSRNSV-- 55
CP C +E +LH L DC + ++W+RL W TN F D W+ ++ ++ N V
Sbjct: 348 CPCCGNADETVLHVLCDCRPASQVWIRLVPSDWITNFFSFDDCRDWVFKNLSKRSNGVSK 527
Query: 56 -----RFLAGVWGVWKWRNNMVFED 75
F+ W +W WRN +FE+
Sbjct: 528 FKWQPTFMTTCWHMWTWRNKAIFEE 602
>BF640104
Length = 344
Score = 44.7 bits (104), Expect = 4e-05
Identities = 28/92 (30%), Positives = 47/92 (50%), Gaps = 1/92 (1%)
Frame = -1
Query: 112 RWQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWLGGFYAHRSGGNA-LIAEATAL 170
+W P QGV+K+N D + +D G G + R+D GI + +R G + AEA +
Sbjct: 341 KWIKPHQGVIKINCDANLTSED-VWGIGVITRNDNGIVMASGTWNRPGFMCPITAEAWGV 165
Query: 171 LLGLELVWDLGYRQVMVEVDCGELLQVMADEE 202
D G++ V+ E D +L+ +++ EE
Sbjct: 164 YQAALFALDQGFQNVLFENDNEKLISMLSREE 69
>TC83479 similar to GP|23476992|emb|CAD48949. hypothetical protein {Plasmodium
falciparum 3D7}, partial (0%)
Length = 1222
Score = 43.1 bits (100), Expect = 1e-04
Identities = 27/78 (34%), Positives = 40/78 (50%)
Frame = +2
Query: 113 WQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWLGGFYAHRSGGNALIAEATALLL 172
W+ P+ G KLN D S ++ G GGL+RD +G + F + G+ + E A+
Sbjct: 977 WKKPEIGWTKLNTDGSVNKETA--GFGGLLRDYRGEPICAFVSKAPQGDTFLVELWAIWR 1150
Query: 173 GLELVWDLGYRQVMVEVD 190
GL L LG + + VE D
Sbjct: 1151 GLVLSLGLGIKSIWVESD 1204
>TC93101
Length = 675
Score = 42.4 bits (98), Expect = 2e-04
Identities = 48/211 (22%), Positives = 77/211 (35%), Gaps = 28/211 (13%)
Frame = -1
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIWLRLG-ALAWTNFGVGDHGSWIRSQARSRNS---VR 56
CPRC E H C +Q +W ++ + + + W+ ++ ++
Sbjct: 612 CPRCYFNLETTNHIFMSCERTQRVWFGSQLSIRFPDNSTINFSDWLFDAISNQTEEIIIK 433
Query: 57 FLAGVWGVWKWRNNMVFED----SPWLVDEACRRICHEHDEFI-------------TFSN 99
A + +W RN +FE+ ++ A I I SN
Sbjct: 432 ISAITYSIWHARNKAIFENQFVSEDTIIQ*AQNSILAYEQATIKPQNPNIVLSSLSATSN 253
Query: 100 GVAGDHGSWLSSRWQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQG------IW-LGG 152
S + SRWQ P ++K N D + Q R G G +IR+ G W + G
Sbjct: 252 TNTTRRRSNVRSRWQKPLNNILKANCDANLQVQGR-WGLGCIIRNADGEAKVTATWCING 76
Query: 153 FYAHRSGGNALIAEATALLLGLELVWDLGYR 183
F A AE A+L + L D G++
Sbjct: 75 FDC------AATAETYAILAAMYLAKDYGFK 1
>BG585866
Length = 828
Score = 41.2 bits (95), Expect = 4e-04
Identities = 19/63 (30%), Positives = 26/63 (41%)
Frame = +3
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIWLRLGALAWTNFGVGDHGSWIRSQARSRNSVRFLAG 60
C RC EE HC+RDC S+ IW ++G + F W++ FL
Sbjct: 576 CSRCGEHEESFFHCVRDCRFSKIIWHKIGFSSPDFFSSSSVQDWLKDGISCHRPTTFLED 755
Query: 61 VWG 63
G
Sbjct: 756 CGG 764
>BG449067
Length = 578
Score = 38.9 bits (89), Expect = 0.002
Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Frame = -2
Query: 112 RWQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWLGGFYAHRSG-GNALIAEATAL 170
+W+ P++ ++KLN D + D G G + R+D+G + R G +A AEA +
Sbjct: 310 KWKKPEKDIIKLNSDANLSSTD-LWGIGVVARNDEGFAMASGTWFRFGFPSATTAEAWGI 134
Query: 171 LLGLELVWDLGYRQVMVEVDCGELLQVMADEESCRFLCL 209
+ + G+ +V E D ++Q++ E L L
Sbjct: 133 YQAMIFAGEYGFSKVQFESDNERVIQMLNGTEEVNRLYL 17
>BG586862
Length = 804
Score = 38.1 bits (87), Expect = 0.004
Identities = 49/203 (24%), Positives = 75/203 (36%), Gaps = 7/203 (3%)
Frame = -1
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIWLRLGALAWTNF---GVGDHGSWIRSQARSRNS--- 54
CPRC + E + H +C +Q+ W G+ NF GV WI + +
Sbjct: 540 CPRCESKIETVQHLFLNCEVTQKEWF--GSQLGINFHSSGVLHFHDWITNFILKNDEETI 367
Query: 55 VRFLAGVWGVWKWRNNMVFEDSPWLVDEACRRICHEHDEFITFSNGVAGDHGSWLSSRWQ 114
+ A ++ +W RN VFE+ D +R F +A S L S
Sbjct: 366 IALTALLYSIWHARNQKVFENIDVPGDVVIQRASSSLHSF-----KMAQVSDSVLPSNAI 202
Query: 115 PPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWL-GGFYAHRSGGNALIAEATALLLG 173
P S W G G + R+ +G+ + G + A AEA +
Sbjct: 201 P----------SYSLW------GIGVVARNCEGLAMASGTWLRHGIPCATTAEAWGIYQA 70
Query: 174 LELVWDLGYRQVMVEVDCGELLQ 196
+ D G+ + E D G LL+
Sbjct: 69 MVFAGDCGFSKFEFESDNGNLLR 1
>CB892607 similar to GP|8927657|gb| EST gb|N38213 comes from this gene.
{Arabidopsis thaliana}, partial (1%)
Length = 782
Score = 37.4 bits (85), Expect = 0.006
Identities = 26/74 (35%), Positives = 37/74 (49%), Gaps = 1/74 (1%)
Frame = +1
Query: 115 PPKQGVVKLNMDESFWEQDRCMGAGGLIRDDQGIWLGGFYAHRSGG-NALIAEATALLLG 173
PP V L D D GGLI+DDQG ++ YA++ G + L AE + G
Sbjct: 559 PPNSCEVALKCDYVVLNCDLNAACGGLIQDDQGHFV-FHYANKLGSCSVLQAELWGI*HG 735
Query: 174 LELVWDLGYRQVMV 187
L + W+ GY ++ V
Sbjct: 736 LSIDWNRGYSKIRV 777
>TC89958 similar to GP|12483623|dbj|BAB21445. NADH dehydrogenase subunit 4L
{Gonostoma gracile}, partial (17%)
Length = 1040
Score = 36.2 bits (82), Expect = 0.014
Identities = 14/33 (42%), Positives = 22/33 (66%)
Frame = +2
Query: 113 WQPPKQGVVKLNMDESFWEQDRCMGAGGLIRDD 145
W+PP G VK NMD + +++ C+GAG + D+
Sbjct: 551 WEPPS*GYVKCNMDAALFKEYNCVGAGFRV*DE 649
>TC88926 similar to GP|7290507|gb|AAF45960.1| peb gene product {Drosophila
melanogaster}, partial (1%)
Length = 1073
Score = 35.4 bits (80), Expect = 0.024
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Frame = +3
Query: 121 VKLNMDESFWEQDRC--MGAGGLIRDDQGIWLGGFYAHRSGGNALI--AEATALLLGLEL 176
V LN+D S + G GG++ D G WL GF A + N + E A+L GL
Sbjct: 342 VILNVDGSLLREREVPSAGCGGVLSDSSGKWLCGF-AQKLNPNLKVDETEKEAILRGLLW 518
Query: 177 VWDLGYRQVMVEVD 190
V + G R+++V+ D
Sbjct: 519 VKEKGKRKILVKSD 560
>BF006335 similar to GP|14334538|gb| unknown protein {Arabidopsis thaliana},
partial (3%)
Length = 505
Score = 31.2 bits (69), Expect = 0.46
Identities = 18/42 (42%), Positives = 22/42 (51%)
Frame = +2
Query: 137 GAGGLIRDDQGIWLGGFYAHRSGGNALIAEATALLLGLELVW 178
G GGL R+ + GFY N L AE LL+GL+L W
Sbjct: 209 GFGGLDRNYDKAFQFGFYGSIGWSNILHAEIQDLLVGLKLWW 334
>BG452711
Length = 672
Score = 30.8 bits (68), Expect = 0.60
Identities = 9/25 (36%), Positives = 16/25 (64%)
Frame = +3
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIW 25
CP C+ +D+LH L C ++++W
Sbjct: 408 CPICNQRSQDMLHALFSCTRAKDVW 482
>AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanogaster},
partial (35%)
Length = 698
Score = 30.8 bits (68), Expect = 0.60
Identities = 18/69 (26%), Positives = 28/69 (40%), Gaps = 22/69 (31%)
Frame = +2
Query: 42 GSWIRSQARSRNSVRFLA----------------------GVWGVWKWRNNMVFEDSPWL 79
G+W RSQ N+ LA G+WGV+ + M E+ W
Sbjct: 92 GAWDRSQGGGANNASGLAAIVAEDNCAHSFQSFNTCYKDTGLWGVYFVSDGMTIENMVWN 271
Query: 80 VDEACRRIC 88
+ EA +++C
Sbjct: 272 IQEAWKKMC 298
>TC93136
Length = 722
Score = 30.8 bits (68), Expect = 0.60
Identities = 22/79 (27%), Positives = 35/79 (43%), Gaps = 4/79 (5%)
Frame = +1
Query: 13 HCLRDCPHSQEIWLR-LGALAWTNFGVGDHGSWIRSQARSRNSVRFL---AGVWGVWKWR 68
H CP +W + LG L ++ F + + RSR +L A +W +WK
Sbjct: 274 HLFLHCPFLSSVWSKILGWLYYSKFVC----VFRLLERRSRTKGVWLIWHATIWVIWKGI 441
Query: 69 NNMVFEDSPWLVDEACRRI 87
NN +F++ +DE I
Sbjct: 442 NNRIFKNISKAIDEIVEEI 498
>TC82520
Length = 833
Score = 30.0 bits (66), Expect = 1.0
Identities = 11/22 (50%), Positives = 16/22 (72%)
Frame = +2
Query: 55 VRFLAGVWGVWKWRNNMVFEDS 76
+ + A VW +WK RNN VF++S
Sbjct: 497 IMWFASVWVLWKKRNNRVFQNS 562
>BF003590
Length = 531
Score = 29.3 bits (64), Expect = 1.7
Identities = 21/78 (26%), Positives = 33/78 (41%), Gaps = 8/78 (10%)
Frame = +1
Query: 57 FLAGVWGVWKWRNNMVFED--------SPWLVDEACRRICHEHDEFITFSNGVAGDHGSW 108
F W +W+ RN MVF++ S +++ C H T + V D
Sbjct: 313 FCITFWKIWQVRNQMVFQNQQLNPIQISISILEFT*ELNCSNHSAIATVT--VRSD---- 474
Query: 109 LSSRWQPPKQGVVKLNMD 126
+ W PP GV+ LN++
Sbjct: 475 -AEFWCPPPPGVMNLNLN 525
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 29.3 bits (64), Expect = 1.7
Identities = 10/25 (40%), Positives = 14/25 (56%)
Frame = -3
Query: 1 CPRCSAPEEDILHCLRDCPHSQEIW 25
C RC E + H L CP+++ IW
Sbjct: 174 CSRCGMESETVNHILFQCPYARLIW 100
>TC92402 similar to GP|20198252|gb|AAM15483.1 subtilisin-like serine
protease AIR3 {Arabidopsis thaliana}, partial (16%)
Length = 704
Score = 27.7 bits (60), Expect = 5.1
Identities = 20/65 (30%), Positives = 27/65 (40%), Gaps = 2/65 (3%)
Frame = +1
Query: 90 EHDEFITFSNGVAGDHGSWLSSRWQPPKQG--VVKLNMDESFWEQDRCMGAGGLIRDDQG 147
EH T S G HG+ ++S WQ + G + N+D W + R G I
Sbjct: 64 EHKLHTTRSWEFLGLHGNDINSAWQKGRFGENTIIANIDTGVWPESRSFSDRG-IGPIPA 240
Query: 148 IWLGG 152
W GG
Sbjct: 241 KWRGG 255
>CA918228 weakly similar to GP|18478606|gb| repetitive proline rich protein
{Oryza sativa}, partial (13%)
Length = 303
Score = 27.7 bits (60), Expect = 5.1
Identities = 15/37 (40%), Positives = 17/37 (45%), Gaps = 2/37 (5%)
Frame = +1
Query: 42 GSWIRSQARSR--NSVRFLAGVWGVWKWRNNMVFEDS 76
GSW+RS R N RF W VW WR + S
Sbjct: 130 GSWLRSWLWMRCWNRFRF---AWWVWIWRRRWALQSS 231
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.333 0.143 0.509
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,062,738
Number of Sequences: 36976
Number of extensions: 201197
Number of successful extensions: 1132
Number of sequences better than 10.0: 45
Number of HSP's better than 10.0 without gapping: 1118
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1128
length of query: 278
length of database: 9,014,727
effective HSP length: 95
effective length of query: 183
effective length of database: 5,502,007
effective search space: 1006867281
effective search space used: 1006867281
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0118c.6