
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0230.8
(334 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC86770 similar to GP|13872875|dbj|BAB43982. contains EST AU0965... 582 e-167
AW980815 homologue to GP|13872875|dbj contains EST AU096506(S140... 54 2e-11
TC81684 weakly similar to GP|21593080|gb|AAM65029.1 unknown {Ara... 31 0.76
TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed pr... 30 1.00
TC87277 similar to PIR|T45588|T45588 arm repeat containing prote... 30 1.7
TC85237 similar to GP|4200165|emb|CAA76145.1 neutral invertase {... 29 2.2
AL387992 weakly similar to GP|507345|gb|AA TonB {Haemophilus inf... 29 2.2
BG585180 29 2.9
TC89890 weakly similar to GP|17065226|gb|AAL32767.1 Unknown prot... 28 3.8
TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {... 28 4.9
BG644345 weakly similar to GP|19697333|gb putative protein poten... 28 4.9
TC78863 weakly similar to GP|18481710|gb|AAL73532.1 hypothetical... 28 4.9
BE326007 similar to GP|13676415|d hypothetical protein {Glycine ... 28 6.5
TC79190 similar to PIR|E96542|E96542 scarecrow-like protein [imp... 28 6.5
TC84617 similar to GP|1151236|gb|AAB68209.1| Lpg18p {Saccharomyc... 28 6.5
TC82232 similar to GP|10177316|dbj|BAB10642. kinesin-like protei... 27 8.4
TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Atel... 27 8.4
BQ751419 weakly similar to GP|21629340|gb L509.2 {Leishmania maj... 27 8.4
BG454776 weakly similar to PIR|T49896|T4 glycine/proline-rich pr... 27 8.4
BQ751031 27 8.4
>TC86770 similar to GP|13872875|dbj|BAB43982. contains EST
AU096506(S14064)~unknown protein {Oryza sativa (japonica
cultivar-group)}, partial (73%)
Length = 2061
Score = 582 bits (1499), Expect = e-167
Identities = 293/336 (87%), Positives = 311/336 (92%), Gaps = 2/336 (0%)
Frame = +2
Query: 1 DDQEGVRNLDDDNFIDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDEINDLFKMGKKK- 59
DD EG RN+DDDNFIDDTGVEP YG Y+EP SPG+APQAEEGEEDDEI DLFKMGKKK
Sbjct: 590 DDNEGARNMDDDNFIDDTGVEPALYG-YDEPRSPGDAPQAEEGEEDDEIKDLFKMGKKKK 766
Query: 60 -NERSPAEIALLVENVVAELEVTAEEDAELNRQGKPAINKLKKLPLLTEVLSKKQLQLEF 118
NERSPAEIALLVENV+AELEVTAEEDAELNRQ KPA+NKLKKLPLL EVLSKKQLQLEF
Sbjct: 767 KNERSPAEIALLVENVMAELEVTAEEDAELNRQHKPAVNKLKKLPLLIEVLSKKQLQLEF 946
Query: 119 LDHGVLTLLKNWLEPLPDGSLPNINIRTAILKILNDFPIDLEQIDRREQLKRSGLGKVIM 178
LDHGVL LLK+WLEPLPDGSLPNINIRTAILKILND PIDLE DRREQLKRSGLGKVIM
Sbjct: 947 LDHGVLNLLKSWLEPLPDGSLPNINIRTAILKILNDLPIDLEHYDRREQLKRSGLGKVIM 1126
Query: 179 FLSKSDEEINVNRKLTKELVDKWSRPIFNKSTRFEDMRNIEDERAPFRRPSVKKPANKAP 238
FLS+SDEEINVNR+L K+LVDKWSRPIFNKSTRFEDMRN ED+R P+RRPSVKKPA KA
Sbjct: 1127FLSRSDEEINVNRRLAKDLVDKWSRPIFNKSTRFEDMRNTEDDRVPYRRPSVKKPAAKAA 1306
Query: 239 GMQSRDSDLDLDLPQPRSGQSSSRQHASRPEATPMDFVIRPQSKVDPEEVRARAKQASHD 298
GMQSRD DLDLDL QPRSG+SSSRQHASRPEATP+DFVIRPQSK+DP+E+RARAKQA+ D
Sbjct: 1307GMQSRDGDLDLDLSQPRSGESSSRQHASRPEATPLDFVIRPQSKIDPDEIRARAKQATQD 1486
Query: 299 QQRMKMNKKLQQLRAPKKRQLQATKLSVEGRGMIKY 334
Q RMKMNKKLQQLRAPKK+QLQATKLSVEGRGM KY
Sbjct: 1487QHRMKMNKKLQQLRAPKKKQLQATKLSVEGRGMAKY 1594
>AW980815 homologue to GP|13872875|dbj contains EST AU096506(S14064)~unknown
protein {Oryza sativa (japonica cultivar-group)},
partial (9%)
Length = 175
Score = 53.9 bits (128), Expect(2) = 2e-11
Identities = 24/26 (92%), Positives = 26/26 (99%)
Frame = +2
Query: 259 SSSRQHASRPEATPMDFVIRPQSKVD 284
SSSRQHASRPEATP+DFVIRPQSK+D
Sbjct: 2 SSSRQHASRPEATPLDFVIRPQSKID 79
Score = 32.3 bits (72), Expect(2) = 2e-11
Identities = 14/18 (77%), Positives = 16/18 (88%)
Frame = +3
Query: 298 DQQRMKMNKKLQQLRAPK 315
DQ +KMNKKLQQLRAP+
Sbjct: 120 DQSCLKMNKKLQQLRAPR 173
>TC81684 weakly similar to GP|21593080|gb|AAM65029.1 unknown {Arabidopsis
thaliana}, partial (29%)
Length = 732
Score = 30.8 bits (68), Expect = 0.76
Identities = 18/59 (30%), Positives = 32/59 (53%), Gaps = 7/59 (11%)
Frame = +3
Query: 3 QEGVRNLDDDNFIDDTGV------EPGF-YGNYNEPSSPGEAPQAEEGEEDDEINDLFK 54
+E + LD+D +DD G+ +PG + + +EP P + P+ GE+ DE + F+
Sbjct: 273 KEPIPVLDEDPNVDDCGIRLFRHSKPGIVFDHADEPQPPMKRPKLVPGEDIDEKSKKFR 449
>TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed protein
{Arabidopsis thaliana}, partial (50%)
Length = 753
Score = 30.4 bits (67), Expect = 1.00
Identities = 16/51 (31%), Positives = 27/51 (52%)
Frame = +1
Query: 1 DDQEGVRNLDDDNFIDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDEIND 51
DD + V + DDDN D +G E + + P A +++ +EDD+ +D
Sbjct: 328 DDDDDVNDEDDDNDEDFSGDEDDEDADPEDDPVPNGAGGSDDDDEDDDDDD 480
>TC87277 similar to PIR|T45588|T45588 arm repeat containing protein homolog
- Arabidopsis thaliana, partial (48%)
Length = 1393
Score = 29.6 bits (65), Expect = 1.7
Identities = 33/142 (23%), Positives = 58/142 (40%), Gaps = 1/142 (0%)
Frame = +1
Query: 12 DNFIDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDEINDLFKMGKKKNERSPA-EIALL 70
+ + + G+EP + +EPS A E + + + G +++RS A EI LL
Sbjct: 49 EQWCEANGIEPPKRPSTSEPSKSASACTPAERSKIESLIQKLTSGGPEDQRSAAGEIRLL 228
Query: 71 VENVVAELEVTAEEDAELNRQGKPAINKLKKLPLLTEVLSKKQLQLEFLDHGVLTLLKNW 130
A+ N + AI + +PLL +LS + + +H V LL
Sbjct: 229 ---------------AKRNADNRVAIAEAGAIPLLVGLLSVPDSRTQ--EHAVTALLNLS 357
Query: 131 LEPLPDGSLPNINIRTAILKIL 152
+ GS+ + I+ +L
Sbjct: 358 IYESNKGSIVSSGAVPGIVHVL 423
>TC85237 similar to GP|4200165|emb|CAA76145.1 neutral invertase {Daucus
carota}, partial (68%)
Length = 2255
Score = 29.3 bits (64), Expect = 2.2
Identities = 14/37 (37%), Positives = 22/37 (58%), Gaps = 5/37 (13%)
Frame = -3
Query: 130 WLEPLP-----DGSLPNINIRTAILKILNDFPIDLEQ 161
W +PLP +G LP + ++ KIL++FP +EQ
Sbjct: 687 WHQPLPWAVAINGLLPTLQLQRVK*KILDNFPFTIEQ 577
>AL387992 weakly similar to GP|507345|gb|AA TonB {Haemophilus influenzae},
partial (8%)
Length = 446
Score = 29.3 bits (64), Expect = 2.2
Identities = 23/58 (39%), Positives = 32/58 (54%), Gaps = 4/58 (6%)
Frame = +1
Query: 81 TAEEDAELNRQGKPAINKLKKLPLLT--EVL--SKKQLQLEFLDHGVLTLLKNWLEPL 134
T E +L + K +NK + PLL E+L KKQ++ E L+ G L LLK + PL
Sbjct: 151 TNPESQQLPQLKKHLLNKRRLQPLLRRPELLLHPKKQVKREQLEEGRLPLLKFIINPL 324
>BG585180
Length = 589
Score = 28.9 bits (63), Expect = 2.9
Identities = 17/57 (29%), Positives = 26/57 (44%)
Frame = +1
Query: 15 IDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDEINDLFKMGKKKNERSPAEIALLV 71
I DT +P + YN EEGEE+ + +GK+ SP +IA ++
Sbjct: 82 ISDTLPKPDLFYKYNNNIIINNGLMLEEGEEEMSRDQSEALGKQLCHASPNDIAKMI 252
>TC89890 weakly similar to GP|17065226|gb|AAL32767.1 Unknown protein
{Arabidopsis thaliana}, partial (76%)
Length = 1013
Score = 28.5 bits (62), Expect = 3.8
Identities = 22/58 (37%), Positives = 27/58 (45%)
Frame = -1
Query: 96 INKLKKLPLLTEVLSKKQLQLEFLDHGVLTLLKNWLEPLPDGSLPNINIRTAILKILN 153
I K+ KLPLL L+ L ++ NW PLPD I I+ AILK N
Sbjct: 767 IFKINKLPLLP---------LDILKSSSK*VISNWPRPLPDDF---IGIKLAILKKSN 630
>TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {Arabidopsis
thaliana}, partial (39%)
Length = 883
Score = 28.1 bits (61), Expect = 4.9
Identities = 17/57 (29%), Positives = 28/57 (48%), Gaps = 6/57 (10%)
Frame = +3
Query: 1 DDQEGVRNLDDDNFIDDTGVEPGFYGNYNEPSSPGEAPQA------EEGEEDDEIND 51
DD + V++ DDD +D + G E P + P+A ++GE+DD+ D
Sbjct: 342 DDDDDVQDEDDDGEEEDYSGDEG-----EEEGDPEDDPEANGAGGSDDGEDDDDDGD 497
>BG644345 weakly similar to GP|19697333|gb putative protein potential
transcriptional repressor Not4hp - Mus musculus, partial
(7%)
Length = 635
Score = 28.1 bits (61), Expect = 4.9
Identities = 25/97 (25%), Positives = 44/97 (44%), Gaps = 4/97 (4%)
Frame = +1
Query: 228 PSVKKPAN-KAPGMQSRDSDLDLDLPQPRSGQSSSRQHASRPEATPMDFVIRPQSKVDP- 285
P ++P+ + PG Q Q Q+ +Q AS P TP P S++ P
Sbjct: 136 PQAQQPSLFQTPGQQQASPFQTPGQQQSSPFQTPGQQQAS-PFQTPGQQQPSPFSQITPF 312
Query: 286 --EEVRARAKQASHDQQRMKMNKKLQQLRAPKKRQLQ 320
++ + + +Q QQ+ ++ QQL+ ++ QLQ
Sbjct: 313 SQQQQQLQFQQQQQQQQQQLQQQQQQQLQQQQQLQLQ 423
>TC78863 weakly similar to GP|18481710|gb|AAL73532.1 hypothetical protein
{Sorghum bicolor}, partial (13%)
Length = 1830
Score = 28.1 bits (61), Expect = 4.9
Identities = 13/45 (28%), Positives = 24/45 (52%)
Frame = +1
Query: 6 VRNLDDDNFIDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDEIN 50
+ +DD +D GV +Y ++ +SP P++E E+D + N
Sbjct: 1006 ITGMDDPRSMDKVGVAQQWYDELDDLASPKADPESEVIEDDLDQN 1140
>BE326007 similar to GP|13676415|d hypothetical protein {Glycine max},
partial (8%)
Length = 544
Score = 27.7 bits (60), Expect = 6.5
Identities = 23/96 (23%), Positives = 43/96 (43%), Gaps = 7/96 (7%)
Frame = +1
Query: 10 DDDNF-IDDTGVEPGFYGNYNE------PSSPGEAPQAEEGEEDDEINDLFKMGKKKNER 62
DDD F I++ G + ++ E PSS + E+D E + + +
Sbjct: 58 DDDIFEIENGKARGGGFDSFKEGDADCSPSSSWKRVDNNTDEDDSEDSGSDRAESSSPDA 237
Query: 63 SPAEIALLVENVVAELEVTAEEDAELNRQGKPAINK 98
S A+I +++ + L+V A + A L+ G A ++
Sbjct: 238 SMADIMPMLDELHPLLDVDAPQPAHLSHDGSDAASE 345
>TC79190 similar to PIR|E96542|E96542 scarecrow-like protein [imported] -
Arabidopsis thaliana, partial (64%)
Length = 1669
Score = 27.7 bits (60), Expect = 6.5
Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 5/75 (6%)
Frame = +2
Query: 258 QSSSRQHASRPEATPMDFVIRPQSKVD-----PEEVRARAKQASHDQQRMKMNKKLQQLR 312
Q+S H+ P +P +RPQ ++ PEE + HD R KM++ +R
Sbjct: 122 QNSPSTHSFSPNNSPGS-TLRPQHSLEFVNGSPEEEDSYLIYHDHDDLRHKMSELESVMR 298
Query: 313 APKKRQLQATKLSVE 327
P L+ V+
Sbjct: 299 GPNVEMLEMYDTKVQ 343
>TC84617 similar to GP|1151236|gb|AAB68209.1| Lpg18p {Saccharomyces
cerevisiae}, partial (86%)
Length = 769
Score = 27.7 bits (60), Expect = 6.5
Identities = 25/98 (25%), Positives = 39/98 (39%)
Frame = +1
Query: 203 RPIFNKSTRFEDMRNIEDERAPFRRPSVKKPANKAPGMQSRDSDLDLDLPQPRSGQSSSR 262
R FN S +D+R R + KKP KAP +Q + L L + R R
Sbjct: 361 RKFFNLSKE-DDVRKFVIRREITPKNQNKKPYTKAPKIQRLVTPLTLQRKRHRISLKRRR 537
Query: 263 QHASRPEATPMDFVIRPQSKVDPEEVRARAKQASHDQQ 300
A++ D ++ ++K E K+ S Q+
Sbjct: 538 AKAAKAAKEEYDVLLAKRNKEQKERKADLKKRRSAVQK 651
>TC82232 similar to GP|10177316|dbj|BAB10642. kinesin-like protein
{Arabidopsis thaliana}, partial (21%)
Length = 1181
Score = 27.3 bits (59), Expect = 8.4
Identities = 18/73 (24%), Positives = 34/73 (45%)
Frame = +3
Query: 244 DSDLDLDLPQPRSGQSSSRQHASRPEATPMDFVIRPQSKVDPEEVRARAKQASHDQQRMK 303
DSD DL L +++S + P+D ++ +E+ + Q D++ +
Sbjct: 51 DSD-DLGLQSKTGLFGDGNEYSSDCDVKPVDITDVEPVEIHEKELEHSSAQQKLDRELKE 227
Query: 304 MNKKLQQLRAPKK 316
++KKL+Q A K
Sbjct: 228 LDKKLEQKEAEMK 266
>TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Ateline
herpesvirus 3}, partial (8%)
Length = 986
Score = 27.3 bits (59), Expect = 8.4
Identities = 18/49 (36%), Positives = 25/49 (50%), Gaps = 1/49 (2%)
Frame = +3
Query: 1 DDQEGV-RNLDDDNFIDDTGVEPGFYGNYNEPSSPGEAPQAEEGEEDDE 48
DD++G ++ DDD DD E G G +E E EE E++DE
Sbjct: 603 DDEDGDDQDEDDDEDEDDDDEEEG--GEEDEEEGVDEEDNEEEEEDEDE 743
>BQ751419 weakly similar to GP|21629340|gb L509.2 {Leishmania major}, partial
(1%)
Length = 766
Score = 27.3 bits (59), Expect = 8.4
Identities = 28/95 (29%), Positives = 40/95 (41%), Gaps = 5/95 (5%)
Frame = +2
Query: 223 APFRRPSVKKPANKAPGMQSRDSDLDLDLPQPRSGQSSSRQHAS-RPEATPMDFVIRP-- 279
AP RR +PA + LDL L QPR R HA P T ++ R
Sbjct: 215 APRRRLPHPRPAPR----------LDLHLHQPRHRVRRQRHHAQHHPHPTVLERHRRRHP 364
Query: 280 -QSKVDPEEVRARAKQASH-DQQRMKMNKKLQQLR 312
+ + P + RAR + H + R ++ + QLR
Sbjct: 365 LRREAQPHDPRARGPEGQHPGRHRGRLRRGPVQLR 469
>BG454776 weakly similar to PIR|T49896|T4 glycine/proline-rich protein -
Arabidopsis thaliana, partial (7%)
Length = 680
Score = 27.3 bits (59), Expect = 8.4
Identities = 17/82 (20%), Positives = 29/82 (34%)
Frame = +1
Query: 191 RKLTKELVDKWSRPIFNKSTRFEDMRNIEDERAPFRRPSVKKPANKAPGMQSRDSDLDLD 250
R +T+ RP +TR + P RP+ ++ P +R +
Sbjct: 361 RPMTRRTTRPTPRPTTRPTTRHTTRLTTKPTPRPTTRPTTRRTTRPTPRPTTRPTTRRTT 540
Query: 251 LPQPRSGQSSSRQHASRPEATP 272
P PR + + +RP P
Sbjct: 541 RPTPRPTTRPTTRLMTRPTTRP 606
>BQ751031
Length = 278
Score = 27.3 bits (59), Expect = 8.4
Identities = 18/50 (36%), Positives = 20/50 (40%), Gaps = 3/50 (6%)
Frame = +2
Query: 227 RPSVKKPANKAPGMQSRD---SDLDLDLPQPRSGQSSSRQHASRPEATPM 273
RP P P M S S D P+P SS RQ ASR P+
Sbjct: 11 RPCSAPPQRPVPSMASTTAPRSPTDALKPRPTLSPSSRRQRASRAPMAPL 160
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.312 0.133 0.367
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,568,061
Number of Sequences: 36976
Number of extensions: 89330
Number of successful extensions: 467
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 454
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 462
length of query: 334
length of database: 9,014,727
effective HSP length: 97
effective length of query: 237
effective length of database: 5,428,055
effective search space: 1286449035
effective search space used: 1286449035
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0230.8