Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0019a.4
         (310 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BF633248 similar to SP|P45598|ARAE_ Arabinose-proton symporter (...   105  2e-23
TC80890 similar to PIR|T04270|T04270 hypothetical protein F20B18...    84  5e-17
BG583956 homologue to GP|10177159|dbj gb|AAF63824.1~gene_id:K21P...    80  8e-16
BF005228 similar to GP|17473547|gb| unknown protein {Arabidopsis...    74  5e-14
BE240450 similar to GP|21555178|gb| transcription factor-like pr...    72  2e-13
TC88635 similar to GP|12711287|emb|CAC28528. GATA-1 zinc finger ...    62  4e-10
TC80004 similar to GP|8778844|gb|AAF79843.1| T6D22.9 {Arabidopsi...    58  4e-09
BE124805 similar to GP|10177426|db GATA-binding transcription fa...    58  5e-09
AW256569 similar to GP|10177426|dbj GATA-binding transcription f...    57  7e-09
TC78378 similar to GP|10177426|dbj|BAB10711. GATA-binding transc...    54  1e-07
CA921267 homologue to GP|10177426|dbj GATA-binding transcription...    53  2e-07
TC89696 similar to GP|10177426|dbj|BAB10711. GATA-binding transc...    52  4e-07
TC85373 similar to GP|15028099|gb|AAK76580.1 putative flowering ...    48  4e-06
TC88671 similar to PIR|JC7336|JC7336 zinc-finger protein - Arabi...    47  1e-05
TC81215 similar to GP|17064972|gb|AAL32640.1 Unknown protein {Ar...    44  8e-05
BG452497 similar to GP|15028099|gb| putative flowering protein C...    43  2e-04
TC85333 similar to GP|15028099|gb|AAK76580.1 putative flowering ...    43  2e-04
TC91918 homologue to GP|9369375|gb|AAF87124.1| F10A5.29 {Arabido...    35  0.028
TC83506 similar to PIR|T08179|T08179 LRG5 protein - Chlamydomona...    34  0.082
TC81171 similar to GP|9795609|gb|AAF98427.1| Unknown protein {Ar...    32  0.41

>BF633248 similar to SP|P45598|ARAE_ Arabinose-proton symporter (Arabinose
           transporter). {Klebsiella oxytoca}, partial (5%)
          Length = 514

 Score =  105 bits (263), Expect = 2e-23
 Identities = 73/141 (51%), Positives = 88/141 (61%), Gaps = 8/141 (5%)
 Frame = +1

Query: 1   MTPVSLNPPGPS--IQGQNHLFN-SLNNQDYHASLFNILDRRQGIGIGELREN----DHQ 53
           MTPVSLNPPGP+  +QGQN  FN S  NQD   + FN+L        G+  EN     HQ
Sbjct: 94  MTPVSLNPPGPNSLLQGQNQFFNISPVNQDT-PTFFNLL--------GDFGENYDHHHHQ 246

Query: 54  DDKLVVWHDGSSSSSSSNHLYNSTFISPQPVMDNPSSSTCDPNLSFS-KMEEEDIKNVHG 112
           D KL   HDGSSSS+    LYNS   S + VM + SSS  D NLS S ++E+   KN HG
Sbjct: 247 DHKLAFHHDGSSSSNHQQQLYNS---SSESVMVD-SSSARDTNLSSSLELEDSSKKNSHG 414

Query: 113 SAKWMSSKMRLMKKMMSTPAT 133
           S KW+SSKMR+M KM++T AT
Sbjct: 415 SEKWISSKMRVMNKMINTTAT 477


>TC80890 similar to PIR|T04270|T04270 hypothetical protein F20B18.260 -
           Arabidopsis thaliana, partial (14%)
          Length = 710

 Score = 84.3 bits (207), Expect = 5e-17
 Identities = 48/111 (43%), Positives = 60/111 (53%), Gaps = 4/111 (3%)
 Frame = +1

Query: 184 PLWRSGPNGPKSLCNACGIRQRKARRAMAEAANG----LATPINTASTKTRVHHNKEKKP 239
           PLWRSGP GPKSLCNACGIRQRKARRA+A AANG    +A        K ++   + K  
Sbjct: 7   PLWRSGPTGPKSLCNACGIRQRKARRALAAAANGETLVVAEKPYVKGKKLQIKRKRSKTD 186

Query: 240 RANHFAQFKNKSKSTSSNAGSSQETKKLECFKDFAISLRNNSTFQQVFPRD 290
           +     + K KS++  +N            F+D   S  NN    QVFP+D
Sbjct: 187 QCAQLLKRKGKSENKCNN------------FEDLITSWSNNLASHQVFPQD 303


>BG583956 homologue to GP|10177159|dbj gb|AAF63824.1~gene_id:K21P3.18~similar
           to unknown protein {Arabidopsis thaliana}, partial (43%)
          Length = 818

 Score = 80.5 bits (197), Expect = 8e-16
 Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 10/167 (5%)
 Frame = +2

Query: 153 HENRYSQRSPRNNN-----NSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRKA 207
           H ++ S+    N N     ++SN  + C+DC TS TPLWR GP GPKSLCNACGIR RK 
Sbjct: 188 HSDKVSEGEDSNPNAAVSSDNSNPKKTCADCGTSKTPLWRGGPAGPKSLCNACGIRSRKK 367

Query: 208 RRAMAEAANGLATPINTASTKTRVHHNKEKKPRANHFAQFKNKSKSTSSNAGSSQETKKL 267
           +RA+   + G                N E+  R       K K  ++    G S+    L
Sbjct: 368 KRAILGISKG----------------NNEEGTR-------KGKKSNSGGGGGGSKVGDNL 478

Query: 268 ECFKDFAISL-----RNNSTFQQVFPRDEVAEAALLLMDLSCGYVHS 309
              K   ++L      N S ++++    E  +AA+LLM LS G V++
Sbjct: 479 N-MKQRLLNLGKEVFMNRSHWKKL---GEDEQAAVLLMSLSYGSVYA 607


>BF005228 similar to GP|17473547|gb| unknown protein {Arabidopsis thaliana},
           partial (5%)
          Length = 742

 Score = 74.3 bits (181), Expect = 5e-14
 Identities = 50/85 (58%), Positives = 55/85 (63%), Gaps = 8/85 (9%)
 Frame = +2

Query: 194 KSLCNACGIRQRKARRAMAEAANGLATPINTASTKTRVHHNKEKKPRANHFAQFKNKSK- 252
           +SLCNACGIRQRKARRAMAEAANGLAT     S KT+V   K KKP      QFK K+K 
Sbjct: 503 QSLCNACGIRQRKARRAMAEAANGLAT-----SPKTKV--LKIKKP-----TQFKTKNKA 646

Query: 253 -------STSSNAGSSQETKKLECF 270
                  ST+S   SSQ+ KKLE F
Sbjct: 647 STSTSSTSTTSAGSSSQDVKKLESF 721


>BE240450 similar to GP|21555178|gb| transcription factor-like protein
           {Arabidopsis thaliana}, partial (18%)
          Length = 529

 Score = 72.4 bits (176), Expect = 2e-13
 Identities = 34/58 (58%), Positives = 42/58 (71%), Gaps = 1/58 (1%)
 Frame = +1

Query: 164 NNNNSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRK-ARRAMAEAANGLAT 220
           N+NN S   R C+ C+++STPLWR+GP GP+SLCNACGIR +K  RRA A  A   AT
Sbjct: 1   NSNNDSLLARRCASCDSTSTPLWRNGPRGPESLCNACGIRYKKEERRANAAVATTAAT 174


>TC88635 similar to GP|12711287|emb|CAC28528. GATA-1 zinc finger protein
           {Nicotiana tabacum}, partial (21%)
          Length = 1327

 Score = 61.6 bits (148), Expect = 4e-10
 Identities = 33/81 (40%), Positives = 43/81 (52%)
 Frame = +1

Query: 158 SQRSPRNNNNSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRKARRAMAEAANG 217
           S  + R++ + S   R C+ C  + TP WR GP GPK+LCNACG+R R  R  +      
Sbjct: 775 SIETKRSSLHESIAPRKCTHCEVTETPQWREGPKGPKTLCNACGVRYRSGR--LFPEYRP 948

Query: 218 LATPINTASTKTRVHHNKEKK 238
            A+P   AS    VH N  KK
Sbjct: 949 AASPTFEAS----VHSNSHKK 999


>TC80004 similar to GP|8778844|gb|AAF79843.1| T6D22.9 {Arabidopsis
           thaliana}, partial (8%)
          Length = 1058

 Score = 58.2 bits (139), Expect = 4e-09
 Identities = 30/75 (40%), Positives = 42/75 (56%)
 Frame = +3

Query: 164 NNNNSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRKARRAMAEAANGLATPIN 223
           NN  +   TR C+ C +  TP WR+GP GPK+LCNACG+R  K+ R + E       P  
Sbjct: 636 NNGQNPIPTRRCTHCLSQRTPQWRAGPLGPKTLCNACGVRY-KSGRLLPE-----YRPAK 797

Query: 224 TASTKTRVHHNKEKK 238
           + +  + +H N  KK
Sbjct: 798 SPTFVSFLHSNSHKK 842


>BE124805 similar to GP|10177426|db GATA-binding transcription factor-like
           protein {Arabidopsis thaliana}, partial (36%)
          Length = 715

 Score = 57.8 bits (138), Expect = 5e-09
 Identities = 26/81 (32%), Positives = 41/81 (50%)
 Frame = +2

Query: 128 MSTPATDKANNSTTIPISPRIQNQGHENRYSQRSPRNNNNSSNTTRVCSDCNTSSTPLWR 187
           +S   +   ++S T+  S   +      + ++R    +  +    R CS C    TP WR
Sbjct: 287 ISLANSSSTSSSATLSSSNLEECSKPAEKKAKRMVSPDGEARGVPRRCSHCGVQKTPQWR 466

Query: 188 SGPNGPKSLCNACGIRQRKAR 208
           +GP GPK+LCNACG+R +  R
Sbjct: 467 TGPGGPKTLCNACGVRYKSGR 529


>AW256569 similar to GP|10177426|dbj GATA-binding transcription factor-like
           protein {Arabidopsis thaliana}, partial (20%)
          Length = 613

 Score = 57.4 bits (137), Expect = 7e-09
 Identities = 22/50 (44%), Positives = 31/50 (62%)
 Frame = -3

Query: 159 QRSPRNNNNSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRKAR 208
           ++ P      ++  R CS C+   TP WR+GP GPK+LCNACG+R +  R
Sbjct: 551 RKKPEAQTGGAHFQRRCSHCHVQKTPQWRAGPLGPKTLCNACGVRFKSGR 402


>TC78378 similar to GP|10177426|dbj|BAB10711. GATA-binding transcription
           factor-like protein {Arabidopsis thaliana}, partial
           (23%)
          Length = 1284

 Score = 53.5 bits (127), Expect = 1e-07
 Identities = 31/93 (33%), Positives = 42/93 (44%)
 Frame = +3

Query: 146 PRIQNQGHENRYSQRSPRNNNNSSNTTRVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQR 205
           P  Q  GHE +                R CS C    TP WR+GP G K+LCNACG+R  
Sbjct: 762 PEAQVVGHEAQ----------EEGQLQRRCSHCQVQKTPQWRTGPMGAKTLCNACGVRY- 908

Query: 206 KARRAMAEAANGLATPINTASTKTRVHHNKEKK 238
           K+ R  +E       P  + +  + +H N  +K
Sbjct: 909 KSGRLFSE-----YRPACSPTFSSEIHSNSHRK 992


>CA921267 homologue to GP|10177426|dbj GATA-binding transcription factor-like
           protein {Arabidopsis thaliana}, partial (20%)
          Length = 694

 Score = 52.8 bits (125), Expect = 2e-07
 Identities = 21/36 (58%), Positives = 25/36 (69%)
 Frame = -1

Query: 173 RVCSDCNTSSTPLWRSGPNGPKSLCNACGIRQRKAR 208
           R CS C  + TP WRSGP G K+LCNACG+R +  R
Sbjct: 538 RRCSHCGVTKTPQWRSGPLGAKTLCNACGVRFKSGR 431


>TC89696 similar to GP|10177426|dbj|BAB10711. GATA-binding transcription
           factor-like protein {Arabidopsis thaliana}, partial
           (25%)
          Length = 806

 Score = 51.6 bits (122), Expect = 4e-07
 Identities = 19/29 (65%), Positives = 21/29 (71%)
 Frame = +2

Query: 173 RVCSDCNTSSTPLWRSGPNGPKSLCNACG 201
           R C  C    TP WR+GPNGPK+LCNACG
Sbjct: 572 RKCHHCGVDDTPQWRAGPNGPKTLCNACG 658


>TC85373 similar to GP|15028099|gb|AAK76580.1 putative flowering protein
           CONSTANS {Arabidopsis thaliana}, partial (22%)
          Length = 683

 Score = 48.1 bits (113), Expect = 4e-06
 Identities = 20/41 (48%), Positives = 26/41 (62%), Gaps = 2/41 (4%)
 Frame = +1

Query: 164 NNNNSSNTTRVCSDCNTSS--TPLWRSGPNGPKSLCNACGI 202
           +N+ S     VC  CN S   TP+ R GP GP++LCNACG+
Sbjct: 217 DNDGSQQQDNVCRQCNISEKCTPMMRRGPEGPRTLCNACGL 339


>TC88671 similar to PIR|JC7336|JC7336 zinc-finger protein - Arabidopsis
           thaliana, partial (44%)
          Length = 1272

 Score = 46.6 bits (109), Expect = 1e-05
 Identities = 20/41 (48%), Positives = 30/41 (72%), Gaps = 2/41 (4%)
 Frame = +3

Query: 164 NNNNSSNTTRVCSDCNTSS--TPLWRSGPNGPKSLCNACGI 202
           ++ ++S +   C+ C TSS  TP+ R GP+GP+SLCNACG+
Sbjct: 693 SSQDASPSEISCTHCGTSSKSTPMMRRGPSGPRSLCNACGL 815


>TC81215 similar to GP|17064972|gb|AAL32640.1 Unknown protein {Arabidopsis
           thaliana}, partial (14%)
          Length = 938

 Score = 43.9 bits (102), Expect = 8e-05
 Identities = 18/31 (58%), Positives = 20/31 (64%)
 Frame = +3

Query: 175 CSDCNTSSTPLWRSGPNGPKSLCNACGIRQR 205
           C  C  +STPLWR+GP     LCNACG R R
Sbjct: 447 CFHCGVTSTPLWRNGPPEKPILCNACGSRWR 539


>BG452497 similar to GP|15028099|gb| putative flowering protein CONSTANS
           {Arabidopsis thaliana}, partial (12%)
          Length = 648

 Score = 42.7 bits (99), Expect = 2e-04
 Identities = 17/30 (56%), Positives = 21/30 (69%), Gaps = 2/30 (6%)
 Frame = +1

Query: 175 CSDCNTSS--TPLWRSGPNGPKSLCNACGI 202
           C  CN S   TP+ R GP GP++LCNACG+
Sbjct: 196 CRQCNISEKCTPMMRRGPEGPRTLCNACGL 285


>TC85333 similar to GP|15028099|gb|AAK76580.1 putative flowering protein
           CONSTANS {Arabidopsis thaliana}, partial (22%)
          Length = 1421

 Score = 42.7 bits (99), Expect = 2e-04
 Identities = 17/30 (56%), Positives = 21/30 (69%), Gaps = 2/30 (6%)
 Frame = +1

Query: 175 CSDCNTSS--TPLWRSGPNGPKSLCNACGI 202
           C  CN S   TP+ R GP GP++LCNACG+
Sbjct: 907 CRQCNISEKCTPMMRRGPEGPRTLCNACGL 996


>TC91918 homologue to GP|9369375|gb|AAF87124.1| F10A5.29 {Arabidopsis
           thaliana}, partial (38%)
          Length = 1171

 Score = 35.4 bits (80), Expect = 0.028
 Identities = 22/85 (25%), Positives = 41/85 (47%)
 Frame = +2

Query: 98  SFSKMEEEDIKNVHGSAKWMSSKMRLMKKMMSTPATDKANNSTTIPISPRIQNQGHENRY 157
           +F    E  +     + K   S  +L    +STP  +    S+  P  P++Q Q  ++ +
Sbjct: 209 TFETSHEAALAYDAAARKLYGSDAKLNLPELSTPPQN--TTSSPSPTPPQMQQQ-QQHPH 379

Query: 158 SQRSPRNNNNSSNTTRVCSDCNTSS 182
            Q  P NNNN +N+  +C++ N ++
Sbjct: 380 IQIQPNNNNNINNSFNICNNINMNN 454


>TC83506 similar to PIR|T08179|T08179 LRG5 protein - Chlamydomonas
           reinhardtii, partial (2%)
          Length = 539

 Score = 33.9 bits (76), Expect = 0.082
 Identities = 21/93 (22%), Positives = 39/93 (41%), Gaps = 5/93 (5%)
 Frame = +3

Query: 89  SSSTCDPNLSF-----SKMEEEDIKNVHGSAKWMSSKMRLMKKMMSTPATDKANNSTTIP 143
           S S C P+L        + ++E  KN H +  W  S+    +    T +  +   S+  P
Sbjct: 219 SPSICSPDLDIL*RPDKETKKEKKKNHHKTRTWPMSRSGRKQPPQQTRSPRRCRRSSARP 398

Query: 144 ISPRIQNQGHENRYSQRSPRNNNNSSNTTRVCS 176
            +PR          +  +PR ++++ + TR  S
Sbjct: 399 AAPRAATSTRRGAGTSSTPRTSSSTPSRTRPTS 497


>TC81171 similar to GP|9795609|gb|AAF98427.1| Unknown protein {Arabidopsis
           thaliana}, partial (40%)
          Length = 932

 Score = 31.6 bits (70), Expect = 0.41
 Identities = 28/117 (23%), Positives = 52/117 (43%), Gaps = 10/117 (8%)
 Frame = +3

Query: 134 DKANNSTTIPISPRIQNQGHENRYSQ-RSPRNNNNSSNTTRVCSDCNT---------SST 183
           ++ ++ T++P SP+ Q+ GH + ++Q  SPR  +  S  T+  S+ +          S  
Sbjct: 42  EQKSSLTSLP-SPKTQSNGHNHSHNQIPSPRPISLPSPKTQTQSNGHNHNHNHNQIPSPR 218

Query: 184 PLWRSGPNGPKSLCNACGIRQRKARRAMAEAANGLATPINTASTKTRVHHNKEKKPR 240
           P+ RS P  P             + + + +   G +     AST T+ +HN    P+
Sbjct: 219 PITRSEPGNPYPTTFVQA--DTTSFKQVVQMLTGSSETAKQASTSTKANHNHNIPPK 383


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.311    0.125    0.363 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,956,653
Number of Sequences: 36976
Number of extensions: 174741
Number of successful extensions: 1145
Number of sequences better than 10.0: 63
Number of HSP's better than 10.0 without gapping: 1103
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1125
length of query: 310
length of database: 9,014,727
effective HSP length: 96
effective length of query: 214
effective length of database: 5,465,031
effective search space: 1169516634
effective search space used: 1169516634
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.9 bits)


Lotus: description of TM0019a.4