Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0219.2
         (365 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC77681 similar to GP|22597156|gb|AAN03465.1 nucleolar histone d...    35  0.060
BQ151362 weakly similar to GP|13561980|gb| flagelliform silk pro...    34  0.078
TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-inf...    33  0.17
TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_10...    32  0.30
TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {...    32  0.51
TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed pr...    32  0.51
TC90297 similar to GP|3204101|emb|CAA07227.1 hypothetical protei...    31  0.66
TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Atel...    31  0.66
BF641220 weakly similar to PIR|G86203|G86 probable N-arginine di...    31  0.66
CB066689 homologue to GP|13646986|dbj DNA-binding protein DF1 {P...    31  0.86
BI311490 weakly similar to GP|21751020|dbj unnamed protein produ...    31  0.86
TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical prot...    30  1.1
TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical prot...    30  1.1
BG648593 weakly similar to GP|9294451|dbj A37 protein; ethylene-...    30  1.1
TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome...    30  1.5
CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect...    30  1.5
TC89984 similar to GP|16604601|gb|AAL24093.1 unknown protein {Ar...    30  1.5
TC77395 similar to GP|4874305|gb|AAD31367.1| expressed protein {...    30  1.9
TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein...    30  1.9
TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I...    30  1.9

>TC77681 similar to GP|22597156|gb|AAN03465.1 nucleolar histone deacetylase
           HD2-P39 {Glycine max}, partial (47%)
          Length = 1233

 Score = 34.7 bits (78), Expect = 0.060
 Identities = 20/53 (37%), Positives = 26/53 (48%)
 Frame = +2

Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
           DDD S D +   DDE E   D +  +E   +ED E+E P     QG    NE+
Sbjct: 584 DDDESDDEIGSSDDEME-NADSDSEDEDDSDEDDEEETPVKKVDQGKKRPNES 739


>BQ151362 weakly similar to GP|13561980|gb| flagelliform silk protein
           {Argiope trifasciata}, partial (3%)
          Length = 1136

 Score = 34.3 bits (77), Expect = 0.078
 Identities = 41/151 (27%), Positives = 55/151 (36%), Gaps = 21/151 (13%)
 Frame = +1

Query: 24  AKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSSAAGVKPLHQSTLDPKGRPTEKKKGHDN 83
           A  R   T Q KK  G       ++   Q+  SAAGV    +S     G     K+ HD 
Sbjct: 163 AGSRPKNTAQAKKTTGEREDPQPKEKTYQR--SAAGVATGRESA---GGTTNNSKRQHD- 324

Query: 84  VPPHQPDSGALINRPSTPFIQA---------GPSSAIGGE------------ALPPLLNL 122
            PP+ P++   +N P  P   A         GP++   G             A PPLL  
Sbjct: 325 -PPY-PNTMPCVNHPPPPTTPAKAPPAPRDSGPAAPTNGGGAKAPLRCRVACAFPPLLCC 498

Query: 123 SDPRFNGLDFMNRTFDNRIHKDVSGQGPPNI 153
           +D      D   R    +  +    Q PPNI
Sbjct: 499 NDKNHLK*DVDRRRSTRKKTQK*KNQAPPNI 591


>TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-infected
           erythrocyte surface antigen {Plasmodium falciparum},
           partial (2%)
          Length = 2007

 Score = 33.1 bits (74), Expect = 0.17
 Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 3/65 (4%)
 Frame = +1

Query: 301 KEIKDGQVV---GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGN 357
           +EIKDG+ +    +++   +   Q ++E++ EE  +  NE  KNE  EKE  +    +  
Sbjct: 172 EEIKDGEKIQQENEENKDEEKSQQENEENKDEEKSQQENELKKNEGGEKETGEITEEKSK 351

Query: 358 NANNE 362
             N E
Sbjct: 352 QENEE 366


>TC77763 similar to GP|15215674|gb|AAK91382.1 AT4g27500/F27G19_100
            {Arabidopsis thaliana}, partial (38%)
          Length = 2297

 Score = 32.3 bits (72), Expect = 0.30
 Identities = 23/99 (23%), Positives = 43/99 (43%), Gaps = 3/99 (3%)
 Frame = +1

Query: 194  TAYEQAK---ADAETANKNLKAAEERCAKLTDDLAASDLLLQKTKSLKETINDKHTAVQA 250
            T  +Q K    D +   K  +A   +  ++ D L A D+ +Q  ++    +  K      
Sbjct: 922  TIQDQVKLIGGDLDGVKKERQAIRSKIKQIDDVLKAIDIDIQSLQAELVAVTQKREQAFE 1101

Query: 251  KCQKLEKKYERLNASILGRASLLFAQGFLAAKEQISVVE 289
              QKL K+ +  N+      +LL     LAAK+ ++ ++
Sbjct: 1102 SIQKLRKQRDEGNSYFYQSRTLLTKARELAAKKDVAAID 1218


>TC80893 similar to GP|4557063|gb|AAD22502.1| expressed protein {Arabidopsis
           thaliana}, partial (39%)
          Length = 883

 Score = 31.6 bits (70), Expect = 0.51
 Identities = 12/26 (46%), Positives = 17/26 (65%)
 Frame = +3

Query: 323 DDESEPEEDGEDGNEQHKNEDHEKED 348
           DD+   EED EDG ++   ED E++D
Sbjct: 480 DDDDGDEEDEEDGEDEEDEEDEEEDD 557



 Score = 30.4 bits (67), Expect = 1.1
 Identities = 16/53 (30%), Positives = 28/53 (52%)
 Frame = +3

Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
           DDD   D+  + DD  E +  G++G E+   ED    DP+A  + G++   ++
Sbjct: 336 DDDDDDDVQDEDDDGEEEDYSGDEGEEEGDPED----DPEANGAGGSDDGEDD 482



 Score = 29.3 bits (64), Expect = 2.5
 Identities = 17/63 (26%), Positives = 31/63 (48%), Gaps = 1/63 (1%)
 Frame = +3

Query: 301 KEIKDGQVVGDDDISLDLLPQFDDESEPEE-DGEDGNEQHKNEDHEKEDPQAGTSQGNNA 359
           KE K      +DD   D +   DD+ E E+  G++G E+   ED  + +   G+  G + 
Sbjct: 303 KENKSDTEDDEDDDDDDDVQDEDDDGEEEDYSGDEGEEEGDPEDDPEANGAGGSDDGEDD 482

Query: 360 NNE 362
           +++
Sbjct: 483 DDD 491



 Score = 28.9 bits (63), Expect = 3.3
 Identities = 13/28 (46%), Positives = 18/28 (63%)
 Frame = +3

Query: 323 DDESEPEEDGEDGNEQHKNEDHEKEDPQ 350
           D + E EEDGED  E  ++E+ + E PQ
Sbjct: 489 DGDEEDEEDGED-EEDEEDEEEDDETPQ 569


>TC89427 weakly similar to GP|4557063|gb|AAD22502.1| expressed protein
           {Arabidopsis thaliana}, partial (50%)
          Length = 753

 Score = 31.6 bits (70), Expect = 0.51
 Identities = 28/97 (28%), Positives = 43/97 (43%), Gaps = 6/97 (6%)
 Frame = +1

Query: 260 ERLNASILGRASLLFAQGFLAA--KEQISVVEPGFDLSRIG----WLKEIKDGQVVGDDD 313
           + L A+    A LL A G L       IS ++  F L ++       +E KDG    DDD
Sbjct: 142 QSLIATAKSLAYLLIATGSLLTDVNHLISPMDGRFPLDQLSKGNHTCEENKDGSETEDDD 321

Query: 314 ISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQ 350
                  + DD+   +ED ++  +   +ED E  DP+
Sbjct: 322 ------DEDDDDDVNDEDDDNDEDFSGDEDDEDADPE 414



 Score = 30.4 bits (67), Expect = 1.1
 Identities = 15/47 (31%), Positives = 24/47 (50%)
 Frame = +1

Query: 303 IKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDP 349
           + +G    DDD   D     DD+ +  +DGED +E  + +D E + P
Sbjct: 424 VPNGAGGSDDDDEDD-----DDDDDDNDDGEDEDEDEEEDDDEDQPP 549


>TC90297 similar to GP|3204101|emb|CAA07227.1 hypothetical protein {Cicer
           arietinum}, partial (56%)
          Length = 853

 Score = 31.2 bits (69), Expect = 0.66
 Identities = 21/81 (25%), Positives = 33/81 (39%), Gaps = 3/81 (3%)
 Frame = +3

Query: 6   VDANPIKMKEYLAQSAAAAKKRAAETEQKKKNEGTSGSDNVRDPKRQKTSS---AAGVKP 62
           +D +P   K+Y+   A         +       G   +DN R P RQ T+    +    P
Sbjct: 387 LDGDP---KQYVDSPARHDNASNRSSNDSTPRLGVGSADNRRRPSRQSTAGSEHSVERSP 557

Query: 63  LHQSTLDPKGRPTEKKKGHDN 83
           LH+    P GR +   +G +N
Sbjct: 558 LHRQARAPAGRDSPSWEGKNN 620


>TC77337 weakly similar to GP|4019275|gb|AAC95573.1| orf 48 {Ateline
           herpesvirus 3}, partial (8%)
          Length = 986

 Score = 31.2 bits (69), Expect = 0.66
 Identities = 14/38 (36%), Positives = 23/38 (59%)
 Frame = +3

Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKED 348
           +DD   D   + DDE E ++D E+G E+ + E  ++ED
Sbjct: 600 EDDEDGDDQDEDDDEDEDDDDEEEGGEEDEEEGVDEED 713



 Score = 31.2 bits (69), Expect = 0.66
 Identities = 11/25 (44%), Positives = 20/25 (80%)
 Frame = +3

Query: 324 DESEPEEDGEDGNEQHKNEDHEKED 348
           DE+  EED EDG++Q +++D +++D
Sbjct: 582 DENGEEEDDEDGDDQDEDDDEDEDD 656



 Score = 29.3 bits (64), Expect = 2.5
 Identities = 15/43 (34%), Positives = 25/43 (57%), Gaps = 2/43 (4%)
 Frame = +3

Query: 310 GDDDISLDLLPQFDDESEP--EEDGEDGNEQHKNEDHEKEDPQ 350
           GDD    D   + DD+ E   EED E+G ++  NE+ E+++ +
Sbjct: 615 GDDQDEDDDEDEDDDDEEEGGEEDEEEGVDEEDNEEEEEDEDE 743



 Score = 28.5 bits (62), Expect = 4.3
 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
 Frame = +3

Query: 323 DDESEPEEDGE----DGNEQHKNEDHEKEDPQAGTSQGNNANNEN 363
           DDE E ++DG+    +G ++  +ED        G   GNN+NN++
Sbjct: 432 DDEDEEDDDGDGAFGEGEDELSSED--------GGGYGNNSNNKS 542


>BF641220 weakly similar to PIR|G86203|G86 probable N-arginine dibasic
           convertase [imported] - Arabidopsis thaliana, partial
           (5%)
          Length = 634

 Score = 31.2 bits (69), Expect = 0.66
 Identities = 40/150 (26%), Positives = 61/150 (40%), Gaps = 17/150 (11%)
 Frame = +2

Query: 231 LQKTKSLKETIN---DKHTAVQAKCQKLEKKYERLNASILGRASLLFAQGFLAAKEQISV 287
           +QKTK  K  I     KH+    +    +K+   L  +     +   A   L++ + + V
Sbjct: 65  VQKTKPNKTLITIVAPKHSLSPFRFST-DKESMGLKGAPAAATTAATAAVALSSSDDVIV 241

Query: 288 VEPGFD-LSRIGWLKEIKDGQVVGDDDISLDLLPQF-----DDESEPEEDGEDGNEQ--- 338
             P  + L R+  LK      +V D +I  +  P+      DDE E +ED ED  E    
Sbjct: 242 KSPNDNRLYRLVHLKNGLQALIVHDPEIYPEGAPKDGSIDEDDEEEDDEDEEDDEEDDDE 421

Query: 339 -HKNEDHEKEDPQ----AGTSQGNNANNEN 363
              +ED E ED       G   G  A N++
Sbjct: 422 GEDDEDEEXEDEDEXXVXGREGGKGAANQS 511


>CB066689 homologue to GP|13646986|dbj DNA-binding protein DF1 {Pisum
           sativum}, partial (8%)
          Length = 508

 Score = 30.8 bits (68), Expect = 0.86
 Identities = 17/56 (30%), Positives = 28/56 (49%)
 Frame = +3

Query: 307 QVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNE 362
           QVV  ++++  L+ Q + +  P + GED  EQ  N   E+ED       G   ++E
Sbjct: 171 QVVQPENMAAPLMVQPEQQWRPPQQGEDNMEQ--NRGQEEEDMDEDDKDGEEEDDE 332


>BI311490 weakly similar to GP|21751020|dbj unnamed protein product {Homo
           sapiens}, partial (22%)
          Length = 387

 Score = 30.8 bits (68), Expect = 0.86
 Identities = 18/47 (38%), Positives = 27/47 (57%)
 Frame = -3

Query: 301 KEIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKE 347
           K+ KD  + G+DD   D   +F +ES+PE + E+  E+ K E  E E
Sbjct: 343 KQEKDA-IDGEDDGEEDNNDEFFNESQPEFEDENEKEESKEEIDESE 206


>TC86146 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
           arietinum}, partial (95%)
          Length = 1084

 Score = 30.4 bits (67), Expect = 1.1
 Identities = 18/52 (34%), Positives = 25/52 (47%)
 Frame = +3

Query: 310 GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANN 361
           GDD   L      DDE + E+D  D  E +++ED +  D     S G+  NN
Sbjct: 264 GDDFDDLHDGTDVDDEDDDEDD--DNEEDYEDEDEDAFDVHDHASVGDRENN 413


>TC86145 homologue to GP|10334499|emb|CAC10211. hypothetical protein {Cicer
           arietinum}, partial (91%)
          Length = 1311

 Score = 30.4 bits (67), Expect = 1.1
 Identities = 18/52 (34%), Positives = 25/52 (47%)
 Frame = +1

Query: 310 GDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANN 361
           GDD   L      DDE + E+D  D  E +++ED +  D     S G+  NN
Sbjct: 445 GDDFDDLHDGTDVDDEDDDEDD--DNEEDYEDEDEDAFDVHDHASVGDRENN 594


>BG648593 weakly similar to GP|9294451|dbj A37 protein; ethylene-inducible
           protein-like {Arabidopsis thaliana}, partial (75%)
          Length = 724

 Score = 30.4 bits (67), Expect = 1.1
 Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
 Frame = +2

Query: 62  PLHQSTLDPKGRPTEKKKG-HDNVPPHQPDSGALINRPST 100
           P  QST  P+  PT  K   H N P H+P   A  +R  T
Sbjct: 5   PPSQSTTPPQSPPTHSKPHTHSNYPSHKPSVAAQFSRSPT 124


>TC91658 similar to GP|14329812|emb|CAC40753. putative nucleosome assembly
           protein 1 {Atropa belladonna}, partial (46%)
          Length = 583

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
 Frame = +2

Query: 312 DDISLDLLPQFDDESEPEE---DGEDGNEQHKNEDHEKEDPQAGTSQGNNAN 360
           DDI +D     DDE   EE   D +D ++  ++ED E+ED   G S+    +
Sbjct: 71  DDIEVDE----DDEDGDEEDDDDDDDDDDDEEDEDDEEEDEGKGKSKSKRGS 214


>CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect virus}
           [Heliothis zea virus 1], partial (2%)
          Length = 803

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 20/47 (42%), Positives = 26/47 (54%)
 Frame = -1

Query: 302 EIKDGQVVGDDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKED 348
           E  D ++V + D     +   DDE E +ED ED NE   NED E+ED
Sbjct: 701 ECVDVELVDNSDKVRYKMDDDDDEEEEDED-EDENE---NEDEEEED 573


>TC89984 similar to GP|16604601|gb|AAL24093.1 unknown protein {Arabidopsis
           thaliana}, partial (32%)
          Length = 1303

 Score = 30.0 bits (66), Expect = 1.5
 Identities = 40/179 (22%), Positives = 77/179 (42%), Gaps = 28/179 (15%)
 Frame = +3

Query: 162 LSAASIVAGMAQCVKELIATKNRYEKKAADYKTAYEQAKADAETANKNLKAAEERCAKLT 221
           L+  SIV      +++L A + + E  +  Y     ++KAD +   K +K+      +L 
Sbjct: 603 LTKESIVQQNDVLLQDLDAAREQLEILSKQYGELEAKSKADIKVLVKEVKSLRSSQTELK 782

Query: 222 DDLAASDLLLQKTKSLKETINDKHTAVQA---------KCQKLEKK---------YERLN 263
            +L  S+ + +K ++ K  ++++  + QA         KC  L K+         YE  +
Sbjct: 783 KEL--SESIKEKYEAEKLLLHEREKSEQAEIAWRKQLEKCGLLLKQLQECSVELPYEDED 956

Query: 264 ASILGRASLLFAQGFL-AAKEQISVV---------EPGFDLSRIGWLKEIKDGQVVGDD 312
            + L  +S   A   L  + +QI ++         + G   S +    +IKDG + GD+
Sbjct: 957 RTFLQSSSSTDAFNKLKTSDDQIDILLAEVENLEKDAGSAASNVDKTNDIKDGVICGDE 1133


>TC77395 similar to GP|4874305|gb|AAD31367.1| expressed protein {Arabidopsis
           thaliana}, partial (65%)
          Length = 1649

 Score = 29.6 bits (65), Expect = 1.9
 Identities = 14/41 (34%), Positives = 21/41 (51%)
 Frame = +2

Query: 311 DDDISLDLLPQFDDESEPEEDGEDGNEQHKNEDHEKEDPQA 351
           D   S D    F+D    E+  +DG+E+ + +DHE E   A
Sbjct: 566 DPQPSEDKPEHFEDTESEEDILDDGDEEEEEQDHEPEPEHA 688


>TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein T28I19.100
           - Arabidopsis thaliana, partial (14%)
          Length = 1460

 Score = 29.6 bits (65), Expect = 1.9
 Identities = 14/40 (35%), Positives = 24/40 (60%)
 Frame = +3

Query: 323 DDESEPEEDGEDGNEQHKNEDHEKEDPQAGTSQGNNANNE 362
           D+E + EE+G+D     + E+ +KED + G    N+ N+E
Sbjct: 699 DEEKDKEEEGDD-----ETENEDKEDEEKGGLVENHENHE 803


>TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I3.230 -
           Arabidopsis thaliana, partial (14%)
          Length = 1074

 Score = 29.6 bits (65), Expect = 1.9
 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 6/88 (6%)
 Frame = +3

Query: 12  KMKEYLAQSAAAAKKRAAETEQKKKN--EGTSGSDNV----RDPKRQKTSSAAGVKPLHQ 65
           K K+     AAA+ +   E E+KKK+  +G  GS  V    +  K+ K +S  G   + +
Sbjct: 402 KKKKDKENGAAASDEEKVEKEKKKKHKEKGEDGSPEVEKSDKKKKKHKETSEVGSPEVDK 581

Query: 66  STLDPKGRPTEKKKGHDNVPPHQPDSGA 93
           S    K +  E K    ++     +S A
Sbjct: 582 SEKKKKKKDKEAKDNAADISNGNDESNA 665


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.308    0.127    0.350 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,715,531
Number of Sequences: 36976
Number of extensions: 108602
Number of successful extensions: 726
Number of sequences better than 10.0: 66
Number of HSP's better than 10.0 without gapping: 640
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 699
length of query: 365
length of database: 9,014,727
effective HSP length: 97
effective length of query: 268
effective length of database: 5,428,055
effective search space: 1454718740
effective search space used: 1454718740
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 59 (27.3 bits)


Lotus: description of TM0219.2