Lotus
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0201.10
         (137 letters)

Database: GMGI 
           63,676 sequences; 37,918,896 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

TC220681 similar to UP|Q6Z2V7 (Q6Z2V7) NHL repeat-containing pro...    90  4e-19
TC228343 similar to UP|Q6Z2V7 (Q6Z2V7) NHL repeat-containing pro...    87  3e-18
TC228344                                                               77  3e-15
TC220821 similar to UP|Q72D38 (Q72D38) Sodium-dependent symporte...    61  1e-10
TC229294                                                               58  1e-09
AI930730                                                               50  3e-07
CD400517                                                               43  4e-05
TC221000 similar to UP|Q9CSQ9 (Q9CSQ9) Mus musculus 10 days embr...    39  6e-04
CD396933                                                               37  0.002
TC222443                                                               37  0.004
TC232260                                                               36  0.007
TC234016                                                               36  0.007
TC232194 weakly similar to UP|Q9LSN6 (Q9LSN6) Arabidopsis thalia...    35  0.009
TC227116 similar to UP|WAS2_HUMAN (Q9Y6W5) Wiskott-Aldrich syndr...    27  0.051
AW734146 similar to GP|3056592|gb|A T1F9.13 {Arabidopsis thalian...    32  0.12
CD390814 similar to GP|13122434|dbj P0666G04.17 {Oryza sativa (j...    31  0.16
TC225822 homologue to PRF|1609232A.0|226866|1609232A 31kD glycop...    31  0.16
TC224734 homologue to UP|Q09085 (Q09085) Hydroxyproline-rich gly...    30  0.36
TC229883 similar to GB|BAC20813.1|23617133|AP005183 transcriptio...    30  0.36
TC224753 homologue to UP|Q09085 (Q09085) Hydroxyproline-rich gly...    30  0.36

>TC220681 similar to UP|Q6Z2V7 (Q6Z2V7) NHL repeat-containing protein-like,
           partial (27%)
          Length = 703

 Score = 89.7 bits (221), Expect = 4e-19
 Identities = 50/115 (43%), Positives = 60/115 (51%), Gaps = 16/115 (13%)
 Frame = +3

Query: 18  CSSSSETSLSWWERVRSPENK------EWWWAHGWN---KVREWSEIIVGPKWKTFIRRF 68
           C        SWWERVRS  +       + WW+ G     K+REWSEI+ GPKWKTFIRRF
Sbjct: 180 CFGGQRRRSSWWERVRSHSHSAPASAGDRWWSRGLRALKKLREWSEIVAGPKWKTFIRRF 359

Query: 69  NRNNNRAGAASYDKKGSFHYDSLSYALNFDDG-------EEDVYSYGGFSTRFAS 116
           NRN       +Y       YD  SYALNFD+G         D  +   FSTR+A+
Sbjct: 360 NRNRPNKRIPTY------QYDPFSYALNFDEGPNGDFNHHHDDAALRNFSTRYAA 506


>TC228343 similar to UP|Q6Z2V7 (Q6Z2V7) NHL repeat-containing protein-like,
           partial (30%)
          Length = 706

 Score = 86.7 bits (213), Expect = 3e-18
 Identities = 52/136 (38%), Positives = 70/136 (51%), Gaps = 31/136 (22%)
 Frame = +1

Query: 14  YLIPCS-----SSSETSLSWWERVRSPENKEW-----------------WWAHG---WNK 48
           +  PC       S+     WWERVR+  +  W                 WW+ G   + K
Sbjct: 163 FCFPCCFGSRRDSATLGFGWWERVRATSS--WSESRSEAQPVTGSPGGRWWSGGVRAFMK 336

Query: 49  VREWSEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDDGEEDVYSYG 108
           VREWSE+  GP+WKTFIRRF+R  +R+G + +   G + YD LSYALNFD+G    +   
Sbjct: 337 VREWSELAAGPRWKTFIRRFSR--SRSGGSRH-AAGKYQYDPLSYALNFDEGHNGYFDGD 507

Query: 109 G------FSTRFASVP 118
           G      FSTR+A+ P
Sbjct: 508 GYDGLRNFSTRYAAPP 555


>TC228344 
          Length = 430

 Score = 77.0 bits (188), Expect = 3e-15
 Identities = 46/107 (42%), Positives = 61/107 (56%), Gaps = 20/107 (18%)
 Frame = +1

Query: 13  CYLIPCSSSSETSLSWWERVR--------SPENKEW---------WWAHG---WNKVREW 52
           C+   C S S    +WWERVR        SP ++           WW+ G   + KVREW
Sbjct: 121 CFGSRCHSVS-VGFAWWERVRATSSWSETSPHSEAQPATGSSSGSWWSGGVRAFMKVREW 297

Query: 53  SEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDD 99
           SE+  GP+WKTFIRRF+R  +R+G A +   G + YD LSY+LNFD+
Sbjct: 298 SELAAGPRWKTFIRRFSR--SRSGWARH-AAGKYQYDPLSYSLNFDE 429


>TC220821 similar to UP|Q72D38 (Q72D38) Sodium-dependent symporter family
           protein, partial (5%)
          Length = 576

 Score = 61.2 bits (147), Expect = 1e-10
 Identities = 35/87 (40%), Positives = 50/87 (57%), Gaps = 3/87 (3%)
 Frame = +2

Query: 41  WWAHGWNKVREWSEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDDG 100
           W +    K++E+SE+I GPKWKTFIR+ +    +       +K  F YD  SYALNF+ G
Sbjct: 218 WLSCKLRKIKEFSEVIAGPKWKTFIRKISGYGRK-----QQQKNRFQYDEHSYALNFNSG 382

Query: 101 E--EDVYSYGGFSTRF-ASVPASTKPT 124
           +  ED  +   FS RF A  P++ + T
Sbjct: 383 DKSEDDDTPPSFSARFSAPFPSARRQT 463


>TC229294 
          Length = 741

 Score = 58.2 bits (139), Expect = 1e-09
 Identities = 34/80 (42%), Positives = 43/80 (53%), Gaps = 2/80 (2%)
 Frame = +1

Query: 39  EWWWAHGWNKVREWSEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFD 98
           E W      K +E SE+I GPKWKTFIR+     +  G     ++  F YD  SYALNF+
Sbjct: 319 ESWVVEKLRKAKEVSEVIAGPKWKTFIRKI----SGYGKKVKQQRNRFQYDEHSYALNFN 486

Query: 99  DG--EEDVYSYGGFSTRFAS 116
            G   ED      FS+RFA+
Sbjct: 487 SGAQSEDEGMPHSFSSRFAA 546


>AI930730 
          Length = 494

 Score = 50.1 bits (118), Expect = 3e-07
 Identities = 25/60 (41%), Positives = 34/60 (56%)
 Frame = +2

Query: 39  EWWWAHGWNKVREWSEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFD 98
           E W  +   K +E SE+I GPKWKTFIR+ +    +       ++  F YD  SYALNF+
Sbjct: 308 ESWVVNKLRKAKEVSEVIAGPKWKTFIRKISGYGKKVKY----QRNRFQYDEHSYALNFN 475


>CD400517 
          Length = 561

 Score = 43.1 bits (100), Expect = 4e-05
 Identities = 29/65 (44%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
 Frame = -2

Query: 53  SEIIVGPKWKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDDG--EEDVYSYGGF 110
           SEI+V PK    +RR     + A      +K  F YD  SYALNFDDG  EED   +  F
Sbjct: 344 SEILVWPKKSFSLRRLRTCISEA--KEMKRKMPFQYDPXSYALNFDDGSIEEDDGVFLDF 171

Query: 111 STRFA 115
           S R+A
Sbjct: 170 SARYA 156


>TC221000 similar to UP|Q9CSQ9 (Q9CSQ9) Mus musculus 10 days embryo whole
           body cDNA, RIKEN full-length enriched library,
           clone:2610528D21 product:formin binding protein 4, full
           insert sequence. (Fragment), partial (5%)
          Length = 727

 Score = 39.3 bits (90), Expect = 6e-04
 Identities = 27/72 (37%), Positives = 33/72 (45%), Gaps = 5/72 (6%)
 Frame = +2

Query: 64  FIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDD----GEEDVYSYGGFSTRF-ASVP 118
           FI R    + R  +A       FHYD+LSYALNF+D     E  V     FS R  AS P
Sbjct: 212 FINRIGGRHRRRHSAD------FHYDALSYALNFEDYATADERHVDELKSFSARLPASPP 373

Query: 119 ASTKPTFTPVMV 130
               PT   + +
Sbjct: 374 PKVSPTSAAIAI 409


>CD396933 
          Length = 572

 Score = 37.4 bits (85), Expect = 0.002
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 5/59 (8%)
 Frame = -3

Query: 47  NKVREWSEIIVGPK-----WKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDDG 100
           +K  E+S I   P+     W+  +RRF R +         K  SF YD +SY+ NFD+G
Sbjct: 411 HKEYEYSRIYYSPRKRVRRWRNILRRFMRESK---TLCRSKPMSFQYDPVSYSQNFDEG 244


>TC222443 
          Length = 503

 Score = 36.6 bits (83), Expect = 0.004
 Identities = 30/103 (29%), Positives = 48/103 (46%)
 Frame = +1

Query: 17  PCSSSSETSLSWWERVRSPENKEWWWAHGWNKVREWSEIIVGPKWKTFIRRFNRNNNRAG 76
           P S    TS SW+ ++ SP +  +   H   +VR       G   K+ + R   ++ ++ 
Sbjct: 37  PISPQGSTSSSWF-KIMSPTSTNFGDPHP--RVR-------GRSLKSRMGRKLLHHRQSQ 186

Query: 77  AASYDKKGSFHYDSLSYALNFDDGEEDVYSYGGFSTRFASVPA 119
           +A       F YD  SYALNF+D     + +  FS+R  S P+
Sbjct: 187 SAD------FSYDPSSYALNFEDDSPQEFPFRNFSSRLPSSPS 297


>TC232260 
          Length = 559

 Score = 35.8 bits (81), Expect = 0.007
 Identities = 16/42 (38%), Positives = 22/42 (52%)
 Frame = +2

Query: 82  KKGSFHYDSLSYALNFDDGEEDVYSYGGFSTRFASVPASTKP 123
           +   F YD  SYALNF+D   +   +  FS+R    P S+ P
Sbjct: 146 QSADFSYDPSSYALNFEDDSPEEIPFRNFSSRLPPSPPSSTP 271


>TC234016 
          Length = 668

 Score = 35.8 bits (81), Expect = 0.007
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
 Frame = +2

Query: 51  EWSEIIVGPK-----WKTFIRRFNRNNNRAGAASYDKKGSFHYDSLSYALNFDDG 100
           E+S I   P+     W+  +RRF R +         K  SF YD +SY+ NFD+G
Sbjct: 164 EYSRIHYNPRKRVRRWRNILRRFMRESK---TLCRSKPMSFQYDPVSYSQNFDEG 319


>TC232194 weakly similar to UP|Q9LSN6 (Q9LSN6) Arabidopsis thaliana genomic
           DNA, chromosome 3, TAC clone:K14A17, partial (40%)
          Length = 550

 Score = 35.4 bits (80), Expect = 0.009
 Identities = 13/33 (39%), Positives = 17/33 (51%)
 Frame = +1

Query: 20  SSSETSLSWWERVRSPENKEWWWAHGWNKVREW 52
           S S T+   W R      K WWW   WN++R+W
Sbjct: 235 SHSSTTYWIWRRWWKWNWKRWWWLRRWNRLRKW 333



 Score = 25.4 bits (54), Expect = 8.9
 Identities = 8/25 (32%), Positives = 9/25 (36%)
 Frame = +1

Query: 28  WWERVRSPENKEWWWAHGWNKVREW 52
           WW          WWW   W   R+W
Sbjct: 337 WWR-------SRWWWTRTWCWDRKW 390


>TC227116 similar to UP|WAS2_HUMAN (Q9Y6W5) Wiskott-Aldrich syndrome protein
           family member 2 (WASP-family protein member 2)
           (Verprolin homology domain-containing protein 2),
           partial (5%)
          Length = 1154

 Score = 26.9 bits (58), Expect(2) = 0.051
 Identities = 12/28 (42%), Positives = 18/28 (63%), Gaps = 1/28 (3%)
 Frame = -3

Query: 19  SSSSETSLSWWERV-RSPENKEWWWAHG 45
           +S  E S+S+WERV  S  +++W W  G
Sbjct: 225 TSKREDSVSFWERVGESMSSRKWVWQCG 142



 Score = 24.6 bits (52), Expect(2) = 0.051
 Identities = 7/17 (41%), Positives = 10/17 (58%)
 Frame = -2

Query: 40 WWWAHGWNKVREWSEII 56
          WWW   W + R+W E +
Sbjct: 58 WWWRWRWRR-RKWEEAL 11


>AW734146 similar to GP|3056592|gb|A T1F9.13 {Arabidopsis thaliana}, partial
           (9%)
          Length = 409

 Score = 31.6 bits (70), Expect = 0.12
 Identities = 17/37 (45%), Positives = 22/37 (58%), Gaps = 2/37 (5%)
 Frame = +3

Query: 82  KKGSFHYDSLSYALNFDDG--EEDVYSYGGFSTRFAS 116
           ++  F YD  SYALNF+ G   ED      FS+RFA+
Sbjct: 264 QRNRFQYDEHSYALNFNSGAQSEDEGMPHSFSSRFAA 374


>CD390814 similar to GP|13122434|dbj P0666G04.17 {Oryza sativa (japonica
           cultivar-group)}, partial (14%)
          Length = 492

 Score = 31.2 bits (69), Expect = 0.16
 Identities = 30/95 (31%), Positives = 42/95 (43%), Gaps = 10/95 (10%)
 Frame = -1

Query: 32  VRSPENKEW--WWAHGWNKVREWSEIIVGPK---WKTFIRRFNRNNNRAGAASYDKKGSF 86
           +RS  N  W  +   G++ +   +E++   K   WK   R+  R   R     +     F
Sbjct: 450 IRSNSNLSWSSYERIGYDPIVCVNELVTRLKMGSWKALWRKIKRERRRF----FRPSPVF 283

Query: 87  H--YDSLSYALNFDDG---EEDVYSYGGFSTRFAS 116
           H  YD  SY  NFDDG   + D  S   FS RFA+
Sbjct: 282 HVQYDPTSYLQNFDDGYSTDPDNVS-RSFSARFAA 181


>TC225822 homologue to PRF|1609232A.0|226866|1609232A 31kD glycoprotein.
           {Glycine max;} , partial (44%)
          Length = 820

 Score = 31.2 bits (69), Expect = 0.16
 Identities = 14/43 (32%), Positives = 22/43 (50%), Gaps = 2/43 (4%)
 Frame = +1

Query: 12  ICYLIPCSSSSETSLSW--WERVRSPENKEWWWAHGWNKVREW 52
           +C+L  CSS    ++ W    +V SP     WWA    K+R++
Sbjct: 106 VCFLCCCSSFGSMAMPWRRLPKVPSPNENWLWWAFFGGKMRKF 234


>TC224734 homologue to UP|Q09085 (Q09085) Hydroxyproline-rich glycoprotein
           (HRGP) (Fragment), partial (48%)
          Length = 990

 Score = 30.0 bits (66), Expect = 0.36
 Identities = 21/69 (30%), Positives = 25/69 (35%), Gaps = 5/69 (7%)
 Frame = -3

Query: 11  CICYLIPCSSSSETSLSWWERVRSPENKEWWWAHGW-----NKVREWSEIIVGPKWKTFI 65
           C C+LIP    S     WW    S     WWW  GW       +R W     G  W+ F+
Sbjct: 559 CCCFLIP----SLLVDGWWRG--SINVNWWWWRLGWWWRAHVVIRRWGRRRWG--WRRFV 404

Query: 66  RRFNRNNNR 74
                 N R
Sbjct: 403 VVGRMRNGR 377


>TC229883 similar to GB|BAC20813.1|23617133|AP005183 transcription
           factor-like protein {Oryza sativa (japonica
           cultivar-group);} , partial (15%)
          Length = 1063

 Score = 30.0 bits (66), Expect = 0.36
 Identities = 19/69 (27%), Positives = 29/69 (41%)
 Frame = -3

Query: 20  SSSETSLSWWERVRSPENKEWWWAHGWNKVREWSEIIVGPKWKTFIRRFNRNNNRAGAAS 79
           S +  ++ W  R RS  +K  WW   W  + +WS I     W  +I   N      G  S
Sbjct: 461 SGNN*TIVWRGRGRSSRSK*QWWIWRWKWL*QWSTI-----WAMYI---NSRMKWRGRRS 306

Query: 80  YDKKGSFHY 88
             + G +H+
Sbjct: 305 R*RYGGYHH 279


>TC224753 homologue to UP|Q09085 (Q09085) Hydroxyproline-rich glycoprotein
           (HRGP) (Fragment), partial (48%)
          Length = 1088

 Score = 30.0 bits (66), Expect = 0.36
 Identities = 21/69 (30%), Positives = 25/69 (35%), Gaps = 5/69 (7%)
 Frame = -2

Query: 11  CICYLIPCSSSSETSLSWWERVRSPENKEWWWAHGW-----NKVREWSEIIVGPKWKTFI 65
           C C+LIP    S     WW    S     WWW  GW       +R W     G  W+ F+
Sbjct: 634 CCCFLIP----SLLVDGWWRG--SINVNWWWWRLGWWWRAHVVIRRWGRRRWG--WRRFV 479

Query: 66  RRFNRNNNR 74
                 N R
Sbjct: 478 VVGRMRNGR 452


  Database: GMGI
    Posted date:  Oct 22, 2004  4:58 PM
  Number of letters in database: 37,918,896
  Number of sequences in database:  63,676
  
Lambda     K      H
   0.319    0.133    0.452 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,995,929
Number of Sequences: 63676
Number of extensions: 139008
Number of successful extensions: 1466
Number of sequences better than 10.0: 123
Number of HSP's better than 10.0 without gapping: 1404
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1447
length of query: 137
length of database: 12,639,632
effective HSP length: 87
effective length of query: 50
effective length of database: 7,099,820
effective search space: 354991000
effective search space used: 354991000
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (25.4 bits)


Lotus: description of TM0201.10