Medicago
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC147714.16 - phase: 0 
         (322 letters)

Database: MTGI 
           36,976 sequences; 27,044,181 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

BE319145                                                              302  1e-82
TC93186 similar to PIR|A86193|A86193 hypothetical protein [impor...    94  9e-20
TC89535 weakly similar to PIR|A96808|A96808 hypothetical protein...    70  8e-13
TC91564 weakly similar to GP|6573754|gb|AAF17674.1| F28K19.1 {Ar...    64  1e-10
BG645509 similar to PIR|T47966|T47 hypothetical protein F15G16.1...    40  0.001
AW560016 similar to GP|8778713|gb T25N20.3 {Arabidopsis thaliana...    39  0.004
BG644635 weakly similar to GP|8843772|db contains similarity to ...    37  0.008
BF645457 similar to PIR|T52301|T5 GYMNOS/PICKLE protein [importe...    35  0.039
TC84117 weakly similar to GP|439289|emb|CAA81388.1| verprolin {S...    35  0.051
TC76771 homologue to SP|Q42961|PGKH_TOBAC Phosphoglycerate kinas...    34  0.066
TC80666 similar to PIR|E85112|E85112 probable methyltransferase ...    34  0.066
BG453645                                                               34  0.066
TC78762 similar to GP|9294670|dbj|BAB03019.1 gene_id:F14O13.20~u...    34  0.087
BE997694 similar to GP|13646986|db DNA-binding protein DF1 {Pisu...    34  0.087
TC88622 weakly similar to GP|3336892|emb|CAA12389.1 Hsp20.0 prot...    32  0.25
TC81171 similar to GP|9795609|gb|AAF98427.1| Unknown protein {Ar...    32  0.25
TC87476 homologue to PIR|T11622|T11622 extensin class 1 precurso...    32  0.33
TC93132                                                                32  0.33
TC86323 weakly similar to GP|15450363|gb|AAK96475.1 At1g16840/F1...    32  0.43
BQ751144 weakly similar to PIR|PQ0479|PQ04 pistil extensin-like ...    32  0.43

>BE319145 
          Length = 455

 Score =  302 bits (774), Expect = 1e-82
 Identities = 145/146 (99%), Positives = 145/146 (99%)
 Frame = +1

Query: 1   MDFTKNITDLPPNKRLRFIYQQQQQQEEQDLSHCSLLPTKKRKESRNSSLFHTPPSSPTP 60
           MDFTKNITDLPPNKRLRFIYQQQQQQEEQDLSHCSLLPTKKRKESRNSSLFHTPPSSPTP
Sbjct: 16  MDFTKNITDLPPNKRLRFIYQQQQQQEEQDLSHCSLLPTKKRKESRNSSLFHTPPSSPTP 195

Query: 61  PPSTYSLPTKKRITALQPHLHHHNNIPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIE 120
           PPSTYSLPTKKRITALQPHLHHHNNIPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIE
Sbjct: 196 PPSTYSLPTKKRITALQPHLHHHNNIPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIE 375

Query: 121 FEVDDDEILCCVCHSTDANAEDPIVF 146
           FEVDDDEILC VCHSTDANAEDPIVF
Sbjct: 376 FEVDDDEILCWVCHSTDANAEDPIVF 453


>TC93186 similar to PIR|A86193|A86193 hypothetical protein [imported] -
           Arabidopsis thaliana, partial (11%)
          Length = 639

 Score = 93.6 bits (231), Expect = 9e-20
 Identities = 48/122 (39%), Positives = 62/122 (50%), Gaps = 1/122 (0%)
 Frame = +3

Query: 130 CCVCHSTDANAEDPIVFCDGCNLMVHASCYGNPLVKQIPDGDWFCDQCRFKNDIDTDTGP 189
           C VCH  +    +  + CD C +MVHA CYG    + +    W C+ CR      +   P
Sbjct: 297 CNVCHMDEEYENNLFLQCDKCRMMVHARCYGEH--EPVNGVLWLCNLCR------SGAPP 452

Query: 190 IRCSLCPTKEGAMKQTTDGKWVHLVCALLVPEVFFVDPEGREGID-CSKIPKKRWLEKCY 248
             C LCP   GAMK TTDG+W HL CA+ +PE    D +  E ID   +I K RW   C 
Sbjct: 453 PPCCLCPLIGGAMKPTTDGRWAHLACAMWIPETCLADVKRMEPIDGLRRISKDRWKLLCS 632

Query: 249 VC 250
           +C
Sbjct: 633 IC 638


>TC89535 weakly similar to PIR|A96808|A96808 hypothetical protein T32E8.13
            [imported] - Arabidopsis thaliana, partial (9%)
          Length = 1729

 Score = 70.5 bits (171), Expect = 8e-13
 Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 10/148 (6%)
 Frame = +3

Query: 124  DDDEILCCVCHSTDANAE-DPIVFCDGCNLMVHASCYGNPLVKQIPDGDWFCDQC-RFKN 181
            D D+  C  C   D++ + + +V C  C + VH  CYG   V+   D  W C  C + K 
Sbjct: 1113 DGDQPYCHYCGRGDSDTDSNRVVVCASCKVAVHRKCYG---VQDDVDDSWLCSWCSKQKG 1283

Query: 182  DIDTDTGPIRCSLCPTKEGAMK---QTTDG----KWVHLVCALLVPEVFFVDPEGREGI- 233
            D+D    P  C LC  K GA+K      DG     +VHL C L +PEV+  D +  E + 
Sbjct: 1284 DVDDSVNP--CVLCSKKGGALKPVYSAVDGVGSSPFVHLYCCLWMPEVYIEDLKKMEPVM 1457

Query: 234  DCSKIPKKRWLEKCYVCGCFDGCALVCS 261
            +   I + R    C +C    G  + C+
Sbjct: 1458 NVGGIKENRRKLMCNICKLRCGACVQCT 1541


>TC91564 weakly similar to GP|6573754|gb|AAF17674.1| F28K19.1 {Arabidopsis
           thaliana}, partial (22%)
          Length = 875

 Score = 63.5 bits (153), Expect = 1e-10
 Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 12/108 (11%)
 Frame = +1

Query: 130 CCVCHSTDANAEDPIVFCDGCNLMVHASCYGNPLVKQIPDGDWFCDQCRFKNDIDTDTGP 189
           C +C   + N  +PI+ C GC + VH+ CY +  VK+   G W+C+ C  ++ +   +GP
Sbjct: 358 CDICRRFE-NVLNPILVCSGCKVAVHSVCYRS--VKETT-GPWYCELC--EDLLSRSSGP 519

Query: 190 ------------IRCSLCPTKEGAMKQTTDGKWVHLVCALLVPEVFFV 225
                         C+LC    GA ++++DG+WVH  CA +    FF+
Sbjct: 520 SAINSWEKPYFVAECALCGGTTGAFRKSSDGQWVHAFCAEVTLSFFFL 663


>BG645509 similar to PIR|T47966|T47 hypothetical protein F15G16.130 -
           Arabidopsis thaliana, partial (15%)
          Length = 631

 Score = 40.4 bits (93), Expect = 0.001
 Identities = 22/64 (34%), Positives = 32/64 (49%), Gaps = 6/64 (9%)
 Frame = +3

Query: 130 CCVC----HSTDANAEDPIVFCDGCNLMVHASC--YGNPLVKQIPDGDWFCDQCRFKNDI 183
           C +C    H +DA      V CDGCN+ VHA C    +   K + + D++C  C+ K+D 
Sbjct: 441 CGICKKIWHHSDAG---DWVCCDGCNVWVHAECDKISSKRFKDLENIDYYCPDCKGKSDC 611

Query: 184 DTDT 187
              T
Sbjct: 612 KLST 623


>AW560016 similar to GP|8778713|gb T25N20.3 {Arabidopsis thaliana}, partial
           (13%)
          Length = 657

 Score = 38.5 bits (88), Expect = 0.004
 Identities = 24/89 (26%), Positives = 33/89 (36%), Gaps = 24/89 (26%)
 Frame = +1

Query: 134 HSTDANAEDP-------------IVFCDGCNLMVHASCYGNPLVKQIPDGDWFCDQCRFK 180
           HS D +  DP             ++ CDGC    H SC     ++ +P G+W C  C  K
Sbjct: 295 HSVDVDGNDPNDDTCGICGDGGDLICCDGCPSTFHQSCLD---IQMLPPGEWRCPNCTCK 465

Query: 181 ----------NDIDTDTGPIR-CSLCPTK 198
                      + D     +R C LC  K
Sbjct: 466 FCGLASATTDKEDDATVNALRTCDLCEKK 552


>BG644635 weakly similar to GP|8843772|db contains similarity to zinc finger
           protein~gene_id:MYN8.4 {Arabidopsis thaliana}, partial
           (15%)
          Length = 784

 Score = 37.4 bits (85), Expect = 0.008
 Identities = 19/58 (32%), Positives = 28/58 (47%), Gaps = 3/58 (5%)
 Frame = +2

Query: 130 CCVCHSTDANAEDPI-VFCDGCNLMVHASC--YGNPLVKQIPDGDWFCDQCRFKNDID 184
           C +C     +++    V CDGC + VHA C    +   K +   D+FC  CR K D +
Sbjct: 248 CGICKKVSNHSDSGSWVRCDGCKVWVHAECDKISSNHFKDLETTDYFCPTCRGKFDFE 421


>BF645457 similar to PIR|T52301|T5 GYMNOS/PICKLE protein [imported] -
           Arabidopsis thaliana, partial (9%)
          Length = 560

 Score = 35.0 bits (79), Expect = 0.039
 Identities = 31/115 (26%), Positives = 48/115 (40%)
 Frame = +2

Query: 86  IPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIEFEVDDDEILCCVCHSTDANAEDPIV 145
           + +D  P+ +L+           P   Q + + I+   D  E LC  C  +       ++
Sbjct: 125 VRSDRKPVYNLDESDDEDFLLKKPGASQEKFERID-RSDAKEDLCQACGESG-----DLL 286

Query: 146 FCDGCNLMVHASCYGNPLVKQIPDGDWFCDQCRFKNDIDTDTGPIRCSLCPTKEG 200
            C  CN   H+SC   PL    PD +W C +C     ID D   + C + PT +G
Sbjct: 287 SCATCNYAYHSSCLLPPLKGPAPD-NWRCPEC-VTPLIDIDK-LLDCEMRPTVQG 442


>TC84117 weakly similar to GP|439289|emb|CAA81388.1| verprolin
           {Saccharomyces cerevisiae}, partial (3%)
          Length = 674

 Score = 34.7 bits (78), Expect = 0.051
 Identities = 28/88 (31%), Positives = 39/88 (43%), Gaps = 9/88 (10%)
 Frame = +2

Query: 32  SHCSLLPTKKRKESRNSSLF------HTPPSSPTPPP---STYSLPTKKRITALQPHLHH 82
           S  S +P    K SR+S         H PP  PTPPP   S Y+LP+    +  QP  ++
Sbjct: 353 SFSSSVPMPDSKYSRHSISSPSGPSRHAPPLPPTPPPYASSPYNLPSSTNTSVSQPAPYN 532

Query: 83  HNNIPNDAVPLIDLNVEYSPSLPSATPI 110
              I N    L    + +S +  SA P+
Sbjct: 533 QAGIGN--TELSXAXIAHSGARLSAYPL 610


>TC76771 homologue to SP|Q42961|PGKH_TOBAC Phosphoglycerate kinase
           chloroplast precursor (EC 2.7.2.3). [Common tobacco]
           {Nicotiana tabacum}, partial (87%)
          Length = 1708

 Score = 34.3 bits (77), Expect = 0.066
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 5/47 (10%)
 Frame = -1

Query: 78  PHLHHHNNIPNDAVPLIDL-----NVEYSPSLPSATPIEKQSQKQDI 119
           P + HH+N+P+ + PL  L      +  SPS P A P E Q Q+  +
Sbjct: 316 PSIQHHSNLPHSSSPLTPLLLLLSRLLSSPSTPPAPPTEGQLQRTQV 176


>TC80666 similar to PIR|E85112|E85112 probable methyltransferase [imported]
           - Arabidopsis thaliana, partial (39%)
          Length = 1616

 Score = 34.3 bits (77), Expect = 0.066
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
 Frame = +3

Query: 48  SSLFHTP--PSSPTPPPSTYSLPTKKRITAL--QPHLHHHNNIPNDAVPLIDLNVE 99
           SS  H P  P+ P PPP T   P+K  +  L   P  +++NN+ N  + L+  N E
Sbjct: 348 SSFNHRPFAPTPPLPPPLTNFNPSKSSLQQLPQNPFNNNNNNLQNPKISLVTKNPE 515


>BG453645 
          Length = 622

 Score = 34.3 bits (77), Expect = 0.066
 Identities = 25/75 (33%), Positives = 35/75 (46%), Gaps = 10/75 (13%)
 Frame = -2

Query: 32  SHCSLLPTKKRKESRNSSLFHTPPSSPTPPPST-YSLPTKKRITALQP---------HLH 81
           + C + PT   K +  S L H PP +  P  S  +S PTK +     P         H H
Sbjct: 366 TEC*ISPTSISKTTP*SLLSHPPPHNLAP*KSNHFS*PTKPK**YATPTNGLLLPYLHSH 187

Query: 82  HHNNIPNDAVPLIDL 96
            H+NIPN+ + L +L
Sbjct: 186 LHSNIPNEQINLEEL 142


>TC78762 similar to GP|9294670|dbj|BAB03019.1 gene_id:F14O13.20~unknown
           protein {Arabidopsis thaliana}, partial (87%)
          Length = 976

 Score = 33.9 bits (76), Expect = 0.087
 Identities = 16/60 (26%), Positives = 27/60 (44%)
 Frame = +1

Query: 118 DIEFEVDDDEILCCVCHSTDANAEDPIVFCDGCNLMVHASCYGNPLVKQIPDGDWFCDQC 177
           D++  VD +E   C C+         +V CD  +  +    +G   +K+ P G W+C  C
Sbjct: 592 DLDLPVDPNEPTYCFCNQVSYGE---MVACDNPDCKIEWFHFGCVGLKEQPKGKWYCSSC 762


>BE997694 similar to GP|13646986|db DNA-binding protein DF1 {Pisum sativum},
           partial (19%)
          Length = 609

 Score = 33.9 bits (76), Expect = 0.087
 Identities = 28/132 (21%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
 Frame = +3

Query: 16  LRFIYQQQQQQEEQDL-----SHCSLLPTKKRKESRNSSLFHTPPSSPTPPPSTYSLPTK 70
           + F+ +  +QQE+Q+L     ++ +++P ++++  + +    TP  +PTP P+   LP  
Sbjct: 219 MAFLQKIAEQQEQQNLVPPVLNNSTIVP-QQQQAPQETIPTPTPKPTPTPTPTPVPLPAA 395

Query: 71  KRITALQ-PHLHHHNNIPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIEFEVDDDEIL 129
                +  P +     + N  V +     + +  +P A+P+  Q Q+Q  + +V   +++
Sbjct: 396 AAPLPIPIPAIPTPQQVQNPTVTV----QQQTSVIPQASPL-PQHQQQQQQVQVQQQQVM 560

Query: 130 CCVCHSTDANAE 141
                 +D N E
Sbjct: 561 NMEVAKSDNNGE 596


>TC88622 weakly similar to GP|3336892|emb|CAA12389.1 Hsp20.0 protein
           {Lycopersicon peruvianum}, partial (64%)
          Length = 1032

 Score = 32.3 bits (72), Expect = 0.25
 Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 14/103 (13%)
 Frame = +2

Query: 48  SSLFHTPPSSPTPPPSTYSLPTKKR--ITALQPHLHHHNNIP---NDAVPLIDLNVE--- 99
           +S+F    S P     T+  P+ +       Q +  HH   P   N+  P+I+  +E   
Sbjct: 65  NSIFGRRRSEPKDHHQTWHHPSYQNHGYGISQTNTPHHITPPPFHNEPSPIINTQIEWKE 244

Query: 100 ------YSPSLPSATPIEKQSQKQDIEFEVDDDEILCCVCHST 136
                 Y   LP         ++ D+  EVD+D +LC +C  +
Sbjct: 245 THEAHIYKAHLPGL-------KRSDVRVEVDEDRVLCIICEKS 352


>TC81171 similar to GP|9795609|gb|AAF98427.1| Unknown protein {Arabidopsis
           thaliana}, partial (40%)
          Length = 932

 Score = 32.3 bits (72), Expect = 0.25
 Identities = 31/114 (27%), Positives = 43/114 (37%)
 Frame = +3

Query: 26  QEEQDLSHCSLLPTKKRKESRNSSLFHTPPSSPTPPPSTYSLPTKKRITALQPHLHHHNN 85
           QEEQ  S  SL   K +    N S       +  P P   SLP+ K  T    H H+HN+
Sbjct: 36  QEEQKSSLTSLPSPKTQSNGHNHS------HNQIPSPRPISLPSPKTQTQSNGHNHNHNH 197

Query: 86  IPNDAVPLIDLNVEYSPSLPSATPIEKQSQKQDIEFEVDDDEILCCVCHSTDAN 139
               +   I  +   +P   +    +  S KQ ++      E       ST AN
Sbjct: 198 NQIPSPRPITRSEPGNPYPTTFVQADTTSFKQVVQMLTGSSETAKQASTSTKAN 359


>TC87476 homologue to PIR|T11622|T11622 extensin class 1 precursor - cowpea,
           partial (48%)
          Length = 939

 Score = 32.0 bits (71), Expect = 0.33
 Identities = 20/62 (32%), Positives = 28/62 (44%)
 Frame = +3

Query: 48  SSLFHTPPSSPTPPPSTYSLPTKKRITALQPHLHHHNNIPNDAVPLIDLNVEYSPSLPSA 107
           S+  H PPS   PPP  Y  P     +   P+++     P+ + P     V  SP  PSA
Sbjct: 351 STSHHRPPSPSPPPPYVYKSPPPPSPSPPPPYVYKSPPPPSPSPP--PPYVYKSPPPPSA 524

Query: 108 TP 109
           +P
Sbjct: 525 SP 530


>TC93132 
          Length = 638

 Score = 32.0 bits (71), Expect = 0.33
 Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 10/91 (10%)
 Frame = +3

Query: 24  QQQEEQDLSHCSLLPTKKRKESRNSSLFHTPPSSPT----------PPPSTYSLPTKKRI 73
           Q  EE+++S  S L  +K  +      +HTPP+SP+          PP     L    R 
Sbjct: 156 QHTEEKEVS--SDLERRKEVQEEEHYYYHTPPTSPSKNSFDLVCPPPPKKRQRLAVTTRR 329

Query: 74  TALQPHLHHHNNIPNDAVPLIDLNVEYSPSL 104
           T+ Q        +P+D   +  L  + S  L
Sbjct: 330 TSTQSQERKFFQVPDDLTSIFLLRTKPSHQL 422


>TC86323 weakly similar to GP|15450363|gb|AAK96475.1 At1g16840/F17F16.27
           {Arabidopsis thaliana}, partial (28%)
          Length = 1069

 Score = 31.6 bits (70), Expect = 0.43
 Identities = 22/63 (34%), Positives = 29/63 (45%), Gaps = 8/63 (12%)
 Frame = +1

Query: 31  LSHCSLLPTKKRKESRNS---SLFHTPPSSP---TPPPST--YSLPTKKRITALQPHLHH 82
           LSH    PT K + +R     S  +  PSSP   T PPS   +  P +    +   H HH
Sbjct: 82  LSHTIFSPTTKTQPNRQPWPVSSRNPKPSSPIHQTSPPSVTVHHAPQENSTKSTHHHHHH 261

Query: 83  HNN 85
           HN+
Sbjct: 262 HNH 270


>BQ751144 weakly similar to PIR|PQ0479|PQ04 pistil extensin-like protein
           (clone pMG14) - common tobacco (fragment), partial (11%)
          Length = 632

 Score = 31.6 bits (70), Expect = 0.43
 Identities = 25/69 (36%), Positives = 30/69 (43%), Gaps = 4/69 (5%)
 Frame = +1

Query: 45  SRNSSLFHTPPSSPT----PPPSTYSLPTKKRITALQPHLHHHNNIPNDAVPLIDLNVEY 100
           +R SSLF  PP SP+    PPPST S                    P   +PL+ L  E 
Sbjct: 295 TRPSSLFSPPPPSPSRSRPPPPSTPS--------------------PLLPLPLLPLATEP 414

Query: 101 SPSLPSATP 109
           SP  PS+ P
Sbjct: 415 SPFPPSSPP 441


  Database: MTGI
    Posted date:  Oct 22, 2004  3:39 PM
  Number of letters in database: 27,044,181
  Number of sequences in database:  36,976
  
Lambda     K      H
   0.319    0.137    0.445 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,298,139
Number of Sequences: 36976
Number of extensions: 288655
Number of successful extensions: 3449
Number of sequences better than 10.0: 141
Number of HSP's better than 10.0 without gapping: 2929
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3297
length of query: 322
length of database: 9,014,727
effective HSP length: 96
effective length of query: 226
effective length of database: 5,465,031
effective search space: 1235097006
effective search space used: 1235097006
frameshift window, decay const: 50,  0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.9 bits)


Medicago: description of AC147714.16