Miyakogusa Predicted Gene

Lj4g3v1687540.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1687540.1 Non Chatacterized Hit- tr|I1JZK0|I1JZK0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.10388
PE,62.86,0,coiled-coil,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.49618.1
         (1640 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G07940.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   250   7e-66
AT5G07940.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   250   7e-66
AT5G07940.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   250   7e-66
AT5G07970.1 | Symbols:  | dentin sialophosphoprotein-related | c...   225   2e-58
AT5G07980.1 | Symbols:  | dentin sialophosphoprotein-related | c...   197   5e-50
AT3G29385.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   111   5e-24

>AT5G07940.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
            INVOLVED IN: biological_process unknown; LOCATED IN:
            cellular_component unknown; EXPRESSED IN: pollen tube;
            BEST Arabidopsis thaliana protein match is: dentin
            sialophosphoprotein-related (TAIR:AT5G07980.1). |
            chr5:2534720-2540086 FORWARD LENGTH=1526
          Length = 1526

 Score =  250 bits (638), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 203/360 (56%), Gaps = 29/360 (8%)

Query: 1287 VTNSNHVTSVRSEPSMINPQMAPSWFEQYGTFKNGKMLPMYDARTMIPQKNLDQPFFVKN 1346
             ++ NH  SVR++   I+PQMAPSW+ QYGTFKNG + PM D     P K  +Q      
Sbjct: 1190 FSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQPMNDTGRFTPLKIGEQ------ 1243

Query: 1347 RSGSLNLGNSMEQVNSLNDASQLGNARQSPVSTPIASELVPSQLLPPAVEPDLLMRL--- 1403
               S N+ +S++  +++    Q     Q   S P         LL  A   D L+++   
Sbjct: 1244 ---SSNVESSVDGTHTVQSCKQCL-MEQMSGSAPGVETPSSDSLLHGAT--DKLLKVDKP 1297

Query: 1404 KKRKRVTTELMPWHKELKQGSERLRDISAAELGWAQASNRLIEKVEKDAELFEDLPTIKS 1463
            KKRK  T+EL  W+KE+ Q S+RL+ +S AE+ WA+ +NR  EKVE +  L ED P I+S
Sbjct: 1298 KKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFAEKVEFET-LLEDSPPIRS 1356

Query: 1464 KRRXXXXXXXXXXXXNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVP 1523
            KRR            +PPPA V+S     ++D V Y+  R  LGDACSS S   S+   P
Sbjct: 1357 KRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRAALGDACSSSSTDRSEGFSP 1416

Query: 1524 PGSQNHLPDTLKPSEKIDQCISK-VEDFVGRARKLENDILRLDSRASILDLRVELQDLER 1582
            P + N L +  +  +  DQ ISK  EDF+ R RKLE D   L++  +I DLRVE+QDLE+
Sbjct: 1417 PNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAGLENGTTIPDLRVEVQDLEK 1476

Query: 1583 FSVINRFAKFHGHGRGQKDGAETSSSSDTTAQ--KAYPQKYVTAVPMPKNLPDRVQCLSL 1640
            F+VINRFAKFH            SSS + T    K   Q+YVT  PMP+N+PDRVQCLSL
Sbjct: 1477 FAVINRFAKFH----------PPSSSMNRTVNSLKLNLQRYVTIAPMPQNIPDRVQCLSL 1526



 Score =  184 bits (468), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/386 (34%), Positives = 185/386 (47%), Gaps = 31/386 (8%)

Query: 56  RTDAAESPVNYDFFGGQQHISGRHPGMLQSFPRQQSGIXXXXXXXXXXXXXXXXXXXXXX 115
           R +  ESPVNYDFFGGQQ  + +  GMLQ  PRQQ                         
Sbjct: 149 RLEMGESPVNYDFFGGQQQSNTQLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQ 208

Query: 116 XXXXXXXXXXSSMTPPASSISKQTVASHSASLINGIPINEASNHLWQPEVMAASANWLQR 175
                        +   ++++    +   + +INGIP+  AS++ +QP++M  + NW+ R
Sbjct: 209 QLQKQQLEARQLNSLNRNAVNGSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHR 268

Query: 176 GASPVMQGSPNGFVLSPEQMRLMGLVPNQGDQSLYGLPISVSRGTPSLYSHVQPDKPAAS 235
           G SP +QGS +G +++PE  +   L+  Q   SLYG+P+S   GT         + P   
Sbjct: 269 GISPAVQGSSSGLMITPEHGQ-SNLMAQQFGPSLYGMPVS---GT---------NAP--- 312

Query: 236 QVSIQNQYSHVQGDKQAVPHISTGGNSFPAHHYPGISDQMNSNDGTSVSRQDIQGKIMFG 295
               QN +S VQ ++ A PH S   +    +      +Q +  D     R   Q K +F 
Sbjct: 313 ----QNAFSSVQMNRLAAPHGSANRSYSLTNQPTSFLNQGDVQDSQMHPRSTYQEKALFS 368

Query: 296 SIAQ-GMNSGLNMENLQQVNSEQRDIPMEDFHGRQALAGSSEASQDKX-XXXXXXXXXXX 353
             +    N+  N EN QQ +S +R+I  +D   +   +G +E S  K             
Sbjct: 369 QTSVPDSNNRPNFENFQQDDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSA 428

Query: 354 LDPTEEKILFGSDDNLWDGFGRNTGFS-----MLDGSDSLSGFPSLQSGSWSALMQSAVA 408
           LDPTEEKILFGSDDNLWD FG +T  S     M   SD     PSLQSGSWSALMQSAVA
Sbjct: 429 LDPTEEKILFGSDDNLWDAFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVA 488

Query: 409 ETSSSEMGIQEEWSGLSFRNTEPSGN 434
           ET+S + G+     G    NT P  N
Sbjct: 489 ETTSDDAGVH----GWVNSNTVPHAN 510


>AT5G07940.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
            INVOLVED IN: biological_process unknown; LOCATED IN:
            cellular_component unknown; EXPRESSED IN: pollen tube;
            BEST Arabidopsis thaliana protein match is: dentin
            sialophosphoprotein-related (TAIR:AT5G07980.1). |
            chr5:2534720-2540086 FORWARD LENGTH=1526
          Length = 1526

 Score =  250 bits (638), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 203/360 (56%), Gaps = 29/360 (8%)

Query: 1287 VTNSNHVTSVRSEPSMINPQMAPSWFEQYGTFKNGKMLPMYDARTMIPQKNLDQPFFVKN 1346
             ++ NH  SVR++   I+PQMAPSW+ QYGTFKNG + PM D     P K  +Q      
Sbjct: 1190 FSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQPMNDTGRFTPLKIGEQ------ 1243

Query: 1347 RSGSLNLGNSMEQVNSLNDASQLGNARQSPVSTPIASELVPSQLLPPAVEPDLLMRL--- 1403
               S N+ +S++  +++    Q     Q   S P         LL  A   D L+++   
Sbjct: 1244 ---SSNVESSVDGTHTVQSCKQCL-MEQMSGSAPGVETPSSDSLLHGAT--DKLLKVDKP 1297

Query: 1404 KKRKRVTTELMPWHKELKQGSERLRDISAAELGWAQASNRLIEKVEKDAELFEDLPTIKS 1463
            KKRK  T+EL  W+KE+ Q S+RL+ +S AE+ WA+ +NR  EKVE +  L ED P I+S
Sbjct: 1298 KKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFAEKVEFET-LLEDSPPIRS 1356

Query: 1464 KRRXXXXXXXXXXXXNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVP 1523
            KRR            +PPPA V+S     ++D V Y+  R  LGDACSS S   S+   P
Sbjct: 1357 KRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRAALGDACSSSSTDRSEGFSP 1416

Query: 1524 PGSQNHLPDTLKPSEKIDQCISK-VEDFVGRARKLENDILRLDSRASILDLRVELQDLER 1582
            P + N L +  +  +  DQ ISK  EDF+ R RKLE D   L++  +I DLRVE+QDLE+
Sbjct: 1417 PNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAGLENGTTIPDLRVEVQDLEK 1476

Query: 1583 FSVINRFAKFHGHGRGQKDGAETSSSSDTTAQ--KAYPQKYVTAVPMPKNLPDRVQCLSL 1640
            F+VINRFAKFH            SSS + T    K   Q+YVT  PMP+N+PDRVQCLSL
Sbjct: 1477 FAVINRFAKFH----------PPSSSMNRTVNSLKLNLQRYVTIAPMPQNIPDRVQCLSL 1526



 Score =  184 bits (468), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/386 (34%), Positives = 185/386 (47%), Gaps = 31/386 (8%)

Query: 56  RTDAAESPVNYDFFGGQQHISGRHPGMLQSFPRQQSGIXXXXXXXXXXXXXXXXXXXXXX 115
           R +  ESPVNYDFFGGQQ  + +  GMLQ  PRQQ                         
Sbjct: 149 RLEMGESPVNYDFFGGQQQSNTQLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQ 208

Query: 116 XXXXXXXXXXSSMTPPASSISKQTVASHSASLINGIPINEASNHLWQPEVMAASANWLQR 175
                        +   ++++    +   + +INGIP+  AS++ +QP++M  + NW+ R
Sbjct: 209 QLQKQQLEARQLNSLNRNAVNGSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHR 268

Query: 176 GASPVMQGSPNGFVLSPEQMRLMGLVPNQGDQSLYGLPISVSRGTPSLYSHVQPDKPAAS 235
           G SP +QGS +G +++PE  +   L+  Q   SLYG+P+S   GT         + P   
Sbjct: 269 GISPAVQGSSSGLMITPEHGQ-SNLMAQQFGPSLYGMPVS---GT---------NAP--- 312

Query: 236 QVSIQNQYSHVQGDKQAVPHISTGGNSFPAHHYPGISDQMNSNDGTSVSRQDIQGKIMFG 295
               QN +S VQ ++ A PH S   +    +      +Q +  D     R   Q K +F 
Sbjct: 313 ----QNAFSSVQMNRLAAPHGSANRSYSLTNQPTSFLNQGDVQDSQMHPRSTYQEKALFS 368

Query: 296 SIAQ-GMNSGLNMENLQQVNSEQRDIPMEDFHGRQALAGSSEASQDKX-XXXXXXXXXXX 353
             +    N+  N EN QQ +S +R+I  +D   +   +G +E S  K             
Sbjct: 369 QTSVPDSNNRPNFENFQQDDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSA 428

Query: 354 LDPTEEKILFGSDDNLWDGFGRNTGFS-----MLDGSDSLSGFPSLQSGSWSALMQSAVA 408
           LDPTEEKILFGSDDNLWD FG +T  S     M   SD     PSLQSGSWSALMQSAVA
Sbjct: 429 LDPTEEKILFGSDDNLWDAFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVA 488

Query: 409 ETSSSEMGIQEEWSGLSFRNTEPSGN 434
           ET+S + G+     G    NT P  N
Sbjct: 489 ETTSDDAGVH----GWVNSNTVPHAN 510


>AT5G07940.1 | Symbols:  | BEST Arabidopsis thaliana protein match is:
            dentin sialophosphoprotein-related (TAIR:AT5G07980.1);
            Has 1906 Blast hits to 1127 proteins in 203 species:
            Archae - 2; Bacteria - 210; Metazoa - 401; Fungi - 205;
            Plants - 136; Viruses - 0; Other Eukaryotes - 952
            (source: NCBI BLink). | chr5:2534720-2540086 FORWARD
            LENGTH=1526
          Length = 1526

 Score =  250 bits (638), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 203/360 (56%), Gaps = 29/360 (8%)

Query: 1287 VTNSNHVTSVRSEPSMINPQMAPSWFEQYGTFKNGKMLPMYDARTMIPQKNLDQPFFVKN 1346
             ++ NH  SVR++   I+PQMAPSW+ QYGTFKNG + PM D     P K  +Q      
Sbjct: 1190 FSSKNHAASVRADHQQISPQMAPSWYSQYGTFKNGLVQPMNDTGRFTPLKIGEQ------ 1243

Query: 1347 RSGSLNLGNSMEQVNSLNDASQLGNARQSPVSTPIASELVPSQLLPPAVEPDLLMRL--- 1403
               S N+ +S++  +++    Q     Q   S P         LL  A   D L+++   
Sbjct: 1244 ---SSNVESSVDGTHTVQSCKQCL-MEQMSGSAPGVETPSSDSLLHGAT--DKLLKVDKP 1297

Query: 1404 KKRKRVTTELMPWHKELKQGSERLRDISAAELGWAQASNRLIEKVEKDAELFEDLPTIKS 1463
            KKRK  T+EL  W+KE+ Q S+RL+ +S AE+ WA+ +NR  EKVE +  L ED P I+S
Sbjct: 1298 KKRKTATSELQSWNKEVMQDSQRLKTLSEAEINWARETNRFAEKVEFET-LLEDSPPIRS 1356

Query: 1464 KRRXXXXXXXXXXXXNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVP 1523
            KRR            +PPPA V+S     ++D V Y+  R  LGDACSS S   S+   P
Sbjct: 1357 KRRLIHTTQLMQQLFSPPPARVISLVASSNYDVVAYTAGRAALGDACSSSSTDRSEGFSP 1416

Query: 1524 PGSQNHLPDTLKPSEKIDQCISK-VEDFVGRARKLENDILRLDSRASILDLRVELQDLER 1582
            P + N L +  +  +  DQ ISK  EDF+ R RKLE D   L++  +I DLRVE+QDLE+
Sbjct: 1417 PNNSNPLSERTENEKISDQYISKAAEDFISRTRKLETDFAGLENGTTIPDLRVEVQDLEK 1476

Query: 1583 FSVINRFAKFHGHGRGQKDGAETSSSSDTTAQ--KAYPQKYVTAVPMPKNLPDRVQCLSL 1640
            F+VINRFAKFH            SSS + T    K   Q+YVT  PMP+N+PDRVQCLSL
Sbjct: 1477 FAVINRFAKFH----------PPSSSMNRTVNSLKLNLQRYVTIAPMPQNIPDRVQCLSL 1526



 Score =  184 bits (468), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/386 (34%), Positives = 185/386 (47%), Gaps = 31/386 (8%)

Query: 56  RTDAAESPVNYDFFGGQQHISGRHPGMLQSFPRQQSGIXXXXXXXXXXXXXXXXXXXXXX 115
           R +  ESPVNYDFFGGQQ  + +  GMLQ  PRQQ                         
Sbjct: 149 RLEMGESPVNYDFFGGQQQSNTQLSGMLQPLPRQQMTFNDMQLLKQQVMVKQMHEYQMQQ 208

Query: 116 XXXXXXXXXXSSMTPPASSISKQTVASHSASLINGIPINEASNHLWQPEVMAASANWLQR 175
                        +   ++++    +   + +INGIP+  AS++ +QP++M  + NW+ R
Sbjct: 209 QLQKQQLEARQLNSLNRNAVNGSCASDTQSRMINGIPLQNASSNWFQPDLMTGNTNWMHR 268

Query: 176 GASPVMQGSPNGFVLSPEQMRLMGLVPNQGDQSLYGLPISVSRGTPSLYSHVQPDKPAAS 235
           G SP +QGS +G +++PE  +   L+  Q   SLYG+P+S   GT         + P   
Sbjct: 269 GISPAVQGSSSGLMITPEHGQ-SNLMAQQFGPSLYGMPVS---GT---------NAP--- 312

Query: 236 QVSIQNQYSHVQGDKQAVPHISTGGNSFPAHHYPGISDQMNSNDGTSVSRQDIQGKIMFG 295
               QN +S VQ ++ A PH S   +    +      +Q +  D     R   Q K +F 
Sbjct: 313 ----QNAFSSVQMNRLAAPHGSANRSYSLTNQPTSFLNQGDVQDSQMHPRSTYQEKALFS 368

Query: 296 SIAQ-GMNSGLNMENLQQVNSEQRDIPMEDFHGRQALAGSSEASQDKX-XXXXXXXXXXX 353
             +    N+  N EN QQ +S +R+I  +D   +   +G +E S  K             
Sbjct: 369 QTSVPDSNNRPNFENFQQDDSRERNISAQDKFCQMEDSGPAEKSFMKVPENMNALQKSSA 428

Query: 354 LDPTEEKILFGSDDNLWDGFGRNTGFS-----MLDGSDSLSGFPSLQSGSWSALMQSAVA 408
           LDPTEEKILFGSDDNLWD FG +T  S     M   SD     PSLQSGSWSALMQSAVA
Sbjct: 429 LDPTEEKILFGSDDNLWDAFGSSTDMSLQGNLMSSNSDLFDACPSLQSGSWSALMQSAVA 488

Query: 409 ETSSSEMGIQEEWSGLSFRNTEPSGN 434
           ET+S + G+     G    NT P  N
Sbjct: 489 ETTSDDAGVH----GWVNSNTVPHAN 510


>AT5G07970.1 | Symbols:  | dentin sialophosphoprotein-related |
            chr5:2544126-2547916 REVERSE LENGTH=1097
          Length = 1097

 Score =  225 bits (574), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 159/464 (34%), Positives = 230/464 (49%), Gaps = 52/464 (11%)

Query: 1189 KCMPDASQSSPAATYRDIEDFGRSLRPNTLLHQSMKNMDFNPSNQEQQLDSSRGQPSYGY 1248
            K M ++++   +      + F     P +L   + +++   P  +E Q       PS   
Sbjct: 674  KVMTESNEMGNSGKENSSDSFRSKFSPESLTQVNARDLSVLPGGKETQ------SPSRS- 726

Query: 1249 NNMVKDRLGDNSSVPCDGRDTNATSQEVIGYGQKNALHVTNSNHVTSVRSEPSMINPQMA 1308
            + +++D L    S  C           ++ +G   +    N NH  S  S+   I+PQ+A
Sbjct: 727  DGLIRDGLNHKDSANC-----------MLQFGPTISQSFFNKNHAVSAGSDHQQISPQIA 775

Query: 1309 PSWFEQYGTFKNGKMLPMYDARTMIPQKNLDQPFFVKNRSGSLNLGNSMEQVNSLNDASQ 1368
            PS F QY  FKNG + P+ D       K       +  R    NLGNS + ++S+  + Q
Sbjct: 776  PSRFSQYEAFKNGLVQPVNDTGRFTLLK-------IGERYS--NLGNSDDGLHSVQSSKQ 826

Query: 1369 LGNA--------RQSPVSTPIASELVPSQLLPPAVEPDLLMRL---KKRKRVTTELMPWH 1417
            L  A        +Q   STP    L  + L  P    D L+++   KKRK VT+EL+ W 
Sbjct: 827  LNTADPGYIVHMQQISGSTPGVETLSSASL--PCGATDQLLKVYKPKKRKNVTSELLSWS 884

Query: 1418 KELKQGSERLRDISAAELGWAQASNRLIEKVEKDAELFEDLPTIKSKRRXXXXXXXXXXX 1477
            KE+ Q  +RL+ +  AE+ WA+A+NR  EKVE  A L ED P I+SKRR           
Sbjct: 885  KEVMQRPQRLKTLGEAEVDWARATNRFAEKVEF-ATLLEDGPPIRSKRRLIYTTQLMQQL 943

Query: 1478 XNPPPAAVLSADVKLHHDSVVYSVARLVLGDACSSVSLCGSDTLVPPGSQNHLPDTLKPS 1537
              P P  V S  +   ++ V YS AR  LGDACSS S    +  +   + N L +  +  
Sbjct: 944  FRPLPGRVKS--LVTSYEFVAYSAARAALGDACSSTSTDRIEGFLLQNNLNPLSERTETE 1001

Query: 1538 EKIDQCISKV-EDFVGRARKLENDILRLDSRASILDLRVELQDLERFSVINRFAKFHGHG 1596
            +  DQ ISK  EDF+ R +KLE D   L+   +I DLRVE+QDLERF+VINRFA FH   
Sbjct: 1002 KMSDQYISKAAEDFISRTKKLETDFAGLEKGTTITDLRVEVQDLERFAVINRFASFH--- 1058

Query: 1597 RGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRVQCLSL 1640
                  + +S     ++ +  PQ+YVT  P+P+++PDRVQCLS 
Sbjct: 1059 -----QSSSSMDRSVSSLRLNPQRYVTVAPVPRHIPDRVQCLSF 1097



 Score =  194 bits (492), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 177/563 (31%), Positives = 251/563 (44%), Gaps = 97/563 (17%)

Query: 1   MQGHQIFQSRHNEANNILGMDTEADLHGMSGLSRGMSMLESQQGAGLDHYKKNLTRTDAA 60
           M G+ + Q+  NE +  +G+D E+    +SG            G  LD +K  + R D  
Sbjct: 105 MHGNLMLQASPNEGS-FVGVDVESSRDRLSG-----------SGFTLDRHKTPM-RFDMG 151

Query: 61  ESPVNYDFFGGQQHISGRHPGMLQSFPRQQSGIXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           ESPVNYDFFGGQQ ++ + PGM+Q FPRQQ                              
Sbjct: 152 ESPVNYDFFGGQQQLNNQLPGMIQPFPRQQMTFNDMQLLKQHAMAKQMHEYQIQQQLQKQ 211

Query: 121 XXXXXSSMTPPASSISKQTVA-SHSASLINGIPINEASNHLWQPEVMAASANWLQRGASP 179
                   +  +++++    + + S   I+G+P+ +ASN+  QP++M  + NW+ RG SP
Sbjct: 212 QLEARQLNSLHSNAVNGSLSSDNQSHPSISGVPLQDASNNWLQPDLMTGNTNWMHRGISP 271

Query: 180 VMQGSPNGFVLSPEQMRLMGLVPNQGDQSLYGLPISVSRGTPSLYSHVQPDKPAASQVSI 239
           ++Q S +G V++PE      L+  Q + SLYG+P+    GT         D P       
Sbjct: 272 IVQSSSSGLVITPEHGH-ANLMAQQFETSLYGMPVG---GT---------DAP------- 311

Query: 240 QNQYSHVQGDKQAVPHISTGGNSFPAHHYPGISDQMNSNDGTSVSRQDIQGKIMFGSIAQ 299
           QN +S  Q    A  H S   +S   +        +N +D   + R   Q  +       
Sbjct: 312 QNAFSSFQMKMLAAQHGSANMSSSLTNQPTSF---LNQSDSHMLPRSTYQENLYSHISVP 368

Query: 300 GMNSGLNMENLQQVNSEQRDIPMEDFHGRQALAGSSEASQDKX-XXXXXXXXXXXLDPTE 358
           G N   N E+ QQ NS Q++I  ++  G+   +G SE S  K             LDPTE
Sbjct: 369 GSNDRPNFESFQQDNSGQQNISGQEEFGQMDGSGLSEKSFMKVPENINTLQKSTTLDPTE 428

Query: 359 EKILFGSDDNLWDGFGRNTGFS-----MLDGSDSLSGFPSLQSGSWSALMQSAVAETSSS 413
           EKILFGSDDNLW+ FG +T  S     M   SD     PSLQSGSWSALMQSAVAET+S 
Sbjct: 429 EKILFGSDDNLWEAFGNSTDMSLTGNLMSSSSDLFDACPSLQSGSWSALMQSAVAETASD 488

Query: 414 EMGIQEEWSGLSFRNTEPSGNERPSTIDNSKEQSLWANNNSQSAPNINARPFPQQDDVSR 473
           + G+  EW                     SK+QS+WAN       NINA P P     SR
Sbjct: 489 DAGVH-EWG--------------------SKQQSVWAN-------NINA-PHPD----SR 515

Query: 474 PSTTVNYSGHPGFHQPGADAAHEQHGRLHTDSPQRSIPQFLERGKWL-DCNPQQKPVAEG 532
                  SG                   HTDS + ++    ++G  + D    +KP+   
Sbjct: 516 IGNRAQVSGG------------------HTDSTRSTVQHLQDKGNIVSDHGLLEKPMTPQ 557

Query: 533 GHIYGNAAE--SSGLEANEKAIS 553
             + GN  +  SSG++    + S
Sbjct: 558 SQMAGNMFQSLSSGIDVQNNSCS 580


>AT5G07980.1 | Symbols:  | dentin sialophosphoprotein-related |
           chr5:2549432-2554669 REVERSE LENGTH=1501
          Length = 1501

 Score =  197 bits (501), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 148/429 (34%), Positives = 210/429 (48%), Gaps = 41/429 (9%)

Query: 1   MQGHQIFQSRHNEANNILGMDTEADLHGMSGLSRGMSMLESQQGAGLDHYKKNLTRTDAA 60
           M G+   Q+  NEAN +LGMD E+    +S   RG +              K  TR +  
Sbjct: 105 MHGNLGLQTMPNEAN-VLGMDVESSRDKLS--ERGFT----------PDLHKIPTRFEMG 151

Query: 61  ESPVNYDFFGGQQHISGRHPGMLQSFPRQQSGIXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           ESPVNYDFFGGQQ  + + PGMLQ  PRQQ                              
Sbjct: 152 ESPVNYDFFGGQQQSNTQLPGMLQPLPRQQVSFNDMQLLKQQVMVKQMHEYQMQQQLQKQ 211

Query: 121 XXXXXSSMTPPASSISKQTVASHSASLINGIPINEASNHLWQPEVMAASANWLQRGASPV 180
                   +   ++++   V+ + + +INGIP+  AS++  QP++M  + NW+ RG SP 
Sbjct: 212 RLEARQLNSLNRNAVNGSCVSDNQSHMINGIPLQNASSNWLQPDLMTGNTNWMHRGISPA 271

Query: 181 MQGSPNGFVLSPEQMRLMGLVPNQGDQSLYGLPISVSRGTPSLYSHVQPDKPAASQVSIQ 240
           +QGS +G +++P+  +   L+  Q + SLYG+P+S   GT + +                
Sbjct: 272 VQGSSSGLMITPDHGQ-ANLMAQQFEPSLYGMPVS---GTNAPH---------------- 311

Query: 241 NQYSHVQGDKQAVPHISTGGNSFPAHHYPGISDQMNSNDGTSVSRQDIQGKIMFGSIAQ- 299
           N +S  Q ++ A  H S    S   +      +Q +  D   + R     K++F   +  
Sbjct: 312 NAFSSSQMNRLAAQHGSANRTSSVTNQPTSFLNQGDVQDSHMLPRSTYPEKLLFSQTSVP 371

Query: 300 GMNSGLNMENLQQVNSEQRDIPMEDFHGRQALAGSSEASQDKX-XXXXXXXXXXXLDPTE 358
             NS  N E+LQ+ +S +R+I ++   G+   +G SE S  K             LDPTE
Sbjct: 372 SSNSMPNFESLQEDDSRERNISVQAKFGQMEGSGPSEQSFIKAPENINALQKSTALDPTE 431

Query: 359 EKILFGSDDNLWDGFGRNTGFS-----MLDGSDSLSGFPSLQSGSWSALMQSAVAETSSS 413
           EKILFGSDDNLW+ FG +T  S     M   SD   G PSLQSGSWSALMQSAVAETSS 
Sbjct: 432 EKILFGSDDNLWEAFGNSTDMSLTGNLMSSSSDLFDGCPSLQSGSWSALMQSAVAETSSD 491

Query: 414 EMGIQEEWS 422
           + G+  EW+
Sbjct: 492 DAGVH-EWA 499



 Score = 67.8 bits (164), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 84/268 (31%), Positives = 116/268 (43%), Gaps = 50/268 (18%)

Query: 764  RPSLTRKFQYHPMGDVGGDIEPQGNKHVINSQPMPHHPYGGLKGQDQSYPGQSNYGHSDG 823
            RPS+ RKFQYHPMG++    EP                    +G+   +    + G    
Sbjct: 693  RPSIPRKFQYHPMGNIDVTDEP-------------------CRGKVSRFGQSQSLGQPAM 733

Query: 824  NYMETEKGDRKSND-----SASKSALPSHIAKTLTPFDRSVGNYG-LNKAASHSQNILEL 877
            N + T+KG    ND      A K   P +   T    DRSV     +N A+S     LEL
Sbjct: 734  NTL-TDKGHVSQNDLNRTNKAFKGMGPENSPSTSASADRSVDRCNQVNSASSR----LEL 788

Query: 878  LHKVDQSREHGIATNTSTSNCHLSSRVMDTEYSDGPVVHPQRNQSSSSHGFGLQLAPPTQ 937
            LHKVD S E+   TN   +  H ++   D     G   H   NQ+S+S GF LQLAPP+Q
Sbjct: 789  LHKVDPSPENSSETN--VTGIHEANAFADY---GGQFRH---NQASASQGFNLQLAPPSQ 840

Query: 938  RLHMGSSHATPHVTSETVDRGPTW-LAATQTSASRESSHENRNNVSGSSGQSFDKASQYN 996
                      P   +    R     L +  T   +  + ++R    GS+ QSF +++   
Sbjct: 841  --------LAPSPDNMQFFRNSLQPLNSFHTGPEKGGTSQSRFAPWGSN-QSFHQSTHQG 891

Query: 997  ALGNIPQA--FTSGFPFSRIHSQSQNMA 1022
                I      TSGFP+SR + Q+Q MA
Sbjct: 892  PFPGILGGSNMTSGFPYSRGYHQNQQMA 919


>AT3G29385.1 | Symbols:  | BEST Arabidopsis thaliana protein match is:
            dentin sialophosphoprotein-related (TAIR:AT5G07980.1);
            Has 74 Blast hits to 74 proteins in 11 species: Archae -
            0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74;
            Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). |
            chr3:11284395-11285402 REVERSE LENGTH=218
          Length = 218

 Score =  111 bits (277), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 29/239 (12%)

Query: 1404 KKRKRVTTELMPWHKELKQGSERLRDISAAELGWAQASNRLIEKVEKDAELFEDLPTIKS 1463
            KKRK  T    PWHK   QGSE   +I  AE  W  A+N L EKV+ +  +        S
Sbjct: 7    KKRKSSTFLQSPWHKVYLQGSELCHNIRIAEQEWNLATNTLSEKVDTNEAISP------S 60

Query: 1464 KRRXXXXXXXXXXXXNPPPAAVLSAD-VKLHHDSVVYSVARLVLGDACSSVSLCGSDTLV 1522
            KRR             P P  V   D   L+++ V+Y V+R+ L ++CS    C SD   
Sbjct: 61   KRRLLSSTHLMQQLLQPAPTFVFLGDNAALNYEIVLYYVSRINLANSCSLK--CRSDLDK 118

Query: 1523 PPGSQNHLPDTLKPSEKIDQCISK-VEDFVGRARKLENDILRLDSRASILDLRVELQDLE 1581
                Q     T K +   DQ  S  V  F  + +KLE++   L+   SILD+  E+QDLE
Sbjct: 119  SINRQ-----TSKTASNQDQQHSLLVNAFNEKIQKLESNFQSLERTTSILDIIFEIQDLE 173

Query: 1582 RFSVINRFAKFHGHGRGQKDGAETSSSSDTTAQKAYPQKYVTAVPMPKNLPDRVQCLSL 1640
            RFS+IN   KFH   +              T ++  P KY  A+ MP NLP+ + CL L
Sbjct: 174  RFSMINHLGKFHNRAK--------------TFKRLIPHKYAVAIQMPMNLPEPLHCLPL 218