Miyakogusa Predicted Gene

Lj2g3v0343880.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v0343880.1 Non Characterized Hit- tr|C5X6M2|C5X6M2_SORBI
Putative uncharacterized protein Sb02g012570
OS=Sorghu,43.9,1e-16,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.34504.1
         (469 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr6g045493.1 | hydroxyproline-rich glycoprotein family protei...   614   e-176
Medtr8g098495.1 | hydroxyproline-rich glycoprotein family protei...   211   2e-54
Medtr1g026560.1 | hydroxyproline-rich glycoprotein family protei...   126   4e-29

>Medtr6g045493.1 | hydroxyproline-rich glycoprotein family protein |
           HC | chr6:16415254-16411073 | 20130731
          Length = 487

 Score =  614 bits (1584), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 330/491 (67%), Positives = 369/491 (75%), Gaps = 27/491 (5%)

Query: 2   MGSLNN-SVDTVNXXXXXXXXXESRVQPAAVPKKRWXXXXXXXXXXXXQ-KSSKRIGHXX 59
           MGSLNN S+DTVN         ESRVQP + PKKRW              K+SKRIGH  
Sbjct: 1   MGSLNNNSIDTVNAAATAIVSAESRVQPTSSPKKRWGSCFSLPSCFGSHNKTSKRIGHAV 60

Query: 60  XXXXXXXXXXXXXXXXXQNPSTSILMPFIXXXXXXXXFLQSDPPSATHSPA-GLLSLTSL 118
                             NPST+I++PFI        FLQSDPPS+THSPA GLLSL+SL
Sbjct: 61  LVPEPVAPTVPVANAA-PNPSTAIVIPFIAPPSSPASFLQSDPPSSTHSPAAGLLSLSSL 119

Query: 119 AANAYXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPESVQLTTPSSPEV 178
           +ANAY       +FTIGPYAYETQLVSPPVFSNFTTEPSTA+FTPPPESV +TTPSSPEV
Sbjct: 120 SANAYSTSGPASMFTIGPYAYETQLVSPPVFSNFTTEPSTANFTPPPESVLMTTPSSPEV 179

Query: 179 PFAQLLASSLDRARKSNGSQKFALYNYEFQPYQQYPGSPGGQLISPGSAFSTSGTSTPFP 238
           PFAQLLASSLDRARKSN   KFALYNYE+QPYQQYPGSPG QL+SPGS  STSGTSTPFP
Sbjct: 180 PFAQLLASSLDRARKSN--HKFALYNYEYQPYQQYPGSPGAQLVSPGSVISTSGTSTPFP 237

Query: 239 DRRP-------------------TRKWSSRMGSGSLTPESAGQGSRLGSGSLTPNGVGLA 279
           DRR                    TRKW SR+GSGSLTP+  GQGSRLGSGSLTP+GV   
Sbjct: 238 DRRSSLELRKGEAPKILGFEHFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHT 297

Query: 280 SRLGSGCVTPDGLGQDSRLGSGSLTPDGAGPSSQDRISVQNQFSGEASLANTENGIQSNS 339
           SRLGSGC TPDGLGQDSRLGSGSLTPDG GP+++D I VQNQ     S+AN+++G Q+N+
Sbjct: 298 SRLGSGCATPDGLGQDSRLGSGSLTPDGVGPTTRDSIDVQNQIPVGVSVANSDHGSQTNA 357

Query: 340 TLVDHRVSFELTGEDVARCLANKTGILLRNISRSSQGILAKDPIERDNIQRDSSSCCDVC 399
           TLVDHRVSFELTGEDVARCLANKTG LLRN+S SSQGILAKDPI+R+ I ++++SCCDVC
Sbjct: 358 TLVDHRVSFELTGEDVARCLANKTGALLRNMSSSSQGILAKDPIDREKILKETNSCCDVC 417

Query: 400 SGET-NDKQCCQKHHSVNSSSKEFNFDSRKGDVSGTAANSSEWWANKKVVGKESKSANSW 458
           SG+    + CC K +SV SSSKEFNFD+RKGDVSGT+AN S WW NKKV GKESKS NSW
Sbjct: 418 SGKAIGGEHCCPKRNSV-SSSKEFNFDNRKGDVSGTSANGSSWWTNKKVDGKESKSVNSW 476

Query: 459 AFFPMLQPEIS 469
           AFFPMLQP+IS
Sbjct: 477 AFFPMLQPDIS 487


>Medtr8g098495.1 | hydroxyproline-rich glycoprotein family protein |
           HC | chr8:41086006-41083003 | 20130731
          Length = 460

 Score =  211 bits (536), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 172/490 (35%), Positives = 236/490 (48%), Gaps = 69/490 (14%)

Query: 6   NNSVDTVNXXXXXXXXXESRVQP--AAVPKKRWXXXXXXXXXXXXQKSS-KRIGHX---- 58
           NN++DT+N           R+QP  A   KK+W            QK++ KRIGH     
Sbjct: 14  NNTLDTINAAAFAIASSHDRLQPSTATNQKKKWGNWLNITGCFGYQKNNRKRIGHAVLVP 73

Query: 59  XXXXXXXXXXXXXXXXXXQNPSTSILMPFIXXXXXXXXFLQSDPPSATHSPAGLLSLTSL 118
                             Q P  SI +PFI        F QS+PPS   SP G+LS TS+
Sbjct: 74  ETTPTGADAAANAVSSTAQAPP-SITLPFIAPPSSPASFFQSEPPSTAQSPVGILSKTSV 132

Query: 119 AANAYXXXXXXXIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPESVQLTTPSSPEV 178
           +A+ Y       IF IGPYA+ETQLVSPPVFS      STA FTPPPESV LTTPSSPEV
Sbjct: 133 SASMYSPGGPNSIFAIGPYAHETQLVSPPVFS----ASSTAPFTPPPESVHLTTPSSPEV 188

Query: 179 PFAQLLASSLDRARKSNGSQKFALYNYEFQPYQQYPGSPGGQLISPGSAFSTSGTSTPFP 238
           PFAQL  S+   +R S   Q+  + +Y+FQ YQ  PGSP G LISP SA   SGTS+P P
Sbjct: 189 PFAQLFDSN---SRNSETYQRLQISHYDFQNYQFQPGSPVGPLISPRSAI--SGTSSPLP 243

Query: 239 DRRPTRKWSSRMGSGSLTPESAGQGSRLGSGSLTPNGVGLASRLGSGCVTPDGLGQDSRL 298
           D         ++ +  L          L   S+        S   SG +TPD +      
Sbjct: 244 DEF------KKLDTAKLL--------HLDKLSIY---GKQKSSQSSGSITPDAV------ 280

Query: 299 GSGSLTPDGAGPSSQDRISVQNQFSGEASLANTENGIQSNSTLVDHRVSFELTGEDVARC 358
              + T  G  P         N +  +  ++   +    N T V+HRVSFEL+ +  +  
Sbjct: 281 -KATTTQAGFFP---------NHWVSDIKISPCPSNNHRNETSVNHRVSFELSAQKASSS 330

Query: 359 LANK-------TGIL--LRNISRSSQGILAKDP---IER--DNIQRDSSSCCDV-----C 399
           + NK       T +L   +N + ++     K+    IE   D+ Q  + +  D       
Sbjct: 331 VENKPPASSQWTKVLSKFKNDAAAAAKTTDKEENHSIENECDDKQVVTETLIDTTKQRKA 390

Query: 400 SGETNDKQCCQKHHSVNSSSKEFNFDSRKGDVSGTAANSSEWWANKKVVGKESKSANSWA 459
           +  T D++  Q     +SS+KEFNF + +G  S      ++WWAN+KV G E+ ++  W+
Sbjct: 391 AEATVDEKDHQSLTLSSSSTKEFNFANAEGGDSPAPNIVADWWANEKVAGNENAASKDWS 450

Query: 460 FFPMLQPEIS 469
           FFP++QP +S
Sbjct: 451 FFPIIQPHVS 460


>Medtr1g026560.1 | hydroxyproline-rich glycoprotein family protein,
           putative | HC | chr1:8668237-8665743 | 20130731
          Length = 484

 Score =  126 bits (317), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 111/207 (53%), Gaps = 15/207 (7%)

Query: 78  NPSTSILMPFIXXXXXXXXFLQSDPPSATHSPAGLLSLTSLAANAYXXXXXXXIFTIGPY 137
           N +T I    +        F  S  PS   SP+  LSL++ +           ++  GPY
Sbjct: 73  NQATGIAPSLLAPPSSPASFTHSALPSTAQSPSCFLSLSANSPGG----PSNSMYATGPY 128

Query: 138 AYETQLVSPPVFSNFTTEPSTASFTPPPESVQLTTPSSPEVPFAQLLASSLDRARKSNGS 197
           A+ETQLVSPPVFSNFTTEPSTA  TPPPE   LTTPSSP+VPFA  L SS +   K+ G 
Sbjct: 129 AHETQLVSPPVFSNFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLTSSAN--LKNGGK 186

Query: 198 QKFALYNYEFQPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRRPTRKWSSRMGSGSLTP 257
             +   N     Y  YPGSP   LISP S  S    ST FP+R    +W S     SL P
Sbjct: 187 GNYITANDLQTTYSLYPGSPASSLISPISRNSGDCLSTSFPEREFRPQWDS-----SLYP 241

Query: 258 ESAGQGSRLGSGSLT---PNGVGLASR 281
           E+ G+  R GSG ++    N V +AS+
Sbjct: 242 EN-GKYQRTGSGRVSGHDTNDVTMASQ 267