Miyakogusa Predicted Gene

Lj2g3v2365820.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v2365820.1 Non Characterized Hit- tr|I1JHJ0|I1JHJ0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.55659 PE,91.38,0,SMALL
NUCLEAR RIBONUCLEOPROTEIN,NULL; U1 SMALL NUCLEAR RIBONUCLEOPROTEIN
A/U2 SMALL NUCLEAR RIBONUCL,CUFF.38896.1
         (231 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr5g074430.1 | U1 small nuclear ribonucleoprotein | HC | chr5...   398   e-111
Medtr5g074430.2 | U1 small nuclear ribonucleoprotein | HC | chr5...   394   e-110
Medtr1g055405.1 | U2 small nuclear ribonucleoprotein B, putative...   275   3e-74
Medtr4g084990.2 | U1 small nuclear ribonucleoprotein | HC | chr4...   263   8e-71
Medtr4g084990.1 | U1 small nuclear ribonucleoprotein | HC | chr4...   262   2e-70
Medtr7g108715.1 | small nuclear ribonucleoprotein | HC | chr7:44...   150   8e-37
Medtr4g084990.3 | U1 small nuclear ribonucleoprotein | HC | chr4...   148   5e-36
Medtr3g072860.1 | RNA recognition motif protein | HC | chr3:3279...    95   6e-20
Medtr7g108725.1 | U1 small nuclear A-like ribonucleoprotein | HC...    93   3e-19
Medtr4g085040.1 | RNA recognition motif, a.k.a. RRM, RBD protein...    89   4e-18
Medtr4g085000.1 | small nuclear ribonucleoprotein | HC | chr4:33...    83   3e-16
Medtr3g091860.1 | RNA-binding protein with multiple splicing pro...    54   1e-07
Medtr3g091860.2 | RNA-binding protein with multiple splicing pro...    54   1e-07
Medtr6g034835.1 | RNA recognition motif, a.k.a. RRM, RBD protein...    51   8e-07
Medtr3g091860.3 | RNA-binding protein with multiple splicing pro...    51   9e-07
Medtr1g099190.1 | RNA-binding (RRM/RBD/RNP motif) family protein...    51   1e-06
Medtr3g104670.1 | RNA recognition motif, a.k.a. RRM, RBD protein...    50   1e-06

>Medtr5g074430.1 | U1 small nuclear ribonucleoprotein | HC |
           chr5:31637698-31631234 | 20130731
          Length = 233

 Score =  398 bits (1023), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 195/233 (83%), Positives = 202/233 (86%), Gaps = 2/233 (0%)

Query: 1   MLSGDIPPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFS 60
           MLSGDIPP+QTIYIKNLNEK+KKDELKRSLYCLFSQYGRILD++ALKTPKLRGQAWVCFS
Sbjct: 1   MLSGDIPPNQTIYIKNLNEKIKKDELKRSLYCLFSQYGRILDIIALKTPKLRGQAWVCFS 60

Query: 61  EVPSASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXXX--X 118
           EV +ASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGS+VP                 
Sbjct: 61  EVTAASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSFVPREKKKKQEEKAEKKKYA 120

Query: 119 XXXTQQPAPANGTHGASNGGPTASFRPGVGAQEAAAPNNILFIENLPYETTGRMLEMLFE 178
               Q   P  GTHGASNGG TASFRPG GAQEAAAPNNILFIENLPYETTGRMLEMLFE
Sbjct: 121 DESKQSAVPNGGTHGASNGGSTASFRPGSGAQEAAAPNNILFIENLPYETTGRMLEMLFE 180

Query: 179 QYPGFKEVRLIEAKPGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           QYPGFKEVRLIEAKPGIAFVDFED+ QSSMAMQALQGFKITPQNPMII FAKK
Sbjct: 181 QYPGFKEVRLIEAKPGIAFVDFEDDGQSSMAMQALQGFKITPQNPMIINFAKK 233


>Medtr5g074430.2 | U1 small nuclear ribonucleoprotein | HC |
           chr5:31637698-31631271 | 20130731
          Length = 234

 Score =  394 bits (1012), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 195/234 (83%), Positives = 202/234 (86%), Gaps = 3/234 (1%)

Query: 1   MLSGDIPPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFS 60
           MLSGDIPP+QTIYIKNLNEK+KKDELKRSLYCLFSQYGRILD++ALKTPKLRGQAWVCFS
Sbjct: 1   MLSGDIPPNQTIYIKNLNEKIKKDELKRSLYCLFSQYGRILDIIALKTPKLRGQAWVCFS 60

Query: 61  EVPSASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXXX--X 118
           EV +ASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGS+VP                 
Sbjct: 61  EVTAASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSFVPREKKKKQEEKAEKKKYA 120

Query: 119 XXXTQQPAPANGTHGASNGGPT-ASFRPGVGAQEAAAPNNILFIENLPYETTGRMLEMLF 177
               Q   P  GTHGASNGG T ASFRPG GAQEAAAPNNILFIENLPYETTGRMLEMLF
Sbjct: 121 DESKQSAVPNGGTHGASNGGSTQASFRPGSGAQEAAAPNNILFIENLPYETTGRMLEMLF 180

Query: 178 EQYPGFKEVRLIEAKPGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           EQYPGFKEVRLIEAKPGIAFVDFED+ QSSMAMQALQGFKITPQNPMII FAKK
Sbjct: 181 EQYPGFKEVRLIEAKPGIAFVDFEDDGQSSMAMQALQGFKITPQNPMIINFAKK 234


>Medtr1g055405.1 | U2 small nuclear ribonucleoprotein B, putative |
           LC | chr1:24534556-24533368 | 20130731
          Length = 279

 Score =  275 bits (703), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 140/177 (79%), Positives = 146/177 (82%), Gaps = 2/177 (1%)

Query: 57  VCFSEVPSASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXX 116
           VCFSEV +ASNAVRQMQNFPFY KPMRIQY KTKSDC++KEEGS+VP             
Sbjct: 103 VCFSEVTAASNAVRQMQNFPFYVKPMRIQYTKTKSDCVSKEEGSFVPREKKKKQEEKAEK 162

Query: 117 XX-XXXTQQPAPANGTHGASNGGPT-ASFRPGVGAQEAAAPNNILFIENLPYETTGRMLE 174
                 ++Q A  NGTHGASNGG T ASF PG GAQEAAAPNNILFIENLPYETTGRMLE
Sbjct: 163 KWYADESKQSAVPNGTHGASNGGSTQASFCPGSGAQEAAAPNNILFIENLPYETTGRMLE 222

Query: 175 MLFEQYPGFKEVRLIEAKPGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           MLFEQYPGFKEVRLIEAKPGIAFVDFEDE QSSMAMQALQGFKITPQNPMII FAKK
Sbjct: 223 MLFEQYPGFKEVRLIEAKPGIAFVDFEDEGQSSMAMQALQGFKITPQNPMIINFAKK 279


>Medtr4g084990.2 | U1 small nuclear ribonucleoprotein | HC |
           chr4:33189422-33193785 | 20130731
          Length = 245

 Score =  263 bits (673), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 136/235 (57%), Positives = 165/235 (70%), Gaps = 10/235 (4%)

Query: 7   PPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPSAS 66
           P + TIYI NLNEK+K DELK+SL+ +FSQ+G+IL+V+A KT K +GQAWV F +V SAS
Sbjct: 11  PQNMTIYINNLNEKIKIDELKKSLHAVFSQFGKILEVLAFKTLKHKGQAWVIFEDVTSAS 70

Query: 67  NAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXXXXXXXTQQPA 126
           NA+RQMQ FPFY+KPMRIQYA+TKSD IAK EG++VP                       
Sbjct: 71  NALRQMQGFPFYDKPMRIQYARTKSDVIAKAEGTFVPREKRKRHDDKAGKKRKDQNDANL 130

Query: 127 PANGTH----GASNGGPTASFRPGVGAQEA------AAPNNILFIENLPYETTGRMLEML 176
              G +    GA    P  S  P  G  ++      A PNNILFI+NLP ETT  ML+ML
Sbjct: 131 AGTGLNPAYAGAYGATPALSQIPYPGGAKSLLPEAPAPPNNILFIQNLPNETTPMMLQML 190

Query: 177 FEQYPGFKEVRLIEAKPGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           F QYPGFKEVR++EAKPGIAFV++ DE+QS+MAMQALQGFKI PQNPM+IT+AKK
Sbjct: 191 FLQYPGFKEVRMVEAKPGIAFVEYGDEMQSTMAMQALQGFKIAPQNPMLITYAKK 245


>Medtr4g084990.1 | U1 small nuclear ribonucleoprotein | HC |
           chr4:33189286-33193785 | 20130731
          Length = 244

 Score =  262 bits (670), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 139/237 (58%), Positives = 167/237 (70%), Gaps = 15/237 (6%)

Query: 7   PPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPSAS 66
           P + TIYI NLNEK+K DELK+SL+ +FSQ+G+IL+V+A KT K +GQAWV F +V SAS
Sbjct: 11  PQNMTIYINNLNEKIKIDELKKSLHAVFSQFGKILEVLAFKTLKHKGQAWVIFEDVTSAS 70

Query: 67  NAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXXXXXXXTQQPA 126
           NA+RQMQ FPFY+KPMRIQYA+TKSD IAK EG++VP                   Q  A
Sbjct: 71  NALRQMQGFPFYDKPMRIQYARTKSDVIAKAEGTFVP---REKRKRHDDKGKKRKDQNDA 127

Query: 127 PANGT------HGASNGGPTASFRPGVGAQEA------AAPNNILFIENLPYETTGRMLE 174
              GT       GA    P  S  P  G  ++      A PNNILFI+NLP ETT  ML+
Sbjct: 128 NLAGTGLNPAYAGAYGATPALSQIPYPGGAKSLLPEAPAPPNNILFIQNLPNETTPMMLQ 187

Query: 175 MLFEQYPGFKEVRLIEAKPGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           MLF QYPGFKEVR++EAKPGIAFV++ DE+QS+MAMQALQGFKI PQNPM+IT+AKK
Sbjct: 188 MLFLQYPGFKEVRMVEAKPGIAFVEYGDEMQSTMAMQALQGFKIAPQNPMLITYAKK 244


>Medtr7g108715.1 | small nuclear ribonucleoprotein | HC |
           chr7:44399841-44398655 | 20130731
          Length = 161

 Score =  150 bits (380), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 83/159 (52%), Positives = 101/159 (63%), Gaps = 10/159 (6%)

Query: 83  RIQYAKTKSDCIAKEEGSYVPXXXXXXXXXXXXXXXXXXTQQPAPANGTH----GASNGG 138
           RIQYA+TKSD IAK +G++VP                          G +    GA    
Sbjct: 3   RIQYARTKSDVIAKADGTFVPREKRKRHDDKAGKKRKDQNDANLAGTGLNPAYAGAYGAT 62

Query: 139 PTASFRPGVGAQEA------AAPNNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEAK 192
           P  S  P  G  ++      A PNNILFI+NLP ETT  ML+MLF QYPGFKEVR++EAK
Sbjct: 63  PALSQIPYPGGAKSLLPEAPAPPNNILFIQNLPNETTPMMLQMLFLQYPGFKEVRMVEAK 122

Query: 193 PGIAFVDFEDEVQSSMAMQALQGFKITPQNPMIITFAKK 231
           PGIAFV++ DE+QS++AMQALQGFKI PQNPM+IT+AKK
Sbjct: 123 PGIAFVEYGDEMQSTVAMQALQGFKIAPQNPMLITYAKK 161


>Medtr4g084990.3 | U1 small nuclear ribonucleoprotein | HC |
           chr4:33189422-33193785 | 20130731
          Length = 186

 Score =  148 bits (373), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 67/97 (69%), Positives = 83/97 (85%)

Query: 7   PPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPSAS 66
           P + TIYI NLNEK+K DELK+SL+ +FSQ+G+IL+V+A KT K +GQAWV F +V SAS
Sbjct: 11  PQNMTIYINNLNEKIKIDELKKSLHAVFSQFGKILEVLAFKTLKHKGQAWVIFEDVTSAS 70

Query: 67  NAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVP 103
           NA+RQMQ FPFY+KPMRIQYA+TKSD IAK EG++VP
Sbjct: 71  NALRQMQGFPFYDKPMRIQYARTKSDVIAKAEGTFVP 107


>Medtr3g072860.1 | RNA recognition motif protein | HC |
           chr3:32792380-32791534 | 20130731
          Length = 151

 Score = 94.7 bits (234), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 44/58 (75%), Positives = 48/58 (82%)

Query: 46  LKTPKLRGQAWVCFSEVPSASNAVRQMQNFPFYEKPMRIQYAKTKSDCIAKEEGSYVP 103
           LK    +    VCFSEV +ASNAVRQMQNFPFY KPMRIQYAKTKSDC+AKEEGS+VP
Sbjct: 62  LKLDNYKCFTMVCFSEVTAASNAVRQMQNFPFYVKPMRIQYAKTKSDCVAKEEGSFVP 119


>Medtr7g108725.1 | U1 small nuclear A-like ribonucleoprotein | HC
          | chr7:44402147-44401062 | 20130731
          Length = 95

 Score = 92.8 bits (229), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 41/65 (63%), Positives = 53/65 (81%)

Query: 7  PPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPSAS 66
          P + TIYI NLNEK+K DELK+SL+ +FSQ+G+IL+V+A KT K +GQAWV F +V SAS
Sbjct: 11 PQNMTIYINNLNEKIKIDELKKSLHAVFSQFGKILEVLAFKTLKHKGQAWVIFEDVTSAS 70

Query: 67 NAVRQ 71
          NA+R 
Sbjct: 71 NALRH 75


>Medtr4g085040.1 | RNA recognition motif, a.k.a. RRM, RBD protein
          | LC | chr4:33227987-33226801 | 20130731
          Length = 160

 Score = 88.6 bits (218), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 41/65 (63%), Positives = 53/65 (81%)

Query: 9  SQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPSASNA 68
          + TIYI NLNEK+K DELK+SL+ +FSQ+ +IL+V+A KT   +GQAWV F +V SASNA
Sbjct: 13 NMTIYINNLNEKIKIDELKKSLHAVFSQFKKILEVLAFKTLIHKGQAWVIFEDVTSASNA 72

Query: 69 VRQMQ 73
          +RQMQ
Sbjct: 73 LRQMQ 77


>Medtr4g085000.1 | small nuclear ribonucleoprotein | HC |
          chr4:33195873-33197281 | 20130731
          Length = 75

 Score = 82.8 bits (203), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 37/60 (61%), Positives = 49/60 (81%)

Query: 5  DIPPSQTIYIKNLNEKVKKDELKRSLYCLFSQYGRILDVVALKTPKLRGQAWVCFSEVPS 64
          ++P + TIYI NLNEK+K DELKRSL+ +FSQ+G+IL+V+A KT K +GQAWV F +V S
Sbjct: 9  ELPQNMTIYINNLNEKIKIDELKRSLHAVFSQFGKILEVLAFKTLKHKGQAWVIFEDVTS 68


>Medtr3g091860.1 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41919016 | 20130731
          Length = 228

 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 44/92 (47%), Gaps = 16/92 (17%)

Query: 135 SNGGPTASFRPGVGAQEAAAPNNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA--- 191
           S G P  S  PG          N LF+E LP   T R +  +F  + G+KEVRL+     
Sbjct: 116 SGGRPDHSLPPGA--------TNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESR 167

Query: 192 KPG-----IAFVDFEDEVQSSMAMQALQGFKI 218
           +PG     + FVDF     ++ AM AL G+K 
Sbjct: 168 QPGGDPLLLCFVDFVSPAHAATAMDALHGYKF 199


>Medtr3g091860.2 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41919016 | 20130731
          Length = 228

 Score = 53.9 bits (128), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 44/92 (47%), Gaps = 16/92 (17%)

Query: 135 SNGGPTASFRPGVGAQEAAAPNNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA--- 191
           S G P  S  PG          N LF+E LP   T R +  +F  + G+KEVRL+     
Sbjct: 116 SGGRPDHSLPPGA--------TNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESR 167

Query: 192 KPG-----IAFVDFEDEVQSSMAMQALQGFKI 218
           +PG     + FVDF     ++ AM AL G+K 
Sbjct: 168 QPGGDPLLLCFVDFVSPAHAATAMDALHGYKF 199


>Medtr6g034835.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr6:12131803-12138897 | 20130731
          Length = 261

 Score = 51.2 bits (121), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 7/73 (9%)

Query: 158 ILFIENLPYETTGRMLEMLFEQYPGFKEVRLI--EAKPG-----IAFVDFEDEVQSSMAM 210
            L+IE LP + T R +  +F  + G++EVRL+  EAK       + FVDF +   ++ A+
Sbjct: 152 TLYIEGLPSDCTRREVAHIFRPFVGYREVRLVSKEAKHRGDPLILCFVDFANPACAATAL 211

Query: 211 QALQGFKITPQNP 223
            ALQG+K+   NP
Sbjct: 212 SALQGYKVDEINP 224


>Medtr3g091860.3 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41918988 | 20130731
          Length = 243

 Score = 51.2 bits (121), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 42/89 (47%), Gaps = 16/89 (17%)

Query: 135 SNGGPTASFRPGVGAQEAAAPNNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA--- 191
           S G P  S  PG          N LF+E LP   T R +  +F  + G+KEVRL+     
Sbjct: 116 SGGRPDHSLPPGA--------TNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESR 167

Query: 192 KPG-----IAFVDFEDEVQSSMAMQALQG 215
           +PG     + FVDF     ++ AM AL G
Sbjct: 168 QPGGDPLLLCFVDFVSPAHAATAMDALHG 196


>Medtr1g099190.1 | RNA-binding (RRM/RBD/RNP motif) family protein |
           HC | chr1:44752922-44747555 | 20130731
          Length = 252

 Score = 50.8 bits (120), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 46/76 (60%), Gaps = 8/76 (10%)

Query: 156 NNILFIENLPYETTGRMLEMLFEQYPGFKEVRLI----EAKPG----IAFVDFEDEVQSS 207
           ++ L++E LP ++T R +  +F  + G++EVRL+    + + G    + FVDF +   ++
Sbjct: 151 SSTLYVEGLPSDSTKREVAHIFRPFVGYREVRLVAKESKHRGGDPLILCFVDFANPACAA 210

Query: 208 MAMQALQGFKITPQNP 223
            A+ ALQG+K+   NP
Sbjct: 211 TALSALQGYKVDEINP 226


>Medtr3g104670.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr3:48246099-48241678 | 20130731
          Length = 229

 Score = 50.4 bits (119), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 45/80 (56%), Gaps = 8/80 (10%)

Query: 152 AAAPNNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEAKPG--------IAFVDFEDE 203
           +AA +NILF+  LP + T R +  LF  + G+K+++++  +P           FV+F + 
Sbjct: 128 SAAESNILFVGGLPKDCTRREVGHLFRPFIGYKDIKVVHKEPRRSGDKAMIFCFVEFTEP 187

Query: 204 VQSSMAMQALQGFKITPQNP 223
             +  AM+ALQG+K   + P
Sbjct: 188 KCALTAMEALQGYKFDDKKP 207