Miyakogusa Predicted Gene

Lj1g3v1008810.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v1008810.2 Non Characterized Hit- tr|I1NH08|I1NH08_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.13293
PE,67.19,0.00000000000001,RRM_1,RNA recognition motif domain; no
description,Nucleotide-binding, alpha-beta plait; seg,NULL;
R,CUFF.26661.2
         (232 letters)

Database: Medicago_aa4.0v1 
           62,319 sequences; 21,947,249 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Medtr3g091860.1 | RNA-binding protein with multiple splicing pro...   347   4e-96
Medtr3g091860.2 | RNA-binding protein with multiple splicing pro...   347   4e-96
Medtr3g091860.3 | RNA-binding protein with multiple splicing pro...   313   7e-86
Medtr1g099190.1 | RNA-binding (RRM/RBD/RNP motif) family protein...   169   2e-42
Medtr6g034835.1 | RNA recognition motif, a.k.a. RRM, RBD protein...   168   4e-42
Medtr1g099190.3 | RNA-binding (RRM/RBD/RNP motif) family protein...   108   4e-24
Medtr3g104670.1 | RNA recognition motif, a.k.a. RRM, RBD protein...   108   4e-24
Medtr3g104670.5 | RNA recognition motif, a.k.a. RRM, RBD protein...    91   6e-19
Medtr1g099190.2 | RNA-binding (RRM/RBD/RNP motif) family protein...    75   4e-14
Medtr3g104670.4 | RNA recognition motif, a.k.a. RRM, RBD protein...    65   5e-11
Medtr3g104670.3 | RNA recognition motif, a.k.a. RRM, RBD protein...    65   5e-11
Medtr5g074430.1 | U1 small nuclear ribonucleoprotein | HC | chr5...    50   1e-06
Medtr5g074430.2 | U1 small nuclear ribonucleoprotein | HC | chr5...    50   1e-06
Medtr1g055405.1 | U2 small nuclear ribonucleoprotein B, putative...    49   3e-06

>Medtr3g091860.1 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41919016 | 20130731
          Length = 228

 Score =  347 bits (891), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 169/215 (78%), Positives = 187/215 (86%), Gaps = 1/215 (0%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDVSGVHDMPGYYPHDDDRGGLRVIRDTESLDA 60
           MSDAYWRYA   +H P  I  KR R++YDVSGVH++  Y+PHDDDRG L+VIRDTESLDA
Sbjct: 1   MSDAYWRYAESQQHAPPTIPGKRPRTEYDVSGVHNLANYFPHDDDRGRLQVIRDTESLDA 60

Query: 61  SYERYLRSAQVSSFGEGQSTRTIRGRLPSHSFDDSHVTSIGGVDRGPSAKEKILGLSSGR 120
           SYERYLR+A +SS G GQSTRTI G +PSHS DDSHVTS+GGVDR  + K++IL LS GR
Sbjct: 61  SYERYLRNA-ISSHGSGQSTRTIDGGVPSHSIDDSHVTSMGGVDRRTNVKDQILELSGGR 119

Query: 121 PDHSLPPDATSTLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFV 180
           PDHSLPP AT+TLFVEGLP+NC+RREVAHIFRPFVGYKEVRLVSKESRQPGGDPL+LCFV
Sbjct: 120 PDHSLPPGATNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLLLCFV 179

Query: 181 DFESPAHAATAKDALQGYKFDELDRNSANLRFQFA 215
           DF SPAHAATA DAL GYKFDELDRNS NLRFQFA
Sbjct: 180 DFVSPAHAATAMDALHGYKFDELDRNSVNLRFQFA 214


>Medtr3g091860.2 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41919016 | 20130731
          Length = 228

 Score =  347 bits (891), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 169/215 (78%), Positives = 187/215 (86%), Gaps = 1/215 (0%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDVSGVHDMPGYYPHDDDRGGLRVIRDTESLDA 60
           MSDAYWRYA   +H P  I  KR R++YDVSGVH++  Y+PHDDDRG L+VIRDTESLDA
Sbjct: 1   MSDAYWRYAESQQHAPPTIPGKRPRTEYDVSGVHNLANYFPHDDDRGRLQVIRDTESLDA 60

Query: 61  SYERYLRSAQVSSFGEGQSTRTIRGRLPSHSFDDSHVTSIGGVDRGPSAKEKILGLSSGR 120
           SYERYLR+A +SS G GQSTRTI G +PSHS DDSHVTS+GGVDR  + K++IL LS GR
Sbjct: 61  SYERYLRNA-ISSHGSGQSTRTIDGGVPSHSIDDSHVTSMGGVDRRTNVKDQILELSGGR 119

Query: 121 PDHSLPPDATSTLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFV 180
           PDHSLPP AT+TLFVEGLP+NC+RREVAHIFRPFVGYKEVRLVSKESRQPGGDPL+LCFV
Sbjct: 120 PDHSLPPGATNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLLLCFV 179

Query: 181 DFESPAHAATAKDALQGYKFDELDRNSANLRFQFA 215
           DF SPAHAATA DAL GYKFDELDRNS NLRFQFA
Sbjct: 180 DFVSPAHAATAMDALHGYKFDELDRNSVNLRFQFA 214


>Medtr3g091860.3 | RNA-binding protein with multiple splicing
           protein | HC | chr3:41912150-41918988 | 20130731
          Length = 243

 Score =  313 bits (803), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 152/197 (77%), Positives = 170/197 (86%), Gaps = 1/197 (0%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDVSGVHDMPGYYPHDDDRGGLRVIRDTESLDA 60
           MSDAYWRYA   +H P  I  KR R++YDVSGVH++  Y+PHDDDRG L+VIRDTESLDA
Sbjct: 1   MSDAYWRYAESQQHAPPTIPGKRPRTEYDVSGVHNLANYFPHDDDRGRLQVIRDTESLDA 60

Query: 61  SYERYLRSAQVSSFGEGQSTRTIRGRLPSHSFDDSHVTSIGGVDRGPSAKEKILGLSSGR 120
           SYERYLR+A +SS G GQSTRTI G +PSHS DDSHVTS+GGVDR  + K++IL LS GR
Sbjct: 61  SYERYLRNA-ISSHGSGQSTRTIDGGVPSHSIDDSHVTSMGGVDRRTNVKDQILELSGGR 119

Query: 121 PDHSLPPDATSTLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFV 180
           PDHSLPP AT+TLFVEGLP+NC+RREVAHIFRPFVGYKEVRLVSKESRQPGGDPL+LCFV
Sbjct: 120 PDHSLPPGATNTLFVEGLPSNCTRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLLLCFV 179

Query: 181 DFESPAHAATAKDALQG 197
           DF SPAHAATA DAL G
Sbjct: 180 DFVSPAHAATAMDALHG 196


>Medtr1g099190.1 | RNA-binding (RRM/RBD/RNP motif) family protein |
           HC | chr1:44752922-44747555 | 20130731
          Length = 252

 Score =  169 bits (429), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 102/238 (42%), Positives = 145/238 (60%), Gaps = 25/238 (10%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDV--SGVHDMPGYYPHDDDRGGLRVIRDTESL 58
           M+D +W    +    P  +  KR R++YD   SGV      + +     G +++ DT+ L
Sbjct: 1   MADGFWNRQQQHLPPPGGML-KRPRTEYDTAPSGVTSGNEVHNYIAQNNGHQMLNDTKIL 59

Query: 59  DASYERYLRSAQVSSFGEGQST--------RTIRGRLPSHSFDD-SHVTSIGGVDRGP-- 107
            ++Y+R+L+SA ++SF  G+++        R + G LP HS  D S +  + GV  GP  
Sbjct: 60  GSAYDRFLQSAGLTSFNSGEASVIGGVGFARGV-GELPGHSLGDPSAMGHLSGVGGGPDL 118

Query: 108 ---------SAKEKILGLSSGRPDH-SLPPDATSTLFVEGLPTNCSRREVAHIFRPFVGY 157
                      +  I  +S   P+   LP DA+STL+VEGLP++ ++REVAHIFRPFVGY
Sbjct: 119 SRNGRDVNFGGQLPIDAVSRPGPETIPLPRDASSTLYVEGLPSDSTKREVAHIFRPFVGY 178

Query: 158 KEVRLVSKESRQPGGDPLVLCFVDFESPAHAATAKDALQGYKFDELDRNSANLRFQFA 215
           +EVRLV+KES+  GGDPL+LCFVDF +PA AATA  ALQGYK DE++  S+ LR QF+
Sbjct: 179 REVRLVAKESKHRGGDPLILCFVDFANPACAATALSALQGYKVDEINPESSYLRLQFS 236


>Medtr6g034835.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr6:12131803-12138897 | 20130731
          Length = 261

 Score =  168 bits (425), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 104/240 (43%), Positives = 147/240 (61%), Gaps = 31/240 (12%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDV--SGV---HDMPGYYPHDDDRGGLRVIRDT 55
           M+D YW    +    P +   KR R DY++  SG+   ++M  Y   ++DR G  +++D+
Sbjct: 1   MADGYWN--RQQSLLPHSGLHKRPRPDYEMPASGLPSGNEM-HYLSREEDRSGHPMVKDS 57

Query: 56  ESLDASYERYLRSAQVSSFGEGQST--------RTIRGRLPSHSFDDSHVT----SIGGV 103
           +++ ++Y+RYL+  QV SF  G+++        R I G LP+HS  D          GG 
Sbjct: 58  KTIGSAYDRYLQ-GQVPSFTSGEASTVGALGLQRGIGG-LPNHSLSDPSAMIGRHGGGGP 115

Query: 104 DRGPSAKEKILGLSS--------GRPDHSLPPDATSTLFVEGLPTNCSRREVAHIFRPFV 155
           D  P+ +    G           G     LPPDA+ TL++EGLP++C+RREVAHIFRPFV
Sbjct: 116 DLAPNGRGMNYGFQPPMDPVSRHGPEPALLPPDASPTLYIEGLPSDCTRREVAHIFRPFV 175

Query: 156 GYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAATAKDALQGYKFDELDRNSANLRFQFA 215
           GY+EVRLVSKE++   GDPL+LCFVDF +PA AATA  ALQGYK DE++  S++LR QF+
Sbjct: 176 GYREVRLVSKEAKH-RGDPLILCFVDFANPACAATALSALQGYKVDEINPESSHLRLQFS 234


>Medtr1g099190.3 | RNA-binding (RRM/RBD/RNP motif) family protein |
           HC | chr1:44752913-44747593 | 20130731
          Length = 195

 Score =  108 bits (270), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 72/192 (37%), Positives = 109/192 (56%), Gaps = 25/192 (13%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDV--SGVHDMPGYYPHDDDRGGLRVIRDTESL 58
           M+D +W    +    P  +  KR R++YD   SGV      + +     G +++ DT+ L
Sbjct: 1   MADGFWNRQQQHLPPPGGML-KRPRTEYDTAPSGVTSGNEVHNYIAQNNGHQMLNDTKIL 59

Query: 59  DASYERYLRSAQVSSFGEGQST--------RTIRGRLPSHSFDD-SHVTSIGGVDRGP-- 107
            ++Y+R+L+SA ++SF  G+++        R + G LP HS  D S +  + GV  GP  
Sbjct: 60  GSAYDRFLQSAGLTSFNSGEASVIGGVGFARGV-GELPGHSLGDPSAMGHLSGVGGGPDL 118

Query: 108 ---------SAKEKILGLSSGRPDH-SLPPDATSTLFVEGLPTNCSRREVAHIFRPFVGY 157
                      +  I  +S   P+   LP DA+STL+VEGLP++ ++REVAHIFRPFVGY
Sbjct: 119 SRNGRDVNFGGQLPIDAVSRPGPETIPLPRDASSTLYVEGLPSDSTKREVAHIFRPFVGY 178

Query: 158 KEVRLVSKESRQ 169
           +EVRLV+KES+ 
Sbjct: 179 REVRLVAKESKH 190


>Medtr3g104670.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr3:48246099-48241678 | 20130731
          Length = 229

 Score =  108 bits (270), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 79/124 (63%), Gaps = 16/124 (12%)

Query: 108 SAKEKILGLSSGRPD--------------HSLPPDA--TSTLFVEGLPTNCSRREVAHIF 151
           + ++ +LG+S+G PD               +LP  A  ++ LFV GLP +C+RREV H+F
Sbjct: 94  TKRDALLGVSTGVPDPIANNERSISKSNYDALPVSAAESNILFVGGLPKDCTRREVGHLF 153

Query: 152 RPFVGYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAATAKDALQGYKFDELDRNSANLR 211
           RPF+GYK++++V KE R+ G   ++ CFV+F  P  A TA +ALQGYKFD+   +S  L+
Sbjct: 154 RPFIGYKDIKVVHKEPRRSGDKAMIFCFVEFTEPKCALTAMEALQGYKFDDKKPDSPTLK 213

Query: 212 FQFA 215
            +FA
Sbjct: 214 IKFA 217


>Medtr3g104670.5 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr3:48246099-48244267 | 20130731
          Length = 215

 Score = 91.3 bits (225), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 45/109 (41%), Positives = 68/109 (62%), Gaps = 16/109 (14%)

Query: 108 SAKEKILGLSSGRPD--------------HSLPPDA--TSTLFVEGLPTNCSRREVAHIF 151
           + ++ +LG+S+G PD               +LP  A  ++ LFV GLP +C+RREV H+F
Sbjct: 94  TKRDALLGVSTGVPDPIANNERSISKSNYDALPVSAAESNILFVGGLPKDCTRREVGHLF 153

Query: 152 RPFVGYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAATAKDALQGYKF 200
           RPF+GYK++++V KE R+ G   ++ CFV+F  P  A TA +ALQ  K+
Sbjct: 154 RPFIGYKDIKVVHKEPRRSGDKAMIFCFVEFTEPKCALTAMEALQVSKY 202


>Medtr1g099190.2 | RNA-binding (RRM/RBD/RNP motif) family protein |
           HC | chr1:44752913-44747576 | 20130731
          Length = 180

 Score = 75.5 bits (184), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 58/177 (32%), Positives = 92/177 (51%), Gaps = 25/177 (14%)

Query: 1   MSDAYWRYAAESRHNPSAIAAKRARSDYDV--SGVHDMPGYYPHDDDRGGLRVIRDTESL 58
           M+D +W    +    P  +  KR R++YD   SGV      + +     G +++ DT+ L
Sbjct: 1   MADGFWNRQQQHLPPPGGML-KRPRTEYDTAPSGVTSGNEVHNYIAQNNGHQMLNDTKIL 59

Query: 59  DASYERYLRSAQVSSFGEGQST--------RTIRGRLPSHSFDD-SHVTSIGGVDRGP-- 107
            ++Y+R+L+SA ++SF  G+++        R + G LP HS  D S +  + GV  GP  
Sbjct: 60  GSAYDRFLQSAGLTSFNSGEASVIGGVGFARGV-GELPGHSLGDPSAMGHLSGVGGGPDL 118

Query: 108 ---------SAKEKILGLSSGRPDH-SLPPDATSTLFVEGLPTNCSRREVAHIFRPF 154
                      +  I  +S   P+   LP DA+STL+VEGLP++ ++REVA IF  F
Sbjct: 119 SRNGRDVNFGGQLPIDAVSRPGPETIPLPRDASSTLYVEGLPSDSTKREVARIFFIF 175


>Medtr3g104670.4 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr3:48246099-48244805 | 20130731
          Length = 178

 Score = 65.5 bits (158), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 16/78 (20%)

Query: 108 SAKEKILGLSSGRPD--------------HSLPPDA--TSTLFVEGLPTNCSRREVAHIF 151
           + ++ +LG+S+G PD               +LP  A  ++ LFV GLP +C+RREV H+F
Sbjct: 94  TKRDALLGVSTGVPDPIANNERSISKSNYDALPVSAAESNILFVGGLPKDCTRREVGHLF 153

Query: 152 RPFVGYKEVRLVSKESRQ 169
           RPF+GYK++++V KE R+
Sbjct: 154 RPFIGYKDIKVVHKEPRR 171


>Medtr3g104670.3 | RNA recognition motif, a.k.a. RRM, RBD protein |
           HC | chr3:48246099-48244805 | 20130731
          Length = 178

 Score = 65.5 bits (158), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 50/78 (64%), Gaps = 16/78 (20%)

Query: 108 SAKEKILGLSSGRPD--------------HSLPPDA--TSTLFVEGLPTNCSRREVAHIF 151
           + ++ +LG+S+G PD               +LP  A  ++ LFV GLP +C+RREV H+F
Sbjct: 94  TKRDALLGVSTGVPDPIANNERSISKSNYDALPVSAAESNILFVGGLPKDCTRREVGHLF 153

Query: 152 RPFVGYKEVRLVSKESRQ 169
           RPF+GYK++++V KE R+
Sbjct: 154 RPFIGYKDIKVVHKEPRR 171


>Medtr5g074430.1 | U1 small nuclear ribonucleoprotein | HC |
           chr5:31637698-31631234 | 20130731
          Length = 233

 Score = 50.4 bits (119), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 8/71 (11%)

Query: 130 TSTLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAA 189
            + LF+E LP   + R +  +F  + G+KEVRL+     +PG     + FVDFE    ++
Sbjct: 158 NNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA---KPG-----IAFVDFEDDGQSS 209

Query: 190 TAKDALQGYKF 200
            A  ALQG+K 
Sbjct: 210 MAMQALQGFKI 220


>Medtr5g074430.2 | U1 small nuclear ribonucleoprotein | HC |
           chr5:31637698-31631271 | 20130731
          Length = 234

 Score = 50.4 bits (119), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 8/71 (11%)

Query: 130 TSTLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAA 189
            + LF+E LP   + R +  +F  + G+KEVRL+     +PG     + FVDFE    ++
Sbjct: 159 NNILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA---KPG-----IAFVDFEDDGQSS 210

Query: 190 TAKDALQGYKF 200
            A  ALQG+K 
Sbjct: 211 MAMQALQGFKI 221


>Medtr1g055405.1 | U2 small nuclear ribonucleoprotein B, putative |
           LC | chr1:24534556-24533368 | 20130731
          Length = 279

 Score = 49.3 bits (116), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 8/69 (11%)

Query: 131 STLFVEGLPTNCSRREVAHIFRPFVGYKEVRLVSKESRQPGGDPLVLCFVDFESPAHAAT 190
           + LF+E LP   + R +  +F  + G+KEVRL+     +PG     + FVDFE    ++ 
Sbjct: 205 NILFIENLPYETTGRMLEMLFEQYPGFKEVRLIEA---KPG-----IAFVDFEDEGQSSM 256

Query: 191 AKDALQGYK 199
           A  ALQG+K
Sbjct: 257 AMQALQGFK 265