Miyakogusa Predicted Gene
- Lj1g3v1063550.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1063550.1 Non Characterized Hit- tr|C5XL49|C5XL49_SORBI
Putative uncharacterized protein Sb03g002180
OS=Sorghu,46.07,0.000000000008,RRM,RNA recognition motif domain;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL; RRM_1,RNA
recogni,CUFF.26718.1
(314 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr3g092510.1 | CTC-interacting domain protein | HC | chr3:422... 509 e-144
Medtr3g092550.1 | CTC-interacting domain protein | HC | chr3:422... 477 e-135
Medtr2g070540.1 | CTC-interacting domain protein | HC | chr2:297... 269 2e-72
Medtr8g081640.3 | RNA recognition motif (RRM) containing protein... 55 8e-08
Medtr8g081640.2 | RNA recognition motif (RRM) containing protein... 55 8e-08
Medtr8g081640.1 | RNA recognition motif (RRM) containing protein... 55 8e-08
Medtr8g081640.4 | RNA recognition motif (RRM) containing protein... 55 8e-08
Medtr4g085540.1 | polyadenylate-binding protein | HC | chr4:3341... 54 2e-07
Medtr3g102040.1 | polyadenylate-binding protein | HC | chr3:4701... 54 2e-07
Medtr4g074930.3 | RNA recognition motif, a.k.a. RRM, RBD protein... 48 1e-05
>Medtr3g092510.1 | CTC-interacting domain protein | HC |
chr3:42267137-42262435 | 20130731
Length = 376
Score = 509 bits (1311), Expect = e-144, Method: Compositional matrix adjust.
Identities = 257/329 (78%), Positives = 280/329 (85%), Gaps = 22/329 (6%)
Query: 1 MAVAENAGAKIGSSGQNLDNNNTVVSAEDSSEVEKSKTRTDQNLSNGG-FNHEHHPGNIA 59
MAVAEN GAKIGSS QNLDNNN + DS+EVEKSK RTDQ+++N FNH+H
Sbjct: 1 MAVAENVGAKIGSSSQNLDNNNNHAVSSDSTEVEKSKPRTDQDVNNNSVFNHQHQ----- 55
Query: 60 VPNGNYSYNA-QVGQMQANGVQNQQLVMNNDGY-------GENGDESFKRDMRDLAELLS 111
NGNYS+ Q+GQM ANGVQN Q V+NNDGY GENG ESFKR+MRDL ELLS
Sbjct: 56 --NGNYSFKTHQMGQMHANGVQNHQFVVNNDGYVMNGLRNGENGGESFKREMRDLEELLS 113
Query: 112 KLNPMAEEFVPPSLTNSHGYLA-GPNAGFGYPNNFILQNDF----GQTNRRRKNVYNS-G 165
KLNPMAEEFVPPSLTN+HGYLA GP AGFGYPNNFIL N++ GQTNRRRKN Y + G
Sbjct: 114 KLNPMAEEFVPPSLTNNHGYLAAGPAAGFGYPNNFILLNNYANANGQTNRRRKNGYTTNG 173
Query: 166 KRRIFHKIEMEKRDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRF 225
KRR HK++MEKR+EMIRRTVYVSDIDQLVTEEQLA+LFLNCGQVVDCRVCGDPNSILRF
Sbjct: 174 KRRANHKVDMEKREEMIRRTVYVSDIDQLVTEEQLASLFLNCGQVVDCRVCGDPNSILRF 233
Query: 226 AFVEFTDEEGARTALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCSRTIYC 285
AF+EFTDEE AR A++LSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCSRTIYC
Sbjct: 234 AFIEFTDEESARAAVSLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCSRTIYC 293
Query: 286 TNIDKKLTQSDVKNFFESICGEVQRLRLL 314
TNIDKKLTQ+DVK+FFESICGEV RLRLL
Sbjct: 294 TNIDKKLTQADVKHFFESICGEVHRLRLL 322
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 55/98 (56%), Gaps = 4/98 (4%)
Query: 176 EKRDEMIRRTVYVSDIDQLVTEEQLAALFLN-CGQVVDCRVCGDPNSILRFAFVEFTDEE 234
E EM RT+Y ++ID+ +T+ + F + CG+V R+ GD R AFVEF E
Sbjct: 281 EDEREMCSRTIYCTNIDKKLTQADVKHFFESICGEVHRLRLLGDYQHSTRIAFVEFAVAE 340
Query: 235 GARTALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRS 272
A AL+ SG +LG P+RV PSKT PV PRS
Sbjct: 341 SAIAALSCSGVILGALPIRVSPSKT---PVRARSSPRS 375
>Medtr3g092550.1 | CTC-interacting domain protein | HC |
chr3:42290596-42284870 | 20130731
Length = 384
Score = 477 bits (1227), Expect = e-135, Method: Compositional matrix adjust.
Identities = 250/334 (74%), Positives = 275/334 (82%), Gaps = 25/334 (7%)
Query: 1 MAVAENAGAKIGSSGQNLDNNNTVVSAEDSSEVEKSKTRTDQNLSNGGFNHE-----HHP 55
MAVAEN G KI SS +NLDN +VVS++ + VEKSK +TDQNL+ N +H
Sbjct: 1 MAVAENVGTKIDSSSENLDN--SVVSSDSTEVVEKSKPKTDQNLNTNSVNTNVVGVINHQ 58
Query: 56 GNIAVPNGNYSYNA-QVGQMQANGVQNQQLVMNNDGYGENGDESFKRDMRDLAELLSKLN 114
+VPNGN+ + A Q+ QM NGVQNQ LV DGYG NG ESFKR+MRDL ELLSKLN
Sbjct: 59 QQDSVPNGNHGFIAHQMSQMHGNGVQNQHLV---DGYGGNGGESFKREMRDLEELLSKLN 115
Query: 115 PMAEEFVPPSL-TNSHGYLA-GPNAGFGYPNN-FILQNDF-----------GQTNRRRKN 160
PMAEEFVPPSL TN HGYLA GPNAGFGYPNN F+LQN+F GQ NRRRKN
Sbjct: 116 PMAEEFVPPSLVTNYHGYLAAGPNAGFGYPNNNFMLQNNFGNANANATANNGQINRRRKN 175
Query: 161 VYNSGKRRIFHKIEMEKRDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPN 220
YN+ KRR++HK++MEKR+EMIRRTVYVSDIDQ VTEEQLAALFLNCGQVVDCRVCGDPN
Sbjct: 176 GYNNAKRRVYHKMDMEKREEMIRRTVYVSDIDQQVTEEQLAALFLNCGQVVDCRVCGDPN 235
Query: 221 SILRFAFVEFTDEEGARTALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCS 280
SILRFAFVEFTDE GAR ALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMC+
Sbjct: 236 SILRFAFVEFTDEVGARAALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCT 295
Query: 281 RTIYCTNIDKKLTQSDVKNFFESICGEVQRLRLL 314
RTIYCTN+DKKLTQ+DVK+FFESICGEVQRLRLL
Sbjct: 296 RTIYCTNLDKKLTQADVKHFFESICGEVQRLRLL 329
Score = 69.7 bits (169), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 52/87 (59%), Gaps = 1/87 (1%)
Query: 176 EKRDEMIRRTVYVSDIDQLVTEEQLAALFLN-CGQVVDCRVCGDPNSILRFAFVEFTDEE 234
E EM RT+Y +++D+ +T+ + F + CG+V R+ GD + R AFVEF E
Sbjct: 288 EDEREMCTRTIYCTNLDKKLTQADVKHFFESICGEVQRLRLLGDYHHSTRIAFVEFAVAE 347
Query: 235 GARTALNLSGTMLGYYPLRVLPSKTAI 261
A AL+ SG +LG P+RV PSKT +
Sbjct: 348 SAIAALSCSGVVLGSLPIRVSPSKTPV 374
>Medtr2g070540.1 | CTC-interacting domain protein | HC |
chr2:29723724-29729859 | 20130731
Length = 298
Score = 269 bits (688), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 134/219 (61%), Positives = 170/219 (77%), Gaps = 10/219 (4%)
Query: 103 MRDLAELLSKLNPMAEEFVPPSLTNSHGYLAGPNAGFGY--PNNFILQND-----FGQTN 155
++ L ++ +KLNP+A+EF P S + +H + GF PN+F++ N
Sbjct: 28 VQKLVDMFTKLNPLAKEFFPSSYSPNHDH---GRQGFNLITPNHFLVNTKPSANDNNPNN 84
Query: 156 RRRKNVYNSGKRRIFHKIEMEKRDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRV 215
RRR+N + G+RR+ + +R++ +RRTVYVSDIDQ VTEE+LAALF NCGQV+DCR+
Sbjct: 85 RRRRNNFTQGRRRLNGRSLKAQREDSVRRTVYVSDIDQHVTEERLAALFTNCGQVIDCRI 144
Query: 216 CGDPNSILRFAFVEFTDEEGARTALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDE 275
CGDP+S+LRFAFVEF DE GAR ALNL GT+LGYYP+RVLPSKTAI PVNPTFLPRS+DE
Sbjct: 145 CGDPHSVLRFAFVEFADEHGARAALNLGGTVLGYYPVRVLPSKTAILPVNPTFLPRSDDE 204
Query: 276 REMCSRTIYCTNIDKKLTQSDVKNFFESICGEVQRLRLL 314
REMC+RT+YCTNIDKK++Q++VKNFFES CGEV RLRLL
Sbjct: 205 REMCTRTVYCTNIDKKISQAEVKNFFESSCGEVTRLRLL 243
Score = 73.6 bits (179), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 52/85 (61%), Gaps = 1/85 (1%)
Query: 180 EMIRRTVYVSDIDQLVTEEQLAALF-LNCGQVVDCRVCGDPNSILRFAFVEFTDEEGART 238
EM RTVY ++ID+ +++ ++ F +CG+V R+ GD R AFVEF E A
Sbjct: 206 EMCTRTVYCTNIDKKISQAEVKNFFESSCGEVTRLRLLGDQVHSTRIAFVEFAMAESAIV 265
Query: 239 ALNLSGTMLGYYPLRVLPSKTAIAP 263
ALN SG +LG P+RV PSKT + P
Sbjct: 266 ALNCSGMLLGTQPIRVSPSKTPVRP 290
>Medtr8g081640.3 | RNA recognition motif (RRM) containing protein |
HC | chr8:35236572-35228497 | 20130731
Length = 1047
Score = 55.5 bits (132), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 53/91 (58%), Gaps = 6/91 (6%)
Query: 178 RDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFAFVEFTDEEGAR 237
+++++++T+ VS++ L+T EQL LF CG VV+C + + FA++E++ E A
Sbjct: 324 KEDVLKKTLQVSNLSPLLTVEQLKQLFGFCGTVVECTITDSKH----FAYIEYSKPEEAA 379
Query: 238 TALNLSGTMLGYYPLRVLPSKTAIAPVNPTF 268
A+ L+ +G PL V +K+ P PT
Sbjct: 380 AAMALNNIDVGGRPLNVEMAKS--LPPKPTM 408
>Medtr8g081640.2 | RNA recognition motif (RRM) containing protein |
HC | chr8:35236572-35228497 | 20130731
Length = 1047
Score = 55.5 bits (132), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 53/91 (58%), Gaps = 6/91 (6%)
Query: 178 RDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFAFVEFTDEEGAR 237
+++++++T+ VS++ L+T EQL LF CG VV+C + + FA++E++ E A
Sbjct: 324 KEDVLKKTLQVSNLSPLLTVEQLKQLFGFCGTVVECTITDSKH----FAYIEYSKPEEAA 379
Query: 238 TALNLSGTMLGYYPLRVLPSKTAIAPVNPTF 268
A+ L+ +G PL V +K+ P PT
Sbjct: 380 AAMALNNIDVGGRPLNVEMAKS--LPPKPTM 408
>Medtr8g081640.1 | RNA recognition motif (RRM) containing protein |
HC | chr8:35236572-35228497 | 20130731
Length = 1047
Score = 55.5 bits (132), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 53/91 (58%), Gaps = 6/91 (6%)
Query: 178 RDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFAFVEFTDEEGAR 237
+++++++T+ VS++ L+T EQL LF CG VV+C + + FA++E++ E A
Sbjct: 324 KEDVLKKTLQVSNLSPLLTVEQLKQLFGFCGTVVECTITDSKH----FAYIEYSKPEEAA 379
Query: 238 TALNLSGTMLGYYPLRVLPSKTAIAPVNPTF 268
A+ L+ +G PL V +K+ P PT
Sbjct: 380 AAMALNNIDVGGRPLNVEMAKS--LPPKPTM 408
>Medtr8g081640.4 | RNA recognition motif (RRM) containing protein |
HC | chr8:35236578-35228497 | 20130731
Length = 1047
Score = 55.5 bits (132), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 53/91 (58%), Gaps = 6/91 (6%)
Query: 178 RDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFAFVEFTDEEGAR 237
+++++++T+ VS++ L+T EQL LF CG VV+C + + FA++E++ E A
Sbjct: 324 KEDVLKKTLQVSNLSPLLTVEQLKQLFGFCGTVVECTITDSKH----FAYIEYSKPEEAA 379
Query: 238 TALNLSGTMLGYYPLRVLPSKTAIAPVNPTF 268
A+ L+ +G PL V +K+ P PT
Sbjct: 380 AAMALNNIDVGGRPLNVEMAKS--LPPKPTM 408
>Medtr4g085540.1 | polyadenylate-binding protein | HC |
chr4:33416965-33423437 | 20130731
Length = 654
Score = 53.9 bits (128), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 14/109 (12%)
Query: 170 FHKIEMEKRDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFA-FV 228
F + E D+ +YV ++D + +E+L LF + G + C+V DPN + R + FV
Sbjct: 302 FEQSMKEAADKYQGANLYVKNLDDSIADEKLKELFSSYGTITSCKVMRDPNGVSRGSGFV 361
Query: 229 EF-TDEEGARTALNLSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDER 276
F T EE +R L ++G M+ PL V T R ED R
Sbjct: 362 AFSTPEEASRALLEMNGKMVASKPLYV------------TLAQRKEDRR 398
Score = 51.2 bits (121), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 60/126 (47%), Gaps = 15/126 (11%)
Query: 182 IRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNS--ILRFAFVEFTDEEGARTA 239
+ ++YV D+D VT+ QL LF GQVV RVC D + L + +V +++ + A A
Sbjct: 32 VTTSLYVGDLDMNVTDSQLYDLFNQLGQVVSVRVCRDLTTRRSLGYGYVNYSNPQDAARA 91
Query: 240 LN-LSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCSRTIYCTNIDKKLTQSDVK 298
L+ L+ T L P+R++ S R R+ I+ N+DK + +
Sbjct: 92 LDVLNFTPLNNRPIRIMYSH------------RDPSIRKSGQGNIFIKNLDKAIDHKALH 139
Query: 299 NFFESI 304
+ F S
Sbjct: 140 DTFSSF 145
>Medtr3g102040.1 | polyadenylate-binding protein | HC |
chr3:47012095-47003272 | 20130731
Length = 622
Score = 53.5 bits (127), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%), Gaps = 2/85 (2%)
Query: 176 EKRDEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILRFA-FVEF-TDE 233
E D+ +Y+ ++D VT+E+L+ LF G V C++ DP I R + FV F T E
Sbjct: 298 ETVDKFYGANLYLKNLDDSVTDEKLSELFSEFGTVTSCKILRDPQGISRGSGFVAFSTPE 357
Query: 234 EGARTALNLSGTMLGYYPLRVLPSK 258
E R ++G M+ PL V P++
Sbjct: 358 EATRALAEMNGKMVAGKPLYVAPAQ 382
Score = 52.4 bits (124), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 65/133 (48%), Gaps = 16/133 (12%)
Query: 179 DEMIRRTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNS--ILRFAFVEFTDEEGA 236
+++ ++YV D+D VT+ QL LF GQVV R+C D S L + +V F++ A
Sbjct: 19 NQLTTTSLYVGDLDHDVTDSQLYDLFNQIGQVVSVRICRDLASQQSLGYGYVNFSNPHDA 78
Query: 237 RTALN-LSGTMLGYYPLRVLPSKTAIAPVNPTFLPRSEDEREMCSRTIYCTNIDKKLTQS 295
A++ L+ T L P+R++ S R R+ + I+ N+D+ +
Sbjct: 79 AKAMDVLNFTPLNNKPIRIMYSH------------RDPSVRKSGAANIFIKNLDRAIDHK 126
Query: 296 DVKNFFESICGEV 308
+ + F SI G +
Sbjct: 127 ALYDTF-SIFGNI 138
>Medtr4g074930.3 | RNA recognition motif, a.k.a. RRM, RBD protein |
HC | chr4:28567458-28562229 | 20130731
Length = 600
Score = 48.1 bits (113), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 62/137 (45%), Gaps = 8/137 (5%)
Query: 184 RTVYVSDIDQLVTEEQLAALFLNCGQVVDCRVCGDPNSILR-FAFVEFTDEEGARTALNL 242
+T++V ++ V + F +CG+VVD R D + F VEF E A++AL +
Sbjct: 341 KTLFVGNLSFSVQRSDIEKFFQDCGEVVDVRFSSDEEGRFKGFGHVEFASAEAAQSALEM 400
Query: 243 SGTMLGYYPLR--VLPSKTAIAPVNPTFLPRSEDEREMCSRTIYCTNIDKKLTQSDVK-- 298
+G L +R + + A P N + R S+T++ DK L + +++
Sbjct: 401 NGQELLQRAVRLDLARERGAFTPNNNSNYSAQSGGRGQ-SQTVFVRGFDKNLGEDEIRAK 459
Query: 299 --NFFESICGEVQRLRL 313
F CGE R+ +
Sbjct: 460 LMEHFGGTCGEPTRVSI 476