Miyakogusa Predicted Gene
- Lj6g3v0497370.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0497370.1 Non Chatacterized Hit- tr|I1JG17|I1JG17_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.42349
PE,76.53,0,CRM,RNA-binding, CRM domain; no description,RNA-binding,
CRM domain; seg,NULL; YhbY-like,RNA-binding,CUFF.57922.1
(481 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G25440.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain p... 381 e-106
AT3G25440.2 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain p... 379 e-105
AT2G28480.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain p... 144 1e-34
AT4G13070.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain p... 133 2e-31
AT3G27550.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain p... 119 7e-27
AT3G23070.1 | Symbols: ATCFM3A, CFM3A | CRM family member 3A | c... 112 6e-25
AT4G14510.1 | Symbols: ATCFM3B, CFM3B | CRM family member 3B | c... 107 2e-23
AT3G18390.1 | Symbols: EMB1865 | CRS1 / YhbY (CRM) domain-contai... 106 3e-23
AT4G29750.1 | Symbols: | CRS1 / YhbY (CRM) domain-containing pr... 104 1e-22
AT3G01370.1 | Symbols: ATCFM2, CFM2 | CRM family member 2 | chr3... 99 5e-21
AT5G16180.1 | Symbols: CRS1, ATCRS1 | ortholog of maize chloropl... 82 9e-16
>AT3G25440.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain
protein | chr3:9223998-9225505 FORWARD LENGTH=444
Length = 444
Score = 381 bits (978), Expect = e-106, Method: Compositional matrix adjust.
Identities = 215/438 (49%), Positives = 281/438 (64%), Gaps = 35/438 (7%)
Query: 54 SSSPIYRCYQ---------HLVCGQAQEPTFGHRHDLKPSAFIGHLSFHSGPFLKCNDKM 104
SSSP++ + ++VC + + + +F+ SFHS P + D
Sbjct: 31 SSSPLFSSFNSMNRTISSCNIVCKVLHKESLTR--PMWNVSFLRSSSFHSTPARETGDDD 88
Query: 105 VEQLQDSEKSPSDSCADNAKVQRKKLKGKRAVVRWLXXXXXXXXXEYERMTXXXXXXXXX 164
+ + ++S DSC + + K KRAVVRWL E+ERMT
Sbjct: 89 ISKSENSSSQDGDSCTKLKRKKLKG---KRAVVRWLKFFRWKKKKEFERMTSEEKILNKL 145
Query: 165 XXXXXXXXXXXXXLKKIEPTETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQ 224
+KK+EP+E++E THDPEILTPEEHF++LKMGLKCKNYVPVGRRGIYQ
Sbjct: 146 RKARKKEERLMETMKKLEPSESAETTHDPEILTPEEHFYYLKMGLKCKNYVPVGRRGIYQ 205
Query: 225 GVILNMHLHWKKHQTLKVIIKTFSSEEVKEIATELARLTGGIVLDIHEENTIIMYRGKNY 284
GVILNMHLHWKKHQTL+V+IKTF+ +EVKEIA ELARLTGGIVLD+HE NTIIMYRGKNY
Sbjct: 206 GVILNMHLHWKKHQTLQVVIKTFTPDEVKEIAVELARLTGGIVLDVHEGNTIIMYRGKNY 265
Query: 285 SQPPTEIMSPRITLPRKKALDKSIYRDGLRAVRRYIPRLEQELEILRAQIQSTGESTTEA 344
QPPTEIMSPRITLPRKKALDKS RD LRAVR+YIPRLEQEL++L+AQ ++ + T
Sbjct: 266 VQPPTEIMSPRITLPRKKALDKSKCRDALRAVRKYIPRLEQELQLLQAQAETKRDYTNVK 325
Query: 345 AEGIQKIDGASVESGGISNLEPQNSDKLRELI-NGNMGCPEDETDIESGLDSCSDTDKLS 403
+ Q + S++L+++I +++ + E+GL+ +D+D LS
Sbjct: 326 VDDNQ-----------------ERSEELKKIIERSEECLEDEQEEDEAGLELATDSD-LS 367
Query: 404 DIFETDSDVDDLVKEEKPLYLDEFDNFPEQSDGESDDFEEHLRQVSNSENMKKDVDSPKF 463
DIFETDS+++D K E+PL+L+EF+ FP ++ E +DF + + S E D SP F
Sbjct: 368 DIFETDSELED-AKTERPLFLEEFEKFPAINNREDEDFGDLGKAKSEGEENDDD-KSPNF 425
Query: 464 DEVDRIFLRAASFLKKKR 481
DEVD++FLRAA LKKKR
Sbjct: 426 DEVDKMFLRAAFLLKKKR 443
>AT3G25440.2 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain
protein | chr3:9224294-9225505 FORWARD LENGTH=380
Length = 380
Score = 379 bits (973), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/398 (52%), Positives = 267/398 (67%), Gaps = 24/398 (6%)
Query: 85 AFIGHLSFHSGPFLKCNDKMVEQLQDSEKSPSDSCADNAKVQRKKLKGKRAVVRWLXXXX 144
+F+ SFHS P + D + + ++S DSC + + K KRAVVRWL
Sbjct: 5 SFLRSSSFHSTPARETGDDDISKSENSSSQDGDSCTKLKRKKLKG---KRAVVRWLKFFR 61
Query: 145 XXXXXEYERMTXXXXXXXXXXXXXXXXXXXXXXLKKIEPTETSEITHDPEILTPEEHFFF 204
E+ERMT +KK+EP+E++E THDPEILTPEEHF++
Sbjct: 62 WKKKKEFERMTSEEKILNKLRKARKKEERLMETMKKLEPSESAETTHDPEILTPEEHFYY 121
Query: 205 LKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLKVIIKTFSSEEVKEIATELARLTG 264
LKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTL+V+IKTF+ +EVKEIA ELARLTG
Sbjct: 122 LKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLQVVIKTFTPDEVKEIAVELARLTG 181
Query: 265 GIVLDIHEENTIIMYRGKNYSQPPTEIMSPRITLPRKKALDKSIYRDGLRAVRRYIPRLE 324
GIVLD+HE NTIIMYRGKNY QPPTEIMSPRITLPRKKALDKS RD LRAVR+YIPRLE
Sbjct: 182 GIVLDVHEGNTIIMYRGKNYVQPPTEIMSPRITLPRKKALDKSKCRDALRAVRKYIPRLE 241
Query: 325 QELEILRAQIQSTGESTTEAAEGIQKIDGASVESGGISNLEPQNSDKLRELI-NGNMGCP 383
QEL++L+AQ ++ + T + Q + S++L+++I
Sbjct: 242 QELQLLQAQAETKRDYTNVKVDDNQ-----------------ERSEELKKIIERSEECLE 284
Query: 384 EDETDIESGLDSCSDTDKLSDIFETDSDVDDLVKEEKPLYLDEFDNFPEQSDGESDDFEE 443
+++ + E+GL+ +D+D LSDIFETDS+++D K E+PL+L+EF+ FP ++ E +DF +
Sbjct: 285 DEQEEDEAGLELATDSD-LSDIFETDSELED-AKTERPLFLEEFEKFPAINNREDEDFGD 342
Query: 444 HLRQVSNSENMKKDVDSPKFDEVDRIFLRAASFLKKKR 481
+ S E D SP FDEVD++FLRAA LKKKR
Sbjct: 343 LGKAKSEGEENDDD-KSPNFDEVDKMFLRAAFLLKKKR 379
>AT2G28480.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain
protein | chr2:12176642-12178031 REVERSE LENGTH=372
Length = 372
Score = 144 bits (364), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 74/152 (48%), Positives = 102/152 (67%), Gaps = 2/152 (1%)
Query: 178 LKKIEPTETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKH 237
LK+ E + P +T EE F+ KMG K NYVP+GRRG++ GVILNMHLHWKKH
Sbjct: 146 LKRYEVAKVQGPEVRPHEITGEERFYLKKMGQKRSNYVPIGRRGVFGGVILNMHLHWKKH 205
Query: 238 QTLKVIIKTFSSEEVKEIATELARLTGGIVLDIHEENTIIMYRGKNYSQPPTEIMSPRIT 297
+T+KVI +V++ A ELA+L+GG+ ++I ++TII YRGK Y QP ++MSP T
Sbjct: 206 ETVKVICNNSKPGQVQQYAEELAKLSGGVPVNIIGDDTIIFYRGKGYVQP--QVMSPIDT 263
Query: 298 LPRKKALDKSIYRDGLRAVRRYIPRLEQELEI 329
L +K+A +KS Y L +VR +I E+ELE+
Sbjct: 264 LSKKRAYEKSKYEQSLESVRHFIAIAEKELEL 295
>AT4G13070.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain
protein | chr4:7621780-7623051 REVERSE LENGTH=343
Length = 343
Score = 133 bits (335), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 96/144 (66%), Gaps = 3/144 (2%)
Query: 178 LKKIEPTETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKH 237
L+K + ++ +DPE LT EE + + G K KN+V VGRRG++ GV+LN+HLHWKKH
Sbjct: 164 LRKYDVPKSPAEPYDPESLTEEEQHYLKRTGEKRKNFVLVGRRGVFGGVVLNLHLHWKKH 223
Query: 238 QTLKVIIKTFSSE-EVKEIATELARLTGGIVLDIHEENTIIMYRGKNYSQPPTEIMSPRI 296
+T+KVI K + +V E A ELARL+ GIV+D+ NTI++YRGKNY +P E+MSP
Sbjct: 224 ETVKVICKPCNKPGQVHEYAEELARLSKGIVIDVKPNNTIVLYRGKNYVRP--EVMSPVD 281
Query: 297 TLPRKKALDKSIYRDGLRAVRRYI 320
TL + KAL+K Y L +I
Sbjct: 282 TLSKDKALEKYRYEQSLEHTSEFI 305
>AT3G27550.1 | Symbols: | RNA-binding CRS1 / YhbY (CRM) domain
protein | chr3:10208010-10209899 REVERSE LENGTH=491
Length = 491
Score = 119 bits (297), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 70/153 (45%), Positives = 101/153 (66%), Gaps = 2/153 (1%)
Query: 178 LKKIEPTETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKH 237
LKK + E HDPE+ T E+ F K+G K KNYVPVG RG++ GV+ NMH+HWK H
Sbjct: 74 LKKYDLPELPSPVHDPELFTSEQVQAFKKIGFKNKNYVPVGVRGVFGGVVQNMHMHWKFH 133
Query: 238 QTLKVIIKTFSSEEVKEIATELARLTGGIVLDIHEENTIIMYRGKNYSQPPTEIMSPRIT 297
+T++V F E++KE+A+ +ARL+GG+V++IH TIIM+RG+NY QP I P T
Sbjct: 134 ETVQVCCDNFPKEKIKEMASMIARLSGGVVINIHNVKTIIMFRGRNYRQPKNLI--PVNT 191
Query: 298 LPRKKALDKSIYRDGLRAVRRYIPRLEQELEIL 330
L ++KAL K+ + L + + I + EQ+L +
Sbjct: 192 LTKRKALFKARFEQALESQKLNIKKTEQQLRRM 224
>AT3G23070.1 | Symbols: ATCFM3A, CFM3A | CRM family member 3A |
chr3:8203548-8207243 FORWARD LENGTH=881
Length = 881
Score = 112 bits (280), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 109/182 (59%), Gaps = 9/182 (4%)
Query: 181 IEPTETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTL 240
++P E E DPE +T EE F F K+GLK K ++ +GRRG++ G + NMHLHWK + +
Sbjct: 626 LKPAEQRE---DPESITDEERFMFRKLGLKMKAFLLLGRRGVFDGTVENMHLHWKYRELV 682
Query: 241 KVIIKTFSSEEVKEIATELARLTGGIVLDIHEEN---TIIMYRGKNYSQPPTEIMSPRIT 297
K+I+K + + VK++A L +GGI++ I + II+YRG++Y +P ++ P+
Sbjct: 683 KIIVKAKTFDGVKKVALALEAESGGILVSIDKVTKGYAIIVYRGQDYKRPT--MLRPKNL 740
Query: 298 LPRKKALDKSIYRDGLRAVRRYIPRLEQELEILRAQIQSTGESTTEAAEGI-QKIDGASV 356
L ++KAL +SI + ++I ++ + + LRA+I+ + T + E + K+D A
Sbjct: 741 LTKRKALARSIELQRREGLLKHISTMQAKAKQLRAEIEQMEKVTDKGDEELYNKLDMAYA 800
Query: 357 ES 358
S
Sbjct: 801 SS 802
>AT4G14510.1 | Symbols: ATCFM3B, CFM3B | CRM family member 3B |
chr4:8337390-8341057 REVERSE LENGTH=907
Length = 907
Score = 107 bits (266), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 100/162 (61%), Gaps = 6/162 (3%)
Query: 178 LKKIEPT-ETSEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKK 236
L K+E + + +E D E +T EE F F K+GLK K ++ +GRRG++ G + NMHLHWK
Sbjct: 646 LAKVEESLKPAEQRTDLEGITEEERFMFQKLGLKMKAFLLLGRRGVFDGTVENMHLHWKY 705
Query: 237 HQTLKVIIKTFSSEEVKEIATELARLTGGIVLD---IHEENTIIMYRGKNYSQPPTEIMS 293
+ +K+++K + E +++A L +GGI++ I + +I+YRGK+Y +P T +
Sbjct: 706 RELIKILVKAKTLEGAQKVAMALEAESGGILVSVDKISKGYAVIVYRGKDYKRPTT--LR 763
Query: 294 PRITLPRKKALDKSIYRDGLRAVRRYIPRLEQELEILRAQIQ 335
P+ L ++KAL +S+ A+ ++I ++ E LRA+I+
Sbjct: 764 PKNLLTKRKALARSLELQKREALIKHIEAIQTRSEQLRAEIE 805
>AT3G18390.1 | Symbols: EMB1865 | CRS1 / YhbY (CRM)
domain-containing protein | chr3:6313572-6317584 FORWARD
LENGTH=848
Length = 848
Score = 106 bits (265), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 96/157 (61%), Gaps = 5/157 (3%)
Query: 191 HDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLKVIIKTFSSE 250
+D E+++ EE F K+GLK K Y+P+G RG++ GVI NMHLHWK + +K+I K +
Sbjct: 648 YDQEVISEEERAMFRKVGLKMKAYLPIGIRGVFDGVIENMHLHWKHRELVKLISKQKNQA 707
Query: 251 EVKEIATELARLTGGIVLDIHEEN---TIIMYRGKNYSQPPTEIMSPRITLPRKKALDKS 307
V+E A L +GG+++ I + +I YRGKNY +P + + PR L + KAL +S
Sbjct: 708 FVEETARLLEYESGGVLVAIEKVPKGFALIYYRGKNYRRPIS--LRPRNLLTKAKALKRS 765
Query: 308 IYRDGLRAVRRYIPRLEQELEILRAQIQSTGESTTEA 344
I A+ ++I LE+ +E +++Q+ S S +E+
Sbjct: 766 IAMQRHEALSQHISELERTIEQMQSQLTSKNPSYSES 802
>AT4G29750.1 | Symbols: | CRS1 / YhbY (CRM) domain-containing
protein | chr4:14569728-14572962 FORWARD LENGTH=841
Length = 841
Score = 104 bits (260), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 87/148 (58%), Gaps = 5/148 (3%)
Query: 187 SEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLKVIIKT 246
SE+ D EI+T EE + K+GL ++ +GRR +Y G I NMHLHWK + +KVI++
Sbjct: 638 SELPTDSEIITEEERLLYRKIGLSMDPFLLLGRREVYDGTIENMHLHWKHRELVKVIVRG 697
Query: 247 FSSEEVKEIATELARLTGGIVLDIHEE---NTIIMYRGKNYSQPPTEIMSPRITLPRKKA 303
S +VK IA L +GG+++ + + II+YRGKNY P + P L RKKA
Sbjct: 698 KSLPQVKHIAISLEAESGGVLVSVDKTMKGYAIILYRGKNYQMPFR--LRPSNLLTRKKA 755
Query: 304 LDKSIYRDGLRAVRRYIPRLEQELEILR 331
+SI A++ ++ LE+ +E+L+
Sbjct: 756 FARSIELQRREALKYHVADLEERIELLK 783
>AT3G01370.1 | Symbols: ATCFM2, CFM2 | CRM family member 2 |
chr3:139033-143477 FORWARD LENGTH=1011
Length = 1011
Score = 99.4 bits (246), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 158/341 (46%), Gaps = 44/341 (12%)
Query: 178 LKKIEPTETSEITH-DPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKK 236
L +E E+ +++ D E +T +E + K+GLK K ++ +GRRG++ G I NMHLHWK
Sbjct: 560 LADLENRESPQLSDIDKEGITNDEKYMLRKIGLKMKPFLLLGRRGVFDGTIENMHLHWKY 619
Query: 237 HQTLKVIIKTFSSEEVKEIATELARLTGGIVLDIH---EENTIIMYRGKNYSQPPTEIMS 293
+ +K+I +S E ++A L +GGI++ + + II+YRGKNY +P + +
Sbjct: 620 RELVKIICNEYSIEAAHKVAEILEAESGGILVAVEMVSKGYAIIVYRGKNYERP--QCLR 677
Query: 294 PRITLPRKKALDKSIYRDGLRAVRRYIPRLEQELEILRAQI---------QSTGEST--- 341
P+ L +++AL +S+ ++++ ++ +L +E L Q+ S GES+
Sbjct: 678 PQTLLSKREALKRSVEAQRRKSLKLHVLKLSNNIEELNRQLVEDSATNETWSDGESSNMM 737
Query: 342 ---------TEAAEGIQKIDGA-----SVESGGISNLEPQNSDKLRELINGNMGCPEDET 387
TE + +KI+ SV S G N E + ++ L + EDE+
Sbjct: 738 VEEETENQHTEPEKAREKIELGYSSDLSVPSSGEENWEDDSEGEVDPLTTSSQEYQEDES 797
Query: 388 DIESGL----DSCSDTDKLSDIFETDS-DVDDLVKEEKP--LYLDEFDNFPEQSDGESDD 440
+ S +S T LS ET S + P +L+ P S G
Sbjct: 798 ESASSQRHEGNSLDSTANLSVFAETGSANASSFHDRSLPHNSFLNANRKLPGSSTGSGSQ 857
Query: 441 FEEHLRQVSNSENMKKDVDSPKFDEVDRIFLRAASFLKKKR 481
+ S ++ + D+ + +R+ LR + KKR
Sbjct: 858 ISALRERKSENDGLVTDLSN-----RERLILRKQALKMKKR 893
Score = 50.1 bits (118), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 46/92 (50%), Gaps = 1/92 (1%)
Query: 196 LTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLKVIIKTFSSEEVKEI 255
L P E +G++ + +G+ GI +G++ +H W+ + +K+ + S +K
Sbjct: 166 LPPAELRRLRTVGIRLTKKLKIGKAGITEGIVNGIHERWRTTEVVKIFCEDISRMNMKRT 225
Query: 256 ATELARLTGGIVLDIHEENTIIMYRGKNYSQP 287
L TGG+V+ + I++YRG NY P
Sbjct: 226 HDVLETKTGGLVI-WRSGSKILLYRGVNYQYP 256
>AT5G16180.1 | Symbols: CRS1, ATCRS1 | ortholog of maize chloroplast
splicing factor CRS1 | chr5:5279884-5282898 FORWARD
LENGTH=720
Length = 720
Score = 82.0 bits (201), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 88/158 (55%), Gaps = 4/158 (2%)
Query: 187 SEITHDPEILTPEEHFFFLKMGLKCKNYVPVGRRGIYQGVILNMHLHWKKHQTLKVIIKT 246
SE D EILT EE ++GLK + + +GRRG++ GV+ +H HWK + KVI
Sbjct: 563 SEGDDDIEILTNEERECLRRIGLKMNSSLVLGRRGVFFGVMEGLHQHWKHREVAKVITMQ 622
Query: 247 FSSEEVKEIATELARLTGGIVLDIH---EENTIIMYRGKNYSQPPTEIMSPRITLPRKKA 303
V A L + G+++ I E + I++YRGKNY +P +++M+ + L ++KA
Sbjct: 623 KLFSRVVYTAKALETESNGVLISIEKLKEGHAILIYRGKNYKRPSSKLMAQNL-LTKRKA 681
Query: 304 LDKSIYRDGLRAVRRYIPRLEQELEILRAQIQSTGEST 341
L +S+ L +++ + + E+ +E L+ + + +S
Sbjct: 682 LQRSVVMQRLGSLKFFAYQRERAIEDLKVSLVNLQDSA 719