Miyakogusa Predicted Gene
- Lj2g3v0933930.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0933930.1 Non Characterized Hit- tr|I1N5J5|I1N5J5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.39497
PE,77.82,0,RNA-binding domain, RBD,NULL; coiled-coil,NULL; RRM_6,NULL;
no description,Nucleotide-binding, alpha,CUFF.35797.1
(916 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr6g004970.1 | zinc finger CCCH domain protein | HC | chr6:53... 1295 0.0
Medtr2g449870.1 | RNA recognition motif, a.k.a. RRM, RBD protein... 417 e-116
Medtr2g449950.1 | RNA recognition motif, a.k.a. RRM, RBD protein... 174 4e-43
Medtr2g449940.1 | hypothetical protein | LC | chr2:22032429-2203... 139 1e-32
Medtr2g449970.1 | hypothetical protein | LC | chr2:22043000-2204... 75 3e-13
>Medtr6g004970.1 | zinc finger CCCH domain protein | HC |
chr6:534944-543221 | 20130731
Length = 910
Score = 1295 bits (3352), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 654/922 (70%), Positives = 732/922 (79%), Gaps = 20/922 (2%)
Query: 1 MELKVSSPKLESVAPSDCLSDPEEKEVSXXXXXXXXXXXXXXXXXSQSLERDVSDPVISR 60
MELK SSPK ESV PSDC SDPEE EVS SQSLERDVSDPVI+R
Sbjct: 1 MELKASSPKPESVVPSDCASDPEETEVSDDDDDDRNHKHRKKEDRSQSLERDVSDPVINR 60
Query: 61 PFRKRNKNFGNRPPFKGNESLSFETLKAYGDAATDKEFYSKFEXXXXXXXXXXXXXLDMS 120
PF+K +KNFGNR PF+ NES++FETL+ Y DA TDK+FYSKF+ DM+
Sbjct: 61 PFKKCHKNFGNRHPFRENESMAFETLRTYNDATTDKDFYSKFDRRRPGMTSGPRMPFDMN 120
Query: 121 QRLRTNQSFTVDPGAGRGRGRESGFWNQRESRLSSMDVASQIVQQRSIPSSLYTGRGLPN 180
QR+R NQ F DPGAGRGRGRESGFWNQRESR SS+DVASQ+VQQ I +LYTGRGLPN
Sbjct: 121 QRIRPNQLFAGDPGAGRGRGRESGFWNQRESRFSSIDVASQMVQQGPIHPALYTGRGLPN 180
Query: 181 VSNAQNASWNTFGLIPAVPNGGMDMLHPLGLQGTLRQPMNSSLNVNIPRQRCRDFEERGY 240
+SNAQNASWNTFGL+PAVPNGG+DMLHP+GLQGTLR P+NSSLNVNIPRQRCRDFEERG+
Sbjct: 181 ISNAQNASWNTFGLLPAVPNGGLDMLHPMGLQGTLRPPINSSLNVNIPRQRCRDFEERGF 240
Query: 241 CLRGDMCPMEHGVNRIVIEDVQGLSQFNLPVSLPSAPLIGAPAGSGSHHSVNASTTSVNS 300
CLRGDMCPMEHGVNRIV+EDVQ LSQFNLPVSL SA L GAP GSGS HSVN ST S+NS
Sbjct: 241 CLRGDMCPMEHGVNRIVVEDVQSLSQFNLPVSLTSAHLTGAPTGSGSLHSVNNSTASMNS 300
Query: 301 KCIPG--KKSVVGDDGMPVDSAYPGLGCTSGADLYDPDQPLWNNSGLESSNALLSIQSSK 358
KC PG KS+V D G +D AYPG GCTSGADLYDPDQPLWN+ GLE
Sbjct: 301 KCKPGIISKSIVSDVGSSMDGAYPGPGCTSGADLYDPDQPLWNDRGLE------------ 348
Query: 359 IDETELISSEALNSDYPVGTTRTSVNLQGSSSSVWARMNSSRNRFDMKEKTNSMISSFHY 418
ID+ E +SS+A +S PV TRTSV+LQG+SSSVW R+ S+NRFD KEK+N +SSFH+
Sbjct: 349 IDDAEPMSSDAPDSVCPVEATRTSVSLQGASSSVWGRIGGSKNRFDTKEKSNPTMSSFHF 408
Query: 419 PQNQLKEDNDELVGSRSTSCQGKQIIADDTDPRSMETLLKAQAFNMRNIRKPSQKALCTL 478
P NQ KEDNDELVG S S QGKQIIADD PR+ E LKAQ +MRNIRKPSQKAL TL
Sbjct: 409 PDNQPKEDNDELVGCHSASSQGKQIIADDAIPRAFEASLKAQ-IDMRNIRKPSQKALRTL 467
Query: 479 FVSGIPQRSNKRETLLTHFKKFGEVIDIHIPMNSDRAFVQFSKREEAEAALKSPDAVMGN 538
FV+GIP +SN+R+ LL HFKKFGEVIDI+IP+NS+RAFVQFSKREEAEAAL++PDAVMGN
Sbjct: 468 FVNGIPHKSNRRDALLAHFKKFGEVIDIYIPLNSERAFVQFSKREEAEAALRAPDAVMGN 527
Query: 539 RFIKLFWANRDCV--RNDCTASGNGVIVTPRGQAPAFVPSHPVVTDRRKDIHQTDASKTI 596
RFIKL+WANRDC+ N ++SGNG IVTPRGQ P FVPSHPV TDRRKDIHQ DAS+T
Sbjct: 528 RFIKLWWANRDCIPSENTSSSSGNGAIVTPRGQ-PTFVPSHPVATDRRKDIHQPDASRTT 586
Query: 597 FEVPSPSDQPKHIIAGGPKAPPPSQKKYENLEHLKEELRKKQEMLDQKRNEFKRQLSKLE 656
FE SPSD K +IA PK PPP Q+K ENLEHLKE+LRKKQEMLDQKRNEFKRQL+KLE
Sbjct: 587 FEESSPSDPSKLVIADAPKVPPPLQRKLENLEHLKEQLRKKQEMLDQKRNEFKRQLNKLE 646
Query: 657 KQATGLKGEIVTEHAAKRPKTSMTTDVAKLTSPQSSDADPGMTSLHAEATTDRNKQLVST 716
KQATG KGE VTE AKRPKTSM +DVAKL SPQSSDAD GM+S AE D+NKQL ++
Sbjct: 647 KQATGPKGEAVTEQPAKRPKTSMASDVAKLASPQSSDADIGMSSSQAETAVDKNKQLANS 706
Query: 717 VSQSPKASTTMRVLEPTGLNQPIQPFVPVNRYKLDNRPTAFRIITPLPDGLANVASLKEH 776
VSQSPK ST + EP GL Q IQ VPVNRYKLDNRPTAFRII PLP GLANVA+L+EH
Sbjct: 707 VSQSPKPSTPRKPHEPAGLKQSIQSLVPVNRYKLDNRPTAFRIIPPLPVGLANVAALEEH 766
Query: 777 FLPYGELSSVELVDVQVDDSSQQEAHINFTTRRAAERAFINGKCWKDHNLEFMWLASANS 836
FLPYGELS+VEL DVQV+DSS+QEA +NFTTR AAE+AF GKCWKDHNL+FMWL NS
Sbjct: 767 FLPYGELSAVELEDVQVNDSSEQEARLNFTTRGAAEQAFTKGKCWKDHNLKFMWLTPTNS 826
Query: 837 SNATGSREPSLSA-PKEPLD-RDDHSEEKLGNAVNQEAAVSDGERKNSENENGLKVMEME 894
NAT SRE SLSA P EPLD + +SEEK N+ N EA VSDGE K+SE +N L+ M+ E
Sbjct: 827 GNATVSRERSLSAPPSEPLDTTNSNSEEKSRNSANHEAIVSDGEHKDSETKNDLENMKTE 886
Query: 895 PGEDLQCSPRQVSSAKQSPEGN 916
EDLQC+ QVSSAKQSPE N
Sbjct: 887 QDEDLQCTTSQVSSAKQSPENN 908
>Medtr2g449870.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
LC | chr2:22009059-22002716 | 20130731
Length = 599
Score = 417 bits (1071), Expect = e-116, Method: Compositional matrix adjust.
Identities = 227/381 (59%), Positives = 265/381 (69%), Gaps = 45/381 (11%)
Query: 169 PSSLYTGRGLPNVSNAQNASWNTFGLIPAVPNGGMDMLHPLGLQGTLRQPMNSSLNVNIP 228
P LY G LPN+SNAQN S NTFGL+PAVPNGG+DMLH +GLQGTLR P++SSLNVNIP
Sbjct: 72 PFILYVG--LPNISNAQNVSCNTFGLLPAVPNGGLDMLHQMGLQGTLRTPIDSSLNVNIP 129
Query: 229 RQRCRDFEERGYCLRGDMCPMEHGVNRIVIEDVQGLSQFNLPVSLPSAPLIGAPAGSGSH 288
QRCRDFEE G+CLRGDMCPMEHGVNRIV+EDVQ + PS ++
Sbjct: 130 CQRCRDFEECGFCLRGDMCPMEHGVNRIVVEDVQSF------IVQPSCFTYKCTPNWSTY 183
Query: 289 ----HSVNASTTSVNSKCIPG--KKSVVGDDGMPVDSAYPGLGCTSGADLYDPDQPLWNN 342
SVN T S+NSKC PG KS+V D G+P+D AYPG GCTSGADLYDPDQPLWN+
Sbjct: 184 CIWITSVNNLTASMNSKCKPGIISKSIVSDVGLPMDGAYPGPGCTSGADLYDPDQPLWND 243
Query: 343 SGLESSNALLSIQSSKIDETELISSEALNSDYPVGTTRTSVNLQGSSSSVWARMNSSRNR 402
GLESSNALL++QSSKID+ E +SS+A N P TRTS S +
Sbjct: 244 RGLESSNALLNMQSSKIDDAEPMSSDAPNRVCPSEATRTS---------------GSLHD 288
Query: 403 FDMKEKTNSMISSFHYPQNQLKEDNDELVGSRSTSCQGKQIIADDTDPRSMETLLKAQAF 462
F KEK+N ++SSFH+P NQ KEDNDEL DD PR+ + K
Sbjct: 289 FTPKEKSNPIVSSFHFPDNQSKEDNDEL--------------EDDAIPRAFKASQKP-LI 333
Query: 463 NMRNIRKPSQKALCTLFVSGIPQRSNKRETLLTHFKKFGEVIDIHIPMNSDRAFVQFSKR 522
+ R+I KPSQKAL TLFV+GIP +SN RE L HFKKFGEVI+ +IPMNS+RAFVQFSKR
Sbjct: 334 DTRSICKPSQKALHTLFVNGIPHKSNGREA-LAHFKKFGEVINFYIPMNSERAFVQFSKR 392
Query: 523 EEAEAALKSPDAVMGNRFIKL 543
EEAEAAL++PD+VMGNRFI L
Sbjct: 393 EEAEAALRTPDSVMGNRFINL 413
>Medtr2g449950.1 | RNA recognition motif, a.k.a. RRM, RBD protein |
LC | chr2:22037929-22038884 | 20130731
Length = 175
Score = 174 bits (440), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 95/172 (55%), Positives = 117/172 (68%), Gaps = 18/172 (10%)
Query: 446 DDTDPRSME---TLLKAQAFNMRNIRKPSQKALCTLFVSGIPQRSNKRETLLTHFKKFGE 502
DD +P S + ++ + +A ++RK SQK L TLFV+GIP RSN++E LL HFK GE
Sbjct: 19 DDAEPMSSDAPNSVFQVEA-TRTSVRKLSQKGLYTLFVNGIPHRSNRKEALLAHFKMLGE 77
Query: 503 VIDIHIPMNSDRAFVQFSKREEAEAALKSPDAVMGNRFIKLFWANRDCVRNDCTASGNGV 562
VI+I+IPMNS+RAFVQFS REEAEA+L++ D VMGNRFIKL GNG
Sbjct: 78 VINIYIPMNSERAFVQFSNREEAEASLRARDVVMGNRFIKL-------------CKGNGA 124
Query: 563 IVTPRGQAPAFVPSHPVVTDRRKDIHQTDASKTIFEVPSPSDQPKHIIAGGP 614
IVT RG+ P FVPSHPV TDRRKD HQ++AS+ E SPS K +IA P
Sbjct: 125 IVTLRGKPPTFVPSHPVGTDRRKDNHQSEASRITKE-SSPSKLSKSVIADAP 175
>Medtr2g449940.1 | hypothetical protein | LC |
chr2:22032429-22032112 | 20130731
Length = 105
Score = 139 bits (350), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/103 (66%), Positives = 79/103 (76%)
Query: 553 NDCTASGNGVIVTPRGQAPAFVPSHPVVTDRRKDIHQTDASKTIFEVPSPSDQPKHIIAG 612
+ + GNG IVTPR Q P FVPSH V TDRRKDIHQT+AS+TIFE SPSD K +IA
Sbjct: 3 HQLPSRGNGAIVTPRRQPPTFVPSHLVGTDRRKDIHQTEASRTIFEESSPSDASKLVIAD 62
Query: 613 GPKAPPPSQKKYENLEHLKEELRKKQEMLDQKRNEFKRQLSKL 655
PK PPP ++ ENLEHLKE+L KK EMLDQKRN+FK QL+KL
Sbjct: 63 APKVPPPLHEELENLEHLKEQLHKKWEMLDQKRNKFKHQLNKL 105
>Medtr2g449970.1 | hypothetical protein | LC |
chr2:22043000-22043467 | 20130731
Length = 87
Score = 75.1 bits (183), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 31/40 (77%), Positives = 39/40 (97%)
Query: 512 SDRAFVQFSKREEAEAALKSPDAVMGNRFIKLFWANRDCV 551
S++AFVQFSKREEAEA+L++P+AVMGN FIKL+WANRDC+
Sbjct: 27 SEQAFVQFSKREEAEASLRTPEAVMGNHFIKLWWANRDCI 66