Miyakogusa Predicted Gene
- Lj0g3v0218509.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0218509.1 Non Characterized Hit- tr|I1N1V5|I1N1V5_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,60.64,0,seg,NULL;
coiled-coil,NULL; BHLH FAMILY PROTEIN,NULL; STRUCTURAL MAINTENANCE OF
CHROMOSOMES SMC FAMI,CUFF.14123.1
(876 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr8g471150.1 | basic helix loop helix protein, putative | HC ... 829 0.0
Medtr2g009620.1 | viral A-type inclusion protein, putative | HC ... 281 3e-75
Medtr2g009620.2 | viral A-type inclusion protein, putative | HC ... 281 3e-75
Medtr5g032080.1 | viral A-type inclusion protein, putative | HC ... 213 1e-54
>Medtr8g471150.1 | basic helix loop helix protein, putative | HC |
chr8:28762436-28766018 | 20130731
Length = 767
Score = 829 bits (2141), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/721 (64%), Positives = 556/721 (77%), Gaps = 51/721 (7%)
Query: 5 IYEELDEAKAEIERLKSELRAKIVSLENLKKSHYAQENQIQEAKLKAENLDQELLQKESA 64
+YEELDEAK+EI++LK+ELRAK SLENLK+S AQ NQ QEAK K+E LDQELLQK
Sbjct: 4 VYEELDEAKSEIDKLKAELRAKSDSLENLKRSLNAQVNQTQEAKSKSEKLDQELLQKADE 63
Query: 65 IAEAKQLWEDLKGNLNEQASIIKHLTAANDKLRVDCDEKFNNWENEKNGLLLALDEANEK 124
IAEAK L+E LKGNL EQ SIIKHL AANDKLRVD DEK WENEK GL+LAL+EAN+K
Sbjct: 64 IAEAKSLYESLKGNLKEQESIIKHLKAANDKLRVDFDEKIKMWENEKRGLVLALEEANDK 123
Query: 125 AESQEQQIYLYRQEIQCLKDSLSVSQKKCLENEKNFKASNELRERDDLFQKIEDEKRKVE 184
AE+Q+QQ+ YR+EI+ LK LSVS++KC E++K K+S EL ERD +FQK+E+EK K+E
Sbjct: 124 AENQDQQVCRYRKEIESLKSCLSVSKQKCSESQKKLKSSKELSERDGMFQKLEEEKVKLE 183
Query: 185 DQLKWKKEQFKHLEEAHEKLRDEFRSCKKEWEMLKSTLLDEISSLEIKLDSQDRISENLQ 244
DQLKWKKEQFKHLEEA+EKL+ +F+S KKEWEM KSTLLDEISSLE KL+SQ RISE+LQ
Sbjct: 184 DQLKWKKEQFKHLEEAYEKLKGQFKSSKKEWEMEKSTLLDEISSLETKLESQIRISEDLQ 243
Query: 245 HQLQTCHQALAHVESQKKRLEVEVSDFKEKLDNASSEYQDARLQLDCLNSQCDSDIADLR 304
HQLQTCHQALAHVESQKKRLEVEVSDF+ +LDNA SEY DARLQLDCLNS D DI DLR
Sbjct: 244 HQLQTCHQALAHVESQKKRLEVEVSDFRLQLDNAGSEYHDARLQLDCLNSDRDKDIVDLR 303
Query: 305 YSLKANEAYHKELKYRIEKLEQENQELRMSLKEFQEAQIQEAGGSYSQSK--PKRRNLEQ 362
YSLK EA+ KE KY++EKLEQENQELRMSL+E QE+QIQ AG YSQSK K +NLEQ
Sbjct: 304 YSLKTKEAHIKEAKYQMEKLEQENQELRMSLRELQESQIQ-AGAYYSQSKLRTKLKNLEQ 362
Query: 363 THSECSSNFKAKEAEWNSQLEQLKGDLNSCRSELETKIAAVEELQKELERSHSLTIEMKL 422
TH EC+ KA+EAEW+S++EQL G LN+C+SELE KIAAVEELQ ELE SH + +E +L
Sbjct: 363 THKECALTLKAREAEWDSRIEQLTGQLNTCQSELEAKIAAVEELQMELESSHLIVVETRL 422
Query: 423 VNEEMSVMLLVLKNGLSEAQEKLANHKNEMDLFNKEREEKIFQLMQQLEMKDAALISANK 482
+NEEMSVMLLVLK G+SEAQ +LAN+K+EMDL NKE+E +IFQLM+QLEMKD +LISA K
Sbjct: 423 LNEEMSVMLLVLKQGISEAQLRLANYKDEMDLLNKEKEREIFQLMKQLEMKDDSLISAQK 482
Query: 483 SINEECEREACLMRQVESCESKIELQHS-------------LQDELDKHKEMLEESSMYQ 529
+NEE E+ CLMR++ES S ELQ S LQ+ELD++KEMLEES+ Q
Sbjct: 483 GLNEEREKAECLMRKIESFGSSKELQRSLQNEPESYGCNKELQNELDRYKEMLEESTRCQ 542
Query: 530 LILKETVLQMECYLKEQMKEVHDALDSTLIELDERICERNEMEFELQIWKSIVERMKNDL 589
IL+E VLQ+EC KEQ++E H+ALD + ELDERICER+EMEFELQIWKSI++R+KNDL
Sbjct: 543 RILEEKVLQIECESKEQLRETHEALDIAINELDERICERSEMEFELQIWKSILDRLKNDL 602
Query: 590 EENYVMRKELETSLLSQVDVCECLKEEKESLVYKLEEEEKRLDNVRYLQIIEEKDKAXXX 649
EE+++MRKELE SL++QVDV E +K+EK+ V +L++E L+
Sbjct: 603 EESHLMRKELEASLIAQVDVGESIKQEKDKAVEELQKEVFMLEQ---------------- 646
Query: 650 XXXXXXXXXXXSFRRELENVLITKGTMERIYE---------YEKENLIQLMKGKNMRIDE 700
SFRRE E+V+I K TMER E E+ENL+ ++G RI E
Sbjct: 647 ----------ESFRREFESVVIAKSTMERACELTGNATELSLERENLLAFVQGLYDRIYE 696
Query: 701 L 701
L
Sbjct: 697 L 697
>Medtr2g009620.1 | viral A-type inclusion protein, putative | HC |
chr2:1985911-1981880 | 20130731
Length = 909
Score = 281 bits (718), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 219/643 (34%), Positives = 364/643 (56%), Gaps = 42/643 (6%)
Query: 1 MDNNIYEELDEAKAEIERLKSELRAKIVSLENLKKSHYAQENQIQEAKLKAENLDQELLQ 60
M+++ + +LD AKAE+E+L++E R K +E+LK + + AE +EL
Sbjct: 1 MEDSSHTDLDYAKAELEKLRAECRVKTQQIESLKNDRARETTNL------AEKHARELDL 54
Query: 61 KESAIAEAKQLWEDLKGNLNEQASIIKHLTAANDKLRVDCDEKFNNWENEKNGLLLALDE 120
K I E K++ EDL+ +L E+ I HL + N+K+ E+ E + L+L LDE
Sbjct: 55 KSEEIYELKRINEDLESSLREKEKYIVHLNSENNKIEARFAERVFKLEGSNSELVLTLDE 114
Query: 121 ANEKAESQEQQIYLYRQEIQCLKDSLSVSQKKCLENEKNFKASNELRERDDLFQKIEDEK 180
+ E+ + +E+ LK SL ++KKC+E E+ K + ++ ++D+ ++E+E
Sbjct: 115 ITARNSCLEKNVCESSEEVSRLKSSLLAAEKKCIEAEERAKQAKTMKLKEDVIMQLEEEN 174
Query: 181 RKVEDQLKWKKEQFKHLEEAHEKLRDEFRSCKKEWEMLKSTLLDEISSLEIKLDSQDRIS 240
V+D++KW+ EQFKHLEEA++ L+D+F+ K+EWE +S L+ EISSL++ L+SQ R
Sbjct: 175 VTVQDKIKWRNEQFKHLEEAYQHLKDQFQLSKEEWEKERSLLVGEISSLQMSLNSQTRTL 234
Query: 241 ENLQHQLQTCHQALAHVESQKKRLEVEVSDFKEKLDNASSEYQDARLQLDCLNSQCDSDI 300
E LQ + + C+ ALA ES++K LE E+S+FK ++ + ++ + +++ L + + +I
Sbjct: 235 EGLQSRFEMCNHALACEESKRKLLEAEISEFKTSFEDVYGQCEEKKFEIEELTVRRNDEI 294
Query: 301 ADLRYSLKANEAYHKELKYRIEKLEQENQELRMSLKEFQEAQIQEAGGSYSQSK--PKRR 358
A+LR SL E KEL+ +I LEQ+NQE+ LKEF+EAQI+ AGG+ SK K R
Sbjct: 295 AELRNSLAEKEILVKELERKIVLLEQDNQEVGDLLKEFREAQIRGAGGNSMTSKLRNKLR 354
Query: 359 NLEQTHSECSSNFKAKEAEWNSQLEQLKGDLNSCRSELETKIAAVEELQKELERSHSLTI 418
LE+ H CSS K+KE++W+ Q+ +++ D+ +S L K + ELQ ELE +
Sbjct: 355 KLEEVHKNCSSVLKSKESQWDCQVAKMEADVIGYQSALTNKEQEIRELQIELENCYCAI- 413
Query: 419 EMKLVNEEMSVMLLVLKN--GLSEAQEKLANHKNEMDLFNKEREEKIFQLMQQLEMKDAA 476
EE + LL+ K+ +++A K + + +E + I +QL +KD +
Sbjct: 414 ------EENHIELLIFKSVLAVADAYSKSFGTETGKAVCVEENGDTILNFSEQLRLKDNS 467
Query: 477 LISANKSINEECEREACLMRQVESCESKIELQHSLQDELDKHKEMLEESSMYQLILKETV 536
L + + Q L++E + K+ LEESS QLILKE +
Sbjct: 468 LKTMAQK------------------------QFLLEEEFEHQKKCLEESSAGQLILKEQL 503
Query: 537 LQMECYLKEQMKEVHDALDSTLIELDERICERNEMEFELQIWKSIVERMKNDLEENYVMR 596
LQME LK + K +AL+ E+ + E + ++ E + WKS VE ++ +E
Sbjct: 504 LQMENTLKHERKVSFEALEMLKHEMASKNDELSRLDCEARHWKSTVETLRVSYQEIQGTC 563
Query: 597 KELETSLLSQVDVCECLKEEKESLVYKLEEEEKRLDNVRYLQI 639
KE+ETSLLS+ + LK E ++L+ ++++E+ ++++ LQI
Sbjct: 564 KEMETSLLSRDANEQALKLENKNLLCIVKDQERDTEDLQ-LQI 605
>Medtr2g009620.2 | viral A-type inclusion protein, putative | HC |
chr2:1985974-1981800 | 20130731
Length = 909
Score = 281 bits (718), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 219/643 (34%), Positives = 364/643 (56%), Gaps = 42/643 (6%)
Query: 1 MDNNIYEELDEAKAEIERLKSELRAKIVSLENLKKSHYAQENQIQEAKLKAENLDQELLQ 60
M+++ + +LD AKAE+E+L++E R K +E+LK + + AE +EL
Sbjct: 1 MEDSSHTDLDYAKAELEKLRAECRVKTQQIESLKNDRARETTNL------AEKHARELDL 54
Query: 61 KESAIAEAKQLWEDLKGNLNEQASIIKHLTAANDKLRVDCDEKFNNWENEKNGLLLALDE 120
K I E K++ EDL+ +L E+ I HL + N+K+ E+ E + L+L LDE
Sbjct: 55 KSEEIYELKRINEDLESSLREKEKYIVHLNSENNKIEARFAERVFKLEGSNSELVLTLDE 114
Query: 121 ANEKAESQEQQIYLYRQEIQCLKDSLSVSQKKCLENEKNFKASNELRERDDLFQKIEDEK 180
+ E+ + +E+ LK SL ++KKC+E E+ K + ++ ++D+ ++E+E
Sbjct: 115 ITARNSCLEKNVCESSEEVSRLKSSLLAAEKKCIEAEERAKQAKTMKLKEDVIMQLEEEN 174
Query: 181 RKVEDQLKWKKEQFKHLEEAHEKLRDEFRSCKKEWEMLKSTLLDEISSLEIKLDSQDRIS 240
V+D++KW+ EQFKHLEEA++ L+D+F+ K+EWE +S L+ EISSL++ L+SQ R
Sbjct: 175 VTVQDKIKWRNEQFKHLEEAYQHLKDQFQLSKEEWEKERSLLVGEISSLQMSLNSQTRTL 234
Query: 241 ENLQHQLQTCHQALAHVESQKKRLEVEVSDFKEKLDNASSEYQDARLQLDCLNSQCDSDI 300
E LQ + + C+ ALA ES++K LE E+S+FK ++ + ++ + +++ L + + +I
Sbjct: 235 EGLQSRFEMCNHALACEESKRKLLEAEISEFKTSFEDVYGQCEEKKFEIEELTVRRNDEI 294
Query: 301 ADLRYSLKANEAYHKELKYRIEKLEQENQELRMSLKEFQEAQIQEAGGSYSQSK--PKRR 358
A+LR SL E KEL+ +I LEQ+NQE+ LKEF+EAQI+ AGG+ SK K R
Sbjct: 295 AELRNSLAEKEILVKELERKIVLLEQDNQEVGDLLKEFREAQIRGAGGNSMTSKLRNKLR 354
Query: 359 NLEQTHSECSSNFKAKEAEWNSQLEQLKGDLNSCRSELETKIAAVEELQKELERSHSLTI 418
LE+ H CSS K+KE++W+ Q+ +++ D+ +S L K + ELQ ELE +
Sbjct: 355 KLEEVHKNCSSVLKSKESQWDCQVAKMEADVIGYQSALTNKEQEIRELQIELENCYCAI- 413
Query: 419 EMKLVNEEMSVMLLVLKN--GLSEAQEKLANHKNEMDLFNKEREEKIFQLMQQLEMKDAA 476
EE + LL+ K+ +++A K + + +E + I +QL +KD +
Sbjct: 414 ------EENHIELLIFKSVLAVADAYSKSFGTETGKAVCVEENGDTILNFSEQLRLKDNS 467
Query: 477 LISANKSINEECEREACLMRQVESCESKIELQHSLQDELDKHKEMLEESSMYQLILKETV 536
L + + Q L++E + K+ LEESS QLILKE +
Sbjct: 468 LKTMAQK------------------------QFLLEEEFEHQKKCLEESSAGQLILKEQL 503
Query: 537 LQMECYLKEQMKEVHDALDSTLIELDERICERNEMEFELQIWKSIVERMKNDLEENYVMR 596
LQME LK + K +AL+ E+ + E + ++ E + WKS VE ++ +E
Sbjct: 504 LQMENTLKHERKVSFEALEMLKHEMASKNDELSRLDCEARHWKSTVETLRVSYQEIQGTC 563
Query: 597 KELETSLLSQVDVCECLKEEKESLVYKLEEEEKRLDNVRYLQI 639
KE+ETSLLS+ + LK E ++L+ ++++E+ ++++ LQI
Sbjct: 564 KEMETSLLSRDANEQALKLENKNLLCIVKDQERDTEDLQ-LQI 605
>Medtr5g032080.1 | viral A-type inclusion protein, putative | HC |
chr5:13767234-13769396 | 20130731
Length = 720
Score = 213 bits (541), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/237 (52%), Positives = 156/237 (65%), Gaps = 14/237 (5%)
Query: 207 EFRSCKKEWEMLKSTLLDEISSLEIKLDSQDRISENLQHQLQTCHQALAHVESQKKRLEV 266
+F+ C KE E KSTLLDEIS L+ KLDS ++S++LQHQL C Q LAH ESQ+K +EV
Sbjct: 95 QFKPCTKECESEKSTLLDEISFLKSKLDSHIKVSQDLQHQLHMCKQLLAHEESQRKSIEV 154
Query: 267 EVSDFKEKLDNASSEYQDARLQLDCLNSQCDSDIADLRYSLKANEAYHKELKYRIEKLEQ 326
EV D K K + LNSQ D DI DLR +LK E Y+KE KY EKLEQ
Sbjct: 155 EVLDLKSKSEG--------------LNSQKDKDIEDLRKALKIQEVYYKESKYSNEKLEQ 200
Query: 327 ENQELRMSLKEFQEAQIQEAGGSYSQSKPKRRNLEQTHSECSSNFKAKEAEWNSQLEQLK 386
ENQ+LR SL+E QE+Q A S S + R L++TH EC FKA++ EW+ QLEQ+
Sbjct: 201 ENQQLRKSLRELQESQDARASYSISMLRSNLRGLQKTHRECVKIFKARQVEWSFQLEQMS 260
Query: 387 GDLNSCRSELETKIAAVEELQKELERSHSLTIEMKLVNEEMSVMLLVLKNGLSEAQE 443
++++ R LE K A +E+L+KELE S S IEM L+NEEM VMLLVLK G+SE E
Sbjct: 261 DNIDNYRYALEVKAATIEKLKKELECSQSFNIEMMLLNEEMFVMLLVLKEGISEHNE 317
Score = 77.8 bits (190), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/71 (61%), Positives = 52/71 (73%)
Query: 3 NNIYEELDEAKAEIERLKSELRAKIVSLENLKKSHYAQENQIQEAKLKAENLDQELLQKE 62
NN+YEELDEAK EI++LK+ELR K S +NLK+SH Q NQIQEA KAE L+QELLQK
Sbjct: 2 NNVYEELDEAKGEIKKLKAELRGKKRSYKNLKRSHDVQVNQIQEAISKAEKLEQELLQKA 61
Query: 63 SAIAEAKQLWE 73
A Q+ E
Sbjct: 62 DEFIYADQIHE 72