Miyakogusa Predicted Gene
- Lj4g3v2804810.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2804810.1 Non Characterized Hit- tr|I1K4M6|I1K4M6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.8711 PE=,91.9,0,no
description,Armadillo-like helical; ARMADILLO/BETA-CATENIN REPEAT
FAMILY PROTEIN / U-BOX DOMAIN-C,CUFF.51694.1
(917 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr8g091470.1 | cellulose synthase-interactive protein | HC | ... 1571 0.0
Medtr3g087800.1 | cellulose synthase-interactive protein | HC | ... 1464 0.0
Medtr1g029790.1 | armadillo/beta-catenin-like repeat protein | H... 876 0.0
>Medtr8g091470.1 | cellulose synthase-interactive protein | HC |
chr8:38165718-38156457 | 20130731
Length = 2216
Score = 1571 bits (4067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 788/901 (87%), Positives = 818/901 (90%)
Query: 2 QIDNKEDNSMWICALLLAILFQDRDVIRAHPTTKSVPALASLLKSEESANKYFAAQAISS 61
QID KEDNSMWICALLLAILFQDRD+IRAH T KSVPALA+LLKSEESANKYFAAQ+I+S
Sbjct: 1120 QIDYKEDNSMWICALLLAILFQDRDIIRAHSTIKSVPALANLLKSEESANKYFAAQSIAS 1179
Query: 62 LVCNGSRGTLLSVXXXXXXXXXXXXXXXXDTDIQDLLELSEEFSLVRYPDQVALERLFRV 121
LVCNGSRGTLLSV DTDIQ+LLELSEEFSLVRYPDQVALE+LFRV
Sbjct: 1180 LVCNGSRGTLLSVTNSGAASGLISLLGCADTDIQELLELSEEFSLVRYPDQVALEKLFRV 1239
Query: 122 DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGILTHLGRDCLSNKIIMVESGALEAL 181
DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALG+LT LGRDC NK +MVESGALEAL
Sbjct: 1240 DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGLLTQLGRDCPPNKTVMVESGALEAL 1299
Query: 182 NKYLSLSPQDATEEAATDLLGILFSSVEIRKHESAFGAVAQLVAVLRLGGRSARYSAAKA 241
KYLSL PQDATEEAATDLLGILFSS +IRKH+SAFGAV QLVAVLRLGGR ARYSAAKA
Sbjct: 1300 TKYLSLGPQDATEEAATDLLGILFSSADIRKHDSAFGAVNQLVAVLRLGGRGARYSAAKA 1359
Query: 242 LESLFSADHIRNAEIARQAFQPLVEILNTGSEREQHAAIAALIGLLSENPSRALAVADVE 301
LESLFSADHIRNA+IARQA QPLVEILNTGSEREQHAAIAAL+GLLSENPSRALAVADVE
Sbjct: 1360 LESLFSADHIRNADIARQAVQPLVEILNTGSEREQHAAIAALVGLLSENPSRALAVADVE 1419
Query: 302 MNAVDVLCRILSSNCSMDLKGDAAELCCALFGNTRIRSTMGAARCVEPLVSLLATEFSPA 361
MNAVDVL RILSSNCSMDLKGDAAELCCALFGNTRIRST AARCVEPLVSLLATEFSPA
Sbjct: 1420 MNAVDVLYRILSSNCSMDLKGDAAELCCALFGNTRIRSTAAAARCVEPLVSLLATEFSPA 1479
Query: 362 QHSVVRALDRLVDDEQLAELVAAHGAVVPLVSLLSGSNYILHEATSRALVKLGKDRPACK 421
HSVVRALDRLVDDEQLAELVAAHGAV+PLVSLLSG NY+LHEA SRALVKLGKDRPACK
Sbjct: 1480 HHSVVRALDRLVDDEQLAELVAAHGAVIPLVSLLSGRNYVLHEAISRALVKLGKDRPACK 1539
Query: 422 MEMVKAGVIESILDILTDAPDYLIAAFAELLRILTNNATIAKGPSAAKVVEPLFLLLKRD 481
MEMVKAGVIESILDIL +APDYL+AAFAELLRILTNNATIAKGP AAKVVEPLFLLL R
Sbjct: 1540 MEMVKAGVIESILDILQEAPDYLLAAFAELLRILTNNATIAKGPRAAKVVEPLFLLLARH 1599
Query: 482 DFGPDGQHSALQVLVNILEHPQCRSDYSLTSRQVIEPLIPLLDSPIPAVQQLVAXXXXXX 541
DF PDGQHSA+QVLVNILEHPQCR+DYSLT +VIEPLIPLLDSPI VQQL A
Sbjct: 1600 DFVPDGQHSAMQVLVNILEHPQCRADYSLTPHKVIEPLIPLLDSPISVVQQLAAELLSHL 1659
Query: 542 XXXXXXQKDPVTQQVIGPLIRVLGSGIHILQQRAVKALVSIALTWPNEIAKEGGVIEISK 601
QKDPVT +VI PLIR+LGSGI +LQQRA+KALVSIALTWPNEIAKEGGV+EIS+
Sbjct: 1660 LLEEHLQKDPVTPKVIAPLIRILGSGIPLLQQRALKALVSIALTWPNEIAKEGGVVEISR 1719
Query: 602 VILQADPSLPHALWESAASVLSSILQFSSEFYLEVPIAVLVRLLRSGSESTVIGALNALL 661
VILQADPSLPHALWESAASVLSSILQFSSEFYLEVP+AVLVRLLRSGSESTVIGALNALL
Sbjct: 1720 VILQADPSLPHALWESAASVLSSILQFSSEFYLEVPVAVLVRLLRSGSESTVIGALNALL 1779
Query: 662 VLESDDRFSAEAMAESGAVEALLELLRSHQCEETAARLLEVLLNNVKIRETKVTKSAILP 721
VLESDD SAEAMAESGA+EALLELLRSHQCEETAARLLEVLLNNVKIRETKVTKSAILP
Sbjct: 1780 VLESDDGTSAEAMAESGAIEALLELLRSHQCEETAARLLEVLLNNVKIRETKVTKSAILP 1839
Query: 722 LSQYLLDPXXXXXXXXXXXXXXXGDLFQNEGLTRTGDAVSACRALVNVLEDNPTEEMKVV 781
LSQYLLDP GDLFQNE L R+GDAVSACRALVNVLED PTEEMKVV
Sbjct: 1840 LSQYLLDPQTQAQHARLLATLALGDLFQNEALARSGDAVSACRALVNVLEDQPTEEMKVV 1899
Query: 782 AICALQNLVMYSRSNKRAVAEAGGVQVVLDLIGSSEPETSVQAAMFIKLLFSNNTIQEYA 841
AICALQNLVMYSR N+RAVAEA GVQVVLDLIGSS PETSVQAAMFIKLLFSNNTIQEYA
Sbjct: 1900 AICALQNLVMYSRPNRRAVAEASGVQVVLDLIGSSNPETSVQAAMFIKLLFSNNTIQEYA 1959
Query: 842 SSETVRAITATIEKDLWASGTVNEEYLKALNSLFSNFPRLRATEPATLSIPHLVTSLKTG 901
SSETVRAITATIEKDLWASGTVNEEYLKALNSLFSNFPRLRATEPATLSIPHLVT+LKTG
Sbjct: 1960 SSETVRAITATIEKDLWASGTVNEEYLKALNSLFSNFPRLRATEPATLSIPHLVTALKTG 2019
Query: 902 S 902
S
Sbjct: 2020 S 2020
>Medtr3g087800.1 | cellulose synthase-interactive protein | HC |
chr3:39790305-39779771 | 20130731
Length = 2152
Score = 1464 bits (3789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 739/903 (81%), Positives = 793/903 (87%), Gaps = 2/903 (0%)
Query: 2 QIDNKEDNSMWICALLLAILFQDRDVIRAHPTTKSVPALASLLKSEESANKYFAAQAISS 61
QID+KED+SMWICA+LLAILFQDRD+IRAH T KS+PALA+LLKSEESANKYFAAQ+I+S
Sbjct: 1065 QIDDKEDSSMWICAMLLAILFQDRDIIRAHATMKSIPALANLLKSEESANKYFAAQSIAS 1124
Query: 62 LVCNGSRGTLLSVXXXXXXXXXXXXXXXXDTDIQDLLELSEEFSLVRYPDQVALERLFRV 121
LVCNGSRGTLLSV D DI+DLLELS EFSLV +PDQVALERLFRV
Sbjct: 1125 LVCNGSRGTLLSVANSGVAGGLISLLGCADVDIRDLLELSNEFSLVPFPDQVALERLFRV 1184
Query: 122 DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGILTHLGRDCLSNKIIMVESGALEAL 181
DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGILT L RDC SNKI+MVESGA+EAL
Sbjct: 1185 DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGILTQLARDCPSNKIVMVESGAIEAL 1244
Query: 182 NKYLSLSPQDATEEAATDLLGILFSSVEIRKHESAFGAVAQLVAVLRLGGRSARYSAAKA 241
KYLSL PQDA EEAATDLLGILFS+ EIR+HESAFGAV QLVAVLRLGGR+ARYSAAKA
Sbjct: 1245 TKYLSLGPQDAIEEAATDLLGILFSNAEIRRHESAFGAVTQLVAVLRLGGRAARYSAAKA 1304
Query: 242 LESLFSADHIRNAEIARQAFQPLVEILNTGSEREQHAAIAALIGLLSENPSRALAVADVE 301
LESLFSAD+IRNAE ARQA QPLVEILNTG EREQHAAI+AL+ LLSENP+RALAVADVE
Sbjct: 1305 LESLFSADNIRNAETARQAVQPLVEILNTGLEREQHAAISALVKLLSENPARALAVADVE 1364
Query: 302 MNAVDVLCRILSSNCSMDLKGDAAELCCALFGNTRIRSTMGAARCVEPLVSLLATEFSPA 361
NA+DVLC+ILSS CSMDLKGDAAELCC LFGNTRIRST+ AARCVEPLVSLL TEFSPA
Sbjct: 1365 TNAIDVLCKILSSGCSMDLKGDAAELCCVLFGNTRIRSTVAAARCVEPLVSLLVTEFSPA 1424
Query: 362 QHSVVRALDRLVDDEQLAELVAAHGAVVPLVSLLSGSNYILHEATSRALVKLGKDRPACK 421
HSVVRALDRLV DE+LA+LV A GAV+PLV LL G+N++LHEA SRALVKLGKDRP+CK
Sbjct: 1425 HHSVVRALDRLVGDEKLADLVVARGAVIPLVGLLFGTNFVLHEAISRALVKLGKDRPSCK 1484
Query: 422 MEMVKAGVIESILDILTDAPDYLIAAFAELLRILTNNATIAKGPSAAKVVEPLFLLLKRD 481
MEMVKAGVIE ILDIL +APDYL AAFAELLRILTNNA+IAKG SAAKVVEPLF LL R
Sbjct: 1485 MEMVKAGVIECILDILHEAPDYLCAAFAELLRILTNNASIAKGSSAAKVVEPLFFLLTRQ 1544
Query: 482 DFGPDGQHSALQVLVNILEHPQCRSDYSLTSRQVIEPLIPLLDSPIPAVQQLVAXXXXXX 541
+FGP+GQHS LQVLVNILEHPQCR+DY LTS Q IEPLIPLLDS VQQLVA
Sbjct: 1545 EFGPNGQHSVLQVLVNILEHPQCRADYRLTSHQTIEPLIPLLDSQTDTVQQLVAELLSHL 1604
Query: 542 XXXXXXQKDPVTQQVIGPLIRVLGSGIHILQQRAVKALVSIALTWPNEIAKEGGVIEISK 601
QKDPVTQQVIGPL+RVLGSG+ ILQQRA+KALVSIA WPNEIAKEGGVIEISK
Sbjct: 1605 LLEEDLQKDPVTQQVIGPLVRVLGSGMQILQQRALKALVSIADVWPNEIAKEGGVIEISK 1664
Query: 602 VILQADPSLPHALWESAASVLSSILQFSSEFYLEVPIAVLVRLLRSGSESTVIGALNALL 661
VILQADPS+PH LWESAASVL+SILQFSSEFYLE+P+AVLVRLL+SGSEST+ GALNALL
Sbjct: 1665 VILQADPSIPHVLWESAASVLASILQFSSEFYLEIPVAVLVRLLQSGSESTISGALNALL 1724
Query: 662 VLESDDR--FSAEAMAESGAVEALLELLRSHQCEETAARLLEVLLNNVKIRETKVTKSAI 719
VLESDD +AEAMAESGA+EALLELL SHQCE+TAARLLEVLLNNVKIRETKVTKSAI
Sbjct: 1725 VLESDDETSAAAEAMAESGAIEALLELLTSHQCEDTAARLLEVLLNNVKIRETKVTKSAI 1784
Query: 720 LPLSQYLLDPXXXXXXXXXXXXXXXGDLFQNEGLTRTGDAVSACRALVNVLEDNPTEEMK 779
LPLSQYLLDP GDLFQNE L RT DA SAC ALVNVL D PTEEMK
Sbjct: 1785 LPLSQYLLDPQTQAQQARLLATLALGDLFQNESLARTADAASACHALVNVLIDQPTEEMK 1844
Query: 780 VVAICALQNLVMYSRSNKRAVAEAGGVQVVLDLIGSSEPETSVQAAMFIKLLFSNNTIQE 839
VVAICALQNLVM+SR+NKRAVAEA GVQV+LDLIGSS+PETSVQAAMFIKLLFSN+TIQE
Sbjct: 1845 VVAICALQNLVMHSRANKRAVAEASGVQVILDLIGSSDPETSVQAAMFIKLLFSNHTIQE 1904
Query: 840 YASSETVRAITATIEKDLWASGTVNEEYLKALNSLFSNFPRLRATEPATLSIPHLVTSLK 899
YA SETVRAITA IEKDLWA+G VN+EYLKALNSLFSNF LRATEPATLSIPHLVTSLK
Sbjct: 1905 YAVSETVRAITAAIEKDLWATGAVNDEYLKALNSLFSNFAHLRATEPATLSIPHLVTSLK 1964
Query: 900 TGS 902
+GS
Sbjct: 1965 SGS 1967
>Medtr1g029790.1 | armadillo/beta-catenin-like repeat protein | HC |
chr1:10323565-10313898 | 20130731
Length = 2132
Score = 876 bits (2264), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/904 (51%), Positives = 632/904 (69%)
Query: 2 QIDNKEDNSMWICALLLAILFQDRDVIRAHPTTKSVPALASLLKSEESANKYFAAQAISS 61
Q + ++ +WI AL LAILF+D +++ + T +P++A LL+SEE +KYFAAQA++S
Sbjct: 1040 QAEYEDTEGIWISALFLAILFKDANIVLSPATMNIIPSIALLLRSEEVIDKYFAAQALAS 1099
Query: 62 LVCNGSRGTLLSVXXXXXXXXXXXXXXXXDTDIQDLLELSEEFSLVRYPDQVALERLFRV 121
LVCNG+RG L++ ++D+ +L+ LSEEFSLV+ PDQV L+ LF +
Sbjct: 1100 LVCNGNRGINLAIANSGAVAGLITIIGHIESDMPNLVTLSEEFSLVQNPDQVVLDHLFEI 1159
Query: 122 DDIRVGATSRKAIPALVDLLKPIPDRPGAPFLALGILTHLGRDCLSNKIIMVESGALEAL 181
+D+R+G+T+RK+IP LVDLL+PIP+RP AP +A+ +L + +NK ++ E+GALEAL
Sbjct: 1160 EDVRLGSTARKSIPLLVDLLRPIPERPNAPPVAVRLLISIADGSDANKSLLAEAGALEAL 1219
Query: 182 NKYLSLSPQDATEEAATDLLGILFSSVEIRKHESAFGAVAQLVAVLRLGGRSARYSAAKA 241
NKYLSLSPQD+TE A ++LL ILF + ++ KHE++ ++ QL+AVLRLG R+ARYSAA+A
Sbjct: 1220 NKYLSLSPQDSTETAISELLRILFFNSDLIKHEASTSSLNQLIAVLRLGSRNARYSAARA 1279
Query: 242 LESLFSADHIRNAEIARQAFQPLVEILNTGSEREQHAAIAALIGLLSENPSRALAVADVE 301
L LF AD+IR++E+A+QA QPLV++LNT S EQ AA+ ALI L S + S+A DVE
Sbjct: 1280 LHELFDADYIRDSELAKQAIQPLVDMLNTTSGSEQEAALLALIKLTSGDSSKACVFTDVE 1339
Query: 302 MNAVDVLCRILSSNCSMDLKGDAAELCCALFGNTRIRSTMGAARCVEPLVSLLATEFSPA 361
N ++ L +ILSS +++LK AA LC ALFGN++IR+ A+ C++PL+SL+ + + A
Sbjct: 1340 GNPLESLYKILSSASTLELKTHAAHLCYALFGNSKIRANPVASDCIKPLISLMQSGYETA 1399
Query: 362 QHSVVRALDRLVDDEQLAELVAAHGAVVPLVSLLSGSNYILHEATSRALVKLGKDRPACK 421
V A DRL++DEQ EL AA+ V LV L+SG NY L EAT L+KLGKDR K
Sbjct: 1400 IEYGVCAFDRLLEDEQQVELAAAYNVVDLLVGLVSGENYQLIEATISVLIKLGKDRTPSK 1459
Query: 422 MEMVKAGVIESILDILTDAPDYLIAAFAELLRILTNNATIAKGPSAAKVVEPLFLLLKRD 481
+MVKAG+I++ L + AP L + AEL RILTN++ IA+ AA++VEPLF +L R
Sbjct: 1460 HDMVKAGIIDNCLKLFQSAPSSLCSTIAELFRILTNSSAIARSSDAAEIVEPLFNVLLRR 1519
Query: 482 DFGPDGQHSALQVLVNILEHPQCRSDYSLTSRQVIEPLIPLLDSPIPAVQQLVAXXXXXX 541
DF GQHSALQ LVNILE PQ + LT QVIEPLI L+SP +QQL
Sbjct: 1520 DFNIWGQHSALQALVNILEKPQSLASLKLTPSQVIEPLISFLESPSQDIQQLGTELLSHL 1579
Query: 542 XXXXXXQKDPVTQQVIGPLIRVLGSGIHILQQRAVKALVSIALTWPNEIAKEGGVIEISK 601
Q+D T+ + PL+R+ G GI LQQ A+KAL I+ +WP +A GG+ E++K
Sbjct: 1580 LAQEHFQQDITTKNAVVPLVRLAGIGILNLQQTAIKALEKISTSWPKAVADAGGIFELAK 1639
Query: 602 VILQADPSLPHALWESAASVLSSILQFSSEFYLEVPIAVLVRLLRSGSESTVIGALNALL 661
VI+Q DP PHALWESAA VLS++L+ ++++Y +VP+ VLV+LL S +ST+ ALNAL
Sbjct: 1640 VIIQDDPQPPHALWESAALVLSNVLRSNADYYFKVPVVVLVKLLHSTLDSTISLALNALA 1699
Query: 662 VLESDDRFSAEAMAESGAVEALLELLRSHQCEETAARLLEVLLNNVKIRETKVTKSAILP 721
V E SAE M E+GA++ALL L+RSHQCEE +A LLE L NNV++RE KV+K AI P
Sbjct: 1700 VFERSYGSSAEEMMEAGAMDALLHLIRSHQCEEASAGLLETLFNNVRVREMKVSKYAIAP 1759
Query: 722 LSQYLLDPXXXXXXXXXXXXXXXGDLFQNEGLTRTGDAVSACRALVNVLEDNPTEEMKVV 781
LSQYLLDP G+L Q+E R D+VSACRAL+++LED PTEEM +V
Sbjct: 1760 LSQYLLDPLTRSQSGKLLAALALGNLSQHERHARASDSVSACRALISILEDQPTEEMTMV 1819
Query: 782 AICALQNLVMYSRSNKRAVAEAGGVQVVLDLIGSSEPETSVQAAMFIKLLFSNNTIQEYA 841
AICALQN VM SR+N+RAVAEAGG+ V+ +L+ E + QAA+ IK LFS +T+ EY
Sbjct: 1820 AICALQNFVMNSRTNRRAVAEAGGILVIQELLLFPNTEVAGQAALLIKFLFSTHTLLEYV 1879
Query: 842 SSETVRAITATIEKDLWASGTVNEEYLKALNSLFSNFPRLRATEPATLSIPHLVTSLKTG 901
S+E +R++TA +E++LW++ T+NEE LK L+ +F NFP+L +E ATL IPHLV +LK+G
Sbjct: 1880 SNELIRSLTAALERELWSTATINEEVLKTLHVIFVNFPKLHTSEAATLCIPHLVGALKSG 1939
Query: 902 SXXA 905
S A
Sbjct: 1940 SEVA 1943