Miyakogusa Predicted Gene
- Lj0g3v0091309.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0091309.1 Non Characterized Hit- tr|I1JNN2|I1JNN2_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.20503
PE,84.44,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; COLON CANCER-ASSOCIATED
PROTEIN MIC1,NULL; Mic1,Colon cancer-ass,CUFF.4992.1
(735 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g093740.1 | colon cancer-associated Mic1-like protein | HC... 1192 0.0
Medtr8g106830.1 | hypothetical protein | HC | chr8:45101834-4510... 65 2e-10
>Medtr7g093740.1 | colon cancer-associated Mic1-like protein | HC |
chr7:37298063-37290218 | 20130731
Length = 730
Score = 1192 bits (3084), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 587/734 (79%), Positives = 630/734 (85%), Gaps = 5/734 (0%)
Query: 1 MSGKASTSKPNIGMSGSDGLSHAYIQYPPLRCNIPGSRDLFYDDGNKLLLSPTADQIFSW 60
MS KA+TSKP IG+ GSDGLSHAYIQYPPLRCN+P S LFYDDGNKLLLSP ADQ+FSW
Sbjct: 1 MSRKATTSKPTIGLRGSDGLSHAYIQYPPLRCNVPESGGLFYDDGNKLLLSPAADQVFSW 60
Query: 61 KVSPFDPFVGPSTDSISEGPVIAIRYSLDMKVIAIQRSNHEIQLWDKETGEFFSHKCKPE 120
KV FDP GP+TDSISEGP+IAIRYSLD KVIAIQRS EIQ WD+ET E FSHKCKPE
Sbjct: 61 KVGIFDPLTGPTTDSISEGPIIAIRYSLDTKVIAIQRSGQEIQFWDRETAETFSHKCKPE 120
Query: 121 SESILGFFWTDSQQCDIVLVKTSGLDMYAYSSASKSLQLVETKKLTVSWYVYTHESRLVL 180
SESILGFFWTDS+QCDIV+VKT+GLD+ AY S SKSLQLVETKKL VSWYVYTHESRLVL
Sbjct: 121 SESILGFFWTDSRQCDIVIVKTNGLDLCAYKSESKSLQLVETKKLNVSWYVYTHESRLVL 180
Query: 181 LASGMQCKTFHGFQISSADIVRLPRFEMVMAKSEANSKPVLAVEDIFIVTVYGRIYCLQV 240
LASGMQCKTFHGFQISSADIVRLPRFEMVMAKSEANSKPVLA EDIFIVTVYGRIYCLQV
Sbjct: 181 LASGMQCKTFHGFQISSADIVRLPRFEMVMAKSEANSKPVLAAEDIFIVTVYGRIYCLQV 240
Query: 241 DRVAMLLHLYRLYRDAVIQQGSLPIYSSRIAVSVVDSVLLIHQVDAKVVILYDLFIDSRA 300
DRVAMLLH YRLYRDAVIQQGSLPIYSSRIA SVVD+VLLIHQVDAKVVILYDLF DSRA
Sbjct: 241 DRVAMLLHSYRLYRDAVIQQGSLPIYSSRIAGSVVDNVLLIHQVDAKVVILYDLFADSRA 300
Query: 301 PISAPLPLLLRGFPXXXXXXXXXXXXXXXXDGNVVRSHEAVTYADTWIFLVPDLVCDVAN 360
PISAPLPLLLRGFP DGNV SHEAVTYAD+WIFLVPDLVCDVAN
Sbjct: 301 PISAPLPLLLRGFPRSSSSSQFSGRESESSDGNVASSHEAVTYADSWIFLVPDLVCDVAN 360
Query: 361 KIIWKFNLDLDAISASNSEVPSVLEFLQRRKLEANKAKQLCLDITRTLILEHRPVSVVSK 420
K++WKFNLDL+AISASNS+VPS+L+FLQRRKLEANKAKQLCL IT+TLILE RPV VV+K
Sbjct: 361 KLLWKFNLDLEAISASNSDVPSILDFLQRRKLEANKAKQLCLGITQTLILERRPVPVVAK 420
Query: 421 AINVLVTSYAHSIKTGSYLKVLKPERTPASGVPNSSADVSATETGATGKSIISETTARID 480
AINVLV+SY+HSIKT SYLK LKPE SG NS ADVS E A GKSII E+TAR+D
Sbjct: 421 AINVLVSSYSHSIKTCSYLKGLKPEMPLNSGAQNSDADVSTIERDAIGKSIIHESTARVD 480
Query: 481 SGSLIKAXXXXXXXXXXANPNRNSEEAIVGGTVNNGNSLITEAHPDHDMXXXXXXXXXXX 540
S +L N NS+EA VGG+VNN NS EAH + M
Sbjct: 481 SETL-----DSEDESHFTNLEHNSKEAYVGGSVNNENSPSNEAHSSYVMQSSLLSVQEES 535
Query: 541 XXTSAAISPDEMYSFVFSPVDEEMVGDPSYLVAIIIEFLYSANLEKVRVHPNLYVLIVQL 600
TSAAISPDEMY+FVFSPVDEEMVGDPSYLVAIIIEFL+SANLEK+RV PNLYVLI+QL
Sbjct: 536 QLTSAAISPDEMYNFVFSPVDEEMVGDPSYLVAIIIEFLHSANLEKIRVLPNLYVLIIQL 595
Query: 601 LARNERYAELGLFVINKILEPSKEVALQLRESGRQDTQTRKLGLDMLRQLGLHHDYVLLL 660
L RNERYAELGLFV+NKILEPSKEVALQL ESGRQ+TQTRKLGLDMLRQLGLH+DYV+LL
Sbjct: 596 LVRNERYAELGLFVVNKILEPSKEVALQLLESGRQNTQTRKLGLDMLRQLGLHNDYVVLL 655
Query: 661 VQDGYYLEALRYARKYRVDTIRPSLFLEAAFVSNDSQHLAAVLRFFTDFLPGFGNTSEYN 720
VQDGYYLEALRYARKY+VDTIRPSLFLEAAFVSNDSQHLAAVLRFFTDFLPGF NT+E+N
Sbjct: 656 VQDGYYLEALRYARKYKVDTIRPSLFLEAAFVSNDSQHLAAVLRFFTDFLPGFKNTAEHN 715
Query: 721 RYYRILNELNSSLT 734
RY+RILNE+NSS+T
Sbjct: 716 RYHRILNEMNSSMT 729
>Medtr8g106830.1 | hypothetical protein | HC |
chr8:45101834-45101998 | 20130731
Length = 54
Score = 65.1 bits (157), Expect = 2e-10, Method: Composition-based stats.
Identities = 33/49 (67%), Positives = 37/49 (75%), Gaps = 4/49 (8%)
Query: 208 MVMAKSEANSKPVLAVEDIFIVTVYGRIYCLQVDRVAMLLHLYRLYRDA 256
MV+AKSEANSK VL EDIFI+ + CLQVDRVAMLLH Y LY D+
Sbjct: 1 MVVAKSEANSKAVLTPEDIFIIYCH----CLQVDRVAMLLHFYMLYHDS 45