Miyakogusa Predicted Gene
- Lj1g3v5020850.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v5020850.1 Non Characterized Hit- tr|I1NCF0|I1NCF0_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.16169
PE,87.3,0,TRANSLATIONAL ACTIVATOR GCN1-RELATED,NULL; TRANSLATIONAL
ACTIVATOR GCN1-RELATED,Proteasome component,CUFF.33880.1
(2631 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g116425.1 | translational activator GCN1-like protein | HC... 4423 0.0
>Medtr7g116425.1 | translational activator GCN1-like protein | HC |
chr7:48052719-48022334 | 20130731
Length = 2632
Score = 4423 bits (11471), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 2191/2597 (84%), Positives = 2343/2597 (90%), Gaps = 4/2597 (0%)
Query: 21 QRLRIFHREIPALLNSSPPDD--SPELASLLTDTIFRTVPIYDDRRSRKAVDDVIIKALT 78
QRLRIF RE+PA LNSS D S ELASLLTD IFRTV IYDDRRSRKAVDDVI+K+L+
Sbjct: 22 QRLRIFQREVPAFLNSSSTSDEMSTELASLLTDIIFRTVAIYDDRRSRKAVDDVIVKSLS 81
Query: 79 GGGAVFMKTFAGALVQNMEKQVKFQSHVGSYRXXXXXXXXXXKSQFAEVSKNALCRVASA 138
G VFMKTFA ALVQ+MEKQ+K QSHVG YR KS+F+ VSKNALCRVAS
Sbjct: 82 G--TVFMKTFAAALVQSMEKQLKSQSHVGCYRLLSWSCLLLSKSKFSTVSKNALCRVASG 139
Query: 139 QASLLKLVLQRSFHEKQACKKKFFRLFDQSPDICKVYVLGLKNGQIPYKDXXXXXXXXXX 198
QASLL LV +RSF E++ACKKK F LF + PDI KVYV +KNG IPYKD
Sbjct: 140 QASLLNLVWRRSFRERRACKKKIFHLFKELPDIYKVYVQEVKNGSIPYKDSPELLLLLLE 199
Query: 199 XXXXXXXXXREFKPAFLDIYVNAILNAKEKPGKSLIEAFLPLYLQMAHEDLQNIVMPSSI 258
EFK AFLDIYVNAIL+AK KPGKSLIEAF PLYLQM+HED IV+P+++
Sbjct: 200 FSTRSSSLFGEFKSAFLDIYVNAILSAKAKPGKSLIEAFHPLYLQMSHEDFGTIVLPAAV 259
Query: 259 KMLKRNPEIVLESVGILLKSVDLDLSKYAAEILSVVLVQARHADEGRRDGALAIVGSLSK 318
KMLKRNPEIVLESVGILLKSV LDLSKYAAEILSVVLVQARHADEGRRD AL IV +LS+
Sbjct: 260 KMLKRNPEIVLESVGILLKSVKLDLSKYAAEILSVVLVQARHADEGRRDVALDIVKNLSQ 319
Query: 319 KSSNPDALDTMFNAIKAVIKGSEGRLAFPYQRVGMVNAIQELSNAPDGKYISSLSNTICD 378
KSSNPDALD MFNAIK+VIKGSEGRLAFPYQRVGMVNAIQELSNAPDGKY+ +LS TICD
Sbjct: 320 KSSNPDALDIMFNAIKSVIKGSEGRLAFPYQRVGMVNAIQELSNAPDGKYLINLSQTICD 379
Query: 379 FLLSYYKDDGNEEVKIVILSAIASWAVRSTDIIHEGLVSFLASGLKEKETLRRGFLRSLH 438
FLLS YKDDGNEEVKI LSAIASWA +ST+II E LVSF ASGLKEKE LRRGFLRSL
Sbjct: 380 FLLSCYKDDGNEEVKIATLSAIASWADKSTNIIQESLVSFFASGLKEKEILRRGFLRSLR 439
Query: 439 AICKNTDAVLKMSTLLGPLVQLVKTGFTKAVQRLDGIYALLLVGKIAAVDIKAEEILVRE 498
AICKN DAVLKMS LL PLVQLVKTGFTKAVQRLDGIYALLLVGKIAAVDIKAEEILV+E
Sbjct: 440 AICKNADAVLKMSPLLVPLVQLVKTGFTKAVQRLDGIYALLLVGKIAAVDIKAEEILVKE 499
Query: 499 KIWTLVSQNEPSLVPISMASKLSVEDSMACVDLLEVLLLEHSQRILSNFSVKLLLQLMIF 558
KIW +SQNEPSL+PISMASKL+VEDS+AC+DLLEVLLLEH QR LSNFSV LLQL+IF
Sbjct: 500 KIWATISQNEPSLIPISMASKLAVEDSIACIDLLEVLLLEHLQRTLSNFSVTSLLQLVIF 559
Query: 559 FICHLRWDIRRKAYDVARKIITSSPQLSGDLFLEFSKYLSLVGDKILALRLSDSDISLDP 618
FICH RWDIRR A +VA++IITS PQLS D+ EFSKYL+LV +K+ ALR+SD+DISLDP
Sbjct: 560 FICHPRWDIRRIACNVAKRIITSVPQLSEDILSEFSKYLNLVEEKVSALRISDTDISLDP 619
Query: 619 QIPFIPSVEVLVKALLIISPEAMKLAPDSFVRIILCSHHPCVLGSAKRDAVWKRLSKCLQ 678
Q+PFIPSVEVLVKALLI+SP AMK+APDSFVRIILCSHHPCV+GSAKRDAVWKRL KCLQ
Sbjct: 620 QVPFIPSVEVLVKALLIMSPAAMKVAPDSFVRIILCSHHPCVVGSAKRDAVWKRLCKCLQ 679
Query: 679 THGFDVIDIVSANVVNLVQVLLGPLGLKSANPLEQQAAISSLSTLMLIIPGDIYTEFEEH 738
THGFDVIDIV+ANV+N VQVLLGP+GL+SANPLEQ+AAISSLS LM IIPGD YTEFE+H
Sbjct: 680 THGFDVIDIVAANVINFVQVLLGPMGLRSANPLEQEAAISSLSNLMSIIPGDTYTEFEKH 739
Query: 739 LRNLPERFSHDMLSENDIQIFNTPEGMLSSEQGVYVAESVAAKNTKQAKGRFRMYDDEDD 798
L NLPERFSH+ LSENDIQIF+TPEGMLS+EQG+YVAESVA KNTKQAKGRFRMY +ED
Sbjct: 740 LLNLPERFSHNALSENDIQIFHTPEGMLSTEQGIYVAESVAFKNTKQAKGRFRMYGEEDG 799
Query: 799 LDHARSNHSMKRDQPIRETAGAGKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXSVRDKVC 858
LDH +SNHSMKRDQP RE AGAGK+ S+RD+V
Sbjct: 800 LDHTQSNHSMKRDQPSREAAGAGKKDSGKTTKKADKGKTAKEEARESLLKEEASIRDRVR 859
Query: 859 EIQKNLSLMLRTLGDMAIANSVFAHSKLPSMVKFVEPLLRSPIVSDEAFETMVKLSRCIA 918
EIQKNLSLMLRTLG+MAIANS+FAHS+LPSMVKFVEPLLRSPIVSDEAFET+V LSRC A
Sbjct: 860 EIQKNLSLMLRTLGNMAIANSIFAHSRLPSMVKFVEPLLRSPIVSDEAFETLVMLSRCTA 919
Query: 919 PPLCEWALDISTALRLIVTDEVHLLLDLVPSAAEEEVNGRPSLGLFERILDGLSTSCKSG 978
PLC+WALDISTALRL+VTDEVHLLLDLVPS AEE+VN +PS GLFERI+DGLSTSCKSG
Sbjct: 920 SPLCDWALDISTALRLVVTDEVHLLLDLVPSVAEEQVNQKPSHGLFERIIDGLSTSCKSG 979
Query: 979 ALPVDSFSFVFPIMERILLSSKKTKFHDDVLRLFYLHLDPHLPLPRIRMLSALYHVLGVV 1038
ALPVDSF+FVFPIMERILL SKKTKFHDDVLRL YLH+D HLPLPR+RMLS LYH L VV
Sbjct: 980 ALPVDSFTFVFPIMERILLCSKKTKFHDDVLRLIYLHMDAHLPLPRVRMLSVLYHALSVV 1039
Query: 1039 PAYQSSIGPALNELSLGLQPDEVASALYGVYSKDVHVRMACLNAVRCIPAVANRSLPQNI 1098
PAY++SIGPALNELSLG QPDEVASALYGVY+KDVHVRMACLNAV+CIPAV++RSLPQN
Sbjct: 1040 PAYKASIGPALNELSLGFQPDEVASALYGVYAKDVHVRMACLNAVKCIPAVSSRSLPQNT 1099
Query: 1099 EVATSLWIALHDPEKSIAQVAEDIWDHYGFDFGTDFSGIFKALSHVNYNVRXXXXXXXXX 1158
EVATS+WIALHDPEKS+A+VAEDIWDHYGFDFGTDFSGIFKALSHVNYNVR
Sbjct: 1100 EVATSIWIALHDPEKSVAEVAEDIWDHYGFDFGTDFSGIFKALSHVNYNVRLAAAEALAA 1159
Query: 1159 XXDEYPDSIHECLSTLFSLYIRDMGIGDDNLDAGWLGRQGIALALHSAADVLRTKDLPIV 1218
DE+PD I E LSTLFSLYIRDMGIG+DN+DAGWLGRQG+ALALHSAADVLRTKDLP+V
Sbjct: 1160 ALDEHPDLIQESLSTLFSLYIRDMGIGNDNVDAGWLGRQGVALALHSAADVLRTKDLPVV 1219
Query: 1219 MTFLISRALADPNADVRGRMINAGILIIDKSGKDNVSLLFPIFENYLNKTAPDEEKYDLV 1278
MTFLISRALAD NADVRGRMIN+GILIIDK+GKDNVSLLFPIFENYLNKTAPDEE+YDLV
Sbjct: 1220 MTFLISRALADLNADVRGRMINSGILIIDKNGKDNVSLLFPIFENYLNKTAPDEEQYDLV 1279
Query: 1279 REGVVIFTGALAKHLAKDDPKVHAVVDKLLDVINTPSEAVQRAVSACLSPLMQSKQDDAA 1338
REGVVIFTGALAKHLAKDDPKVHAVVDKLLDV+NTPSE+VQRAVSACLSPLMQSKQD+A
Sbjct: 1280 REGVVIFTGALAKHLAKDDPKVHAVVDKLLDVLNTPSESVQRAVSACLSPLMQSKQDEAD 1339
Query: 1339 ALVTRLLDQLLKSEKYGERRGAAFGLAGVVKGFGISCLKKYRIVIILQEGLVDRNSAKSR 1398
LVTRLLDQ++KSEKYGERRGAAFGLAGVVKGFG+SCLKKY+IVIILQE L +RNSAKSR
Sbjct: 1340 TLVTRLLDQMMKSEKYGERRGAAFGLAGVVKGFGLSCLKKYKIVIILQECLAERNSAKSR 1399
Query: 1399 EGALLGFECLCEILGRLFEPYVIKMLPLLLVSFSDQXXXXXXXXXXXXXXMMSQLSAQGV 1458
EGALLGFECLCE LG+LFEPYVI+MLPLLLVSFSDQ MMSQLSAQGV
Sbjct: 1400 EGALLGFECLCETLGKLFEPYVIQMLPLLLVSFSDQVAAVREAAECAARAMMSQLSAQGV 1459
Query: 1459 KLVLPSLLKGLEDKAWRTKQSSVQLLGAMAYCAPQQLSQCLPKIVPKLTEVLTDTHPKVQ 1518
KLVLPSLLKGLEDKAWRTKQSSVQLLGAMAYCAPQQLSQCLPKIVPKLTEVLTD+HPKVQ
Sbjct: 1460 KLVLPSLLKGLEDKAWRTKQSSVQLLGAMAYCAPQQLSQCLPKIVPKLTEVLTDSHPKVQ 1519
Query: 1519 SAGQMALQQVGSVIKNPEISALVPTLLKGLSDPNEYTKYSLDILLQTTFVNSIDAPSLAL 1578
SAGQ ALQQVGSVIKNPEI+ALVPTLLKGLSDPNE+TKYSLDILLQTTFVNSIDAPSLAL
Sbjct: 1520 SAGQTALQQVGSVIKNPEIAALVPTLLKGLSDPNEHTKYSLDILLQTTFVNSIDAPSLAL 1579
Query: 1579 LVPIVHRGLRERSADTKKRASQIVGNMCSLVTEATDMIPYIGLLLPEVKKVLVDPIPEVR 1638
LVPIVHRGLR RSADTKKRASQIVGNMCSLVTE DMIPYIGLLLPEVKKVLVDPIPEVR
Sbjct: 1580 LVPIVHRGLRVRSADTKKRASQIVGNMCSLVTEPKDMIPYIGLLLPEVKKVLVDPIPEVR 1639
Query: 1639 SVAARAIGSLIGGMGEENFPDLVPWLFDTLKSDNSNVERSGAAQGLSEVLAALGIGFFEH 1698
SVAARAIGSLIGGMGE+NFPDLVPWLF+TLKSDNSNVERSGAAQGLSEVLAALG+ FFEH
Sbjct: 1640 SVAARAIGSLIGGMGEDNFPDLVPWLFETLKSDNSNVERSGAAQGLSEVLAALGVEFFEH 1699
Query: 1699 VLPDIIRNCSHQKASVRDGYLTLFKFLPRSLGVQFQNYLSQVLPAILDGLADENESVRDA 1758
V PDIIRNCSHQKASVRDGYLTLFK+LPRSLGVQFQ YL QVLPAILDGLADENESVRDA
Sbjct: 1700 VFPDIIRNCSHQKASVRDGYLTLFKYLPRSLGVQFQKYLPQVLPAILDGLADENESVRDA 1759
Query: 1759 ALGAGHVLVEHYAATSLPLLLPVVEDGIFNDNWRIRQSSVELLGDLLFKVAGTSGKALLE 1818
ALGAGHVLVEHYA TSLPLLLP VEDGI ND+WRIRQSSVELLGDLLFKVAGTSGKALLE
Sbjct: 1760 ALGAGHVLVEHYATTSLPLLLPAVEDGIINDSWRIRQSSVELLGDLLFKVAGTSGKALLE 1819
Query: 1819 GGSDDEGSSTEAHGRAIIEVLGYSKRNEVLAALYMVRTDVSLSVRQAALHVWKTIVANTP 1878
GGSDDEGSSTEAHGRAIIEVLG KRNE+LAALYMVR DVSLSVRQAALHVWKTIVANTP
Sbjct: 1820 GGSDDEGSSTEAHGRAIIEVLGREKRNEILAALYMVRADVSLSVRQAALHVWKTIVANTP 1879
Query: 1879 KTLREIMPVLMDTLIASLASSSSERRQVAGRSLGELVRKLGERVLPLIIPILSRGLNDPD 1938
KTLREIMPVLMDTLIASLAS+SSERRQVAGRSLGELV KLGERVLPLIIPILS+GL+DPD
Sbjct: 1880 KTLREIMPVLMDTLIASLASASSERRQVAGRSLGELVGKLGERVLPLIIPILSQGLSDPD 1939
Query: 1939 SSKRQGVCSGLSEVMASAGKSQLLTFMNDLILTIRTALCDSVPEVRESAGLAFSTLYKSA 1998
SS+RQGVCSGLSEVMASAGKSQL+TFM DLI TIRTALCDS P VRESAGLAFSTLYKSA
Sbjct: 1940 SSRRQGVCSGLSEVMASAGKSQLMTFMTDLIPTIRTALCDSEPAVRESAGLAFSTLYKSA 1999
Query: 1999 GLQAIDEIVPTLLHALEDDRTSDTALDGLKQILSVRTSAVLPHIFPKLVHPPLSAFHAHA 2058
G+QAIDEIVPTLLHALEDD+TSDTALDGLKQILSVRTSAVLPHI PKLVHPPLSAF+AHA
Sbjct: 2000 GMQAIDEIVPTLLHALEDDKTSDTALDGLKQILSVRTSAVLPHILPKLVHPPLSAFNAHA 2059
Query: 2059 LGALADVAGPGLDFHLGTVLPPLLSAMGSDDKEVQTSAKEAAETVVSVIDEEGIEPLISE 2118
LGALA+VAGPGLDFHLGTVLPPLLSAM D+EVQTSAK+AAETVV VIDEEG+EPLISE
Sbjct: 2060 LGALAEVAGPGLDFHLGTVLPPLLSAMSDVDQEVQTSAKKAAETVVLVIDEEGVEPLISE 2119
Query: 2119 LVKGVSDSQATVRRSSSYLIGYFLKNSKLYLVDEAPNMISTLIILLSDPDTSTVSVAWEA 2178
L+KGVSDSQA +RRSSSYLIGYF KNSKLYLVDEAPNMISTLI+LLSDPD+STV+VAWEA
Sbjct: 2120 LLKGVSDSQAAIRRSSSYLIGYFFKNSKLYLVDEAPNMISTLIVLLSDPDSSTVTVAWEA 2179
Query: 2179 LSRVIISVPKEVLPSYIKLVRDAISTSRDKERRKRKGGPILIPGFCLPKALQPILPIFLQ 2238
LSRVI+SVPKEVLPSYIKLVRDA+S+SRDKERRK+KGGP+LIPGFCLPK+LQPILPIFLQ
Sbjct: 2180 LSRVIMSVPKEVLPSYIKLVRDAVSSSRDKERRKKKGGPVLIPGFCLPKSLQPILPIFLQ 2239
Query: 2239 GLISGSAELREQAALGLGELIEVTSEQSLKEFVIPITGPLIRIIGDRFPWQVKSAILSTL 2298
GLISGSAELREQAALGLGELIEV EQSLKE VIPITGPLIRIIGDRFPWQVKSAILSTL
Sbjct: 2240 GLISGSAELREQAALGLGELIEVAGEQSLKEVVIPITGPLIRIIGDRFPWQVKSAILSTL 2299
Query: 2299 TIMIRKGGISLKPFLPQLQTTFVKCLQDSTRTIRXXXXXXXXXXXXXXTRVDPLVSDLLS 2358
TIMIRKGGISLKPFLPQLQTTFVKCLQD+TRTIR TRVDPLVSDLLS
Sbjct: 2300 TIMIRKGGISLKPFLPQLQTTFVKCLQDNTRTIRSGAAVALGMLSGLNTRVDPLVSDLLS 2359
Query: 2359 TLQGSDGGVREAILTALKGVMKHAGKNVSSAVRDRAYSVLKDLIHHDDERVRMYAARILG 2418
+LQGSDGGVREAIL+ALKGV+KHAGKNVSSAV R YSVLKDLIHHDD+RVR+YAA ILG
Sbjct: 2360 SLQGSDGGVREAILSALKGVLKHAGKNVSSAVSSRIYSVLKDLIHHDDDRVRVYAASILG 2419
Query: 2419 ILTQYLEDVQLTELIQELSSLANSPSWSPRHGSILTISSLFHHNPVPIFSSPLFPTIVDC 2478
+LTQYLE VQ TELIQE++SLANSP+W PRHGSILTISSL + NP PIFSS LF T+VDC
Sbjct: 2420 VLTQYLEAVQFTELIQEVTSLANSPNWPPRHGSILTISSLLYRNPAPIFSSSLFQTVVDC 2479
Query: 2479 LRVTLKDEKFPLRETSTKALGRLLLYRAQVDPPDTLLYKDVLSLLVSSTHDESSEVRRRA 2538
LR LKDEKFPLRE+STKALGRLLLYRAQ DP DT+LYKDVLSLLV+ST DESSEVRRRA
Sbjct: 2480 LRDALKDEKFPLRESSTKALGRLLLYRAQEDPSDTVLYKDVLSLLVTSTRDESSEVRRRA 2539
Query: 2539 LSAIKAVAKANPSAIMLHGTIVGPAIAECLKDASTPVRLAAERCAVHALQLTKGSENVQA 2598
LSAIKAVAKANPSAIM HGT++GPA+AECLKDA+TPVRLAAERCA+HA QLTKGSENVQA
Sbjct: 2540 LSAIKAVAKANPSAIMSHGTVIGPALAECLKDANTPVRLAAERCAIHAFQLTKGSENVQA 2599
Query: 2599 AQKYITGLDARRLSKLP 2615
QKYITGLDARRLSK P
Sbjct: 2600 VQKYITGLDARRLSKFP 2616