Miyakogusa Predicted Gene
- Lj3g3v2994570.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2994570.1 Non Chatacterized Hit- tr|I1LKS4|I1LKS4_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.53931
PE,78,0,coiled-coil,NULL; FAMILY NOT NAMED,NULL; seg,NULL,CUFF.45105.1
(627 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G22450.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 326 4e-89
AT4G29790.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 134 2e-31
AT2G19390.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 128 1e-29
>AT5G22450.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G19390.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:7437145-7442856 REVERSE LENGTH=1154
Length = 1154
Score = 326 bits (835), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 232/635 (36%), Positives = 341/635 (53%), Gaps = 94/635 (14%)
Query: 1 MECIFASISLDDASYLKQQLNIAEEFNKNLSHMFGTDHDILGVVINNKTTQGSEERKRSH 60
M+ IFA++++DD +K QLN A+E +K+LS ++ILG+ K + +
Sbjct: 603 MDHIFAAVNVDDMQNMKDQLNFAQELDKSLSDAILDGYNILGL----KLPKAVHRPGVGN 658
Query: 61 CDEESTKCDALDG----KNEMGRLDKVTPLFQRLLCALIEEDESEESYHQSEGKNISRQC 116
D + G + +M +L++ TPL++R+L ALIEED+ EE + GKN+S
Sbjct: 659 VDYSGPTSSCVSGLSFERLDMRKLNESTPLYKRVLSALIEEDDGEEVVQFNGGKNLSLHY 718
Query: 117 ASDDSHCGSCNQIDFEPKDRDRMDSEVESKVDLQIQKNCMLDRLSCDKSTTSNTFRYPNT 176
ASDDSHCGSC ID E ++RDRM+ EVES D Q K+ + DR S ++S SN FR
Sbjct: 719 ASDDSHCGSCTYIDTEFRERDRMEFEVESSGDFQTPKSGLFDRFSSERSVVSNPFRNGGM 778
Query: 177 SSSLQSTGVWQGDEELSLSDITLTSEICSNDLDQLQPAEISVPSFPSPDGPYXXXXXXXX 236
S S+ S W GD++LS SD L +E SN L QLQ E+++P+FP D Y
Sbjct: 779 SISVHSNEQWIGDDDLSHSDAALGNETYSNSLGQLQAREVNIPNFPVSDTQYQLMSLDER 838
Query: 237 XXXXXXXIGLYPEILPDLAEEDEAISQDIVKLEKELYEQNGRKKKNLDIIDRSIQKGRDM 296
IG++PE +PDLAE E +S D+++L++ +Y++ KKK L+ + +IQKG+D+
Sbjct: 839 LLLELQSIGVFPEAMPDLAE--ETMSTDVMELKEGIYQEILNKKKKLEKLIITIQKGKDV 896
Query: 297 ERRNIEQAAFDHLTEMAYRKRLACRGSKNSKGAVHKVPKQVALAFLKRTLGRCKRYEEAD 356
E+R IE A D L E A++KR+ACRGSK +K V+KV +QVAL F++RT+ RC+++EE
Sbjct: 897 EKRKIEHLAMDQLVETAHKKRMACRGSKAAK--VNKVTRQVALGFIRRTVARCRKFEETG 954
Query: 357 ISCFSEPTLQNIMFALPSRESDAQPGDCIVSGTASNTCYKAS-PQIEVRKSGAVSSTSEK 415
SCFS+P LQ+I+F+ PS +DA+ + SGTASNT + S Q E + SGAVSST
Sbjct: 955 FSCFSDPALQDILFSSPS--NDAKSSENGGSGTASNTLNEPSNHQAEAKGSGAVSST--- 1009
Query: 416 YDVQIDHADRGLVDSFQGSIHSSEQASSKNGSVFIKEKKREMLVNVVVSGSSSRASKFDG 475
K+RE L++ V+ +SS+ + G
Sbjct: 1010 -------------------------------------KRREALIDDVIGCASSKVTTSKG 1032
Query: 476 AV---PGGVKGKRSERDRNQSRDQTRQNSIPRAGRLSLDSSQNENKPKAKPKQKNTAYGH 532
+ GG +GKRSER+ D R + P+ + +++ N+++ T H
Sbjct: 1033 SAVLSGGGAQGKRSERE-----DGFRNKNKPKPKENNNNNNGNQSR-------STTTSTH 1080
Query: 533 DRFMEPKESACLPIHGSSLSVVGNQDTSQAKESA-DLGNLPLPDLGSIEEFGVSGELGGP 591
P A G+S V + D + E+ D L DL I+E
Sbjct: 1081 -----PTGPAS---RGASNRGVTSGDGAVDDEAPIDFSKLAFRDLDEIDE---------Q 1123
Query: 592 QDLSSWLNNFDDDGLQEDDCIGL-EIPMDDLSDLM 625
DL +W +GLQ+ D GL E+PMDDLS +
Sbjct: 1124 ADLGNWF-----EGLQDIDTAGLDEVPMDDLSFMF 1153
>AT4G29790.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G19390.1); Has
538 Blast hits to 357 proteins in 124 species: Archae -
0; Bacteria - 74; Metazoa - 109; Fungi - 58; Plants -
105; Viruses - 2; Other Eukaryotes - 190 (source: NCBI
BLink). | chr4:14584228-14590123 FORWARD LENGTH=1211
Length = 1211
Score = 134 bits (336), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 134/407 (32%), Positives = 198/407 (48%), Gaps = 69/407 (16%)
Query: 244 IGLYPEILPDLAE-EDEAISQDIVKLEKELYEQNGRKKKNLDIIDRSIQKGRDMERRN-- 300
IG+ + +P ++ EDE I DI LE+ + E +KK D+++R ++ +M+ R
Sbjct: 850 IGICLDPMPSISNVEDEGIVDDIKTLEEAICEVVSKKK---DMLNRLLKPALEMKERQEK 906
Query: 301 -IEQAAFDHLTEMAYRKRLACR--GSKNSKGAVHKVPKQVALAFLKRTLGRCKRYEEADI 357
E+ ++ L EMAY K A R S + K + K+ KQ A AF+KRTL RC+++EE
Sbjct: 907 EFERLGYEKLIEMAYEKSKASRRHHSASGKSSATKISKQAAFAFVKRTLERCRQFEETGK 966
Query: 358 SCFSEPTLQNIMFA-LPSRESDAQPGDCIVSGTASNTCYKASPQIEVRKSGAVSSTSEKY 416
SCFSE T +NI+ A L E + + I+S + T + P + + ++ ++E +
Sbjct: 967 SCFSESTFKNIIIAGLTQFEDNPTDKEDILSAS---TLMGSQPSSSL--ALPMTQSTENH 1021
Query: 417 DVQIDHADRGLVDSFQGSIHSSEQASSKNGSVFIKEKKREMLVNVVVSGSSSRASKFDGA 476
++A R D S + KKRE+L++ V G
Sbjct: 1022 ANSSENALREGRDEMMWSN---------------RMKKRELLLDDV------------GG 1054
Query: 477 VP--GGVKGKRSERDRNQS--RDQTRQNSIPRAGRLSLDSSQNENKPKAKPKQKNTAYGH 532
P KGKRSERDR+ +R S + GR +L +++ E K K KP+QK T
Sbjct: 1055 KPLSSSTKGKRSERDRDGKGQASSSRGGSTNKIGRPALVNAKGERKSKTKPRQKTTP--- 1111
Query: 533 DRFMEPKESACLPI---HGSSLSVVGNQDTSQAK--------ESADLGNLPLPD-LGSIE 580
M S C+ I +SLS N + S+ E DL +L +PD LG +
Sbjct: 1112 ---MFSSSSTCVNIVEQTRTSLSKTTNSNNSEYSNLETLDESEPLDLSHLQIPDGLGGPD 1168
Query: 581 EFGVSGELGGPQDLSSWLNNFDDDGLQEDDCIGLEIPMDDLSDLMLM 627
+F DLSSWLN DD DD +GL+IPMDDLSDL +M
Sbjct: 1169 DFDTQA-----GDLSSWLNIDDDALPDTDDLLGLQIPMDDLSDLNMM 1210
>AT2G19390.1 | Symbols: | unknown protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G29790.1); Has
203 Blast hits to 188 proteins in 60 species: Archae - 0;
Bacteria - 11; Metazoa - 24; Fungi - 34; Plants - 93;
Viruses - 0; Other Eukaryotes - 41 (source: NCBI BLink).
| chr2:8390136-8396477 REVERSE LENGTH=1211
Length = 1211
Score = 128 bits (322), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 196/394 (49%), Gaps = 49/394 (12%)
Query: 244 IGLYPEILPDLAE-EDEAISQDIVKLEKELYEQNGRKKKNLDIIDRSIQKGRDMERRNIE 302
+G+ +++P ++ EDE I+ +I KLE+ + + +KK+ +D + + + ++++ + ++
Sbjct: 854 LGISIDLMPSISNVEDEGIADEIKKLEEAICNEGSKKKEIVDRLLKPAIEMKELQEKELD 913
Query: 303 QAAFDHLTEMAYRKRLACRGSKNSKG--AVHKVPKQVALAFLKRTLGRCKRYEEADISCF 360
Q ++ L EMAY K A R N+ G + +K+ KQ ALAF++RTL RC ++E+ SCF
Sbjct: 914 QLGYEKLIEMAYEKSKASRRHHNAGGKNSNNKISKQAALAFVRRTLERCHQFEKTGKSCF 973
Query: 361 SEPTLQNIMFALPSRESDAQPGDCIVS---GTASNTCYKASPQIEVRKSGAVSSTSEKYD 417
SEP ++++ A A D ++ T+++T + P + + SE Y
Sbjct: 974 SEPEIKDMFIA-----GLATAEDTLMDKEYNTSTSTPMGSQPSSSL---ALIGQNSENYA 1025
Query: 418 VQID--HADRGLVDSFQGSIHSSEQASSKNGSVFI-KEKKREMLVNVVVSGSSSRASKFD 474
D ++ L+ EQ + K + + + KKRE+L++ V G+
Sbjct: 1026 KSSDVLPSENALL----------EQTTGKEDTAWSNRVKKRELLLDDVGIGTQ------- 1068
Query: 475 GAVPGGVKGKRSERDRNQSRDQTRQNSIPRAGRLSLDSSQNENKPKAKPKQKNTAYGHDR 534
+ KGKRS+RDR+ + + + GR SL +++ E K KAKPKQK T
Sbjct: 1069 --LSSNTKGKRSDRDRDGKGQASSRGGTNKIGRPSLSNAKGERKTKAKPKQKTTQISPSV 1126
Query: 535 FMEPKESACLPIHGSSLSVVGNQDTSQAKESA-DLGNLPLPD-LGSIEEFGVSGELGGPQ 592
+ + LP + S N + + E DL L +PD LG + P
Sbjct: 1127 RVPEQPKPSLPKPNEANSEYNNLEALEETEPILDLSQLQIPDGLGDFD--------AQPG 1178
Query: 593 DLSSWLNNFDDDGLQEDDCIGLEIPMDDLSDLML 626
D++SW N DD+ ++ D L IP DD+S+L +
Sbjct: 1179 DINSWF-NMDDE--EDFDMTELGIPTDDISELNI 1209