Miyakogusa Predicted Gene
- Lj1g3v1454060.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1454060.1 Non Chatacterized Hit- tr|K3ZAB6|K3ZAB6_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si023487,56.52,9e-18,coiled-coil,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.27346.1
(492 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G48860.2 | Symbols: | unknown protein; INVOLVED IN: biologic... 464 e-131
AT3G48860.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 443 e-124
AT5G23700.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 373 e-103
AT5G13260.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 358 4e-99
AT4G25070.1 | Symbols: | unknown protein; EXPRESSED IN: culture... 332 4e-91
AT4G25070.2 | Symbols: | unknown protein; EXPRESSED IN: culture... 330 1e-90
AT4G08630.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 238 9e-63
>AT3G48860.2 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G23700.1); Has 12429 Blast
hits to 9751 proteins in 897 species: Archae - 180;
Bacteria - 1190; Metazoa - 6552; Fungi - 1361; Plants -
886; Viruses - 50; Other Eukaryotes - 2210 (source: NCBI
BLink). | chr3:18117619-18121853 FORWARD LENGTH=577
Length = 577
Score = 464 bits (1193), Expect = e-131, Method: Compositional matrix adjust.
Identities = 243/378 (64%), Positives = 280/378 (74%), Gaps = 11/378 (2%)
Query: 121 NRSPSPALGRNFVEHTPA-VRSTSAGRXXXXXXXXXXX-----XXXXXXTLRTPMLVPPI 174
RSPSPALGRNF E P+ VRS SAGR +++TP+ +PP+
Sbjct: 126 TRSPSPALGRNFAEQVPSSVRSASAGRPSMSARSTTPTPIPNLMPPSRVSVKTPVSIPPL 185
Query: 175 DPPTNRSREKRFTPDITIRQLNSKDTGDQREASALRDELDMLQEENENILEKLRQAEEKR 234
DPPT RSR+KRF D+ +NSK+ GDQREASALRDELDMLQEENEN+LEKLR+AEEKR
Sbjct: 186 DPPT-RSRDKRFFADVP--SVNSKEKGDQREASALRDELDMLQEENENVLEKLRRAEEKR 242
Query: 235 QEVEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQSQNERDEDVTALR 294
E EARA+ELEKQVASLGEGVSLEAKLLSRKEAALRQREAAL AKQ ++ +DE++ +LR
Sbjct: 243 VEAEARAKELEKQVASLGEGVSLEAKLLSRKEAALRQREAALNVAKQKKSGKDEEIVSLR 302
Query: 295 VEIQNLKDDXXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLKRCWLARYWGLAVK 354
E++NLKD+ LRTMTQRMILT +EMEEVVLKRCWLARYWGLAV+
Sbjct: 303 SELENLKDEATTAAERLQEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQ 362
Query: 355 HGICADISQSKHEHWSSLAPLPFELVISAGQKAKEESWNKSADGPDRSKLVRDLNDLAGE 414
HGICADI+ S+ EHWS LAPLPFELV SA QKAKE SW+K G DRSK RDL+DL GE
Sbjct: 363 HGICADIAPSRQEHWSKLAPLPFELVTSAAQKAKELSWDKG--GNDRSKAARDLSDLTGE 420
Query: 415 GNIESMLSVEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLDSKAPGDAKYXXXXXXX 474
GNIESMLSVEMGLRELASLKVEDAVVL AQ R+ +LVR +V DSK G++++
Sbjct: 421 GNIESMLSVEMGLRELASLKVEDAVVLIFAQQRKLSLVRHTVSDSKGHGESRFIDAYELG 480
Query: 475 XXXXXDVLFKEAWLTYFW 492
DV FK+AWL YFW
Sbjct: 481 EAEQEDVAFKQAWLMYFW 498
>AT3G48860.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G23700.1); Has 12232 Blast
hits to 9546 proteins in 892 species: Archae - 172;
Bacteria - 1174; Metazoa - 6487; Fungi - 1343; Plants -
856; Viruses - 50; Other Eukaryotes - 2150 (source: NCBI
BLink). | chr3:18117619-18120865 FORWARD LENGTH=494
Length = 494
Score = 443 bits (1140), Expect = e-124, Method: Compositional matrix adjust.
Identities = 233/353 (66%), Positives = 269/353 (76%), Gaps = 11/353 (3%)
Query: 121 NRSPSPALGRNFVEHTPA-VRSTSAGRXXXXXXXXXXX-----XXXXXXTLRTPMLVPPI 174
RSPSPALGRNF E P+ VRS SAGR +++TP+ +PP+
Sbjct: 126 TRSPSPALGRNFAEQVPSSVRSASAGRPSMSARSTTPTPIPNLMPPSRVSVKTPVSIPPL 185
Query: 175 DPPTNRSREKRFTPDITIRQLNSKDTGDQREASALRDELDMLQEENENILEKLRQAEEKR 234
DPPT RSR+KRF D+ +NSK+ GDQREASALRDELDMLQEENEN+LEKLR+AEEKR
Sbjct: 186 DPPT-RSRDKRFFADVP--SVNSKEKGDQREASALRDELDMLQEENENVLEKLRRAEEKR 242
Query: 235 QEVEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQSQNERDEDVTALR 294
E EARA+ELEKQVASLGEGVSLEAKLLSRKEAALRQREAAL AKQ ++ +DE++ +LR
Sbjct: 243 VEAEARAKELEKQVASLGEGVSLEAKLLSRKEAALRQREAALNVAKQKKSGKDEEIVSLR 302
Query: 295 VEIQNLKDDXXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLKRCWLARYWGLAVK 354
E++NLKD+ LRTMTQRMILT +EMEEVVLKRCWLARYWGLAV+
Sbjct: 303 SELENLKDEATTAAERLQEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQ 362
Query: 355 HGICADISQSKHEHWSSLAPLPFELVISAGQKAKEESWNKSADGPDRSKLVRDLNDLAGE 414
HGICADI+ S+ EHWS LAPLPFELV SA QKAKE SW+K G DRSK RDL+DL GE
Sbjct: 363 HGICADIAPSRQEHWSKLAPLPFELVTSAAQKAKELSWDKG--GNDRSKAARDLSDLTGE 420
Query: 415 GNIESMLSVEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLDSKAPGDAKY 467
GNIESMLSVEMGLRELASLKVEDAVVL AQ R+ +LVR +V DSK G++++
Sbjct: 421 GNIESMLSVEMGLRELASLKVEDAVVLIFAQQRKLSLVRHTVSDSKGHGESRF 473
>AT5G23700.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:7992851-7996420 FORWARD LENGTH=573
Length = 573
Score = 373 bits (958), Expect = e-103, Method: Compositional matrix adjust.
Identities = 225/411 (54%), Positives = 266/411 (64%), Gaps = 50/411 (12%)
Query: 117 VARPNRSPSPALGRNFVEHTPAVRSTSAGRXXXXXXXXXXXXXXX----XXTLRTPMLVP 172
AR NRS SPA+GRN E +VRS+S GR +L+ P+++P
Sbjct: 97 FARRNRSHSPAIGRNITEQVTSVRSSSTGRPSTFSRSSTPNASPLWMPPKASLKPPVIIP 156
Query: 173 PIDPPTNRSREKRFTPDITIRQLNSKDTGDQREASALRDELDMLQEENENILEKLRQAEE 232
PID + + R++R+ D+ R +NS+D G QREASALRDE+DMLQEENE +LEKL +AEE
Sbjct: 157 PIDH-SFKDRDQRYFGDVP-RLVNSRDKGYQREASALRDEVDMLQEENEIVLEKLHRAEE 214
Query: 233 KRQEVEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQSQNERDEDVTA 292
R+ EARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAA + ++ + E+V +
Sbjct: 215 MREAAEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAANEKKDGKKEEVVS 274
Query: 293 LRVEIQNLKDDXXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLKRCWLARYWGLA 352
LR EIQ LKD+ LR MTQRM+LT +EMEEV LKRCWLARYWGLA
Sbjct: 275 LRSEIQILKDEAETAAECLQEAESEAKALRIMTQRMVLTQDEMEEVALKRCWLARYWGLA 334
Query: 353 VKHGICADISQSKHEHWSSLAPLPFELVISAGQKAKEESWNKSADGPDRSKLVRDLNDLA 412
V+HGICADI+ S+HE WS+LAPLPFELVISA QK K+ D+SK R L+DL
Sbjct: 335 VQHGICADIAPSRHEKWSALAPLPFELVISAAQKTKD----------DQSKTARFLSDLP 384
Query: 413 GEGNIESMLSVEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLDSKAPGD-------- 464
GEGNIESMLSVEMGLRELASLKVEDAV+LA AQ R P+LVRQ DSK G+
Sbjct: 385 GEGNIESMLSVEMGLRELASLKVEDAVMLAFAQKRTPSLVRQ---DSKGHGELSFVESYG 441
Query: 465 -------AKYXXXXXX----------------XXXXXXDVLFKEAWLTYFW 492
A+Y DV FK+AWL YFW
Sbjct: 442 KRRESKHAQYIISAVKLDEILTMLSHFSNAEIKEGEQEDVAFKQAWLMYFW 492
>AT5G13260.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:4243164-4246677 FORWARD LENGTH=537
Length = 537
Score = 358 bits (920), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 195/334 (58%), Positives = 234/334 (70%), Gaps = 15/334 (4%)
Query: 172 PPIDPPTNRSR------------EKRFTPDITIRQLNSKDTGDQREASALRDELDMLQEE 219
PP+ P R++ EKR DI N KD+ DQ EASALRDELDMLQEE
Sbjct: 146 PPVPPSKLRNQTTNPLPVATPKTEKRVLADIG--HFNGKDSKDQHEASALRDELDMLQEE 203
Query: 220 NENILEKLRQAEEKRQEVEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAA 279
N++ILEKLR +EK +E EAR RELEKQV SLGEGVSLEAKLLSRKEAALRQREAALK A
Sbjct: 204 NDSILEKLRLEDEKCKEAEARVRELEKQVTSLGEGVSLEAKLLSRKEAALRQREAALKDA 263
Query: 280 KQSQNERDEDVTALRVEIQNLKDDXXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVV 339
+Q+++ + + TALR +++ K + LRTMT RMILT +EMEEVV
Sbjct: 264 RQNRDGTNRETTALRSQVETAKLETAAIVAQLQGAESEVNGLRTMTHRMILTQKEMEEVV 323
Query: 340 LKRCWLARYWGLAVKHGICADISQSKHEHWSSLAPLPFELVISAGQKA-KEESWNKSADG 398
LKRCWLARYWGLA ++GIC+DI+ SK+E+WSSLAPLPFE+V+SAGQKA +E +S +
Sbjct: 324 LKRCWLARYWGLASRYGICSDIATSKYEYWSSLAPLPFEIVLSAGQKAKEESWEKESEEN 383
Query: 399 PDRSKLVRDLNDLAGEGNIESMLSVEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLD 458
RS+LV+D+NDL GEGNIESMLSVEMGL+EL SLKVE A+ + LAQ R N R S ++
Sbjct: 384 EKRSQLVQDINDLTGEGNIESMLSVEMGLKELTSLKVEVAITITLAQLRLANTTRLSDIE 443
Query: 459 SKAPGDAKYXXXXXXXXXXXXDVLFKEAWLTYFW 492
K+PG K DVLFKEAWLTYFW
Sbjct: 444 LKSPGGPKITETLELSQEESEDVLFKEAWLTYFW 477
>AT4G25070.1 | Symbols: | unknown protein; EXPRESSED IN: cultured
cell; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT3G48860.2); Has 14837 Blast hits
to 10961 proteins in 1163 species: Archae - 189;
Bacteria - 1924; Metazoa - 7665; Fungi - 1127; Plants -
653; Viruses - 80; Other Eukaryotes - 3199 (source: NCBI
BLink). | chr4:12872482-12876468 FORWARD LENGTH=765
Length = 765
Score = 332 bits (851), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 180/311 (57%), Positives = 226/311 (72%), Gaps = 5/311 (1%)
Query: 184 KRFTPDITIRQLNSKDTGDQREASALRDELDMLQEENENILEKLRQAEEKRQEVEARARE 243
KR+ P + NS D REASALRDELDMLQEEN+NI++KL++AEE+R+ EARA+E
Sbjct: 381 KRYHPANILAPNNSNQQEDDREASALRDELDMLQEENDNIMDKLQRAEERREAAEARAKE 440
Query: 244 LEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQSQNERDEDVTALRVEIQNLKDD 303
LEKQVASLGEG + + KLL RKEAALRQREAAL+AA+Q ++ R+ + AL E Q+LKD+
Sbjct: 441 LEKQVASLGEGANFDVKLLKRKEAALRQREAALRAAEQKRDGRNRETNALSSEFQSLKDE 500
Query: 304 XXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLKRCWLARYWGLAVKHGICADISQ 363
LRTM R IL+ EEMEEVVLKRCWLARYW LAV+HGIC DIS
Sbjct: 501 AEKSTEQLQEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDIST 560
Query: 364 SKHEHWSSLAPLPFELVISAGQKAKEESWNKSADGPDR--SKLVRDLNDLAGEGNIESML 421
S++EHWS+LAPLP E+V+SA QK+ E+SW G DR SK++ + +DL GEGNIESML
Sbjct: 561 SRYEHWSALAPLPSEVVLSAAQKS-EDSWQTG--GSDRTWSKVISNFSDLNGEGNIESML 617
Query: 422 SVEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLDSKAPGDAKYXXXXXXXXXXXXDV 481
+VE GLRE+ASLKVEDAV+LAL+++R+ N+ RQ+V D + G+ K+ D+
Sbjct: 618 AVETGLREIASLKVEDAVMLALSRYRQTNVARQAVTDPRVQGEPKFSETFELSHDEQQDI 677
Query: 482 LFKEAWLTYFW 492
LFKEAWL YFW
Sbjct: 678 LFKEAWLLYFW 688
>AT4G25070.2 | Symbols: | unknown protein; EXPRESSED IN: cultured
cell; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT3G48860.2); Has 30201 Blast hits
to 17322 proteins in 780 species: Archae - 12; Bacteria
- 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr4:12872482-12876468 FORWARD LENGTH=767
Length = 767
Score = 330 bits (846), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 179/310 (57%), Positives = 225/310 (72%), Gaps = 5/310 (1%)
Query: 185 RFTPDITIRQLNSKDTGDQREASALRDELDMLQEENENILEKLRQAEEKRQEVEARAREL 244
R+ P + NS D REASALRDELDMLQEEN+NI++KL++AEE+R+ EARA+EL
Sbjct: 384 RYHPANILAPNNSNQQEDDREASALRDELDMLQEENDNIMDKLQRAEERREAAEARAKEL 443
Query: 245 EKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQSQNERDEDVTALRVEIQNLKDDX 304
EKQVASLGEG + + KLL RKEAALRQREAAL+AA+Q ++ R+ + AL E Q+LKD+
Sbjct: 444 EKQVASLGEGANFDVKLLKRKEAALRQREAALRAAEQKRDGRNRETNALSSEFQSLKDEA 503
Query: 305 XXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLKRCWLARYWGLAVKHGICADISQS 364
LRTM R IL+ EEMEEVVLKRCWLARYW LAV+HGIC DIS S
Sbjct: 504 EKSTEQLQEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDISTS 563
Query: 365 KHEHWSSLAPLPFELVISAGQKAKEESWNKSADGPDR--SKLVRDLNDLAGEGNIESMLS 422
++EHWS+LAPLP E+V+SA QK+ E+SW G DR SK++ + +DL GEGNIESML+
Sbjct: 564 RYEHWSALAPLPSEVVLSAAQKS-EDSWQTG--GSDRTWSKVISNFSDLNGEGNIESMLA 620
Query: 423 VEMGLRELASLKVEDAVVLALAQHRRPNLVRQSVLDSKAPGDAKYXXXXXXXXXXXXDVL 482
VE GLRE+ASLKVEDAV+LAL+++R+ N+ RQ+V D + G+ K+ D+L
Sbjct: 621 VETGLREIASLKVEDAVMLALSRYRQTNVARQAVTDPRVQGEPKFSETFELSHDEQQDIL 680
Query: 483 FKEAWLTYFW 492
FKEAWL YFW
Sbjct: 681 FKEAWLLYFW 690
>AT4G08630.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G48860.2); Has 1487 Blast hits to 747 proteins
in 184 species: Archae - 0; Bacteria - 56; Metazoa -
305; Fungi - 197; Plants - 180; Viruses - 3; Other
Eukaryotes - 746 (source: NCBI BLink). |
chr4:5506998-5511959 REVERSE LENGTH=845
Length = 845
Score = 238 bits (606), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 149/384 (38%), Positives = 213/384 (55%), Gaps = 67/384 (17%)
Query: 168 PMLVPPIDP------PTNRSREKRFTPDITIRQLNSKDTGDQREASALRDELDMLQEENE 221
P+ + P+ P PTN ++KRF+ D+ N ++ G QR SAL+DE+DMLQEENE
Sbjct: 397 PISLKPVTPAFQSNTPTNLRKDKRFSMDLG-SSGNLRELGSQRSTSALQDEVDMLQEENE 455
Query: 222 NILEKLRQAEEKRQEVEARARELEKQVASLGEGVSLEAKLLSRKEAALRQREAALKAAKQ 281
++LEKLR AE+K +E +ARA++LEKQV LGEGV+++A+LLSR+ + L K+
Sbjct: 456 SLLEKLRLAEDKCEEADARAKQLEKQVEILGEGVTMDARLLSRQASVL------FNFWKR 509
Query: 282 SQNERDEDVTALRVEIQNLKDDXXXXXXXXXXXXXXXXXLRTMTQRMILTHEEMEEVVLK 341
+ + + K+ L+T+T+R+ILT EEMEEVVLK
Sbjct: 510 GSSTTERGCFENCITKSWRKEGRSSSLDQLHEVELELNSLKTVTKRLILTQEEMEEVVLK 569
Query: 342 RCWLARYWGLAVKHGICADISQSKHEHWSSLAPLPFELVISAGQKAKEE----------- 390
RCWL+RYWGL V+HGI DI+ KHE+WSS APLP E+V+SAGQ+A++
Sbjct: 570 RCWLSRYWGLCVRHGIQPDIAGGKHEYWSSFAPLPLEIVLSAGQRARDGVSQCNIFHLAA 629
Query: 391 -------------------SWNKSADGP--DRSKLVRDLNDLAGEGNIESMLSVEMGLRE 429
S +++A+ +R K +++L + +GEGN+E+M+ VE GLRE
Sbjct: 630 EISLELFGIVLTSLVLTLWSPHQAANNTYGEREKSLQNLQETSGEGNLENMIWVEKGLRE 689
Query: 430 LAS--------------------LKVEDAVVLALAQHRRPNLVRQSVLDS-KAPGDAKYX 468
LAS LKV++AV +AQ+RR + V D K P D ++
Sbjct: 690 LASLKNQSSVIQETDLKYDSLRCLKVQEAVAFVMAQNRRNTSSKFFVSDEVKMPMDGQF- 748
Query: 469 XXXXXXXXXXXDVLFKEAWLTYFW 492
DV FK+AWL+YFW
Sbjct: 749 EAFELSDEEVEDVNFKQAWLSYFW 772