Miyakogusa Predicted Gene
- Lj0g3v0273049.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0273049.1 tr|Q10N85|Q10N85_ORYSJ Expressed protein OS=Oryza
sativa subsp. japonica GN=LOC_Os03g17050 PE=4
SV=1,43.01,0.0000000000004,seg,NULL,gene.g21217.t1.1
(523 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G13660.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 213 2e-55
AT5G13660.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 213 2e-55
AT5G59830.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 204 1e-52
AT5G59830.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 204 1e-52
AT3G53680.1 | Symbols: | Acyl-CoA N-acyltransferase with RING/F... 122 4e-28
AT2G37520.1 | Symbols: | Acyl-CoA N-acyltransferase with RING/F... 112 5e-25
>AT5G13660.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G59830.2); Has 135 Blast hits to 126 proteins
in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 135; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr5:4405094-4406983 FORWARD
LENGTH=536
Length = 536
Score = 213 bits (543), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 145/404 (35%), Positives = 201/404 (49%), Gaps = 71/404 (17%)
Query: 184 YDRGASNSVSNPHAYEGGDNSISMSL--PYNKGDASILFTGDAYDRTNNNLLT------- 234
Y +G S S ++ G S ++S ++ D S + G +T+ N +
Sbjct: 134 YGQGISTSFETAPSFNSGQESTTLSFGQTFSNTDRSFILPGQFASKTDGNFIRNFNNEGV 193
Query: 235 ----MGQTYNEGDRNL--PIHAIYKEICHTISMDQGFSMVDSNVMSIPQAYNKAQNNSM- 287
+G Y++GD N+ H + K + + +SM Q D N+ S+ +YNK Q N M
Sbjct: 194 GVVPIGDYYDKGDENVLSTFHPLEKGVENFLSMGQSLQKADCNIFSVSSSYNKGQENFMP 253
Query: 288 -LSNHLFSEVENGT-----------IAMGSTDHQRENDMPFVSHSYNKG----------- 324
LS E + T ++ G + +M F+ S +
Sbjct: 254 LLSCEQVPEYDFMTESNYHNENANALSAGQSSFTEGGEMTFMVSSQERAGQSNDQIRRED 313
Query: 325 -ESTIISFGGCDDDDPT-SSLFLSNYGLLMGQAP--SHKPEAVNANE-FVISSSNLPSST 379
S +SFG C + SS+ +SN P + P + A E N P ++
Sbjct: 314 DRSETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENMSFECRNPPYAS 373
Query: 380 AQTSALETENVPQTRDEIKVSKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWSRE----- 434
+ L VP+ +D K +KK ++N FPSNV+SLLSTG+ DGV+VKY +WSRE
Sbjct: 374 PRVDTLL---VPKIKD-TKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSRERNLKG 429
Query: 435 ------------------VINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQELRSTPQ 476
V+NAYEFE+HA CKTKHPNNHIYFENGKTIYGVVQEL++TPQ
Sbjct: 430 MIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQ 489
Query: 477 NMLFEVIQTITGSPINQKSFRVWKESFLAAARELQRICGKDEVT 520
LF+ IQ +TGS IN K+F WK S+ A ELQRI GKD+VT
Sbjct: 490 EKLFDAIQNVTGSDINHKNFNTWKASYHVARLELQRIYGKDDVT 533
>AT5G13660.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G59830.2);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr5:4405094-4406983
FORWARD LENGTH=537
Length = 537
Score = 213 bits (542), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 145/405 (35%), Positives = 201/405 (49%), Gaps = 72/405 (17%)
Query: 184 YDRGASNSVSNPHAYEGGDNSISMSL--PYNKGDASILFTGDAYDRTNNNLLT------- 234
Y +G S S ++ G S ++S ++ D S + G +T+ N +
Sbjct: 134 YGQGISTSFETAPSFNSGQESTTLSFGQTFSNTDRSFILPGQFASKTDGNFIRNFNNEGV 193
Query: 235 ----MGQTYNEGDRNL--PIHAIYKEICHTISMDQGFSMVDSNVMSIPQAYNKAQNNSM- 287
+G Y++GD N+ H + K + + +SM Q D N+ S+ +YNK Q N M
Sbjct: 194 GVVPIGDYYDKGDENVLSTFHPLEKGVENFLSMGQSLQKADCNIFSVSSSYNKGQENFMP 253
Query: 288 -LSNHLFSEVENGT-----------IAMGSTDHQRENDMPFVSHSYNKG----------- 324
LS E + T ++ G + +M F+ S +
Sbjct: 254 LLSCEQVPEYDFMTESNYHNENANALSAGQSSFTEGGEMTFMVSSQERAGQSNDQIRRED 313
Query: 325 -ESTIISFGGCDDDDPT-SSLFLSNYGLLMGQAP--SHKPEAVNANE-FVISSSNLPSST 379
S +SFG C + SS+ +SN P + P + A E N P ++
Sbjct: 314 DRSETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENMSFECRNPPYAS 373
Query: 380 AQTSALETENVPQTRDEIKVSKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWSRE----- 434
+ L VP+ +D K +KK ++N FPSNV+SLLSTG+ DGV+VKY +WSRE
Sbjct: 374 PRVDTLL---VPKIKD-TKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQRNLK 429
Query: 435 -------------------VINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQELRSTP 475
V+NAYEFE+HA CKTKHPNNHIYFENGKTIYGVVQEL++TP
Sbjct: 430 GMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTP 489
Query: 476 QNMLFEVIQTITGSPINQKSFRVWKESFLAAARELQRICGKDEVT 520
Q LF+ IQ +TGS IN K+F WK S+ A ELQRI GKD+VT
Sbjct: 490 QEKLFDAIQNVTGSDINHKNFNTWKASYHVARLELQRIYGKDDVT 534
>AT5G59830.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G13660.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:24105423-24107071 FORWARD LENGTH=425
Length = 425
Score = 204 bits (519), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/265 (45%), Positives = 158/265 (59%), Gaps = 43/265 (16%)
Query: 288 LSNHLFSEVENGTIAMGSTDHQRENDMPFV-----SHSYNKGESTIISFGGCDDDDPTSS 342
L H ++E+ +I S REN+ ++ H Y +S I+FG +D+ S
Sbjct: 167 LEGHSQRKIESSSIQACS----RENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGS 222
Query: 343 LFLSNYGLLMGQAPSHKPEAVNANEFVISSSNLPSSTAQTSALETE-------NVPQTRD 395
SN ++G S+ + + + V S T+ E + ++P+T+
Sbjct: 223 T--SN---VVGNYQSYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKA 277
Query: 396 EIKVSKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWSRE--------------------- 434
E K SKK S +FPSNVRSL+STGMLDGV VKY + SRE
Sbjct: 278 EAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFT 337
Query: 435 -VINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQELRSTPQNMLFEVIQTITGSPINQ 493
V+NAY FERHAGCKTKHPNNHIYFENGKTIY +VQELR+TP+++LF+VIQT+ GSPINQ
Sbjct: 338 KVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQ 397
Query: 494 KSFRVWKESFLAAARELQRICGKDE 518
K+FR+WKESF AA RELQRI GK+E
Sbjct: 398 KAFRIWKESFQAATRELQRIYGKEE 422
>AT5G59830.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G13660.2); Has 174 Blast hits to 139 proteins
in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr5:24105423-24107071 FORWARD
LENGTH=425
Length = 425
Score = 204 bits (519), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/265 (45%), Positives = 158/265 (59%), Gaps = 43/265 (16%)
Query: 288 LSNHLFSEVENGTIAMGSTDHQRENDMPFV-----SHSYNKGESTIISFGGCDDDDPTSS 342
L H ++E+ +I S REN+ ++ H Y +S I+FG +D+ S
Sbjct: 167 LEGHSQRKIESSSIQACS----RENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGS 222
Query: 343 LFLSNYGLLMGQAPSHKPEAVNANEFVISSSNLPSSTAQTSALETE-------NVPQTRD 395
SN ++G S+ + + + V S T+ E + ++P+T+
Sbjct: 223 T--SN---VVGNYQSYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKA 277
Query: 396 EIKVSKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWSRE--------------------- 434
E K SKK S +FPSNVRSL+STGMLDGV VKY + SRE
Sbjct: 278 EAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFT 337
Query: 435 -VINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQELRSTPQNMLFEVIQTITGSPINQ 493
V+NAY FERHAGCKTKHPNNHIYFENGKTIY +VQELR+TP+++LF+VIQT+ GSPINQ
Sbjct: 338 KVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQ 397
Query: 494 KSFRVWKESFLAAARELQRICGKDE 518
K+FR+WKESF AA RELQRI GK+E
Sbjct: 398 KAFRIWKESFQAATRELQRIYGKEE 422
>AT3G53680.1 | Symbols: | Acyl-CoA N-acyltransferase with
RING/FYVE/PHD-type zinc finger domain |
chr3:19892863-19897412 REVERSE LENGTH=841
Length = 841
Score = 122 bits (307), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 88/153 (57%), Gaps = 23/153 (15%)
Query: 374 NLPSSTAQTSALETENVPQTRDEIKVSKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWS- 432
+LP T NV + +K+ KK S NF SNV+ LL TG+LDG VKY + S
Sbjct: 148 DLPMIQEHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLSTSA 207
Query: 433 -RE---------------------VINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQE 470
RE V+ AYEFERHAG KTKHPNNHIY ENG+ +Y V+QE
Sbjct: 208 ARELQGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQE 267
Query: 471 LRSTPQNMLFEVIQTITGSPINQKSFRVWKESF 503
LR P ++L EVI+ + GS ++++ F+ WK SF
Sbjct: 268 LRIAPPDVLEEVIRKVAGSALSEEGFQAWKGSF 300
>AT2G37520.1 | Symbols: | Acyl-CoA N-acyltransferase with
RING/FYVE/PHD-type zinc finger domain |
chr2:15745033-15749615 REVERSE LENGTH=829
Length = 829
Score = 112 bits (281), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 108/208 (51%), Gaps = 38/208 (18%)
Query: 334 CDDDDPTSSLFLSNYGLLMGQAPSHKPEAVNANEFVISSSNLPSSTAQTSAL-------- 385
C D S +S+ ++G + + + V + FV+ SST T
Sbjct: 83 CSGSDFGSEETVSDDASVVGSSQTEQSSDVLPSRFVLEIPKHLSSTGITKITFKLSKPKK 142
Query: 386 ETENVPQTRDE------IKV-SKKATSNNFPSNVRSLLSTGMLDGVSVKYKAWS------ 432
E +++P +D +K+ KK S ++PSNV+ LL TG+L+G VKY +
Sbjct: 143 EFDDLPLIKDHTWDAGVVKMPKKKIVSLSYPSNVKKLLETGILEGARVKYISTPPVRQLL 202
Query: 433 -----------------REVINAYEFERHAGCKTKHPNNHIYFENGKTIYGVVQELRSTP 475
+V++AYEFE+HAG KT+HPNNHI+ EN + +Y +VQEL++ P
Sbjct: 203 GIIHSGGYLCGCTTCNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAP 262
Query: 476 QNMLFEVIQTITGSPINQKSFRVWKESF 503
+ +L EVI+ + GS +N++ R WK SF
Sbjct: 263 RVVLEEVIRNVAGSALNEEGLRAWKASF 290