Miyakogusa Predicted Gene
- Lj0g3v0244399.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0244399.1 Non Chatacterized Hit- tr|F6HXW8|F6HXW8_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,42.86,0.000000000002,DUF4220,Domain of unknown function DUF4220;
SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.15968.1
(361 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G45540.1 | Symbols: | Protein of unknown function (DUF594) |... 166 2e-41
AT5G45480.1 | Symbols: | Protein of unknown function (DUF594) |... 142 3e-34
AT5G45530.1 | Symbols: | Protein of unknown function (DUF594) |... 135 6e-32
AT5G45460.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 124 8e-29
AT5G45470.1 | Symbols: | Protein of unknown function (DUF594) |... 118 8e-27
AT4G19090.1 | Symbols: | Protein of unknown function (DUF594) |... 74 2e-13
>AT5G45540.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18458294-18460705 REVERSE LENGTH=803
Length = 803
Score = 166 bits (420), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 165/334 (49%), Gaps = 45/334 (13%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPIFIVGIVKIGERIWVLWSAS 60
+EDN LW R LFS + + V Y+ L S N L I +F+ G++K ER L+SAS
Sbjct: 111 LEDNELWDRHLFSLVCQAVATVYVILLSIPNRLLTPTLI-MFVGGVIKYVERTAALFSAS 169
Query: 61 SQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEVQGLNETPPAGGSSGNPIHTYNAV 120
+FK+S+ DPDPG NYA+ ME Y + +V + + P G GN V
Sbjct: 170 LDKFKDSMLDDPDPGANYAKLMEEYEARKKMNMPTDVIVVKD--PEKGREGN-----TPV 222
Query: 121 AEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCLLNGNGKDV 180
N + L ++ A+K+ I K L DLI ++Q+ ESR ++
Sbjct: 223 RPDNELTALQ---------VIQYAYKYFNIFKGLIVDLIFTNQERDESRKFFDKLTAEEA 273
Query: 181 FEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFFSIEKDQYPKVDVFI 240
++E+ELG + D +TKA +++++ G+ RFI L C ++ LC F +KDQY DV +
Sbjct: 274 LRIIEVELGLIYDCLFTKAEILHNWTGAVFRFIALGCLVASLCLFKMNKKDQYDGFDVVL 333
Query: 241 TGVLLLGAITLELYSVILHLFSDWTMLWLSM------HKNKVTNKGISLIQFFKS----- 289
T LL+ I L+ ++++ SDWT+ L K+ +T++ ++ I FK+
Sbjct: 334 TYALLICGIALDSIALLMFCVSDWTIARLRKLKEDLEEKDTLTDRVLNWILDFKTLRWKR 393
Query: 290 -----------------KRWSGSIGQFNLISFCL 306
+RWS + +NLI FCL
Sbjct: 394 SKCSQDGHQVLNRNFMFRRWSEYVHAYNLIGFCL 427
>AT5G45480.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18426296-18428929 REVERSE LENGTH=877
Length = 877
Score = 142 bits (358), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 140/278 (50%), Gaps = 17/278 (6%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPIFIVGIVKIGERIWVLWSAS 60
+EDN LW R L + V Y+ L+S N+ + + +F G++K ER L+ AS
Sbjct: 111 LEDNELWLRHLLGLFFQSVATVYVLLQSLPNALWKPILL-VFATGVIKYVERTLALYLAS 169
Query: 61 SQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEVQGLNETPPAGGSSGNPIHTYNAV 120
+FK+S+ PDPGPNYA+ ME Y A+ + K+ Q + G +P
Sbjct: 170 LDKFKDSMIQRPDPGPNYAKLMEEY--AAKKDMKMPTQIIK----VGEPEKDP------- 216
Query: 121 AEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCLLNGNGKDV 180
+ P+ D + P ++ A+K+ I K L DLI + Q ES+ + ++
Sbjct: 217 ---RDDAPVKPPDGFTPLNILQYAYKYFNIFKGLVVDLIFTFQQRAESKRFFDSLKAEEA 273
Query: 181 FEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFFSIEKDQYPKVDVFI 240
++E+EL F+ YTKA +++++IG RFI L C + L F K Y DV +
Sbjct: 274 LRILEVELNFIYAALYTKAEILHNWIGFLFRFIALGCLAAALRIFQYKSKKDYSGFDVGL 333
Query: 241 TGVLLLGAITLELYSVILHLFSDWTMLWLSMHKNKVTN 278
T LLLG I L+ ++I+ SDWT + L K++V +
Sbjct: 334 TYALLLGGIALDCIALIMFCASDWTFVRLRKMKDEVDD 371
>AT5G45530.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18454316-18457222 REVERSE LENGTH=798
Length = 798
Score = 135 bits (339), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 156/356 (43%), Gaps = 71/356 (19%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPI---FIVGIVKIGERIWVLW 57
+EDNALW R LF +++ + Y ++S N +L PI FI G +K ER L+
Sbjct: 110 LEDNALWQRHLFGLVSQALAGVYAVVQSLEN----VLWPPITLLFITGTIKYVERTRALY 165
Query: 58 SASSQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEV-----QGLNETPPAGGSSGN 112
SAS +FK+ + D G NYA+ ME + S + E+ +E PP
Sbjct: 166 SASLDKFKDRMLQRADAGSNYAKLMEEFASRKMSNLPTEIFLTDEPDKHERPPT------ 219
Query: 113 PIHTYNAVAEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCL 172
+ P+ D I V+ KF K L DLI S ++ ESR
Sbjct: 220 --------------LVKPDRDLTDLEI-VQYGFKFFNTFKGLVVDLIFSFRERDESRDFF 264
Query: 173 LNGNGKDVFEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFF-----S 227
+ ++E ELGF+ + YTK ++++ IG+ R I+ S+L +FF
Sbjct: 265 KELKPGEALRIIETELGFLYESMYTKTAILHTGIGTLFRLISFG---SLLSSFFVFHRRP 321
Query: 228 IEKDQYPKVDVFITGVLLLGAITLELYSVILHLFSDWTMLWLSMHKNKVTNKGISLIQFF 287
++ + + DV IT VL + I L+L S+++ L SDWT L K+ K S+ F
Sbjct: 322 LKSEDFHGADVVITYVLFIVGIALDLASMVIFLLSDWTFAVLRNLKDDPEEKSTSIDSLF 381
Query: 288 K-----------------------------SKRWSGSIGQFNLISFCLLKAKKQRL 314
++RWSG+I FN I FC LKAK R+
Sbjct: 382 NWFLEFRKPRWKKHTCNGNQTHEVLSTGFFTRRWSGTIYGFNFIGFC-LKAKVSRI 436
>AT5G45460.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: Protein of unknown function
(DUF594) (TAIR:AT5G45470.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:18417154-18419265 REVERSE LENGTH=703
Length = 703
Score = 124 bits (312), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 154/338 (45%), Gaps = 32/338 (9%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPIFIVGIVKIGERIWVLWSAS 60
+EDNALW R +F + + + Y+ L+S NS L + + +FI G +K ER L+SAS
Sbjct: 111 LEDNALWLRNVFGLVFQAIAGVYVVLQSLPNS-LWVTILLVFISGTIKYLERTTALYSAS 169
Query: 61 SQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEVQGLNETPPAGGSSGNPIHTYNAV 120
+F++S+ PDPGPNYA+ ME Y + ++ ++E P H A
Sbjct: 170 LDKFRDSMIQGPDPGPNYAKLMEEYKAKKEAKLPTKIILIDE-PDKEHRPKKLEHPSLAS 228
Query: 121 AEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCLLN-GNGKD 179
+ E Y A+KF K L +LI S ++ +S N + ++
Sbjct: 229 ETKRKELTHLEIAQY--------AYKFFNTFKGLVVNLIFSFRERDQSIEIFQNLEDPEE 280
Query: 180 VFEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFFSIEKD--QYPKVD 237
++EIELGF+ D +TK V+++ +G+ R + ++ F I + D
Sbjct: 281 ALRIIEIELGFLYDALFTKNAVLHTVLGTVSRVVASGSLVAAFIIFHKISNKGRDFHGAD 340
Query: 238 VFITGVLLLGAITLELYSVILHLFSDWTMLWLSMHKNKVTNKGISLIQFFKSKRWSGSIG 297
V IT +L + L+ S++L LFSDWT LS K+ +FF
Sbjct: 341 VVITYILFAVGLVLDFISILLFLFSDWTCAALSSLKDDPDEPLSWKDRFFN--------- 391
Query: 298 QFNLISFCLLKAKKQRLKIGHRYIKGFE---KKGAKYC 332
CLL+ +K R K+ + KG K+G K C
Sbjct: 392 -------CLLEFRKLRWKMQECHNKGEHKCTKEGEKPC 422
>AT5G45470.1 | Symbols: | Protein of unknown function (DUF594) |
chr5:18422164-18424764 REVERSE LENGTH=866
Length = 866
Score = 118 bits (295), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 132/268 (49%), Gaps = 13/268 (4%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPIFIVGIVKIGERIWVLWSAS 60
+EDNALW R +F + + + Y+ + S NS L ++ + +F+ G +K ER L+SAS
Sbjct: 111 LEDNALWLRHVFGLVFQAIAGVYVVVMSLPNS-LWVVIVLVFVSGTIKYLERTTALYSAS 169
Query: 61 SQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEVQGLNETPPAGGSSGNPIHTYNAV 120
+F++S+ PDPGPNYA+ ME Y + ++ ++E P H A+
Sbjct: 170 LDKFRDSMIQAPDPGPNYAKLMEEYKAKKEARLPTKIVLIDE-PDKENRPKKLEHP--AL 226
Query: 121 AEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCLLNGNG-KD 179
A L + + V+ A+KF K L +LI S ++ ES N N ++
Sbjct: 227 ASKKRKKDLTDLE------IVQYAYKFFNTFKGLVVNLIFSFRERDESLEIFENLNDPEE 280
Query: 180 VFEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFFSI--EKDQYPKVD 237
++EIELGF+ D +TK ++++ IG+ R ++ F + + D
Sbjct: 281 ALRIIEIELGFLYDALFTKIAILHTGIGTVSRVFASGTLVAAFIIFHKKPNKGTDFHGAD 340
Query: 238 VFITGVLLLGAITLELYSVILHLFSDWT 265
V +T L + L+ S++L LFSDWT
Sbjct: 341 VVVTYTLFAVGLVLDFISILLFLFSDWT 368
>AT4G19090.1 | Symbols: | Protein of unknown function (DUF594) |
chr4:10449900-10452757 FORWARD LENGTH=751
Length = 751
Score = 73.9 bits (180), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 143/346 (41%), Gaps = 68/346 (19%)
Query: 1 MEDNALWSRRLFSYIAEVVLATYLFLRSWTNSDLNILAIPIFIVGIVKIGERIWVLWSAS 60
+EDNALW+R + + + Y+ ++S N L+++ + +FI G K ER L+ AS
Sbjct: 105 LEDNALWNRHFLGLVFQALAGVYVVVQSLPNV-LSVIILLLFIAGTSKYLERTIALYLAS 163
Query: 61 SQQFKESLFPDPDPGPNYARYMEAYISASLEGFKVEVQGLNETPPAGGSSGNPIHTYNAV 120
S +++ S+ + +Y + + L+ H
Sbjct: 164 SDKYRNSMLQASNSRFDYTD---------------QTRDLDMDTKLASEMNMKEH----- 203
Query: 121 AEGNNIIPLPETDTYGPAITVKIAHKFLRISKLLFADLILSSQDVTESRSCLLNGNGKD- 179
G P P + + HK L ++L L +D ES++ KD
Sbjct: 204 -RGQ---PKP--------LKLLQPHKELTHLEILQYAFFLELRD--ESKAFFSALQLKDE 249
Query: 180 VFEVMEIELGFMNDLFYTKAGVIYSYIGSFLRFITLSCNISVLCAFFSIEK--DQYPKVD 237
F ++E EL F+ + YTK V++S++G RFI+L +S + ++ K D
Sbjct: 250 AFCIIEAELDFIYEGLYTKGSVLHSWVGLVSRFISLGSLLSAFTIYHYRHNKIQEFHKAD 309
Query: 238 VFITGVLLLGAITLELYSVILHLFSDWTMLWLSMHKNKVTNKG------ISLIQFFKS-- 289
+ IT L L I L++ S+ + + SDWT L+ K+ + ++ I F K
Sbjct: 310 IVITYTLFLVGIALDVISIHMFMVSDWTTAILAKLKDDPDERYSGKDHILNWILFLKRPK 369
Query: 290 ---------------------KRWSGSIGQFNLISFCLLKAKKQRL 314
+RW+GSI N +++ +KA +R+
Sbjct: 370 WKWQTCREGDQQEVLNTPFLLRRWTGSITMLNFLTYS-MKADTERI 414