Miyakogusa Predicted Gene
- Lj4g3v2551470.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2551470.1 Non Chatacterized Hit- tr|I1MYE7|I1MYE7_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,74.83,0,SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; A_thal_3515: uncharacterized
plant-specific domain,,CUFF.51117.1
(277 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G67330.1 | Symbols: | Protein of unknown function (DUF579) |... 327 5e-90
AT1G27930.1 | Symbols: | Protein of unknown function (DUF579) |... 326 1e-89
AT1G71690.1 | Symbols: | Protein of unknown function (DUF579) |... 211 5e-55
AT4G09990.1 | Symbols: | Protein of unknown function (DUF579) |... 200 1e-51
AT1G09610.1 | Symbols: | Protein of unknown function (DUF579) |... 199 1e-51
AT1G33800.1 | Symbols: | Protein of unknown function (DUF579) |... 199 1e-51
AT2G15440.1 | Symbols: | Protein of unknown function (DUF579) |... 159 2e-39
AT5G67210.1 | Symbols: | Protein of unknown function (DUF579) |... 155 2e-38
AT3G50220.1 | Symbols: | Protein of unknown function (DUF579) |... 151 4e-37
AT4G24910.1 | Symbols: | Protein of unknown function (DUF579) |... 135 4e-32
>AT1G67330.1 | Symbols: | Protein of unknown function (DUF579) |
chr1:25214118-25214993 FORWARD LENGTH=291
Length = 291
Score = 327 bits (839), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 166/284 (58%), Positives = 197/284 (69%), Gaps = 18/284 (6%)
Query: 7 LPERRWFLGLAIVGLIGAVLFIATAITASDRRF-MCQLAPGIIKTRTQTGEDYNPTPIQL 65
L ER WFL +A+ GLIG + I + I A+D +C A K + Y TPIQL
Sbjct: 13 LLERPWFLAVALAGLIGGAMLITSFIRATDNTLSLCSTA----KNTAASIAKYTATPIQL 68
Query: 66 RAILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFL 125
++I+HYATS PQQS EI IS +VLK P NFLVFGLG DSLMWAS+NPGG T+FL
Sbjct: 69 QSIVHYATSHTVPQQSFEEISISLNVLKER-LPCNFLVFGLGRDSLMWASLNPGGTTVFL 127
Query: 126 EEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPNKAF-LRGNRACRL 184
EEDP+W++ VLKD P LRAH V+YRT L EA LLS+ ++EP C P KAF +R N C L
Sbjct: 128 EEDPEWIEAVLKDAPSLRAHHVQYRTHLSEAGRLLSTYKNEPMCLPAKAFPIRYNEKCPL 187
Query: 185 ALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLHD-- 242
AL +LPDE Y+TEWDLIM+DAPKGY+ EAPGRMAA+FS+A+MARNRKG G THVFLHD
Sbjct: 188 ALTSLPDEFYDTEWDLIMVDAPKGYFPEAPGRMAAIFSSAIMARNRKGDGTTHVFLHDVN 247
Query: 243 -------AGEFLCKKNLVKGVGRLWHFQIPPSYNRTD--AKSFC 277
A EFLC+K V VGRLWHF+IP + N TD FC
Sbjct: 248 RKVENAFANEFLCEKYKVNSVGRLWHFEIPNAANMTDQPGDRFC 291
>AT1G27930.1 | Symbols: | Protein of unknown function (DUF579) |
chr1:9731510-9732379 REVERSE LENGTH=289
Length = 289
Score = 326 bits (836), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 158/269 (58%), Positives = 191/269 (71%), Gaps = 11/269 (4%)
Query: 9 ERRWFL-GLAIVGLIGAVLFIATAITASDRRFMCQLAPGIIKTRTQTGEDYNPTPIQLRA 67
E+RW + G+ + GL+G L + I A+D DY TPIQL+A
Sbjct: 8 EKRWIITGVLLAGLVGGALLFTSFIRAADETLFLCSTASAKSRAVAAAADYEATPIQLQA 67
Query: 68 ILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFLEE 127
I+HYATS V PQQ+++EI ISF++LK + P+NFLVFGLG DSLMWAS+NP G TLFLEE
Sbjct: 68 IVHYATSNVVPQQNLAEISISFNILKKLA-PANFLVFGLGRDSLMWASLNPRGKTLFLEE 126
Query: 128 DPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPNKAFLRGNRACRLALE 187
D +W Q V KD P LRAH VRYRTQL++A +LL S ++EP C P K++LRGN C+LAL
Sbjct: 127 DLEWFQKVTKDSPFLRAHHVRYRTQLQQADSLLRSYKTEPKCFPAKSYLRGNEKCKLALT 186
Query: 188 NLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLHD----- 242
LPDE Y+TEWDL+M+DAPKGY+AEAPGRMAA+FSAAVMARNRK GVTHVFLHD
Sbjct: 187 GLPDEFYDTEWDLLMVDAPKGYFAEAPGRMAAIFSAAVMARNRKKPGVTHVFLHDVNRRV 246
Query: 243 ----AGEFLCKKNLVKGVGRLWHFQIPPS 267
A EFLC+K V GRLWHF IPP+
Sbjct: 247 EKTFAEEFLCRKYRVNAAGRLWHFAIPPA 275
>AT1G71690.1 | Symbols: | Protein of unknown function (DUF579) |
chr1:26947806-26949064 FORWARD LENGTH=295
Length = 295
Score = 211 bits (537), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 165/287 (57%), Gaps = 20/287 (6%)
Query: 1 MKKRYPLPERRWFLGLAIVGLIGAVLFIATAITASDRRFMCQ-------LAPGIIKTRTQ 53
M P P + + V L+ LF T S Q ++ ++ Q
Sbjct: 1 MTTYLPFPMTTRLILSSFVSLVVLTLFFITRTGFSPSSSFHQPLNNTLRISTSSTGSKLQ 60
Query: 54 TGEDYNPTPIQL-RAILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLM 112
+ N P L A++HYA+S VTPQQ++SEI ++ L+ P NFLVFGLGHDSLM
Sbjct: 61 SPRSCNKIPPSLADALVHYASSNVTPQQTLSEISVTKKELEKKS-PCNFLVFGLGHDSLM 119
Query: 113 WASMNPGGNTLFLEEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPN 172
WA++N GG T+FL+ED W+ + + P L ++ VRY+T++R+A AL+++++ C
Sbjct: 120 WATLNHGGRTIFLDEDESWIHQIAEKFPSLESYHVRYKTKVRDAEALMAATKDREECRRV 179
Query: 173 KAFLRGNRACRLALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKG 232
LR + C LAL+ LP+ VYETEWDLIM+DAP G++ EAPGRM+A+++A ++AR RK
Sbjct: 180 STDLRVS-TCELALKGLPEVVYETEWDLIMVDAPTGFHEEAPGRMSAIYTAGMIARRRKD 238
Query: 233 -SGVTHVFLHDAG---------EFLCKKNLVKGVGRLWHFQIPPSYN 269
T VF+HD EFLC+ + K GRL HF +P N
Sbjct: 239 EEETTAVFVHDVDRKVEDEFSMEFLCRDYMTKQEGRLRHFTVPSHRN 285
>AT4G09990.1 | Symbols: | Protein of unknown function (DUF579) |
chr4:6259110-6260064 REVERSE LENGTH=290
Length = 290
Score = 200 bits (508), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 140/220 (63%), Gaps = 14/220 (6%)
Query: 67 AILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFLE 126
A++HY TS +TPQQ+ E+ +S VL P NFLVFGLGHDSLMWAS+N GG TLFLE
Sbjct: 68 ALVHYVTSEITPQQTFDEVSVSKRVLDKKS-PCNFLVFGLGHDSLMWASLNHGGRTLFLE 126
Query: 127 EDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPNKAFLRGNRACRLAL 186
ED W++TV K P L ++ V Y T++++++ L+ R+E + + + C L+L
Sbjct: 127 EDEAWIETVTKKFPNLESYHVVYDTKVKDSNKLMELKRTEDCKAVSDP---RDSKCALSL 183
Query: 187 ENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLHDAGE- 245
+ P +VYET+WD+IM+DAP GY+ EAPGRM+A+++A ++ARNR G T VF+HD
Sbjct: 184 KGFPADVYETQWDVIMVDAPTGYHDEAPGRMSAIYTAGLLARNRYDGGETDVFVHDINRP 243
Query: 246 --------FLCKKNLVKGVGRLWHFQIPPSYNRTDAKSFC 277
FLC + + GRL HF I PS+ + FC
Sbjct: 244 VEDEFSVAFLCGGYMKEQQGRLRHFNI-PSHRASFGTPFC 282
>AT1G09610.1 | Symbols: | Protein of unknown function (DUF579) |
chr1:3111789-3112637 FORWARD LENGTH=282
Length = 282
Score = 199 bits (507), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 145/228 (63%), Gaps = 24/228 (10%)
Query: 66 RAILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFL 125
+A++HY+TS +TPQQ++ EI +S VL P NFLVFGLGHDSLMW+S+N GG T+FL
Sbjct: 62 QALIHYSTSVITPQQTLKEIAVSSRVLGKKS-PCNFLVFGLGHDSLMWSSLNYGGRTVFL 120
Query: 126 EEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACS----PNKAFLRGNRA 181
EED W++ + + P L ++ V Y +++ +A L+ + P C+ P +
Sbjct: 121 EEDEAWIKQIKRRFPMLESYHVTYDSKVNQADNLIEVGKG-PECTAIGDPRYSM------ 173
Query: 182 CRLALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLH 241
C+LAL+ LP E+YET WDLIM+DAP GYY EAPGRM A+++A +MARNRK G T VF+H
Sbjct: 174 CQLALKGLPAEIYETGWDLIMVDAPTGYYDEAPGRMTAIYTAGMMARNRKQGGETDVFVH 233
Query: 242 DAGE---------FLCKKNLVKGVGRLWHFQIPPSYNRTDAKS---FC 277
D FLC+ + K GRL HF IP + ++++S FC
Sbjct: 234 DVNREIEDKFSKAFLCEGYMKKQEGRLRHFIIPSYRDGSESESNRPFC 281
>AT1G33800.1 | Symbols: | Protein of unknown function (DUF579) |
chr1:12261480-12262456 FORWARD LENGTH=297
Length = 297
Score = 199 bits (507), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 139/220 (63%), Gaps = 15/220 (6%)
Query: 67 AILHYATSRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFLE 126
A++HY TS VTPQQ+ E+ +S VL P NFLVFGLGHDSLMWAS+N GG TLF+E
Sbjct: 76 ALVHYVTSNVTPQQTFDEVSVSKRVLDKKS-PCNFLVFGLGHDSLMWASLNHGGRTLFIE 134
Query: 127 EDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPNKAFLRGNRACRLAL 186
ED W+ V K P L ++ V Y T+++++ L+ RSE S + N C LAL
Sbjct: 135 EDQAWIAIVTKKFPNLESYHVVYDTKVKDSDKLMELGRSEECRSVSDP---RNSKCDLAL 191
Query: 187 ENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLHD---- 242
++ P + YET+WDLIM+DAP GY+ EAPGRM+A+++A ++ARNR+ G T VF+HD
Sbjct: 192 KDFPADFYETKWDLIMVDAPTGYHEEAPGRMSAIYTAGLLARNRE-DGETDVFVHDVNRP 250
Query: 243 -----AGEFLCKKNLVKGVGRLWHFQIPPSYNRTDAKSFC 277
+ FLCK + + GRL HF IP R + FC
Sbjct: 251 VEDEFSATFLCKGYMREQNGRLRHFTIPSHRARA-GRPFC 289
>AT2G15440.1 | Symbols: | Protein of unknown function (DUF579) |
chr2:6743792-6744781 REVERSE LENGTH=329
Length = 329
Score = 159 bits (403), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 137/228 (60%), Gaps = 18/228 (7%)
Query: 65 LRAILHYATSRVTPQQSVSEIKISF--DVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNT 122
L A+LHY TS P S+S +++S +++ S G N L+FGL H+SL+W S+N G T
Sbjct: 79 LAALLHY-TSSSPPNTSMSFLELSTISNIIHSHGPACNLLIFGLTHESLLWRSINFQGRT 137
Query: 123 LFLEEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPACSPNKAFLRGNRAC 182
+F++E P V + PG+ A+ V Y T++ +A LL ++ P C P + L + C
Sbjct: 138 VFVDESPYSVSKFEQSNPGVEAYEVVYSTKVSQAKKLLGYYKTRPECRPVQNLLFSD--C 195
Query: 183 RLALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRK---GSGVTHVF 239
+L + +LP+ VYE +WD+I+ID P+GY +++PGRMA +F++AV+A+++ + T V
Sbjct: 196 KLGINDLPNFVYEIDWDVILIDGPRGYASDSPGRMAPIFTSAVLAKSKDFGTKTKKTDVL 255
Query: 240 LHDAG---------EFLCKKNLVKGVGRLWHFQIPPSYNRTD-AKSFC 277
+H+ G EFLC++NL++ VG L HF + + R FC
Sbjct: 256 VHEFGRKIERVYSEEFLCEENLIEVVGDLGHFVVAAAEERESYGDGFC 303
>AT5G67210.1 | Symbols: | Protein of unknown function (DUF579) |
chr5:26819019-26819972 FORWARD LENGTH=317
Length = 317
Score = 155 bits (393), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 147/269 (54%), Gaps = 20/269 (7%)
Query: 10 RRW---FLGLAIVGLIGAVLFIATAITASDRRFMCQLAPGIIKTRTQTGEDYNPTPIQLR 66
R W F+ + + +L+ +I +S + ++ + T PT +
Sbjct: 26 RLWLLAFVSFFTIAFLLTLLYTTDSIISS-KNNSATVSSAVNSAVTTATISQLPT-TAIN 83
Query: 67 AILHYAT-SRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFL 125
A+LHYA+ S + S E+K DVL+ P N LVFGL H++L+W S+N G T+F+
Sbjct: 84 AMLHYASRSNDSYHMSYGEMKSISDVLRRCSPPCNLLVFGLTHETLLWKSLNHNGRTVFI 143
Query: 126 EEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSR--SEPACSPNKAFLRGNRACR 183
EE+ + + P + V+Y T+ REA L+S+ + + C P + L + C+
Sbjct: 144 EENRYYAAYFEEIHPEIEVFDVQYTTKAREARELVSAVKEAARNECRPVQNLLFSD--CK 201
Query: 184 LALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSG-VTHVFLHD 242
L L +LP+ VY+ +WD+I++D P+G + PGRM+++F+AAV+AR++KG THVF+HD
Sbjct: 202 LGLNDLPNHVYDVDWDVILVDGPRGDGGDVPGRMSSIFTAAVLARSKKGGNPKTHVFVHD 261
Query: 243 ---------AGEFLCKKNLVKGVGRLWHF 262
EFLC++NLV+ L H+
Sbjct: 262 YYRDVERLCGDEFLCRENLVESNDLLAHY 290
>AT3G50220.1 | Symbols: | Protein of unknown function (DUF579) |
chr3:18617672-18618640 REVERSE LENGTH=322
Length = 322
Score = 151 bits (382), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 133/228 (58%), Gaps = 21/228 (9%)
Query: 65 LRAILHYAT-SRVTPQQSVSEIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTL 123
+ A+LHYA+ S + S E+K DVL+ P N LVFGL H++L+W S+N G T+
Sbjct: 89 INALLHYASRSNDSFHMSYGEMKSISDVLRRCAPPCNLLVFGLTHETLLWKSLNHNGRTV 148
Query: 124 FLEEDPKWVQTVLKDEPGLRAHTVRYRTQLREAHALLSSSRSEPA--CSPNKAFLRGNRA 181
F+EE+ + + P + V+Y T+ EA L+++++ C P + L +
Sbjct: 149 FIEENRYYAAYFEEIHPEIDVFDVQYTTKAHEAGELVTAAKEAAGNECRPVQNLLFSD-- 206
Query: 182 CRLALENLPDEVYETEWDLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRK-GSGVTHVFL 240
C+L L +LP+ VY+ +WD+I +D P+G E PGRM+++F+AAV+AR++K G+ THVF+
Sbjct: 207 CKLGLNDLPNHVYDVDWDVIFVDGPRGDAHEGPGRMSSIFTAAVLARSKKGGTPKTHVFV 266
Query: 241 HD---------AGEFLCKKNLVKGVGRLWHFQIPPSYNRTDAKS--FC 277
HD EFLC++NLV+ L H+ + ++ D S FC
Sbjct: 267 HDYYRDVERLCGDEFLCRENLVESNDLLAHYVL----DKMDKNSTKFC 310
>AT4G24910.1 | Symbols: | Protein of unknown function (DUF579) |
chr4:12817954-12818901 REVERSE LENGTH=315
Length = 315
Score = 135 bits (339), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 120/208 (57%), Gaps = 19/208 (9%)
Query: 84 EIKISFDVLKSMGRPSNFLVFGLGHDSLMWASMNPGGNTLFLEEDPKWVQTVLKDE---P 140
E+K+ D + P N LVFG LM +S+N G T+ LE++P + + K E
Sbjct: 105 ELKLLSDTVTRRS-PCNILVFGFAPQYLMLSSINTRGITVILEDEPAKIM-IPKAEVNPN 162
Query: 141 GLRAHTVRY-RTQLREAHALLSSSRSEPACSPN-KAFLRGNRACRLALENLPDEVYETEW 198
R ++++Y + ++R A+ LL +R+ PAC+PN +G+ C+L L +LP +V+ T+W
Sbjct: 163 NTRIYSLKYHQMEVRNAYNLLQHARANPACAPNMNNQHQGSSDCKLELRDLPQQVHNTKW 222
Query: 199 DLIMIDAPKGYYAEAPGRMAAVFSAAVMARNRKGSGVTHVFLHDAG---------EFLCK 249
D+I++D P+G E PGRM ++++AAV+AR + T VF+HD EFLC+
Sbjct: 223 DVIVVDGPRGDDLETPGRMGSIYTAAVLARKGSSNSTTDVFVHDVHRTAEKWLSWEFLCQ 282
Query: 250 KNLVKGVGRLWHFQIPPSYNRTDAKSFC 277
+NLV G W F+I +++A FC
Sbjct: 283 ENLVSAKGTFWKFRI---KRQSNASRFC 307