Miyakogusa Predicted Gene
- Lj3g3v0461990.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0461990.1 Non Chatacterized Hit- tr|I1KZR5|I1KZR5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.7756
PE=,81.03,0,MOSC,Molybdenum cofactor sulfurase, C-terminal; SUBFAMILY
NOT NAMED,NULL; MOLYBDOPTERIN COFACTOR
SUL,NODE_3463_length_2872_cov_23.393454.path1.1
(803 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G16540.1 | Symbols: SIR3, LOS5, ABA3, ATABA3, ACI2 | molybden... 1014 0.0
AT5G51920.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent tr... 94 3e-19
AT1G30910.1 | Symbols: | Molybdenum cofactor sulfurase family p... 81 3e-15
AT5G44720.1 | Symbols: | Molybdenum cofactor sulfurase family p... 80 8e-15
AT4G37100.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent tr... 79 2e-14
AT2G23520.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent tr... 76 8e-14
AT5G66950.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent tr... 75 1e-13
>AT1G16540.1 | Symbols: SIR3, LOS5, ABA3, ATABA3, ACI2 | molybdenum
cofactor sulfurase (LOS5) (ABA3) | chr1:5659465-5665201
FORWARD LENGTH=819
Length = 819
Score = 1014 bits (2622), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/820 (60%), Positives = 616/820 (75%), Gaps = 26/820 (3%)
Query: 6 EEFLREFGEHYGYPNGPKTIDQIRATEFKRL-QDTVYLDHAGATLYSEMQMESVFKDLTS 64
E FL+EFG++YGYP+GPK I +IR TEFKRL + VYLDHAG+TLYSE+QME +FKD TS
Sbjct: 2 EAFLKEFGDYYGYPDGPKNIQEIRDTEFKRLDKGVVYLDHAGSTLYSELQMEYIFKDFTS 61
Query: 65 NVYGNPHSQSDSSSATSEIVSAARQQVLKYCNASPKDYTCIFTSGATAALKLVGEAFPWS 124
NV+GNPHSQSD SSATS++++ AR QVL+Y NASP+DY+C+FTSGATAALKLVGE FPW+
Sbjct: 62 NVFGNPHSQSDISSATSDLIADARHQVLEYFNASPEDYSCLFTSGATAALKLVGETFPWT 121
Query: 125 CNSNFMYTMENHNSVLGIREYALGQGATAVSVDIEEDVHP--RITRETVSTKISLHQEQR 182
+SNF+YTMENHNSVLGIREYAL QGA+A +VDIEE + ++T S K+ Q
Sbjct: 122 QDSNFLYTMENHNSVLGIREYALAQGASACAVDIEEAANQPGQLTNSGPSIKVKHRAVQM 181
Query: 183 RKVVGLPEGEPTGDVYNLFAFPSECNFSGLRFDLDLVNIIKEDS---------SRNKQWL 233
R L + E G+ YNLFAFPSECNFSGLRF+LDLV ++KE++ S++K+W+
Sbjct: 182 RNTSKLQKEESRGNAYNLFAFPSECNFSGLRFNLDLVKLMKENTETVLQGSPFSKSKRWM 241
Query: 234 VLIDAAKGCATMPPDLSKYPADFVAISFYKLFGYPTGLGALIVQNDAAKLLKKTYFSGGT 293
VLIDAAKGCAT+PPDLS+YPADFV +SFYKLFGYPTGLGAL+V+NDAAKLLKKTYFSGGT
Sbjct: 242 VLIDAAKGCATLPPDLSEYPADFVVLSFYKLFGYPTGLGALLVRNDAAKLLKKTYFSGGT 301
Query: 294 VAASIADIDFVKRREGIEELFEDGTVSFLSIASIRQGFKILDSLTVSAISRHTTSLALYA 353
VAASIADIDFVKRRE +EE FEDG+ SFLSIA+IR GFK+L SLT SAI HTTSL++Y
Sbjct: 302 VAASIADIDFVKRRERVEEFFEDGSASFLSIAAIRHGFKLLKSLTPSAIWMHTTSLSIYV 361
Query: 354 RKMLLALRHENGSNLCILYGRHNSMGHRYEMGPIVSFNLKRPDGSWYGYREVEKLASLSG 413
+K L ALRH NG+ +C+LYG N ++ GP V+FNLKRPDGSW+GY EVEKLASLSG
Sbjct: 362 KKKLQALRHGNGAAVCVLYGSENLELSSHKSGPTVTFNLKRPDGSWFGYLEVEKLASLSG 421
Query: 414 IQLRTGCFCNPGACAKYLGLSHVDLITNTEAGHVCWDDHDIINGKPVGAVRISFGYMSTY 473
IQLRTGCFCNPGACAKYL LSH +L +N EAGH+CWDD+D+INGKP GAVR+SFGYMST+
Sbjct: 422 IQLRTGCFCNPGACAKYLELSHSELRSNVEAGHICWDDNDVINGKPTGAVRVSFGYMSTF 481
Query: 474 DDAKKFVDFVASSFMSPQNHIEHGNQLKG---------LEKGVLDASYYLKSITIYPIKS 524
+DAKKF+DF+ SSF SP +G + G LE S+YLKSIT+YPIKS
Sbjct: 482 EDAKKFIDFIISSFASPPKKTGNGTVVSGRFPQLPSEDLESKESFPSHYLKSITVYPIKS 541
Query: 525 CGGFSPKSWPLSDNGLKHDREWVLKSLSGEILSLKKVPEMGFISPFIDLSQGMLFVESPR 584
C GFS WPL GL HDREW+++ L+GEIL+ KKVPEM I FIDL +G+L VES R
Sbjct: 542 CAGFSVIRWPLCRTGLLHDREWMVQGLTGEILTQKKVPEMSLIKTFIDLEEGLLSVESSR 601
Query: 585 CKERLQIRLES-VYDGEIEDIQLYGQRYKVYNYSTETNAWFSEAVGKPCTLLRYSSSNHD 643
C+++L IR++S Y+ ++ + + N T N WF+ A+G+ C LLRYSSS
Sbjct: 602 CEDKLHIRIKSDSYNPRNDEFDSHANILENRNEETRINRWFTNAIGRQCKLLRYSSSTSK 661
Query: 644 FVLKKTKGAVTCRDARSAVSFANEAQXXXXXXXXXXDLNRRLNSDSGVQKSIGGTTMQVN 703
L + K CRD S ++FANEAQ DLNRRL + K ++N
Sbjct: 662 DCLNRNKSPGLCRDLESNINFANEAQFLLISEESVADLNRRLEAKDEDYKR---AHEKLN 718
Query: 704 ASRFRPNLVVFGGRPYDEDEWSDIRIGNKYFRSLGGCNRCQIINLALSDGQVQKSKEPLA 763
RFRPNLV+ GG PY ED+W ++IG+ +F SLGGCNRCQ+IN++ G V+KS EPL
Sbjct: 719 PHRFRPNLVISGGEPYGEDKWKTVKIGDNHFTSLGGCNRCQMINISNEAGLVKKSNEPLT 778
Query: 764 TLASYRRVKGKIVFGILLKYDGEQQQGESWLHAGQDVHPE 803
TLASYRRVKGKI+FG LL+Y+ ++++ + W+ G++V+P+
Sbjct: 779 TLASYRRVKGKILFGTLLRYEIDEKR-QCWIGVGEEVNPD 817
>AT5G51920.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent
transferases superfamily protein |
chr5:21097202-21098914 REVERSE LENGTH=570
Length = 570
Score = 94.4 bits (233), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 119/499 (23%), Positives = 205/499 (41%), Gaps = 69/499 (13%)
Query: 17 GYPNGPKT--IDQIRATEFKRLQDTVY--LDHAGATLYSEMQM---------------ES 57
YPN T ID++R+ + L + Y LD+ G LYS Q+ ES
Sbjct: 82 AYPNYSDTYKIDRLRSDHYFHLGLSHYTCLDYIGIGLYSYSQLLNYDPSTYQISSSLSES 141
Query: 58 VFKDLTSNVYGNPHSQSDSSSATSEIVSAARQQVLKYCNASPKDYTCIFTSGATAALKLV 117
F ++ + + +E + +++++ + S +DY+ +FT+ T+A +LV
Sbjct: 142 PFFSVSPKIGNLKEKLLNDGGQETEFEYSMKRRIMGFLKISEEDYSMVFTANRTSAFRLV 201
Query: 118 GEAFPWSCNSNFM--YTMENHNSVLGIREYALGQGATAVSVDIEEDVHPRITRETVSTKI 175
E++P++ + Y E+ +V I + +GA + + PR +
Sbjct: 202 AESYPFNSKRKLLTVYDYESE-AVSEINRVSEKRGAKVAAAEFS---WPR---------L 248
Query: 176 SLHQEQRRKVVGLPEGEPTGDVYNLFAFPSECNFSGLRFDLDLVNIIKEDSSRNKQWLVL 235
L + RK+V + ++ FP +G R+ +++ +E+ W V+
Sbjct: 249 KLCSSKLRKLVTAGKNGSKTKKKGIYVFPLHSRVTGSRYPYLWMSVAQENG-----WHVM 303
Query: 236 IDAAKGCATMPPD-----LSKYPADFVAISFYKLFG-YPTGLGALIVQNDAAKLLKKTYF 289
IDA C P D LS Y DF+ SFYK+FG P+G G L V+ +L+
Sbjct: 304 IDA---CGLGPKDMDSFGLSIYNPDFMVCSFYKVFGENPSGFGCLFVKKSTISILES--- 357
Query: 290 SGGTVAASIADIDFVKRREGIE----ELFEDGTVSFLSIASIRQGFKILDSLTVSAISRH 345
S G ++ D +E + + T SF S + G +DSL + A
Sbjct: 358 STGPGMINLVPTDNPISLHALEINRTQTDSEETYSFSSSVEYK-GLDHVDSLGLVATGNR 416
Query: 346 TTSLALYARKMLLALRHENGSNLCILYGRHNSMGHRYEMGPIVSFNLKRPDGSWYGYREV 405
+ L + L L+H S L +YG + + GP V+FNL G V
Sbjct: 417 SRCLINWLVSALYKLKHSTTSRLVKIYGPKVN----FNRGPAVAFNLFNHKGEKIEPFIV 472
Query: 406 EKLASLSGIQLRTGCFCNPGACAKYLGLSHVDLITNTEAGHVCWDDHDIINGKPVGAVRI 465
+KLA S I L N Y G+ D + + + D+ + + +
Sbjct: 473 QKLAECSNISLGKSFLKNILFQEDYEGVK--DRVFEKKR------NRDVDEPR-ISVLTA 523
Query: 466 SFGYMSTYDDAKKFVDFVA 484
+ G+++ ++D K FVA
Sbjct: 524 ALGFLANFEDVYKLWIFVA 542
>AT1G30910.1 | Symbols: | Molybdenum cofactor sulfurase family
protein | chr1:11000912-11002801 FORWARD LENGTH=318
Length = 318
Score = 80.9 bits (198), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 123/289 (42%), Gaps = 52/289 (17%)
Query: 514 LKSITIYPIKSCGGFSPKSWPLSDNGLKHDREWVLKSLSGEILSLKKVPEMGFISPFIDL 573
+ S+ +YPIKSC G S L+ G + DR W++ + G L+ + P++ I +++
Sbjct: 21 VSSLFVYPIKSCRGISLSQAALTPTGFRWDRNWLIVNSKGRGLTQRVEPKLSLIE--VEM 78
Query: 574 SQ------------GMLFVESPRCKERLQIRL---ESVYDGEIEDIQLYGQRYKVYNYST 618
+ + V +P + L++ L + + DG + ++ +
Sbjct: 79 PKHAFGEDWEPEKSSNMVVRAPGM-DALKVSLAKPDKIADG----VSVWEWSGSALDEGE 133
Query: 619 ETNAWFSEAVGKPCTLLRYSSSNHDFVLKKTKGAVTCRDARSAVSFANEAQXXXXXXXXX 678
E + WF+ VGKPC L+R++S+ +T+ F++
Sbjct: 134 EASQWFTNFVGKPCRLVRFNSA------YETRPVDPNYAPGHIAMFSDMYPFLLISQGSL 187
Query: 679 XDLNRRLNSDSGVQKSIGGTTMQVNASRFRPNLVVFGGRPYDEDEWSDIRIGNKYFRSLG 738
LN+ L V +RFRPN+ V G P+ ED W++I I F +
Sbjct: 188 DSLNKLLKE-------------PVPINRFRPNIFVDGCEPFAEDLWTEILINGFTFHGVK 234
Query: 739 GCNRCQIINLALSDGQVQKSKEPLATLASYRRVK---------GKIVFG 778
C+RC++ ++ G +EP+ TL ++R K GKI FG
Sbjct: 235 LCSRCKVPTISQETG--IGGQEPIETLRTFRSDKVLQPKSKPHGKIYFG 281
>AT5G44720.1 | Symbols: | Molybdenum cofactor sulfurase family
protein | chr5:18043086-18045275 FORWARD LENGTH=308
Length = 308
Score = 79.7 bits (195), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 134/308 (43%), Gaps = 49/308 (15%)
Query: 508 LDASYYLKSITIYPIKSCGGFSPKSWPLSDNGLKHDREWVLKSLSGEILSLK-------- 559
++ ++S+ IYPIKSC G S ++ G + DR W++ + G + +
Sbjct: 1 MEEGLKIQSLVIYPIKSCRGISVPQATVTHTGFQWDRYWLVVNYKGRAYTQRVEPTLALV 60
Query: 560 --KVPEMGFISPFIDLSQGMLFVESPRCKERLQI---RLESVYDGEIEDIQLYGQRYKVY 614
++P+ F+ + + +L + +P L+I R SV +G + ++ +
Sbjct: 61 ESELPKEAFLEDWEPTNDSLLVIRAPGMSP-LKIPLTRPSSVAEG----VSMWEWSGSAF 115
Query: 615 NYSTETNAWFSEAVGKPCTLLRYSSSNHDFVLKKTKGAVTCRDARSAVSFANEAQXXXXX 674
+ E WFS+ +GK L+R+ N D +T+ + A + +F +
Sbjct: 116 DEGEEAAKWFSDYLGKQSRLVRF---NKD---TETRPSPPEFAAGYSTTFMDMFPFLVAS 169
Query: 675 XXXXXDLNRRLNSDSGVQKSIGGTTMQVNASRFRPNLVVFGGRPYDEDEWSDIRIGNKYF 734
LN L V +RFRPN++V P+ ED W +I+I + F
Sbjct: 170 QGSLDHLNTLLPE-------------PVPINRFRPNILVDNCDPFGEDLWDEIKINDLVF 216
Query: 735 RSLGGCNRCQIINLALSDGQVQKSKEPLATLASYR---------RVKGKIVFG--ILLKY 783
+ + C+RC++ + G + K+ EP TL +R + +GK+ FG ++ +
Sbjct: 217 QGVRLCSRCKVPTVNQETGVMGKA-EPTETLMKFRSDNVLMPDKKPRGKVFFGKEMVWNW 275
Query: 784 DGEQQQGE 791
+ +GE
Sbjct: 276 NLTNTEGE 283
>AT4G37100.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent
transferases superfamily protein |
chr4:17479173-17481863 REVERSE LENGTH=896
Length = 896
Score = 78.6 bits (192), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/289 (25%), Positives = 131/289 (45%), Gaps = 36/289 (12%)
Query: 4 EKEEFLREFGEHYGYPNGPKTIDQIRATEFKRLQDT---VYLDHAGATLYSEMQMESVFK 60
E E L +F Y + IDQ+R+ E+ L + V LD+ G L+S +Q +
Sbjct: 114 ELREALTKFLSMYPKYQASEKIDQLRSDEYSHLSSSASKVCLDYCGFGLFSYVQTLHYWD 173
Query: 61 DLT---SNVYGNPHSQSDSSSATSEIVSA-ARQQVLKYCNASPKDYTCIFTSGATAALKL 116
T S + N + + A S V + +++ Y N +Y +FT +A +L
Sbjct: 174 TCTFSLSEITANLSNHALYGGAESGTVEHDIKTRIMDYLNIPENEYGLVFTVSRGSAFRL 233
Query: 117 VGEAFPWSCNSNFMYTMENH--NSVLGIREYALGQGATAVSVDIEEDVHPRITRETVSTK 174
+ E++P+ N + TM +H SV + + A +GA A + + P + + K
Sbjct: 234 LAESYPFQSNKRLL-TMFDHESQSVNWMAQTAREKGAKAYNAWFK---WPTLKLCSTDLK 289
Query: 175 ISLHQEQRRKVVGLPEGEPTGDVYNLFAFPSECNFSGLRFDLDLVNIIKEDSSRNKQWLV 234
L ++R+K LF FP++ +G ++ + + +++ W V
Sbjct: 290 KRLSYKKRKK---------KDSAVGLFVFPAQSRVTGTKYSYQWMALAQQN-----HWHV 335
Query: 235 LIDAAKGCATMPPD-----LSKYPADFVAISFYKLFGY-PTGLGALIVQ 277
L+DA + P D LS + +F+ SFY++FG+ PTG G L+++
Sbjct: 336 LLDAG---SLGPKDMDSLGLSLFRPEFIITSFYRVFGHDPTGFGCLLIK 381
>AT2G23520.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent
transferases superfamily protein |
chr2:10014256-10016943 REVERSE LENGTH=895
Length = 895
Score = 76.3 bits (186), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 126/268 (47%), Gaps = 34/268 (12%)
Query: 23 KTIDQIRATEFKRLQDT-VYLDHAGATLYSEMQMESVFKDLT---SNVYGNPHSQSDSSS 78
+ +DQ+R+ E+ L D+ V LD+ G L+S +Q + T S + N + +
Sbjct: 131 EKVDQLRSDEYGHLLDSKVCLDYCGFGLFSYVQTLHYWDSCTFSLSEITANLSNHALYGG 190
Query: 79 ATSEIVSA-ARQQVLKYCNASPKDYTCIFTSGATAALKLVGEAFPWSCNSNFMYTMENH- 136
A V + +++ Y N +Y +FT +A +L+ E++P+ N + TM +H
Sbjct: 191 AEIGTVEHDLKTRIMDYLNIPESEYGLVFTGSRGSAFRLLAESYPFHTNKRLL-TMFDHE 249
Query: 137 -NSVLGIREYALGQGATAVSVDIEEDVHPRITRETVSTKISLHQEQRRKVVGLPEGEPTG 195
SV + + A +GA A + + P + + K L ++R+K
Sbjct: 250 SQSVNWMAQTAREKGAKAYNAWFK---WPTLKLCSTDLKKRLSHKKRKK---------KD 297
Query: 196 DVYNLFAFPSECNFSGLRFDLDLVNIIKEDSSRNKQWLVLIDAAKGCATMPPD-----LS 250
LF FP++ +G ++ + + ++++ W VL+DA + P D LS
Sbjct: 298 SAVGLFVFPAQSRVTGSKYSYQWMALAQQNN-----WHVLLDAG---SLGPKDMDSLGLS 349
Query: 251 KYPADFVAISFYKLFGY-PTGLGALIVQ 277
+ +F+ SFYK+FG+ PTG G L+++
Sbjct: 350 LFRPEFIITSFYKVFGHDPTGFGCLLIK 377
>AT5G66950.1 | Symbols: | Pyridoxal phosphate (PLP)-dependent
transferases superfamily protein |
chr5:26733333-26735945 FORWARD LENGTH=870
Length = 870
Score = 75.5 bits (184), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 34/295 (11%)
Query: 4 EKEEFLREFGEHYGYPNGPKTIDQIRATEFKRLQ-DTVYLDHAGATLYSEMQMESVFKDL 62
E EE L F Y + +D++R E+ L V LD+ G L+S +Q +
Sbjct: 110 ELEEALTIFLTMYPKYQSSEKVDELRNDEYFHLSLPKVCLDYCGFGLFSYLQTVHYWDTC 169
Query: 63 T---SNVYGNPHSQSDSSSATS-EIVSAARQQVLKYCNASPKDYTCIFTSGATAALKLVG 118
T S + N + + A I + +++ Y N +Y +FT +A KL+
Sbjct: 170 TFSLSEISANLSNHAIYGGAEKGSIEHDIKIRIMDYLNIPENEYGLVFTVSRGSAFKLLA 229
Query: 119 EAFPWSCNSNFMYTMENH--NSVLGIREYALGQGATAVSVDIEEDVHPRITRETVSTKIS 176
E++P+ N + TM +H SV + + A +GA S + P + ++ K
Sbjct: 230 ESYPFHTNKKLL-TMFDHESQSVSWMGQCAKEKGAKVGSAWFK---WPTLRLCSMDLKKE 285
Query: 177 LHQEQRRKVVGLPEGEPTGDVYNLFAFPSECNFSGLRFDLDLVNIIKEDSSRNKQWLVLI 236
+ +++RK + TG LF FP + +G ++ + + ++++ W VL+
Sbjct: 286 ILSKKKRK-----KDSATG----LFVFPVQSRVTGSKYSYQWMALAQQNN-----WHVLL 331
Query: 237 DAAKGCATMPPD-----LSKYPADFVAISFYKLFGY-PTGLGALIVQNDAAKLLK 285
DA A P D LS + DF+ SFY++FGY PTG G L+++ L+
Sbjct: 332 DAG---ALGPKDMDSLGLSLFRPDFIITSFYRVFGYDPTGFGCLLIKKSVISCLQ 383