Miyakogusa Predicted Gene
- Lj4g3v2604780.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2604780.2 Non Chatacterized Hit- tr|D7LGY0|D7LGY0_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,27.42,4e-18,seg,NULL; DUF688,Protein of unknown function
DUF688,CUFF.51314.2
(569 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G29240.1 | Symbols: | Protein of unknown function (DUF688) |... 115 6e-26
AT2G30990.3 | Symbols: | Protein of unknown function (DUF688) |... 113 3e-25
AT2G30990.2 | Symbols: | Protein of unknown function (DUF688) |... 96 8e-20
AT2G30990.1 | Symbols: | Protein of unknown function (DUF688) |... 96 8e-20
AT4G18630.1 | Symbols: | Protein of unknown function (DUF688) |... 91 3e-18
AT5G45850.1 | Symbols: | Protein of unknown function (DUF688) |... 89 1e-17
AT2G34170.2 | Symbols: | Protein of unknown function (DUF688) |... 70 4e-12
AT2G34170.1 | Symbols: | Protein of unknown function (DUF688) |... 70 4e-12
>AT1G29240.1 | Symbols: | Protein of unknown function (DUF688) |
chr1:10217023-10218924 REVERSE LENGTH=577
Length = 577
Score = 115 bits (289), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 121/397 (30%), Positives = 169/397 (42%), Gaps = 85/397 (21%)
Query: 2 MNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAIPHYKSDTTL------------ 49
M ERKLN + PL+S RR T+ +RN K Y S T+
Sbjct: 1 MEERKLNFSVPLLSTRRMQKTA--GVSVRRN---KSNNFTDYDSKTSECPSVPVLVPYMM 55
Query: 50 ---DQVTEPVTVPFNWEHIPGKPKGND--------------------------------- 73
D+VTEP +VPF WE PG+ KGND
Sbjct: 56 GLDDEVTEPASVPFTWEQAPGRLKGNDFKPQVCVLMKEEEQVFTPCLPPGKAVDANMTRL 115
Query: 74 ----GSIKNVEXXXXXXAYSDALETLSSTESFSMKCSVSGISGLDNLDANKCETGTSSTD 129
G +SDAL+TLS +SFS S+SG+S ++ K D
Sbjct: 116 QSSKGKQVEESDDDEDDVFSDALDTLSPKDSFSFNNSISGVSEYGGVETKK------PLD 169
Query: 130 KEAQDFMMSRFLTAANAMTLHPPQHVS-RKKSVLVEQPR-DFTKLVRVEKESFVNRHITD 187
+++DFMMSRFL AA AMT+ + S RK S + +P +LV EK+ NR+
Sbjct: 170 AQSRDFMMSRFLPAAKAMTVEQSHYASNRKPSTFMAEPTIQIRELVPGEKQQTPNRYDVS 229
Query: 188 IVPXXXX------XXXXXXXXXXXXXXXAHMSAKGCGLFPRSCIRNSLCLLNQVPGTKMV 241
IVP A++S +GCG+ P+ C ++SL +LN VPG K
Sbjct: 230 IVPSYYYHQDIDDEESKVGEEDDEVSEYAYLSKRGCGMLPQLCFKDSLGMLNTVPGFKAK 289
Query: 242 NQFPMFS-SYEVGKSKENSHIHSSRPAPAIKKAWDAIHKSKSSSGAAPPDMHGISSFRSN 300
+ P+ S S++ KS + + + R K A D++ K K S P +H SN
Sbjct: 290 HNSPITSPSHDQVKSSKVAQL-KYRFQSVKKLALDSVSKHKLSGKVHSP-VHP-----SN 342
Query: 301 ESKRYAYSGELKQVGRISS-FRYSRTIAASGTSSFRS 336
K + S + R SS +R+SR + S FRS
Sbjct: 343 GKKFNSESNLISAAKRSSSPYRHSRCM-----SPFRS 374
Score = 51.6 bits (122), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 42/73 (57%), Gaps = 8/73 (10%)
Query: 500 WLCRALPLVSLKNSFPYSNQGTPLHAKRQG---NRRASSFTKWETIVKTSNMHHDHAHSS 556
WLC LP V+ + + S + P + ++Q N R TKWETIVKTS MH DH S
Sbjct: 510 WLCHNLPSVTSQIT---SRRYHPFNPQKQDLTENYRNG--TKWETIVKTSYMHRDHIRYS 564
Query: 557 QELVIYKSQHSKS 569
+ELV + S SK+
Sbjct: 565 EELVAHTSHQSKT 577
>AT2G30990.3 | Symbols: | Protein of unknown function (DUF688) |
chr2:13187776-13189761 REVERSE LENGTH=563
Length = 563
Score = 113 bits (283), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/277 (32%), Positives = 125/277 (45%), Gaps = 49/277 (17%)
Query: 1 MMNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAI-----PHYKSDTTLDQVTEP 55
MM E++L+ N PL+S+RR + TS ++E K + + + P YKSD V P
Sbjct: 7 MMEEKQLDFNRPLISIRRPTQTS--ESESKTRSFDSVTNMIPPSPPVYKSDIKSGPVRNP 64
Query: 56 VTVPFNWEHIPGKPKGN------------------DGSIKNVE--XXXXXXAYSDALETL 95
TVPF WEH PGKPK G + VE +T+
Sbjct: 65 GTVPFQWEHKPGKPKDERKPVLQSFVEPHFVPKLPPGRERVVELGRKPESTGADHQTKTV 124
Query: 96 SSTESFSMK-------CSVSGISGLDNLDANKCETGTSSTDKEAQDFMMSRFLTAANAMT 148
SS++ + ++ +VSG SGLD GT STD++ QD MM RFL AA A+T
Sbjct: 125 SSSDKYLVEDAKSNSSRAVSGASGLDGSGILVEPFGTLSTDRQTQDLMMGRFLPAAKALT 184
Query: 149 LHPPQHVSRKKSVLVEQPRDFTKLV--RVEKESFVNRHITDIVPXXXXXXXXXXXXXXXX 206
P H++RK E + K +VE+ + RH D
Sbjct: 185 SESPPHLARKPPKPEEPVKQLMKKKQNKVEQNPYRFRHSPD-------------QEEEDG 231
Query: 207 XXXAHMSAKGCGLFPRSCIRNSLCLLNQVPGTKMVNQ 243
+ M++ CGL P+ C+R+SL LLN VP +M Q
Sbjct: 232 NISSMMASGVCGLLPQLCLRSSLGLLNPVPSVRMQQQ 268
>AT2G30990.2 | Symbols: | Protein of unknown function (DUF688) |
chr2:13187776-13189761 REVERSE LENGTH=593
Length = 593
Score = 95.5 bits (236), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 79/159 (49%), Gaps = 16/159 (10%)
Query: 88 YSDALETLSSTESFSMKCS-VSGISGLDNLDANKCETGTSSTDKEAQDFMMSRFLTAANA 146
Y DA +TLS TESF CS VSG SGLD GT STD++ QD MM RFL AA A
Sbjct: 153 YLDATDTLSRTESFFFNCSAVSGASGLDGSGILVEPFGTLSTDRQTQDLMMGRFLPAAKA 212
Query: 147 MTLHPPQHVSRKKSVLVEQPRDFTKLV--RVEKESFVNRHITDIVPXXXXXXXXXXXXXX 204
+T P H++RK E + K +VE+ + RH D
Sbjct: 213 LTSESPPHLARKPPKPEEPVKQLMKKKQNKVEQNPYRFRHSPD-------------QEEE 259
Query: 205 XXXXXAHMSAKGCGLFPRSCIRNSLCLLNQVPGTKMVNQ 243
+ M++ CGL P+ C+R+SL LLN VP +M Q
Sbjct: 260 DGNISSMMASGVCGLLPQLCLRSSLGLLNPVPSVRMQQQ 298
Score = 58.9 bits (141), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 44/75 (58%), Gaps = 7/75 (9%)
Query: 1 MMNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAI-----PHYKSDTTLDQVTEP 55
MM E++L+ N PL+S+RR + TS ++E K + + + P YKSD V P
Sbjct: 7 MMEEKQLDFNRPLISIRRPTQTS--ESESKTRSFDSVTNMIPPSPPVYKSDIKSGPVRNP 64
Query: 56 VTVPFNWEHIPGKPK 70
TVPF WEH PGKPK
Sbjct: 65 GTVPFQWEHKPGKPK 79
>AT2G30990.1 | Symbols: | Protein of unknown function (DUF688) |
chr2:13187776-13189761 REVERSE LENGTH=593
Length = 593
Score = 95.5 bits (236), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 79/159 (49%), Gaps = 16/159 (10%)
Query: 88 YSDALETLSSTESFSMKCS-VSGISGLDNLDANKCETGTSSTDKEAQDFMMSRFLTAANA 146
Y DA +TLS TESF CS VSG SGLD GT STD++ QD MM RFL AA A
Sbjct: 153 YLDATDTLSRTESFFFNCSAVSGASGLDGSGILVEPFGTLSTDRQTQDLMMGRFLPAAKA 212
Query: 147 MTLHPPQHVSRKKSVLVEQPRDFTKLV--RVEKESFVNRHITDIVPXXXXXXXXXXXXXX 204
+T P H++RK E + K +VE+ + RH D
Sbjct: 213 LTSESPPHLARKPPKPEEPVKQLMKKKQNKVEQNPYRFRHSPD-------------QEEE 259
Query: 205 XXXXXAHMSAKGCGLFPRSCIRNSLCLLNQVPGTKMVNQ 243
+ M++ CGL P+ C+R+SL LLN VP +M Q
Sbjct: 260 DGNISSMMASGVCGLLPQLCLRSSLGLLNPVPSVRMQQQ 298
Score = 58.9 bits (141), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 44/75 (58%), Gaps = 7/75 (9%)
Query: 1 MMNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAI-----PHYKSDTTLDQVTEP 55
MM E++L+ N PL+S+RR + TS ++E K + + + P YKSD V P
Sbjct: 7 MMEEKQLDFNRPLISIRRPTQTS--ESESKTRSFDSVTNMIPPSPPVYKSDIKSGPVRNP 64
Query: 56 VTVPFNWEHIPGKPK 70
TVPF WEH PGKPK
Sbjct: 65 GTVPFQWEHKPGKPK 79
>AT4G18630.1 | Symbols: | Protein of unknown function (DUF688) |
chr4:10257178-10258734 FORWARD LENGTH=479
Length = 479
Score = 90.5 bits (223), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 120/284 (42%), Gaps = 53/284 (18%)
Query: 2 MNERKLNINAPLMSMRRYSGT--SPSKTEQKRNTLE----------KQQAIPHYKSDTTL 49
M +KLN+ APL S+RR S+ E K+ T+ KQ+ D +
Sbjct: 1 MEGKKLNLYAPLPSIRRIPSMIERSSELENKKTTITRPELRSCPSTKQETPVFVLPDQSF 60
Query: 50 DQVTEPVTVPFNWEHIPGKPKGNDGSI-----KNVEXXXXXXAYSDALETLSSTESFSMK 104
D +TEP ++PF WE IPGKPK + ++ + L+T+SS SFS+
Sbjct: 61 DHLTEPASIPFMWEQIPGKPKDDMATLIQESGLLETDDEEEDEVDEDLDTVSSNVSFSVN 120
Query: 105 CSVSGISGLDNL-DANKCETGTSSTDKEAQDFMMSRFLTAANAMTLHPPQ-HVSRKKSVL 162
CS SG+S ++ + + C+ + +E+ D MMSRFL AA AM L Q H S S
Sbjct: 121 CSTSGVSEIEKTGERSDCDEKS----RESLDLMMSRFLPAAKAMALQTHQKHQSSYNS-- 174
Query: 163 VEQPRDFTKLVRVEKESFVNRHITDIVPXXXXXXXXXXXXXXXXXXX------------- 209
EQ KL+ +E+ V R + +V
Sbjct: 175 SEQ-----KLITQNREALVARQRSQLVAEHEHFAIVQSLYDDLNIDDDTEDDENDDDGDD 229
Query: 210 ----------AHMSAKGCGLFPRSCIRNSLCLLNQVPGTKMVNQ 243
A ++ K CG PR C +NS LN VP + N+
Sbjct: 230 YGHVDHKIYPAEVTKKACGFLPRLCAKNSFKFLNPVPLDSVSNR 273
Score = 49.7 bits (117), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 39/68 (57%), Gaps = 5/68 (7%)
Query: 500 WLCRAL-PLVSLKNSFPYSNQGTPLHAKRQGNRRASSFTKWETIVKTSNMHHDHAHSSQE 558
WL R L P V N PY + K+ N+ TKWETIVKTS +H+DHA SQE
Sbjct: 413 WLGRTLLPPV---NPRPYGVVLGQVGIKKL-NQEVLESTKWETIVKTSYVHNDHARYSQE 468
Query: 559 LVIYKSQH 566
L++Y S+
Sbjct: 469 LIVYPSRQ 476
>AT5G45850.1 | Symbols: | Protein of unknown function (DUF688) |
chr5:18599998-18601435 REVERSE LENGTH=445
Length = 445
Score = 88.6 bits (218), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 86/178 (48%), Gaps = 27/178 (15%)
Query: 2 MNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEK-----------------QQAIPHYK 44
M E+KLN++AP +S+RR T P + NT +K + +
Sbjct: 1 MEEKKLNLDAPFLSVRRVP-TKPDDPNESENTKKKTMTTRRREIKDSCQETENEILIRLL 59
Query: 45 SDTTLDQVTEPVTVPFNWEHIPGKPKGNDGSIKNVEXXXXXXAYSDALETLSSTESFSMK 104
D + + V EP +VPF WE PGKPK ++ + A+E +SST SFS+
Sbjct: 60 QDQSFEHVMEPSSVPFKWEQTPGKPKDKKNLVEESDLIK-------AMEMVSSTASFSVN 112
Query: 105 CSVSGISGLDNLDANKCETGTSSTDK--EAQDFMMSRFLTAANAMTLHPPQHVSRKKS 160
CS SG+S +N + S D E +D +MSRFL AA A++L + SR K+
Sbjct: 113 CSSSGVSEFENNGDGDRSSNVSRDDVILEYRDLIMSRFLPAAEAISLKMKKEASRVKA 170
>AT2G34170.2 | Symbols: | Protein of unknown function (DUF688) |
chr2:14428298-14430051 FORWARD LENGTH=523
Length = 523
Score = 70.1 bits (170), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 111/247 (44%), Gaps = 33/247 (13%)
Query: 2 MNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAIPHYKSDTTLDQVTEPVTV--- 58
M E++LN +APL+S RR K++ + ++ +P +D + + V+V
Sbjct: 1 MAEKQLNFDAPLLSTRR----------MKKSAISVRRNMPKQLTDESKTSESLSVSVLVQ 50
Query: 59 PFNWEHIPGKPKGNDGSIKNVEXXXXXXAYSDALETLSSTESFSMKCSVSGISGLDNLDA 118
+ +P P + + E +SDAL+TLS +K S+SG G++ +
Sbjct: 51 DMGLDQVPDSPLESKHVEEEAEESEDDDVFSDALDTLS------LKQSISGGGGVEAMKP 104
Query: 119 NKCETGTSSTDKEAQDFMMSRFLTAANAMTL-HPPQHVSRKKSV-LVEQP-RDFTKLVRV 175
S E FM+ RFL AA ++TL PPQ+ +++ + L+ +P R +V
Sbjct: 105 --------SMPSEDPQFMLDRFLPAAKSLTLEQPPQYSWKRQPLPLMSEPMRQIRDIVPA 156
Query: 176 EKESFVNRHITDIVPXXXX--XXXXXXXXXXXXXXXAHMSAKGCGLF-PRSCIRNSLCLL 232
E + R+ + P ++S +GCG+ P+ C +NSL +L
Sbjct: 157 ENRATPTRYESSFTPSYYQDIDDEESEEDSDDDEVSEYLSKRGCGMMSPQICFKNSLGML 216
Query: 233 NQVPGTK 239
+ V G K
Sbjct: 217 SSVNGLK 223
>AT2G34170.1 | Symbols: | Protein of unknown function (DUF688) |
chr2:14428298-14430051 FORWARD LENGTH=523
Length = 523
Score = 70.1 bits (170), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 111/247 (44%), Gaps = 33/247 (13%)
Query: 2 MNERKLNINAPLMSMRRYSGTSPSKTEQKRNTLEKQQAIPHYKSDTTLDQVTEPVTV--- 58
M E++LN +APL+S RR K++ + ++ +P +D + + V+V
Sbjct: 1 MAEKQLNFDAPLLSTRR----------MKKSAISVRRNMPKQLTDESKTSESLSVSVLVQ 50
Query: 59 PFNWEHIPGKPKGNDGSIKNVEXXXXXXAYSDALETLSSTESFSMKCSVSGISGLDNLDA 118
+ +P P + + E +SDAL+TLS +K S+SG G++ +
Sbjct: 51 DMGLDQVPDSPLESKHVEEEAEESEDDDVFSDALDTLS------LKQSISGGGGVEAMKP 104
Query: 119 NKCETGTSSTDKEAQDFMMSRFLTAANAMTL-HPPQHVSRKKSV-LVEQP-RDFTKLVRV 175
S E FM+ RFL AA ++TL PPQ+ +++ + L+ +P R +V
Sbjct: 105 --------SMPSEDPQFMLDRFLPAAKSLTLEQPPQYSWKRQPLPLMSEPMRQIRDIVPA 156
Query: 176 EKESFVNRHITDIVPXXXX--XXXXXXXXXXXXXXXAHMSAKGCGLF-PRSCIRNSLCLL 232
E + R+ + P ++S +GCG+ P+ C +NSL +L
Sbjct: 157 ENRATPTRYESSFTPSYYQDIDDEESEEDSDDDEVSEYLSKRGCGMMSPQICFKNSLGML 216
Query: 233 NQVPGTK 239
+ V G K
Sbjct: 217 SSVNGLK 223