Miyakogusa Predicted Gene
- Lj5g3v2298170.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2298170.1 Non Chatacterized Hit- tr|K4B0A6|K4B0A6_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,40.27,5e-19,seg,NULL; DUF1685,Protein of unknown function
DUF1685; FAMILY NOT NAMED,NULL,CUFF.57335.1
(129 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G05870.4 | Symbols: | Protein of unknown function (DUF1685) ... 163 3e-41
AT1G05870.3 | Symbols: | Protein of unknown function (DUF1685) ... 163 3e-41
AT1G05870.2 | Symbols: | Protein of unknown function (DUF1685) ... 163 3e-41
AT1G05870.1 | Symbols: | Protein of unknown function (DUF1685) ... 163 3e-41
AT2G31560.2 | Symbols: | Protein of unknown function (DUF1685) ... 160 2e-40
AT2G31560.1 | Symbols: | Protein of unknown function (DUF1685) ... 160 2e-40
AT2G43340.1 | Symbols: | Protein of unknown function (DUF1685) ... 155 6e-39
AT3G22690.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of... 92 1e-19
AT4G33985.1 | Symbols: | Protein of unknown function (DUF1685) ... 80 4e-16
AT3G04700.1 | Symbols: | Protein of unknown function (DUF1685) ... 76 7e-15
AT3G04710.3 | Symbols: TPR10 | ankyrin repeat family protein | c... 75 2e-14
AT1G08790.1 | Symbols: | Protein of unknown function (DUF1685) ... 74 2e-14
AT5G28690.1 | Symbols: | Protein of unknown function (DUF1685) ... 70 3e-13
AT3G50350.2 | Symbols: | Protein of unknown function (DUF1685) ... 69 9e-13
AT2G15590.2 | Symbols: | Protein of unknown function (DUF1685) ... 67 3e-12
AT3G62070.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 60 5e-10
AT2G15610.1 | Symbols: | Protein of unknown function (DUF1685) ... 57 4e-09
AT2G15590.1 | Symbols: | Protein of unknown function (DUF1685) ... 56 5e-09
AT2G46940.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 55 1e-08
AT3G50350.1 | Symbols: | Protein of unknown function (DUF1685) ... 55 1e-08
AT4G01670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 4e-08
>AT1G05870.4 | Symbols: | Protein of unknown function (DUF1685) |
chr1:1772454-1773228 REVERSE LENGTH=189
Length = 189
Score = 163 bits (412), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)
Query: 13 ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
E K ++LLEGYVE A DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51 ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
NTLPALELCYSMS KF IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170
Query: 111 KARLKFWAQAVACTVKLCS 129
KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189
>AT1G05870.3 | Symbols: | Protein of unknown function (DUF1685) |
chr1:1772454-1773228 REVERSE LENGTH=189
Length = 189
Score = 163 bits (412), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)
Query: 13 ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
E K ++LLEGYVE A DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51 ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
NTLPALELCYSMS KF IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170
Query: 111 KARLKFWAQAVACTVKLCS 129
KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189
>AT1G05870.2 | Symbols: | Protein of unknown function (DUF1685) |
chr1:1772454-1773228 REVERSE LENGTH=189
Length = 189
Score = 163 bits (412), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)
Query: 13 ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
E K ++LLEGYVE A DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51 ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
NTLPALELCYSMS KF IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170
Query: 111 KARLKFWAQAVACTVKLCS 129
KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189
>AT1G05870.1 | Symbols: | Protein of unknown function (DUF1685) |
chr1:1772454-1773228 REVERSE LENGTH=189
Length = 189
Score = 163 bits (412), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/139 (61%), Positives = 95/139 (68%), Gaps = 22/139 (15%)
Query: 13 ENKNKKLLLEGYVEEA----------DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
E K ++LLEGYVE A DL RSKSLTDDDLE+L+GC+DLGFGFSYDEIPEL
Sbjct: 51 ERKKSQVLLEGYVETASSSSVDDQKDDLTRSKSLTDDDLEDLRGCLDLGFGFSYDEIPEL 110
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXX------------XXIANWKISSPGDHPEDV 110
NTLPALELCYSMS KF IANWKISSPGD+P+DV
Sbjct: 111 CNTLPALELCYSMSQKFLDDKQNKSPETSSVEDCPSPPLVTATPIANWKISSPGDNPDDV 170
Query: 111 KARLKFWAQAVACTVKLCS 129
KARLK+WAQAVACTV+LCS
Sbjct: 171 KARLKYWAQAVACTVQLCS 189
>AT2G31560.2 | Symbols: | Protein of unknown function (DUF1685) |
chr2:13436611-13437312 FORWARD LENGTH=202
Length = 202
Score = 160 bits (405), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/130 (63%), Positives = 92/130 (70%), Gaps = 13/130 (10%)
Query: 13 ENKNKKLLLEGYV--EEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
E K ++LLEGY ++ DL R+KSLTDDDLEELKGC+DLGFGFSYDEIPEL NTLPALE
Sbjct: 73 EKKKSQVLLEGYALDDQDDLTRAKSLTDDDLEELKGCLDLGFGFSYDEIPELCNTLPALE 132
Query: 71 LCYSMSHKF-----------XXXXXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQ 119
LCYSMS KF IANWKISSPGD P+DVKARLK+WAQ
Sbjct: 133 LCYSMSQKFLDDKQQNHHKSQEEDDSSPPPTTTAPIANWKISSPGDDPDDVKARLKYWAQ 192
Query: 120 AVACTVKLCS 129
VACTV+LCS
Sbjct: 193 TVACTVRLCS 202
>AT2G31560.1 | Symbols: | Protein of unknown function (DUF1685) |
chr2:13436611-13437312 FORWARD LENGTH=202
Length = 202
Score = 160 bits (405), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/130 (63%), Positives = 92/130 (70%), Gaps = 13/130 (10%)
Query: 13 ENKNKKLLLEGYV--EEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
E K ++LLEGY ++ DL R+KSLTDDDLEELKGC+DLGFGFSYDEIPEL NTLPALE
Sbjct: 73 EKKKSQVLLEGYALDDQDDLTRAKSLTDDDLEELKGCLDLGFGFSYDEIPELCNTLPALE 132
Query: 71 LCYSMSHKF-----------XXXXXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQ 119
LCYSMS KF IANWKISSPGD P+DVKARLK+WAQ
Sbjct: 133 LCYSMSQKFLDDKQQNHHKSQEEDDSSPPPTTTAPIANWKISSPGDDPDDVKARLKYWAQ 192
Query: 120 AVACTVKLCS 129
VACTV+LCS
Sbjct: 193 TVACTVRLCS 202
>AT2G43340.1 | Symbols: | Protein of unknown function (DUF1685) |
chr2:18007769-18008416 FORWARD LENGTH=189
Length = 189
Score = 155 bits (392), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 81/128 (63%), Positives = 92/128 (71%), Gaps = 15/128 (11%)
Query: 17 KKLLLEGYVEEA----DLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALELC 72
+LLEGYV ++ DL R+KSLTDDDLEELKGCVDLGFGF+Y+EIPEL NTLPALELC
Sbjct: 62 SNVLLEGYVVDSAVNDDLKRTKSLTDDDLEELKGCVDLGFGFNYEEIPELCNTLPALELC 121
Query: 73 YSMSHKFXXX-----------XXXXXXXXXXXXIANWKISSPGDHPEDVKARLKFWAQAV 121
YSMS KF IA+WKISSPGD+P+DVKARLKFWAQAV
Sbjct: 122 YSMSQKFIDQDHHHHSSSSPEKKSSVLDSPVSPIASWKISSPGDNPDDVKARLKFWAQAV 181
Query: 122 ACTVKLCS 129
ACTV+LC+
Sbjct: 182 ACTVRLCT 189
>AT3G22690.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1685 (InterPro:IPR012881),
Pentatricopeptide repeat (InterPro:IPR002885); BEST
Arabidopsis thaliana protein match is: Tetratricopeptide
repeat (TPR)-like superfamily protein
(TAIR:AT2G29760.1); Has 49784 Blast hits to 14716
proteins in 280 species: Archae - 2; Bacteria - 10;
Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0;
Other Eukaryotes - 904 (source: NCBI BLink). |
chr3:8021347-8024534 REVERSE LENGTH=938
Length = 938
Score = 91.7 bits (226), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/85 (60%), Positives = 56/85 (65%), Gaps = 4/85 (4%)
Query: 36 LTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXI 95
LTDDDLE LKGC+DLGFGF+YDEIP L TLPALELCYSMS K I
Sbjct: 843 LTDDDLEVLKGCLDLGFGFNYDEIPALCKTLPALELCYSMSQK---NLDDKHTPSLQLPI 899
Query: 96 ANWKISSPGDHPEDVKARLKFWAQA 120
+ S D+P+DVKARLK WAQA
Sbjct: 900 GRSLVPSC-DNPDDVKARLKCWAQA 923
>AT4G33985.1 | Symbols: | Protein of unknown function (DUF1685) |
chr4:16288301-16288857 REVERSE LENGTH=154
Length = 154
Score = 80.1 bits (196), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 69/136 (50%), Gaps = 22/136 (16%)
Query: 2 EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGF---SYDE 58
EAW RK+G + L RSKS+TD+DLEELKGC++LGFGF S D
Sbjct: 29 EAWLRKKGKQSLGR--------------LGRSKSVTDEDLEELKGCIELGFGFEPDSPDL 74
Query: 59 IPELRNTLPALELCYSMSHKFXXXXXXXX-----XXXXXXXIANWKISSPGDHPEDVKAR 113
P L TLPAL L +++ ++ ++ I GD PE +K R
Sbjct: 75 DPRLSETLPALGLYCAVNKQYSSRLSRTSSLSSIASEGENSNSSTTIVDQGDDPETMKLR 134
Query: 114 LKFWAQAVACTVKLCS 129
LK WAQ VAC+VK S
Sbjct: 135 LKQWAQVVACSVKQFS 150
>AT3G04700.1 | Symbols: | Protein of unknown function (DUF1685) |
chr3:1276948-1277607 FORWARD LENGTH=191
Length = 191
Score = 75.9 bits (185), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 48/139 (34%), Positives = 72/139 (51%), Gaps = 18/139 (12%)
Query: 3 AWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
AW R+ + KKLL +G + +L +LTD+DL ELKG ++LGFGF+ + +L
Sbjct: 51 AWERRRRQMIMIQEKKLLHKGASD--NLCIQANLTDEDLNELKGSIELGFGFNEEAGQKL 108
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIA----------------NWKISSPGDH 106
NTLPAL+L ++++ + + + KI PGD
Sbjct: 109 CNTLPALDLYFAVNRQLSPLPSPSSSRSSSASASTFSYSIPCSPKKTDSDSVKILCPGDD 168
Query: 107 PEDVKARLKFWAQAVACTV 125
P+ +K RL+ WAQAVAC+V
Sbjct: 169 PQQMKQRLRHWAQAVACSV 187
>AT3G04710.3 | Symbols: TPR10 | ankyrin repeat family protein |
chr3:1276948-1280942 FORWARD LENGTH=680
Length = 680
Score = 74.7 bits (182), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 72/139 (51%), Gaps = 18/139 (12%)
Query: 3 AWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPEL 62
AW R+ + KKLL +G + +L +LTD+DL ELKG ++LGFGF+ + +L
Sbjct: 51 AWERRRRQMIMIQEKKLLHKGASD--NLCIQANLTDEDLNELKGSIELGFGFNEEAGQKL 108
Query: 63 RNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIA----------------NWKISSPGDH 106
NTLPAL+L ++++ + + + KI PGD
Sbjct: 109 CNTLPALDLYFAVNRQLSPLPSPSSSRSSSASASTFSYSIPCSPKKTDSDSVKILCPGDD 168
Query: 107 PEDVKARLKFWAQAVACTV 125
P+ +K RL+ WAQAVAC++
Sbjct: 169 PQQMKQRLRHWAQAVACSI 187
>AT1G08790.1 | Symbols: | Protein of unknown function (DUF1685) |
chr1:2811989-2812646 FORWARD LENGTH=190
Length = 190
Score = 73.9 bits (180), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 25/133 (18%)
Query: 13 ENKNKKLL--LEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPELRNTLPALE 70
E + +++L LE + E D LTD+DL ELKG ++LGFGF+ ++ L TLPAL+
Sbjct: 58 ERRRRQMLHHLEKHNEGGD-----DLTDEDLSELKGSIELGFGFNEEQGQHLTTTLPALD 112
Query: 71 LCYSMSHKFXXXXXXXXXXXXXXXIA------------------NWKISSPGDHPEDVKA 112
L ++++ + + K+ SPGD P+ VK
Sbjct: 113 LYFAVTRQISPVSTPGSGGSSSSSRPTSLGDRSSSFGSPISDSDSLKVMSPGDDPQQVKT 172
Query: 113 RLKFWAQAVACTV 125
RL+ WAQAVAC+V
Sbjct: 173 RLRHWAQAVACSV 185
>AT5G28690.1 | Symbols: | Protein of unknown function (DUF1685) |
chr5:10723033-10723702 FORWARD LENGTH=192
Length = 192
Score = 70.5 bits (171), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 69/142 (48%), Gaps = 25/142 (17%)
Query: 2 EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGFSYDEIPE 61
E RR+ QE K K V E D S LTD+DL ELKG ++LGFGFS + +
Sbjct: 54 EKRRRQMLKIQEKKQK------SVSEND-NDSPDLTDEDLRELKGSIELGFGFSEEAGQK 106
Query: 62 LRNTLPALELCYSMSHKFXXXXXXXXXX------------------XXXXXIANWKISSP 103
L NTLPAL+L ++++ + + KI P
Sbjct: 107 LCNTLPALDLYFAVNRQLSPLPSPSSSNGGDGSLSSTSVSSSSIPCSPKTDSDSLKILCP 166
Query: 104 GDHPEDVKARLKFWAQAVACTV 125
GD+P+ VK RL+ WAQAVAC++
Sbjct: 167 GDNPQQVKQRLRHWAQAVACSL 188
>AT3G50350.2 | Symbols: | Protein of unknown function (DUF1685) |
chr3:18672164-18673665 FORWARD LENGTH=171
Length = 171
Score = 68.9 bits (167), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 30 LARSKSLTDDDLEELKGCVDLGFGFSYDEI--PELRNTLPALELCYSMSHKFXXXXXXX- 86
L R KSLTD+DL+ELK +LGFGF E P L NTLPALEL +++ +
Sbjct: 63 LRRGKSLTDEDLDELKASFELGFGFGSPENADPRLSNTLPALELYFAVQKSYNDAVSNKS 122
Query: 87 ----XXXXXXXXIANWKISSPGDHPEDVKARLKFWAQAVACTV 125
+ + D P+ VK +LK WA+ VACTV
Sbjct: 123 TTSSSSLSDGDTSPHHTVYQTSDDPQTVKTKLKQWARVVACTV 165
>AT2G15590.2 | Symbols: | Protein of unknown function (DUF1685) |
chr2:6801950-6802506 FORWARD LENGTH=155
Length = 155
Score = 67.4 bits (163), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 59/107 (55%), Gaps = 7/107 (6%)
Query: 30 LARSKSLTDDDLEELKGCVDLGFGF---SYDEIPELRNTLPALELCYSMSHKFXXXXXXX 86
L RSKS+T+DD+EELKGC +LGFGF S D P L +T+PAL+L ++ ++
Sbjct: 40 LPRSKSVTNDDIEELKGCFELGFGFETESPDLNPRLSHTIPALDLYCAVHRQYSNHLSRT 99
Query: 87 XXXXXXXXIAN----WKISSPGDHPEDVKARLKFWAQAVACTVKLCS 129
++N I GD + +K +LK WA+ V +V+ S
Sbjct: 100 SSFASDHEVSNSNNITTIVDKGDDRKTMKQKLKQWAKVVGFSVRHSS 146
>AT3G62070.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G46940.1); Has 137 Blast hits to 135 proteins
in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 137; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:22983710-22984482 REVERSE
LENGTH=228
Length = 228
Score = 59.7 bits (143), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 41 LEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWKI 100
L+E+K C DLGF EL +P + F IA W+I
Sbjct: 150 LQEVKACRDLGF--------ELEVPVPGRISVSTTGSNFDTQTSSGGDSP----IATWRI 197
Query: 101 SSPGDHPEDVKARLKFWAQAVA 122
SSPGD P++VKARLK WAQAVA
Sbjct: 198 SSPGDDPKEVKARLKVWAQAVA 219
>AT2G15610.1 | Symbols: | Protein of unknown function (DUF1685) |
chr2:6806033-6806686 FORWARD LENGTH=153
Length = 153
Score = 57.0 bits (136), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 19/81 (23%)
Query: 2 EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKS-LTDDDLEELKGCVDLGFGFSYDEI- 59
EAW R + H A L RSKS +T+DD+EEL+GC DLGFGF D +
Sbjct: 27 EAWLRMKKKH--------------PSARLHRSKSCVTNDDIEELRGCFDLGFGFEPDSLD 72
Query: 60 --PELRNTLPALELCYSMSHK 78
P L T+PAL+L YS H+
Sbjct: 73 FNPSLSKTIPALDL-YSAIHR 92
>AT2G15590.1 | Symbols: | Protein of unknown function (DUF1685) |
chr2:6801950-6802403 FORWARD LENGTH=125
Length = 125
Score = 56.2 bits (134), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/73 (49%), Positives = 45/73 (61%), Gaps = 17/73 (23%)
Query: 2 EAWRRKEGDHQENKNKKLLLEGYVEEADLARSKSLTDDDLEELKGCVDLGFGF---SYDE 58
EAW RK+ K + L L L RSKS+T+DD+EELKGC +LGFGF S D
Sbjct: 26 EAWLRKK------KKRPLDL--------LPRSKSVTNDDIEELKGCFELGFGFETESPDL 71
Query: 59 IPELRNTLPALEL 71
P L +T+PAL+L
Sbjct: 72 NPRLSHTIPALDL 84
>AT2G46940.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G62070.1); Has 143 Blast hits to 141 proteins
in 14 species: Archae - 0; Bacteria - 0; Metazoa - 4;
Fungi - 0; Plants - 139; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:19286658-19287505 REVERSE
LENGTH=252
Length = 252
Score = 55.5 bits (132), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 42/82 (51%), Gaps = 20/82 (24%)
Query: 41 LEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWKI 100
EE+K C DLGF ++P S S++ IANW+I
Sbjct: 185 FEEVKACRDLGFEL---DVPGR----------VSGSNR-------ETSSGGNSPIANWRI 224
Query: 101 SSPGDHPEDVKARLKFWAQAVA 122
SSPGD P++VKARLK WAQAVA
Sbjct: 225 SSPGDDPKEVKARLKMWAQAVA 246
>AT3G50350.1 | Symbols: | Protein of unknown function (DUF1685) |
chr3:18672906-18673706 FORWARD LENGTH=181
Length = 181
Score = 55.1 bits (131), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 35/52 (67%), Gaps = 2/52 (3%)
Query: 30 LARSKSLTDDDLEELKGCVDLGFGFSYDEI--PELRNTLPALELCYSMSHKF 79
L R KSLTD+DL+ELK +LGFGF E P L NTLPALEL +++ +
Sbjct: 67 LRRGKSLTDEDLDELKASFELGFGFGSPENADPRLSNTLPALELYFAVQKSY 118
>AT4G01670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G62070.1); Has 141 Blast hits to 139 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 138; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr4:711464-712511 REVERSE
LENGTH=249
Length = 249
Score = 53.1 bits (126), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 40/83 (48%), Gaps = 18/83 (21%)
Query: 40 DLEELKGCVDLGFGFSYDEIPELRNTLPALELCYSMSHKFXXXXXXXXXXXXXXXIANWK 99
DLEE+K C DLGF L + YS S +N +
Sbjct: 172 DLEEVKACKDLGF------------ELEPGRVSYSGS------TVDTSSGGNSPISSNHR 213
Query: 100 ISSPGDHPEDVKARLKFWAQAVA 122
ISSPGD P+DVKARLK WAQAVA
Sbjct: 214 ISSPGDDPKDVKARLKAWAQAVA 236