Miyakogusa Predicted Gene
- Lj6g3v0365870.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0365870.1 Non Chatacterized Hit- tr|K3YW08|K3YW08_SETIT
Uncharacterized protein OS=Setaria italica
GN=Si018454,35.22,5e-19,A_tha_TIGR01569: plant integral membrane
protein T,Uncharacterised protein family UPF0497,
trans-mem,CUFF.57734.1
(213 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G17200.1 | Symbols: | Uncharacterised protein family (UPF049... 211 4e-55
AT3G14380.1 | Symbols: | Uncharacterised protein family (UPF049... 144 5e-35
AT1G17200.2 | Symbols: | Uncharacterised protein family (UPF049... 130 6e-31
AT5G54980.1 | Symbols: | Uncharacterised protein family (UPF049... 99 2e-21
AT4G16442.1 | Symbols: | Uncharacterised protein family (UPF049... 75 3e-14
AT2G35760.1 | Symbols: | Uncharacterised protein family (UPF049... 69 3e-12
AT4G15630.1 | Symbols: | Uncharacterised protein family (UPF049... 65 5e-11
AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF049... 63 2e-10
AT1G03700.1 | Symbols: | Uncharacterised protein family (UPF049... 62 4e-10
AT4G03540.1 | Symbols: | Uncharacterised protein family (UPF049... 58 5e-09
AT4G25830.1 | Symbols: | Uncharacterised protein family (UPF049... 55 4e-08
AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF049... 54 6e-08
AT4G15620.1 | Symbols: | Uncharacterised protein family (UPF049... 54 1e-07
AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF049... 49 2e-06
AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF049... 49 2e-06
AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF049... 48 6e-06
>AT1G17200.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:5878493-5879871 FORWARD LENGTH=204
Length = 204
Score = 211 bits (536), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 97/156 (62%), Positives = 123/156 (78%)
Query: 36 LRSLETFLRLLPIGLCVSALVLMLKNSQENDYGSVDYTDLGPFRYLVHANGICAGYSLLS 95
+R+ ET LRL P+GLCV+ALV+MLK+S+ N++GS+ Y++L FRYLVHANGICAGYSLLS
Sbjct: 30 IRTAETMLRLAPVGLCVAALVVMLKDSETNEFGSISYSNLTAFRYLVHANGICAGYSLLS 89
Query: 96 AVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTTWSSACGSF 155
A A+PR S TM R WTFF LDQ+LTY++L AGA S EVLYL GD+ TWS AC S+
Sbjct: 90 AAIAAMPRSSSTMPRVWTFFCLDQLLTYLVLAAGAVSAEVLYLAYNGDSAITWSDACSSY 149
Query: 156 GRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKY 191
G FC++ TA+ ITF V Y++LS+ISSYKLFT++
Sbjct: 150 GGFCHRATASVIITFFVVCFYIVLSLISSYKLFTRF 185
>AT3G14380.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr3:4803175-4803799 REVERSE LENGTH=178
Length = 178
Score = 144 bits (363), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 78/158 (49%), Positives = 108/158 (68%), Gaps = 2/158 (1%)
Query: 35 TLRSLETFLRLLPIGLCVSALVLMLKNSQENDYGSVDYTDLGPFRYLVHANGICAGYSLL 94
T +S E LR+ + L ++ LV+M+KNS N++GSV Y+++G F YLV ANG+CA YSLL
Sbjct: 19 TEKSAEAVLRVASMALSITGLVIMIKNSISNEFGSVSYSNIGAFMYLVSANGVCAAYSLL 78
Query: 95 SAVFVAVPRPSP-TMSRAWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTTWSSACG 153
SA+ + P P + + T FLLDQV+TY++L AGA S E +YL G+ TWSSAC
Sbjct: 79 SAL-AILALPCPISKVQVRTLFLLDQVVTYVVLAAGAVSAETVYLAYYGNIPITWSSACD 137
Query: 154 SFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKY 191
S+G FC+ + TFV L Y+LLS+ISSY+LFT++
Sbjct: 138 SYGSFCHNALISVVFTFVVSLLYMLLSLISSYRLFTRF 175
>AT1G17200.2 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:5878668-5879871 FORWARD LENGTH=135
Length = 135
Score = 130 bits (327), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 62/102 (60%), Positives = 76/102 (74%)
Query: 90 GYSLLSAVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTTWS 149
GYSLLSA A+PR S TM R WTFF LDQ+LTY++L AGA S EVLYL GD+ TWS
Sbjct: 15 GYSLLSAAIAAMPRSSSTMPRVWTFFCLDQLLTYLVLAAGAVSAEVLYLAYNGDSAITWS 74
Query: 150 SACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKY 191
AC S+G FC++ TA+ ITF V Y++LS+ISSYKLFT++
Sbjct: 75 DACSSYGGFCHRATASVIITFFVVCFYIVLSLISSYKLFTRF 116
>AT5G54980.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:22315799-22316560 FORWARD LENGTH=194
Length = 194
Score = 99.0 bits (245), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 97/159 (61%), Gaps = 4/159 (2%)
Query: 35 TLRSLETFLRLLPIGLCVSALVLMLKNSQEN-DYGSVDYTDLGPFRYLVHANGICAGYSL 93
+L+ +++ LRL + L V+ + L + N + N DYG+++Y + +Y+V + I A Y+L
Sbjct: 28 SLKIIDSCLRLSVVPLSVATIWLTVTNHESNPDYGNLEYNSIMGLKYMVGVSAISAIYAL 87
Query: 94 LSAVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTTWSSACG 153
LS V V + +S+AW FF+ DQVL Y++ + A +TE++YL+ KGD TWS C
Sbjct: 88 LSTVSSWV---TCLVSKAWLFFIPDQVLAYVMTTSVAGATEIVYLLNKGDKIVTWSEMCS 144
Query: 154 SFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKYG 192
S+ +C+KLT + + ++ LS+IS+Y+ F+ +
Sbjct: 145 SYPHYCSKLTIALGLHVFVLFFFLFLSVISAYRAFSPFD 183
>AT4G16442.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:9272042-9272970 REVERSE LENGTH=182
Length = 182
Score = 75.1 bits (183), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 83/165 (50%), Gaps = 8/165 (4%)
Query: 36 LRSLETFLRLLPIGLCVSALVLMLKNSQENDYGSVD----YTDLGPFRYLVHANGICAGY 91
+R E LR + AL+L++ +++ ++ YTD+ +LV ANGI A Y
Sbjct: 8 MRLTELLLRCSISVFALLALILVVTDTEVKLIFTIKKTAKYTDMKAVVFLVVANGIAAVY 67
Query: 92 SLLSAV--FVAVPRPSPTMSR--AWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTT 147
SLL +V V + S+ AW FF DQ + Y+ + A AA+ E + +G+
Sbjct: 68 SLLQSVRCVVGTMKGKVLFSKPLAWAFFSGDQAMAYLNVAAIAATAESGVIAREGEEDLQ 127
Query: 148 WSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKYG 192
W C +G+FCN++ + +A + V +S IS++ LF YG
Sbjct: 128 WMRVCTMYGKFCNQMAIGVSSALLASIAMVFVSCISAFSLFRLYG 172
>AT2G35760.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:15032059-15033094 FORWARD LENGTH=201
Length = 201
Score = 68.6 bits (166), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 78/165 (47%), Gaps = 8/165 (4%)
Query: 36 LRSLETFLRLLPIGLCVSALVLMLKNSQENDY----GSVDYTDLGPFRYLVHANGICAGY 91
+R E LR L L + A +L+ + Q + +TD+ LV NGI AGY
Sbjct: 27 VRVTELILRCLVCVLALVAAILIATDVQVREIFMIQKKAKFTDMKALVLLVVVNGIAAGY 86
Query: 92 SLLSAV--FVAVPRPSPTMSR--AWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTT 147
SL+ AV V + + S+ AW F DQ + Y+ + AA+ + + G+
Sbjct: 87 SLVQAVRCVVGLMKGRVLFSKPLAWAIFFGDQAVAYLCVAGVAAAAQSAAFAKLGEPELQ 146
Query: 148 WSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKYG 192
W C +G+FCN++ A A + VL+S IS++ +F YG
Sbjct: 147 WMKICNMYGKFCNQVGEGIASALFACIGMVLISCISAFGVFRLYG 191
>AT4G15630.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:8917527-8918683 FORWARD LENGTH=190
Length = 190
Score = 64.7 bits (156), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 57/113 (50%)
Query: 75 LGPFRYLVHANGICAGYSLLSAVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTE 134
L F Y + AN I GY+ +S V V + + + S + D ++ ++ + A+
Sbjct: 77 LSAFVYNISANAIACGYTAISIVIVMISKGKRSKSLLMAVLIGDLMMVALLFSSTGAAGA 136
Query: 135 VLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKL 187
+ + G+ W CG FG+FCN+ + AIT +A + ++LL ++ + KL
Sbjct: 137 IGLMGRHGNKHVMWKKVCGVFGKFCNQAAVSVAITLIASVVFMLLVVLDALKL 189
>AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:11708628-11709905 REVERSE LENGTH=221
Length = 221
Score = 62.8 bits (151), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 3/119 (2%)
Query: 71 DYTDLGPFRYLVHANGICAGYSLLSAVF--VAVPRPSPTMSRAWTFFLLDQVLTYIILGA 128
DYTDL V N I GY LS F V + RP R + L D V+ + L A
Sbjct: 100 DYTDLPTMSSFVIVNSIVGGYLTLSLPFSIVCILRPLAVPPRLF-LILCDTVMMGLTLMA 158
Query: 129 GAASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKL 187
+AS ++YL G++ + W C FG FC + +F+A + L I+S++ L
Sbjct: 159 ASASAAIVYLAHNGNSSSNWLPVCQQFGDFCQGTSGAVVASFIAATLLMFLVILSAFAL 217
>AT1G03700.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:921038-921844 FORWARD LENGTH=164
Length = 164
Score = 61.6 bits (148), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 71/139 (51%), Gaps = 12/139 (8%)
Query: 54 ALVLMLKNSQENDYGSV----DYTDLGPFRYLVHANGICAGYSLLSAVFVAVPRPSPTMS 109
A++ M+ + + + + + Y+DL F+Y V AN I YS L +F+ P S
Sbjct: 24 AVIAMITSRERSSFFVISLVAKYSDLAAFKYFVIANAIVTVYSFL-VLFL------PKES 76
Query: 110 RAWTFFL-LDQVLTYIILGAGAASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAI 168
W F + LD ++T ++ + +A+ V + ++G+A W CG RFC+++T
Sbjct: 77 LLWKFVVVLDLMVTMLLTSSLSAAVAVAQVGKRGNANAGWLPICGQVPRFCDQITGALIA 136
Query: 169 TFVAVLCYVLLSIISSYKL 187
VA++ YV L I S + +
Sbjct: 137 GLVALVLYVFLLIFSIHHV 155
>AT4G03540.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:1570042-1571483 FORWARD LENGTH=164
Length = 164
Score = 57.8 bits (138), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 8/103 (7%)
Query: 71 DYTDLGPFRYLVHANGICAGYSLLSAVFVAVPRPSPTMSRAWTFFL-LDQVLTYIILGAG 129
YTD+ F+Y V AN + + YS L +F+ P S W F + LD V+T ++ +
Sbjct: 45 KYTDMAAFKYFVIANAVVSVYSFL-VLFL------PKESLLWKFVVVLDLVMTMLLTSSL 97
Query: 130 AASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVA 172
+A+ V + +KG+A W CG +FC+++T FVA
Sbjct: 98 SAALAVAQVGKKGNANAGWLPICGQVPKFCDQITGALIAGFVA 140
>AT4G25830.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:13133727-13134790 FORWARD LENGTH=175
Length = 175
Score = 55.1 bits (131), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 8/163 (4%)
Query: 36 LRSLETFLRLLPIGLCVSALVLMLKNSQENDYGSVDYTDLGPFRYLV------HANGICA 89
LR E LRL + + L+ +SQ + + FRYL+ + + + A
Sbjct: 4 LRETEVILRLCIVFFLLLTSCLIGLDSQTKEIAYIHKN--VSFRYLLALEAELYIDVVVA 61
Query: 90 GYSLLSAVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTEVLYLMEKGDAGTTWS 149
Y+L+ T + W +LLDQ Y++ +A+ + L+ G W
Sbjct: 62 AYNLVQLGLGWYNVEQKTSNPKWFSYLLDQTAAYVVFAGTSAAAQHSLLVVTGSRELQWM 121
Query: 150 SACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKLFTKYG 192
C F RFC ++ + + ++A VLLS IS++ LF Y
Sbjct: 122 KWCYKFTRFCFQMGSAIILNYIAAALMVLLSSISAFNLFRLYS 164
>AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:4840798-4841660 REVERSE LENGTH=209
Length = 209
Score = 54.3 bits (129), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 3/120 (2%)
Query: 70 VDYTDLGPFRYLVHANGICAGYSLLSAVFVAVPRPSPTMSRAWTFFLL--DQVLTYIILG 127
V Y+DL + V AN I GY +LS + V++ T ++ LL D V+ ++
Sbjct: 88 VKYSDLPTLMFFVVANAISGGYLVLS-LPVSIFHIFSTQAKTSRIILLVVDTVMLALVSS 146
Query: 128 AGAASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKL 187
+A+T +YL +G+ W C F FC +++ + +F AV+ +L+ I S+ L
Sbjct: 147 GASAATATVYLAHEGNTTANWPPICQQFDGFCERISGSLIGSFCAVILLMLIVINSAISL 206
>AT4G15620.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:8913935-8915531 FORWARD LENGTH=190
Length = 190
Score = 53.5 bits (127), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/113 (23%), Positives = 52/113 (46%)
Query: 75 LGPFRYLVHANGICAGYSLLSAVFVAVPRPSPTMSRAWTFFLLDQVLTYIILGAGAASTE 134
L F Y + N I GY+ +S + + R + L D V+ ++ A++
Sbjct: 77 LSAFVYNISVNAIACGYTAISIAILMISRGRRSKKLLMVVLLGDLVMVALLFSGTGAASA 136
Query: 135 VLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKL 187
+ + G+ W CG FG+FC++ + +T +A + ++ L ++ + KL
Sbjct: 137 IGLMGLHGNKHVMWKKVCGVFGKFCHRAAPSLPLTLLAAVVFMFLVVLDAIKL 189
>AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:1877333-1878116 FORWARD LENGTH=202
Length = 202
Score = 49.3 bits (116), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 53/118 (44%), Gaps = 3/118 (2%)
Query: 72 YTDLGPFRYLVHANGICAGYSLLSAVF--VAVPRPSPTMSRAWTFFLLDQVLTYIILGAG 129
Y DL F++ V A I AGY +LS F V + RP R +LD + A
Sbjct: 83 YDDLPTFQFFVVAIAIVAGYLVLSLPFSVVTIVRPLAVAPR-LLLLVLDTAALALDTAAA 141
Query: 130 AASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTATTAITFVAVLCYVLLSIISSYKL 187
+A+ ++YL G+ T W C FG FC K + F +V +L +IS L
Sbjct: 142 SAAAAIVYLAHNGNTNTNWLPICQQFGDFCQKTSGAVVSAFASVTFLAILVVISGVSL 199
>AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:4967094-4967846 FORWARD LENGTH=187
Length = 187
Score = 49.3 bits (116), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/97 (24%), Positives = 47/97 (48%), Gaps = 3/97 (3%)
Query: 71 DYTDLGPFRYLVHANGICAGYSLLSAV--FVAVPRPSPTMSRAWTFFLLDQVLTYIILGA 128
+Y DL + V AN + +GY +LS FV + + +R +LD + ++
Sbjct: 66 EYNDLPALTFFVVANAVVSGYLILSLTLAFVHIVKRKTQNTRIL-LIILDVAMLGLLTSG 124
Query: 129 GAASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLTAT 165
+++ ++YL G+ T W + C F FC +++ +
Sbjct: 125 ASSAAAIVYLAHNGNNKTNWFAICQQFNSFCERISGS 161
>AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:15159744-15160669 REVERSE LENGTH=206
Length = 206
Score = 47.8 bits (112), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 3/94 (3%)
Query: 72 YTDLGPFRYLVHANGICAGYSLLSAVF--VAVPRPSPTMSRAWTFFLLDQVLTYIILGAG 129
Y DL F+Y V A + A Y +LS F V++ RP R + D ++ + A
Sbjct: 87 YDDLPAFQYFVIAVAVVASYLVLSLPFSIVSIVRPHAVAPR-LILLICDTLVVTLNTSAA 145
Query: 130 AASTEVLYLMEKGDAGTTWSSACGSFGRFCNKLT 163
AA+ + YL G+ T W C FG FC ++
Sbjct: 146 AAAASITYLAHNGNQSTNWLPICQQFGDFCQNVS 179