Miyakogusa Predicted Gene
- Lj2g3v1277660.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1277660.1 tr|Q259U2|Q259U2_ORYSA H0913C04.2 protein
OS=Oryza sativa GN=H0913C04.2 PE=3 SV=1,51.35,3e-19,A_tha_TIGR01569:
plant integral membrane protein T,Uncharacterised protein family
UPF0497, trans-mem,CUFF.36657.1
(186 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF049... 155 2e-38
AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF049... 138 3e-33
AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF049... 135 1e-32
AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF049... 130 5e-31
AT3G11550.1 | Symbols: | Uncharacterised protein family (UPF049... 126 7e-30
AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF049... 124 4e-29
AT4G03540.1 | Symbols: | Uncharacterised protein family (UPF049... 55 2e-08
AT5G44550.1 | Symbols: | Uncharacterised protein family (UPF049... 52 2e-07
AT1G03700.1 | Symbols: | Uncharacterised protein family (UPF049... 51 4e-07
>AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:4967094-4967846 FORWARD LENGTH=187
Length = 187
Score = 155 bits (391), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 100/166 (60%)
Query: 1 MKGGSIEIGEVSKGASQRKGMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTNF 60
MK G EI E SKG + M R ++I++FILRI E+LPF T F
Sbjct: 1 MKSGQAEIMETSKGIQKSGLMSRRIAILEFILRIVAFFNTIGSAILMGTTHETLPFFTQF 60
Query: 61 MQFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXX 120
++F+AEY+DLP+ FFV+AN++V GYL+LSL L+ HIV+ +R+LLI+ D M
Sbjct: 61 IRFQAEYNDLPALTFFVVANAVVSGYLILSLTLAFVHIVKRKTQNTRILLIILDVAMLGL 120
Query: 121 XXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVGSY 166
HNGN+K NWF ICQQFN++C++ SGS++GS+
Sbjct: 121 LTSGASSAAAIVYLAHNGNNKTNWFAICQQFNSFCERISGSLIGSF 166
>AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:15159744-15160669 REVERSE LENGTH=206
Length = 206
Score = 138 bits (347), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 86/149 (57%)
Query: 16 SQRKGMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTNFMQFRAEYDDLPSFVF 75
+ R G KRGL+I DF+LR+ EE+LPF T F+QF+A YDDLP+F +
Sbjct: 36 ASRGGAKRGLAIFDFLLRLAAIAVTIGAASVMYTAEETLPFFTQFLQFQAGYDDLPAFQY 95
Query: 76 FVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXXXXXXXXXXXXXXXXX 135
FV+A ++V YLVLSL S+ IVR AV R++L++ DT++
Sbjct: 96 FVIAVAVVASYLVLSLPFSIVSIVRPHAVAPRLILLICDTLVVTLNTSAAAAAASITYLA 155
Query: 136 HNGNSKANWFPICQQFNNYCQQASGSVVG 164
HNGN NW PICQQF ++CQ S +VV
Sbjct: 156 HNGNQSTNWLPICQQFGDFCQNVSTAVVA 184
>AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:11708628-11709905 REVERSE LENGTH=221
Length = 221
Score = 135 bits (341), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 65/147 (44%), Positives = 86/147 (58%)
Query: 20 GMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTNFMQFRAEYDDLPSFVFFVLA 79
G KRG++I DF+LR+ EE+LPF T F+QF+A+Y DLP+ FV+
Sbjct: 54 GWKRGVAIFDFVLRLIAAITAMAAAAKMATTEETLPFFTQFLQFQADYTDLPTMSSFVIV 113
Query: 80 NSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXXXXXXXXXXXXXXXXXHNGN 139
NS+V GYL LSL S+ I+R AV R+ LI+ DTVM HNGN
Sbjct: 114 NSIVGGYLTLSLPFSIVCILRPLAVPPRLFLILCDTVMMGLTLMAASASAAIVYLAHNGN 173
Query: 140 SKANWFPICQQFNNYCQQASGSVVGSY 166
S +NW P+CQQF ++CQ SG+VV S+
Sbjct: 174 SSSNWLPVCQQFGDFCQGTSGAVVASF 200
>AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:1877333-1878116 FORWARD LENGTH=202
Length = 202
Score = 130 bits (327), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 97/166 (58%), Gaps = 1/166 (0%)
Query: 1 MKGGSIEIGEVSKGASQRKGMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTNF 60
+KG + +G +++ + G KRGLSI DF+LR+ +E+LPF T F
Sbjct: 18 IKGKAPLLG-LARDHTGSGGYKRGLSIFDFLLRLAAIVAALAAAATMGTSDETLPFFTQF 76
Query: 61 MQFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXX 120
+QF A YDDLP+F FFV+A ++V GYLVLSL SV IVR AV R+LL+V DT
Sbjct: 77 LQFEASYDDLPTFQFFVVAIAIVAGYLVLSLPFSVVTIVRPLAVAPRLLLLVLDTAALAL 136
Query: 121 XXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVGSY 166
HNGN+ NW PICQQF ++CQ+ SG+VV ++
Sbjct: 137 DTAAASAAAAIVYLAHNGNTNTNWLPICQQFGDFCQKTSGAVVSAF 182
>AT3G11550.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr3:3638262-3639052 FORWARD LENGTH=204
Length = 204
Score = 126 bits (317), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 69/161 (42%), Positives = 89/161 (55%), Gaps = 1/161 (0%)
Query: 1 MKGGSIEIGEV-SKGASQRKGMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTN 59
MKG + IG S G RGL+I DF+LR+ +E+LPF T
Sbjct: 18 MKGKAPLIGVARDHTTSGSGGYNRGLAIFDFLLRLAAIVAALAAAATMGTSDETLPFFTQ 77
Query: 60 FMQFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXX 119
F+QF A YDDLP+F FFV+A +LV GYLVLSL +SV I+R A R+LL+V DT +
Sbjct: 78 FLQFEASYDDLPTFQFFVIAMALVGGYLVLSLPISVVTILRPLATAPRLLLLVLDTGVLA 137
Query: 120 XXXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASG 160
H+GN NW PICQQF ++CQ++SG
Sbjct: 138 LNTAAASSAAAISYLAHSGNQNTNWLPICQQFGDFCQKSSG 178
>AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:4840798-4841660 REVERSE LENGTH=209
Length = 209
Score = 124 bits (310), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 98/187 (52%), Gaps = 6/187 (3%)
Query: 6 IEIGEVSKGASQ------RKGMKRGLSIMDFILRIXXXXXXXXXXXXXXXXEESLPFVTN 59
IE GE S+ + + + + +G+S++ F+LR+ ES+ ++
Sbjct: 23 IEAGETSRSSRKLITFEPKLVINKGISVLGFVLRLFAVFGTIGSALAMGTTHESVVSLSQ 82
Query: 60 FMQFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXX 119
+ + +Y DLP+ +FFV+AN++ GYLVLSL +S+FHI + A SR++L+V DTVM
Sbjct: 83 LVLLKVKYSDLPTLMFFVVANAISGGYLVLSLPVSIFHIFSTQAKTSRIILLVVDTVMLA 142
Query: 120 XXXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVGSYXXXXXXXXXXXXX 179
H GN+ ANW PICQQF+ +C++ SGS++GS+
Sbjct: 143 LVSSGASAATATVYLAHEGNTTANWPPICQQFDGFCERISGSLIGSFCAVILLMLIVINS 202
Query: 180 XXXXSRH 186
SRH
Sbjct: 203 AISLSRH 209
>AT4G03540.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr4:1570042-1571483 FORWARD LENGTH=164
Length = 164
Score = 55.5 bits (132), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 47/106 (44%), Gaps = 7/106 (6%)
Query: 61 MQFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXX 120
+ A+Y D+ +F +FV+AN++V Y L L L ++ ++V D VM
Sbjct: 40 ISLEAKYTDMAAFKYFVIANAVVSVYSFLVLFLPKESLLWK-------FVVVLDLVMTML 92
Query: 121 XXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVGSY 166
GN+ A W PIC Q +C Q +G+++ +
Sbjct: 93 LTSSLSAALAVAQVGKKGNANAGWLPICGQVPKFCDQITGALIAGF 138
>AT5G44550.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:17942100-17943174 REVERSE LENGTH=197
Length = 197
Score = 52.0 bits (123), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 51/106 (48%), Gaps = 2/106 (1%)
Query: 62 QFRAEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLI-VFDTVMXXX 120
F A++D P+FVFFV+AN++V + +L + L +F + R+L + + D +
Sbjct: 59 TFTAKFDHTPAFVFFVVANAMVSFHNLLMIALQIFG-GKMEFTGFRLLSVAILDMLNVTL 117
Query: 121 XXXXXXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVGSY 166
NGN A W IC +F YC +G+++ ++
Sbjct: 118 ISAAANAAAFMAEVGKNGNKHARWDKICDRFATYCDHGAGALIAAF 163
>AT1G03700.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:921038-921844 FORWARD LENGTH=164
Length = 164
Score = 51.2 bits (121), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 7/100 (7%)
Query: 65 AEYDDLPSFVFFVLANSLVCGYLVLSLILSVFHIVRSSAVKSRVLLIVFDTVMXXXXXXX 124
A+Y DL +F +FV+AN++V Y L L L + S + V +V D ++
Sbjct: 44 AKYSDLAAFKYFVIANAIVTVYSFLVLFLP-----KESLLWKFV--VVLDLMVTMLLTSS 96
Query: 125 XXXXXXXXXXXHNGNSKANWFPICQQFNNYCQQASGSVVG 164
GN+ A W PIC Q +C Q +G+++
Sbjct: 97 LSAAVAVAQVGKRGNANAGWLPICGQVPRFCDQITGALIA 136