Miyakogusa Predicted Gene
- Lj0g3v0217659.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0217659.1 tr|B9HBX1|B9HBX1_POPTR Predicted protein
OS=Populus trichocarpa GN=POPTRDRAFT_561467 PE=3
SV=1,52.88,2e-18,SUBFAMILY NOT NAMED,NULL; NITRATE, FROMATE, IRON
DEHYDROGENASE,NULL; A_tha_TIGR01569: plant integral,CUFF.14060.1
(197 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G11550.1 | Symbols: | Uncharacterised protein family (UPF049... 160 4e-40
AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF049... 150 6e-37
AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF049... 143 9e-35
AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF049... 129 2e-30
AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF049... 107 6e-24
AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF049... 87 6e-18
AT3G06390.1 | Symbols: | Uncharacterised protein family (UPF049... 50 1e-06
AT1G17200.1 | Symbols: | Uncharacterised protein family (UPF049... 50 1e-06
>AT3G11550.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr3:3638262-3639052 FORWARD LENGTH=204
Length = 204
Score = 160 bits (406), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 85/174 (48%), Positives = 102/174 (58%), Gaps = 5/174 (2%)
Query: 2 STTIDMPGSSKAAKAGKPVLV-----TTPSRPGGWKKGVAIMDFXXXXXXXXXXXXXXXT 56
ST ID+P S +A GK L+ T S GG+ +G+AI DF T
Sbjct: 5 STFIDVPAESSSAMKGKAPLIGVARDHTTSGSGGYNRGLAIFDFLLRLAAIVAALAAAAT 64
Query: 57 MGLSDQTLPXXXXXXXXEASYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPHAAGP 116
MG SD+TLP EASYD TFQFFVI MALV GYLVLSLP+S+V I+RP A P
Sbjct: 65 MGTSDETLPFFTQFLQFEASYDDLPTFQFFVIAMALVGGYLVLSLPISVVTILRPLATAP 124
Query: 117 RLFLIILDTVFXXXXXXXXXXXXXIVYLAHNGNQDTNWIAICNQFGDFCAQTSG 170
RL L++LDT I YLAH+GNQ+TNW+ IC QFGDFC ++SG
Sbjct: 125 RLLLLVLDTGVLALNTAAASSAAAISYLAHSGNQNTNWLPICQQFGDFCQKSSG 178
>AT5G06200.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:1877333-1878116 FORWARD LENGTH=202
Length = 202
Score = 150 bits (379), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 82/172 (47%), Positives = 101/172 (58%), Gaps = 3/172 (1%)
Query: 2 STTIDMPGSSKAAKAGK-PVL--VTTPSRPGGWKKGVAIMDFXXXXXXXXXXXXXXXTMG 58
S +D+P S + GK P+L + GG+K+G++I DF TMG
Sbjct: 5 SIAVDVPAESSSVIKGKAPLLGLARDHTGSGGYKRGLSIFDFLLRLAAIVAALAAAATMG 64
Query: 59 LSDQTLPXXXXXXXXEASYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPHAAGPRL 118
SD+TLP EASYD TFQFFV+ +A+VAGYLVLSLP S+V IVRP A PRL
Sbjct: 65 TSDETLPFFTQFLQFEASYDDLPTFQFFVVAIAIVAGYLVLSLPFSVVTIVRPLAVAPRL 124
Query: 119 FLIILDTVFXXXXXXXXXXXXXIVYLAHNGNQDTNWIAICNQFGDFCAQTSG 170
L++LDT IVYLAHNGN +TNW+ IC QFGDFC +TSG
Sbjct: 125 LLLVLDTAALALDTAAASAAAAIVYLAHNGNTNTNWLPICQQFGDFCQKTSG 176
>AT2G36100.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:15159744-15160669 REVERSE LENGTH=206
Length = 206
Score = 143 bits (360), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 79/176 (44%), Positives = 93/176 (52%), Gaps = 9/176 (5%)
Query: 2 STTIDMPGSSKAAK--------AGKPVLVTTPSRPGGWKKGVAIMDFXXXXXXXXXXXXX 53
STTID+ S K A K V SR GG K+G+AI DF
Sbjct: 5 STTIDVGEPSTVTKSSSHVVKDAKKKGFVAVASR-GGAKRGLAIFDFLLRLAAIAVTIGA 63
Query: 54 XXTMGLSDQTLPXXXXXXXXEASYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPHA 113
M +++TLP +A YD FQ+FVI +A+VA YLVLSLP SIV+IVRPHA
Sbjct: 64 ASVMYTAEETLPFFTQFLQFQAGYDDLPAFQYFVIAVAVVASYLVLSLPFSIVSIVRPHA 123
Query: 114 AGPRLFLIILDTVFXXXXXXXXXXXXXIVYLAHNGNQDTNWIAICNQFGDFCAQTS 169
PRL L+I DT+ I YLAHNGNQ TNW+ IC QFGDFC S
Sbjct: 124 VAPRLILLICDTLVVTLNTSAAAAAASITYLAHNGNQSTNWLPICQQFGDFCQNVS 179
>AT2G27370.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr2:11708628-11709905 REVERSE LENGTH=221
Length = 221
Score = 129 bits (323), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 82/168 (48%)
Query: 28 PGGWKKGVAIMDFXXXXXXXXXXXXXXXTMGLSDQTLPXXXXXXXXEASYDSFTTFQFFV 87
P GWK+GVAI DF M +++TLP +A Y T FV
Sbjct: 52 PAGWKRGVAIFDFVLRLIAAITAMAAAAKMATTEETLPFFTQFLQFQADYTDLPTMSSFV 111
Query: 88 ITMALVAGYLVLSLPLSIVAIVRPHAAGPRLFLIILDTVFXXXXXXXXXXXXXIVYLAHN 147
I ++V GYL LSLP SIV I+RP A PRLFLI+ DTV IVYLAHN
Sbjct: 112 IVNSIVGGYLTLSLPFSIVCILRPLAVPPRLFLILCDTVMMGLTLMAASASAAIVYLAHN 171
Query: 148 GNQDTNWIAICNQFGDFCAQTSGXXXXXXXXXXXXXXXXXMSALVLGK 195
GN +NW+ +C QFGDFC TSG +SA L +
Sbjct: 172 GNSSSNWLPVCQQFGDFCQGTSGAVVASFIAATLLMFLVILSAFALKR 219
>AT5G15290.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr5:4967094-4967846 FORWARD LENGTH=187
Length = 187
Score = 107 bits (267), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 77/159 (48%), Gaps = 3/159 (1%)
Query: 15 KAGKPVLVTTP---SRPGGWKKGVAIMDFXXXXXXXXXXXXXXXTMGLSDQTLPXXXXXX 71
K+G+ ++ T + G + +AI++F MG + +TLP
Sbjct: 2 KSGQAEIMETSKGIQKSGLMSRRIAILEFILRIVAFFNTIGSAILMGTTHETLPFFTQFI 61
Query: 72 XXEASYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPHAAGPRLFLIILDTVFXXXX 131
+A Y+ FFV+ A+V+GYL+LSL L+ V IV+ R+ LIILD
Sbjct: 62 RFQAEYNDLPALTFFVVANAVVSGYLILSLTLAFVHIVKRKTQNTRILLIILDVAMLGLL 121
Query: 132 XXXXXXXXXIVYLAHNGNQDTNWIAICNQFGDFCAQTSG 170
IVYLAHNGN TNW AIC QF FC + SG
Sbjct: 122 TSGASSAAAIVYLAHNGNNKTNWFAICQQFNSFCERISG 160
>AT1G14160.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:4840798-4841660 REVERSE LENGTH=209
Length = 209
Score = 87.4 bits (215), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 62/139 (44%)
Query: 32 KKGVAIMDFXXXXXXXXXXXXXXXTMGLSDQTLPXXXXXXXXEASYDSFTTFQFFVITMA 91
KG++++ F MG + +++ + Y T FFV+ A
Sbjct: 45 NKGISVLGFVLRLFAVFGTIGSALAMGTTHESVVSLSQLVLLKVKYSDLPTLMFFVVANA 104
Query: 92 LVAGYLVLSLPLSIVAIVRPHAAGPRLFLIILDTVFXXXXXXXXXXXXXIVYLAHNGNQD 151
+ GYLVLSLP+SI I A R+ L+++DTV VYLAH GN
Sbjct: 105 ISGGYLVLSLPVSIFHIFSTQAKTSRIILLVVDTVMLALVSSGASAATATVYLAHEGNTT 164
Query: 152 TNWIAICNQFGDFCAQTSG 170
NW IC QF FC + SG
Sbjct: 165 ANWPPICQQFDGFCERISG 183
>AT3G06390.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr3:1938913-1939707 REVERSE LENGTH=199
Length = 199
Score = 50.1 bits (118), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 51/114 (44%), Gaps = 5/114 (4%)
Query: 57 MGLSDQT----LPXXXXXXXXEASYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPH 112
M SDQT LP A ++ F +FV+ + + + Y ++S +SI +++P
Sbjct: 56 MVTSDQTEMTQLPGVSSPAPVSAEFNDSPAFIYFVVALVVASFYALISTLVSISLLLKPE 115
Query: 113 -AAGPRLFLIILDTVFXXXXXXXXXXXXXIVYLAHNGNQDTNWIAICNQFGDFC 165
A ++L LD V + Y+A GN++ W ICN + FC
Sbjct: 116 FTAQFSIYLASLDMVMLGILASATGTAGGVAYIALKGNEEVGWNKICNVYDKFC 169
>AT1G17200.1 | Symbols: | Uncharacterised protein family (UPF0497)
| chr1:5878493-5879871 FORWARD LENGTH=204
Length = 204
Score = 49.7 bits (117), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 3/96 (3%)
Query: 76 SYDSFTTFQFFVITMALVAGYLVLSLPLSIVAIVRPHAAGPRLF-LIILDTVFXXXXXXX 134
SY + T F++ V + AGY +LS +I A+ R + PR++ LD +
Sbjct: 65 SYSNLTAFRYLVHANGICAGYSLLSA--AIAAMPRSSSTMPRVWTFFCLDQLLTYLVLAA 122
Query: 135 XXXXXXIVYLAHNGNQDTNWIAICNQFGDFCAQTSG 170
++YLA+NG+ W C+ +G FC + +
Sbjct: 123 GAVSAEVLYLAYNGDSAITWSDACSSYGGFCHRATA 158