Miyakogusa Predicted Gene
- Lj4g3v0684090.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0684090.1 Non Chatacterized Hit- tr|A9P8G1|A9P8G1_POPTR
Putative uncharacterized protein OS=Populus
trichocarp,26.78,3e-18,seg,NULL,CUFF.47908.1
(356 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 416 e-116
AT1G69430.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 406 e-113
AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 72 7e-13
AT5G61340.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 64 1e-10
AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 59 4e-09
AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 54 2e-07
AT2G18680.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 50 3e-06
>AT1G26650.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G69430.1); Has 205 Blast hits to 204 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 205; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:9210335-9211342 FORWARD
LENGTH=335
Length = 335
Score = 416 bits (1068), Expect = e-116, Method: Compositional matrix adjust.
Identities = 200/311 (64%), Positives = 248/311 (79%), Gaps = 1/311 (0%)
Query: 39 EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
+ S NAL+ILRET+RILR+N A M T +LICPVSA+LL N VD+S+V LT++L+L
Sbjct: 21 QFHSSNALEILRETVRILRYNLGALMLTTAVLICPVSALLLPNFLVDQSLVNKLTVKLLL 80
Query: 99 VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
VA++SGLPL+ +K SCQ+FAE +SSA CFP++ T+ LLSKAAVVYSVDC+YSR+ D+
Sbjct: 81 VAKSSGLPLQPFVKHSCQKFAETAVSSAMCFPVFITVSLLSKAAVVYSVDCSYSREVVDI 140
Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXXX 218
SKF VI+ K WR+++ TY+W C +++GC T FCV LVA CS+ SVLGFSPD
Sbjct: 141 SKFLVILQKIWRRVVFTYVWICILIVGCFTFFCVLLVAICSSFSVLGFSPDFNVYGAMLV 200
Query: 219 XXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGTAF 278
FSVVFANAIIICN A+VI+VLEDVSG A++R+S LIKGQ QVGLL+FLGST+G AF
Sbjct: 201 GLAFSVVFANAIIICNTAIVISVLEDVSGLGALMRASDLIKGQIQVGLLMFLGSTLGLAF 260
Query: 279 VEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCR-SFSMENSEG 337
VEGLF+HRVK +SYGDGSSR+WEGPLLV+M+SFV LIDSMMSAVFYFSCR +SME S G
Sbjct: 261 VEGLFDHRVKKVSYGDGSSRLWEGPLLVLMYSFVTLIDSMMSAVFYFSCRVYYSMEASRG 320
Query: 338 EGNSILETMAI 348
E I+ET+ +
Sbjct: 321 ETQPIMETVTV 331
>AT1G69430.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G26650.1); Has 216 Blast hits to 215 proteins
in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr1:26098025-26099077 FORWARD
LENGTH=350
Length = 350
Score = 406 bits (1043), Expect = e-113, Method: Compositional matrix adjust.
Identities = 200/301 (66%), Positives = 249/301 (82%)
Query: 37 NYEIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRL 96
+++ SMNAL+ILRET+RILR+N AFM I +LLICPVSA+LL N+ VD+S+V +LT+RL
Sbjct: 35 DHKFHSMNALEILRETVRILRYNLGAFMLIALLLICPVSAILLPNLLVDQSVVNSLTVRL 94
Query: 97 MLVARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKF 156
+LV+++SGLPL ++ SCQ+F+E +SSA CFPL+ TL LLS+AAVVYSVDCTYSRKK
Sbjct: 95 LLVSKSSGLPLLPFVRNSCQKFSETAVSSAMCFPLFITLSLLSRAAVVYSVDCTYSRKKV 154
Query: 157 DVSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXX 216
V+KF VI+ + W++++ TY+W CT+++ C+T FCVFLVA CS+ VLGFSPD
Sbjct: 155 VVTKFVVIMQRLWKRLVITYLWICTVIVVCLTSFCVFLVAVCSSFYVLGFSPDFNAYGAI 214
Query: 217 XXXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGT 276
FSVVFANAIIICN +VI++LEDVSG A++R+S LIKGQTQVGLLIFLGSTIG
Sbjct: 215 LVGLVFSVVFANAIIICNTTIVISILEDVSGPGALVRASDLIKGQTQVGLLIFLGSTIGL 274
Query: 277 AFVEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSMENSE 336
FVEGLFEHRVK+LSYGDGSSR+WEGPLLV+M+SFVVLID+MMSAVFYFSCRS+SME E
Sbjct: 275 TFVEGLFEHRVKSLSYGDGSSRLWEGPLLVVMYSFVVLIDTMMSAVFYFSCRSYSMEAVE 334
Query: 337 G 337
Sbjct: 335 A 335
>AT4G19950.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44860.1); Has 338 Blast hits to 330 proteins
in 72 species: Archae - 2; Bacteria - 94; Metazoa - 7;
Fungi - 0; Plants - 232; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr4:10809977-10810942 FORWARD
LENGTH=321
Length = 321
Score = 72.0 bits (175), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 136/300 (45%), Gaps = 20/300 (6%)
Query: 39 EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
E+Q +N ILRE+ I +++ F IT+ LI P+S +L++ + I+
Sbjct: 7 ELQFLNKRGILRESTSIPQYSLKTFYLITLTLIFPLSFAILAHSLFTQPIL--------- 57
Query: 99 VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYA-TLLLLSKAAVVYSVDCTYSRKKFD 157
A+ P + Q +++ C+ ++ LLS AAVV++V Y+ K
Sbjct: 58 -AQIDTYPQAD--QSQLQHEWTVLLVFQFCYIIFLFAFSLLSTAAVVFTVASLYTGKPVS 114
Query: 158 VSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXX 217
S I +++ T++W +++ T+F +FLV A+ +
Sbjct: 115 FSSTMSAIPLVLKRLFITFLWVSLLMLAYNTVFLIFLVTLIVAVDLQNVVLAVFSLVVIF 174
Query: 218 XXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGLLIFLGSTIGTA 277
V+ A + ++A V++VLE + G AM +S L+KG+T + + +
Sbjct: 175 VLFLVVHVYMTA--LWHLASVVSVLEPIYGLAAMKKSYELLKGKTLMACSMVFIYLVHCG 232
Query: 278 FVEGLFEHRVKTLSYGDGS---SRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSMEN 334
F+ G+F V + GD +R+ G LV + V LI ++ +VFY+ C+SF +
Sbjct: 233 FIAGVFGAVV--VRGGDDYGIFARIVAGGFLVGVLVIVNLIGLLVQSVFYYVCKSFHHQE 290
>AT5G61340.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G26650.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:24662064-24663044 REVERSE LENGTH=326
Length = 326
Score = 64.3 bits (155), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 129/274 (47%), Gaps = 22/274 (8%)
Query: 73 PVSAVLLSNVTVDESIVKNLTIRLMLVARTSGL-PLRSIIKQSCQRFAEMVISSASCFPL 131
P SA LL + S L +RL ++ R +G + ++ + SS P
Sbjct: 35 PFSAGLLLSQPFFSSSSSTLHMRLNMLFRGAGFSSSHDFFNILSLKLSQTLSSSLFTLPF 94
Query: 132 YATLLLLSKAAVVYSVDCTYSRKKFDVSKFCVIIAKFWRKILSTYMWACTIVIGC-ITLF 190
T LLLSKA V+ + +S V F+ ++L TY+ ++ + F
Sbjct: 95 SLTFLLLSKAYVIKLLSNNHSADSSSV---------FYLRLLKTYVCNFFFLLSANASAF 145
Query: 191 CVFLVAFCSALSVLGFSP-DXXXXXXXXXXXXFSVVFANAIIICNIAMVITVLEDVSGAQ 249
+F +A+ + L GFS + +S++ ANA +I N+A+V + G
Sbjct: 146 ALFFLAY-NTLEAFGFSSRNFYTFLSLSSAIIYSIIIANAFVISNLALVSSPSSSSGGYT 204
Query: 250 AMLRSSILIKGQTQVGLLIFLGSTIGTAFVEGLFEHRVKTLSYGDGS----SRMWEGPLL 305
+L++ +LI+G+ + + L + +G A VE LF++RV SY +G S EG +
Sbjct: 205 NILKACLLIRGRNSTAMALALPTNLGLAGVEALFQYRVMR-SYYNGDRDIISIALEGTFI 263
Query: 306 VIMHSFVVLIDSMMSAVFYFSCRSFSMENSEGEG 339
+++ +++D++++ +FY SC ++N E +
Sbjct: 264 AYLYALFLVLDTIVNFLFYQSC----IKNEEDQN 293
>AT1G31130.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 246 Blast hits to 244 proteins
in 29 species: Archae - 2; Bacteria - 16; Metazoa - 0;
Fungi - 0; Plants - 222; Viruses - 0; Other Eukaryotes -
6 (source: NCBI BLink). | chr1:11114963-11115928 REVERSE
LENGTH=321
Length = 321
Score = 59.3 bits (142), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 111/248 (44%), Gaps = 24/248 (9%)
Query: 39 EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
E+Q + +L+E+I I + + F IT+ I P+S +L++ + I+ L
Sbjct: 7 ELQFLTIPQLLQESISIKKRSPRTFYLITLSFIFPLSFAILAHSLFTQPILAKLD----- 61
Query: 99 VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
P S +S + ++I S LLS AAVV++V Y+ K
Sbjct: 62 ----KSDPPNS--DRSRHDWTVLLIFQFSYLIFLFAFSLLSTAAVVFTVASLYTGKPVSF 115
Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSV--LGFSPDXXXXXXX 216
S I K ++++ T++W ++ +F VFLV AL + LG +
Sbjct: 116 SSTLSAIPKVFKRLFITFLWVALLMFAYNAVFFVFLVMLLVALDLNSLGLA---IVAGVI 172
Query: 217 XXXXXFSV-VFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGL-----LIFL 270
F V V+ A + ++ VI+VLE V G AM ++ L+KG+T++ + +FL
Sbjct: 173 ISVLYFGVHVYFTA--LWHLGSVISVLEPVYGIAAMRKAYELLKGKTKMAMGLIFVYLFL 230
Query: 271 GSTIGTAF 278
IG F
Sbjct: 231 CGLIGVVF 238
>AT5G44860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G19950.1); Has 233 Blast hits to 227 proteins
in 25 species: Archae - 0; Bacteria - 13; Metazoa - 1;
Fungi - 0; Plants - 216; Viruses - 0; Other Eukaryotes -
3 (source: NCBI BLink). | chr5:18110688-18111653 REVERSE
LENGTH=321
Length = 321
Score = 53.5 bits (127), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 130/297 (43%), Gaps = 16/297 (5%)
Query: 39 EIQSMNALDILRETIRILRFNSWAFMAITVLLICPVSAVLLSNVTVDESIVKNLTIRLML 98
E+Q +N ILRE+ I +F+ F IT+ LI P+S +L++ + I+
Sbjct: 7 ELQFLNIQGILRESTTIPKFSPKTFYLITLTLIFPLSFAILAHSLFTQPIL--------- 57
Query: 99 VARTSGLPLRSIIKQSCQRFAEMVISSASCFPLYATLLLLSKAAVVYSVDCTYSRKKFDV 158
A+ P K + + ++ L+A LLS AAVV++V Y+ K
Sbjct: 58 -AQLDATPPSDQSKTNHEWTLLLIYQFIYVIFLFA-FSLLSTAAVVFTVASLYTGKPVSF 115
Query: 159 SKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVAFCSALSVLGFSPDXXXXXXXXX 218
S I +++ T++W +++ ++F +FLV A+ + S
Sbjct: 116 SSTMSAIPLVLKRLFITFLWVSLMMLVYNSVFLLFLVVLIVAIDLQ--SVILAVFSMVVI 173
Query: 219 XXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRSSILIKGQTQVGL-LIFLGSTI-GT 276
F V ++A V++VLE + G AM +S L+ G+T + ++F+ + G
Sbjct: 174 FVLFLGVHVYMTAWWHLASVVSVLEPIYGIAAMKKSYELLNGRTNMACSMVFMYLALCGI 233
Query: 277 AFVEGLFEHRVKTLSYGDGSSRMWEGPLLVIMHSFVVLIDSMMSAVFYFSCRSFSME 333
+G +++ G LV + V L+ ++ +VFY+ C+SF +
Sbjct: 234 TAGVFGGVVVHGGDDFG-LFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVCKSFHHQ 289
>AT2G18680.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: male gametophyte, pollen tube;
EXPRESSED DURING: L mature pollen stage, M germinated
pollen stage; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G18690.1); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr2:8094780-8095643 FORWARD LENGTH=287
Length = 287
Score = 50.1 bits (118), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 64/126 (50%), Gaps = 8/126 (6%)
Query: 137 LLSKAAVVYSVDCTYSRKKFDVSKFCVIIAKFWRKILSTYMWACTIVIGCITLFCVFLVA 196
LLS +V++ T+ F++ F ++ K+W+ L T + +G LF F+V
Sbjct: 77 LLSTLVMVHASALTHKDDSFEIKDFPILTLKYWKGPLVTNFYIVLFSLGYWFLF--FIVL 134
Query: 197 FCSALSVLGFSP--DXXXXXXXXXXXXFSVVFANAIIICNIAMVITVLEDVSGAQAMLRS 254
F S++ FS D F+V + I+ N++MVI++LED G QA+ ++
Sbjct: 135 F----SIVFFSTKLDSLAAKSRALFIVFAVFESYLAIVWNLSMVISILEDTYGIQALGKA 190
Query: 255 SILIKG 260
+ ++KG
Sbjct: 191 AKIVKG 196