Miyakogusa Predicted Gene
- Lj5g3v1327750.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1327750.1 Non Chatacterized Hit- tr|Q10XM9|Q10XM9_TRIEI
Putative uncharacterized protein OS=Trichodesmium
eryt,25.07,2e-18,seg,NULL,CUFF.55537.1
(439 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G04550.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 377 e-105
AT5G28500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 369 e-102
AT5G28500.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 223 2e-58
>AT3G04550.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
stroma, chloroplast; EXPRESSED IN: 22 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G28500.1); Has 110 Blast hits to 110 proteins
in 51 species: Archae - 0; Bacteria - 67; Metazoa - 1;
Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr3:1225961-1227310 FORWARD
LENGTH=449
Length = 449
Score = 377 bits (969), Expect = e-105, Method: Compositional matrix adjust.
Identities = 207/377 (54%), Positives = 252/377 (66%), Gaps = 15/377 (3%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSLIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I+ILA R+ +W EYAPLISSL +GFTPPTIEELTGI+ +EQNR IV AQVRD
Sbjct: 81 SLDSAGKIEILAGRMALWFEYAPLISSLYTDGFTPPTIEELTGISSIEQNRLIVGAQVRD 140
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
S+LQS +P+L+S FD GGAELLY+I FI++ D KGA +LAR++KD
Sbjct: 141 SILQSIHEPELISAFDTGGAELLYEIRLLSTTQRVAAATFIIDRNIDSKGAQDLARAIKD 200
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNP-EQRDSVLEQALRVAESEKARDA 248
+P+RRG+ GW FDYNLPGDCLSF+YYR SRE N+NP +QR S+L QAL VAESEKA++
Sbjct: 201 YPNRRGDVGWLDFDYNLPGDCLSFLYYRQSRE-NKNPSDQRTSMLLQALGVAESEKAKNR 259
Query: 249 IQKEL----KXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAI 304
+ EL + RLK GE LPVC A+EG I
Sbjct: 260 LNTELYGDKEAEKEKEKKKKEEEVKAIRIPVVRLKFGEVAEATSVVVLPVCKAEEGEKKI 319
Query: 305 LEAPSECRTVGEFGVVVAEKGWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VLPWKANRW 363
LEAP E G+F VV AEKGW++WVVLP W P+ +GKG V VSF D R VLPW
Sbjct: 320 LEAPMEIIAGGDFKVVEAEKGWKRWVVLPSWNPVAAIGKGGVAVSFRDDRKVLPWDG--- 376
Query: 364 YKEEPILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLGSVILVVR 423
KEEP+LVVADR + V ADDG+YLV E+ GLK+E+G LK R V ESLG V+LVVR
Sbjct: 377 -KEEPLLVVADRVRNVVEADDGYYLVVAEN---GLKLEKGSDLKAREVKESLGMVVLVVR 432
Query: 424 PPREEDDG-QLSDEDWD 439
PPRE+DD Q S ++WD
Sbjct: 433 PPREDDDDWQTSHQNWD 449
>AT5G28500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast
stroma, chloroplast; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G04550.1); Has 109 Blast hits to 109 proteins
in 49 species: Archae - 0; Bacteria - 67; Metazoa - 0;
Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes -
1 (source: NCBI BLink). | chr5:10477810-10479114 FORWARD
LENGTH=434
Length = 434
Score = 369 bits (947), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/372 (53%), Positives = 251/372 (67%), Gaps = 10/372 (2%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSLIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I++LA+RLG+W EYAPLISSL EGFTPP+IEELTGI+GVEQN IV AQVRD
Sbjct: 71 SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
SL+QS P+L++ FD GAELLY+I +IV++ D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
FP RRG+ G FDYNLPGDCLSFM YR SRE+ E R ++LEQAL A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250
Query: 250 QKELKXXXXXXXXXXXXXXXXXXXXXXRLKIGEXXXXXXXXXLPVCNADEGGDAILEAPS 309
+EL RL+ GE LPVC A+EG + +LEAP
Sbjct: 251 LRELH-GESEEERVKEEEIKIIRVPVVRLRFGEVAGASSVVVLPVCKAEEGEEKLLEAPM 309
Query: 310 ECRTVGEFGVVVAEKGWEKWVVLPGWEPLVELGKGCVVVSFVDAR-VLPWKANRWYKEEP 368
E + GEFGVV AEK W +WVVLPGW+P+V + KG V VSF D R VLPW K E
Sbjct: 310 EFESGGEFGVVEAEKDWSRWVVLPGWDPVVAVRKG-VAVSFSDDREVLPWNG----KGEA 364
Query: 369 ILVVADRSKREVGADDGFYLVKVEDDGVGLKVERGLGLKERGVSESLGSVILVVRPPREE 428
I+VV DR K+ V AD+G+Y + V D G+K++RGL LKE+GV+ESLG V+LVVRPPR++
Sbjct: 365 IMVVIDREKKTVEADNGYYYLVVADG--GMKLDRGLVLKEKGVNESLGMVVLVVRPPRDD 422
Query: 429 DDG-QLSDEDWD 439
DD Q++DEDWD
Sbjct: 423 DDEWQINDEDWD 434
>AT5G28500.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G04550.1). | chr5:10477810-10478945 FORWARD
LENGTH=269
Length = 269
Score = 223 bits (569), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/185 (58%), Positives = 135/185 (72%)
Query: 70 TLDIPGQIDILANRLGVWHEYAPLISSLIREGFTPPTIEELTGITGVEQNRFIVAAQVRD 129
+LD G+I++LA+RLG+W EYAPLISSL EGFTPP+IEELTGI+GVEQN IV AQVRD
Sbjct: 71 SLDTAGKIEVLADRLGLWFEYAPLISSLYTEGFTPPSIEELTGISGVEQNSLIVGAQVRD 130
Query: 130 SLLQSNTDPDLVSFFDNGGAELLYQIXXXXXXXXXXXXXFIVENKSDGKGANELARSMKD 189
SL+QS P+L++ FD GAELLY+I +IV++ D KGA +LAR++KD
Sbjct: 131 SLVQSGAKPELIAAFDTNGAELLYEIRLLNTTQRVAAAEYIVDHGFDTKGAGDLARAIKD 190
Query: 190 FPSRRGEKGWESFDYNLPGDCLSFMYYRLSREYNRNPEQRDSVLEQALRVAESEKARDAI 249
FP RRG+ G FDYNLPGDCLSFM YR SRE+ E R ++LEQAL A +EKA+ A+
Sbjct: 191 FPHRRGDVGLGDFDYNLPGDCLSFMLYRKSREHRSPSEIRTTLLEQALETAVTEKAKKAV 250
Query: 250 QKELK 254
+EL
Sbjct: 251 LRELH 255