Miyakogusa Predicted Gene
- Lj4g3v2826850.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2826850.1 Non Chatacterized Hit- tr|C5WTE9|C5WTE9_SORBI
Putative uncharacterized protein Sb01g029930
OS=Sorghu,37.16,4e-18,seg,NULL,CUFF.51741.1
(255 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G27990.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 278 2e-75
AT5G52420.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 102 2e-22
AT5G23920.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 93 2e-19
>AT1G27990.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G52420.1); Has 86 Blast hits to 86 proteins in
15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 84; Viruses - 0; Other Eukaryotes - 2
(source: NCBI BLink). | chr1:9752799-9753919 REVERSE
LENGTH=271
Length = 271
Score = 278 bits (712), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 148/260 (56%), Positives = 184/260 (70%), Gaps = 15/260 (5%)
Query: 1 MSGVSLAMAGTDTNTNPKQQAASAAPVGSMNMMGSL-----------RVIEVQLVAFVLV 49
MSGVSLA+ G T+ + + AS++ G + M ++ RVIE+QLVAF+LV
Sbjct: 1 MSGVSLAV-GPRTDVD---KTASSSEKGRWSGMTAIGGGSGGLMGSLRVIELQLVAFILV 56
Query: 50 FSASGLVPLFDLLFPALTTIYLMALARFAFPSNVRGGPRQIIFHGSRGFQAYVVVGTTVG 109
FSASGLVP+ D+LFPA +IY++AL+R AFPS+ +F GS+ F+ YV+ GTT+G
Sbjct: 57 FSASGLVPILDMLFPAFASIYIIALSRLAFPSHGVSTASPEVFRGSKLFRLYVISGTTIG 116
Query: 110 LFLPLAYVLGGFGRGDELAVQSASPHLFLMSVQILTENVISGLSLFSPPVRALVPLMYTI 169
LFLPLAYVLGGF RGD+ AV+SA+PHLFL+S QILTENVISGLSLFSPPVRALVPL+YT+
Sbjct: 117 LFLPLAYVLGGFARGDDHAVRSATPHLFLLSCQILTENVISGLSLFSPPVRALVPLLYTV 176
Query: 170 RRIFVDVDWVQNVWLYKTLPQNALLKDKAWFWFGRXXXXXXXXXXXXXXCAFLIPRFLPR 229
RIFV + W ++VW K+LP NA WFWFGR FLIPRFLPR
Sbjct: 177 WRIFVIIGWSKDVWFNKSLPINATPNVVTWFWFGRYLALANLGYFGVNLLCFLIPRFLPR 236
Query: 230 AFKRYFQERDEIYAKEAEDK 249
AF++YF+ERDEI AK EDK
Sbjct: 237 AFEQYFRERDEILAKSQEDK 256
>AT5G52420.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endoplasmic
reticulum; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G23920.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr5:21281817-21282545
FORWARD LENGTH=242
Length = 242
Score = 102 bits (254), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 103/204 (50%), Gaps = 9/204 (4%)
Query: 40 EVQLVAFVLVFSASGLVPLFDLLFPALTTIYLMALARFAFPSNV---RGGPRQIIFHGSR 96
++ ++A ++V SASGLV + D +F LT IY L++ FP + R P + ++
Sbjct: 44 QLNILAIIIVLSASGLVTIQDFIFTILTLIYFFFLSKLIFPPHNNPNRDAP--LTSSTNK 101
Query: 97 GFQAYVVVGTTVGLFLPLAYVLGGFGRGDELAVQSASPHLFLMSVQILTENVISGLSLFS 156
F+ YV VGL +P+ Y+ G D+ V +A+PH+FL++ QI E + + FS
Sbjct: 102 IFRIYVTAAGIVGLIIPICYIFEGIVEDDKNGVSAAAPHVFLLASQIFMEGLATMFG-FS 160
Query: 157 PPVRALVPLMYTIRRIFVDVDWVQNVWLYKTLPQNALLKDKAWFWFGRXXXXXXXXXXXX 216
P R LVP++Y RR+ V+W+ + + + + + + G+
Sbjct: 161 APARILVPIVYNARRVLTLVEWIMSEFSREDVTGTVSARR---MYAGKVLAAANLGIWSF 217
Query: 217 XXCAFLIPRFLPRAFKRYFQERDE 240
LIP +LPRAFKRY+ E
Sbjct: 218 NLFGVLIPVYLPRAFKRYYGSDKE 241
>AT5G23920.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane,
vacuole; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G52420.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:8073363-8074118 REVERSE
LENGTH=229
Length = 229
Score = 92.8 bits (229), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 102/206 (49%), Gaps = 9/206 (4%)
Query: 37 RVIEVQLVAFVLVFSASGLVPLFDLLFPALTTIYLMA-LARFAFPSNVRGGPRQIIFHGS 95
R ++ ++F+++ +A GLV + ++ F L IYL L+RFAFP +++ +
Sbjct: 29 RKRQLVFLSFMILLAAKGLVGIGEIAFVILCYIYLYEFLSRFAFPRKQTEQKKRLSNPKN 88
Query: 96 RGFQAYVVVGTTVGLFLPLAYVLGGFGRGDELAVQSASPHLFLMSVQILTENVISGLS-L 154
+ FQAY + +GL PL Y+ G RGD +A+PHLFL+S Q TE + G S
Sbjct: 89 KLFQAYFLATAIIGLLFPLCYIGDGIYRGDIHGAGAAAPHLFLLSGQAFTEPI--GFSDK 146
Query: 155 FSPPVRALVPLMYTIRRIFVDVDWVQNVWLYKTLPQNALLKDKAWFWFGRXXXXXXXXXX 214
+S P+ L P+ Y RRIF +DWV+ + P L + GR
Sbjct: 147 YSMPIGILGPVFYNARRIFALLDWVKAEFSDTQRPGGPL-----RLYGGRVIASVNTVMW 201
Query: 215 XXXXCAFLIPRFLPRAFKRYFQERDE 240
L+P FLPR+ + YF ++
Sbjct: 202 FYNLFGLLLPVFLPRSCEIYFSGDNK 227