Miyakogusa Predicted Gene
- Lj5g3v2288720.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2288720.1 Non Chatacterized Hit- tr|D7LJS1|D7LJS1_ARALL
Putative uncharacterized protein (Fragment)
OS=Arabido,38.66,3e-17,seg,NULL,CUFF.57189.1
(356 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G39370.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 157 9e-39
AT2G37380.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 69 5e-12
AT5G26230.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 52 5e-07
>AT2G39370.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G37380.1); Has 184 Blast hits to 178 proteins
in 53 species: Archae - 0; Bacteria - 58; Metazoa - 9;
Fungi - 0; Plants - 103; Viruses - 0; Other Eukaryotes -
14 (source: NCBI BLink). | chr2:16444280-16445266
REVERSE LENGTH=328
Length = 328
Score = 157 bits (398), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 149/369 (40%), Positives = 197/369 (53%), Gaps = 54/369 (14%)
Query: 1 MAATLLTCDVADDDYIDMEVSSFSNLCHSVTSHHLQHGEFEFHMSSIVP-EKEAITSPAD 59
MAA L CD ++DYIDMEV+SF+NL S++ EFEF MS + P E + TSPAD
Sbjct: 1 MAAYLERCDSVEEDYIDMEVTSFTNLVRKTLSNNYPR-EFEFQMSHLCPLEIDKTTSPAD 59
Query: 60 ELFYKGKLLPLHLPPRLQMVEKLLQNSNSPFEEEKNVFEEXXXXXXXXXXXXXXXXXXXF 119
ELFYKGKLLPLHLPPRLQMV+K+L++ F++E F
Sbjct: 60 ELFYKGKLLPLHLPPRLQMVQKILEDYT--FDDEF-----YSTPLATGTVTTPVTSNTPF 112
Query: 120 ESCNISPSDSCQVSRELKPEEYYSLDYLEDTTSGFVVENQKKSWTXX---XXXXXXXXXX 176
ESC +SP++SCQVS+EL PE+Y +LE + S + +KKSWT
Sbjct: 113 ESCTVSPAESCQVSKELNPEDY----FLEYSDSLEEDDEKKKSWTTKLRLMKQSSLGTKI 168
Query: 177 XASRAYLKSWFGKSGCSYETYATST--KVADEGSVSKAREILNKQAQVVKKNPYGQIQRQ 234
ASRAYL+S+FGK+ CS E+ S+ +VADE SV + + P+GQI+ +
Sbjct: 169 KASRAYLRSFFGKTSCSDESSCASSAARVADEDSVLRYSRV----------KPFGQIKTE 218
Query: 235 RYQPSNSNMRSYKEKTSEDRSNHHRRSFSVGIKXXXXXXXXXXXXXXXXXXXXXXXXXXY 294
R P K++++ S HRRSFSV ++
Sbjct: 219 R--P--------KKQSNGSVSGSHRRSFSVSMRRQAAKSSNNKSSNSLGFRP-------- 260
Query: 295 GCQSLKRCSSVNSEIENSIQGAIAHCKKSQQKK-------NASEVGLYSLPESRNSVCED 347
Q LKR +S +SEIENSIQGAI HCK+SQQ+K +EVG SL SR + +D
Sbjct: 261 -LQFLKRSTSSSSEIENSIQGAILHCKQSQQQKQKQKQYSTVNEVGFCSLSASRIAARDD 319
Query: 348 QERVVLCRG 356
QE + RG
Sbjct: 320 QEWAQMFRG 328
>AT2G37380.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G39370.1); Has 1284 Blast hits to 422 proteins
in 114 species: Archae - 0; Bacteria - 90; Metazoa -
125; Fungi - 151; Plants - 136; Viruses - 0; Other
Eukaryotes - 782 (source: NCBI BLink). |
chr2:15686828-15687793 FORWARD LENGTH=321
Length = 321
Score = 68.9 bits (167), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 150/371 (40%), Gaps = 87/371 (23%)
Query: 5 LLTCDVADDDYIDMEVSSFSNLCHSVTSHHL----------QHGEFEFHM-SSIVPEKEA 53
+L+ D DD YIDMEV+ S+ S +S Q EFEF M SS V E+
Sbjct: 12 VLSTD-GDDGYIDMEVNLSSSSSSSTSSSSFFSFPVTSSPPQSREFEFQMCSSAVASGES 70
Query: 54 ITSPADELFYKGKLLPLHLPPRLQMVEKLLQNSNSPFEEEKNVFEEXXXXXXXXXXXXXX 113
TSPADELFYKG+LLPLHLPPRL+MV+KLL S+S +
Sbjct: 71 TTSPADELFYKGQLLPLHLPPRLKMVQKLLLASSSSTAATETPI--------SPRAAADV 122
Query: 114 XXXXXFESCNISPSDSC--QVSRELKPEEYYSLDYLEDTTSGFVVENQK---KSWTXXXX 168
F SC I ++C ++S ELK F+ N+ SW+
Sbjct: 123 LSPRRFSSCEIGQDENCFFEISTELK---------------RFIESNENHLGNSWSKKIK 167
Query: 169 XXXXXXXXXASRAYLKSWFGKSGCSYETYATSTKVADEGSVSKAREILNKQAQVVKKNPY 228
ASRAY+K+ F K CS + + VS+ KKNP+
Sbjct: 168 HSSITQKLKASRAYIKALFSKQACSDSSEINPRFKIEPSKVSR------------KKNPF 215
Query: 229 GQIQRQRYQPSNSNMRSYKEKTSEDRSNHHRRSFSVGIKXXXXXX-----XXXXXXXXXX 283
SE+ HRRSFS I+
Sbjct: 216 --------------------VNSENPLLIHRRSFSGVIQRHSQAKCSTSSSSSSSASSLS 255
Query: 284 XXXXXXXXXXYGCQSLKRCSSVNSEIENSIQGAIAHCKKS--QQKKNASEVGLYSLPESR 341
Q+L R S+ + +NSI+GAI HCK+S +K N +E L S SR
Sbjct: 256 SSFSFGSNGSLDLQTLMRSSNAS---DNSIEGAIEHCKQSFTTRKSNVTESELCS---SR 309
Query: 342 NSV--CEDQER 350
SV C D ++
Sbjct: 310 TSVSTCGDLDK 320
>AT5G26230.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; Has 1807 Blast hits to 1807 proteins in
277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other
Eukaryotes - 339 (source: NCBI BLink). |
chr5:9173517-9174542 REVERSE LENGTH=341
Length = 341
Score = 52.4 bits (124), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 39 EFEFHMSSIVPEKEAIT-SPADELFYKGKLLPLHLPPRLQMVEKL 82
EFEF++S I P K + + PADELFYKG+LLPL L PRL +V L
Sbjct: 25 EFEFNIS-ISPRKASSSLCPADELFYKGQLLPLQLSPRLSLVRTL 68