Miyakogusa Predicted Gene
- Lj0g3v0252839.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0252839.1 Non Chatacterized Hit- tr|B9T7V3|B9T7V3_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,46.88,2e-17,seg,NULL,CUFF.16590.1
(327 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G23850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 125 3e-29
AT1G23840.1 | Symbols: | unknown protein; LOCATED IN: endomembr... 109 3e-24
AT1G23830.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 78 8e-15
>AT1G23850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G23840.1); Has 47 Blast hits to 40 proteins in
5 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 47; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr1:8425981-8427045 REVERSE
LENGTH=354
Length = 354
Score = 125 bits (315), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 169/350 (48%), Gaps = 55/350 (15%)
Query: 21 DVLREAVMIYFRNLNFIIFTFLTSLPLLFIMLYFEIHLQEILLETSLILNQPHAHSLYHH 80
++L+ A + N+N ++F FL SLPL +++FE+ LQ + S L + + Y++
Sbjct: 11 NILKRATKLLCGNINLVLFLFLCSLPLFCFLIFFELSLQTTVSLASQYLVRQLTNWDYYY 70
Query: 81 GFNPDSIMRIFNMDYVLKLIHLGFIYMVPLHLFELGSAIFTIDLASKQQYSEEKKVTLNL 140
S++ + + LI +Y+ P L +L + + + SEE+ L
Sbjct: 71 VPQDASVLE----NLIPLLIQTFLLYLFPYGLIDLFTTTTIVSASWTVHTSEEEP--LRF 124
Query: 141 KEIIFQKPLDL--SNLRGTFVTSIYVLFLTTTHQLGLLWIVINY-HVFL----------- 186
+++ ++ +++ + L G +TS+YVL L+T G L++ NY H+
Sbjct: 125 GQLV-RRTVEICQNRLEGCLITSLYVLLLSTPVFFGFLFVATNYFHIISLTGSGENSYYY 183
Query: 187 -----KDLSCY-----------MLFFVICSL----VFAKVLRTCLEWSAMWNMSLVISVL 226
+D Y MLF + ++ +F +L +WSA WNM LV+SVL
Sbjct: 184 SINIEEDAEGYYRSSSVNSPVKMLFDAVLAMFHGAIFLGLLAMFSKWSAGWNMGLVVSVL 243
Query: 227 ER------VYGVDALALSVYFSRGCHRRGLFLMLIFFSWGHLLRLSC------HHLIGEQ 274
E +YG DAL LS + +G +RGL +ML+F + +R+ C G +
Sbjct: 244 EEEENGQSIYGTDALTLSSNYGKGHEKRGLQVMLVFLVFAIAMRMPCFCFKCTESSNGNR 303
Query: 275 GTWSGFYIQVGLFCLVNPLKWVVFMIYFHDCKERSLEKKTDEELGKDVKV 324
++ FY VGL C+ N +KWV ++++ DC+ LEKK D E+G K
Sbjct: 304 VLYTSFY--VGLICVGNMIKWVACVVFYEDCRTSVLEKKGDVEIGSKGKT 351
>AT1G23840.1 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; EXPRESSED IN: 18 plant structures; EXPRESSED
DURING: 12 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G23850.1);
Has 53 Blast hits to 48 proteins in 5 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 53;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:8424321-8425337 REVERSE LENGTH=338
Length = 338
Score = 109 bits (273), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 156/340 (45%), Gaps = 49/340 (14%)
Query: 18 EVSDVLREAVMIYFRNLNFIIFTFLTSLPLLFIMLYFEIHLQEILLETSLILNQPHAHSL 77
V ++L+ A+ + F N+N +F FL SLPL +++FE+ LQ T++ L + L
Sbjct: 10 SVIELLKRALKLLFGNINLALFLFLCSLPLFCFLIFFELSLQ-----TTVSLASTYISKL 64
Query: 78 YHHGFNPDSIMRIFNMDYVLKLIHLGFIYMVPLHLFELGSAIFTIDLASKQQYSEEKKVT 137
+S + D + LI +Y P + +L + + +S SEE+ +
Sbjct: 65 V------NSEEDLSENDLLPWLIQTTLLYFFPYTILDLLTTTTIVAASSIAYTSEEEPLG 118
Query: 138 LNLKEIIFQKPLDLSNLRGTFVTSIYVLFLTTTHQLGLLWIVINYHVFLKDLSCYMLFF- 196
L L + L + + G +TS+YVL L+T+ LGL Y F +FF
Sbjct: 119 L-LYLVGRSFKLCQNKVGGCLITSLYVLLLSTSVFLGLFSGSTIYLYFASLTLEQQIFFN 177
Query: 197 ----------------------VICSLVFAKVLRTCLEWSAMWNMSLVISVLER------ 228
+I VF + +WSA WN+S+V+SVLE
Sbjct: 178 QAVVQDQRFLEQAVVLLDVVVVLIHGTVFIVLAAKFSKWSAGWNISMVVSVLEEEEDSKG 237
Query: 229 VYGVDALALSVYFSRGCHRRGLFLMLIFFSWGHLLRLSC------HHLIGEQGTWSGFYI 282
+YG AL+LS ++ RG +R ++ML+F + R+ C L G ++G Y
Sbjct: 238 IYGSSALSLSAWYLRGQEKRDFWMMLVFLVGALVTRMPCLYYKCSESLSGNGVLYTGLY- 296
Query: 283 QVGLFCLVNPLKWVVFMIYFHDCKERSLEKKTDEELGKDV 322
V L C+ N +KWV ++++HDC R L KK D E+G
Sbjct: 297 -VSLICVGNVVKWVSCVVWYHDCNTRVLRKKGDVEIGSKA 335
>AT1G23830.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 12 plant structures; EXPRESSED
DURING: 9 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT1G23840.1);
Has 57 Blast hits to 52 proteins in 7 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 57;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr1:8423003-8424040 REVERSE LENGTH=345
Length = 345
Score = 78.2 bits (191), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 99/345 (28%), Positives = 163/345 (47%), Gaps = 55/345 (15%)
Query: 18 EVSDVLREAVMIYFRNLNFIIFTFLTSLPLLFIMLYFEIHLQEILLETS------LILNQ 71
V ++L+ A+ + F N+N ++F L SLPL F +++FE+ LQ + TS LIL +
Sbjct: 10 SVIELLKRALKLLFGNINLLLFLCLCSLPLFFFLIFFELSLQTTVYLTSQFLWKLLILGE 69
Query: 72 PHAHSLYHHGFNPDSIMRIFNMDYVLKLIHLGFIYMVPLHLFELGSAIFTIDLASKQQYS 131
N ++ D + LI +Y P + +L + + +S S
Sbjct: 70 DLPE-------NDLILISEKKNDLISWLIQTFLLYFFPYTILDLLTTTTIVAASSIVYTS 122
Query: 132 EEKKVTLNLKEIIFQKPLDLSNLR--GTFVTSIYVLFLTT------------------TH 171
+E+ + L + ++ + + R G +TS+YVL +T T+
Sbjct: 123 KEEPLGL---LYLVERSIKICQNRVGGCLITSLYVLLWSTSVFLFFFLFFFLQFLSGSTN 179
Query: 172 QLGLLWIVINYHVF------LKDLSCYMLFFVICSLVFAKVLRTCLEWSAMWNMSLVISV 225
+ + ++ Y F L ++ + + C+L F + +WS+ WNM LV+SV
Sbjct: 180 YVSIPYLSREYKGFHYQPTGLFNVVVPLTLLMQCTL-FIVLTAKYSKWSSGWNMGLVVSV 238
Query: 226 LER------VYGVDALALSVYFSRGCHRRGLFLMLIFFSWGHLLRLSCHH----LIGEQG 275
LE +YG DAL+LS ++ +G +R L+LML+F +G R+ C + G
Sbjct: 239 LEEDEDGQGIYGGDALSLSGWYRKGHEKRDLWLMLMFLVFGLATRMPCLYSKCSASGNGV 298
Query: 276 TWSGFYIQVGLFCLVNPLKWVVFMIYFHDCKERSLEKKTDEELGK 320
++GFY VGL C+ N LKWV + +HDCK L KK D E K
Sbjct: 299 MYTGFY--VGLICVGNLLKWVTCLACYHDCKTMVLRKKRDVEQAK 341