Miyakogusa Predicted Gene
- Lj2g3v1730230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v1730230.1 Non Chatacterized Hit- tr|B9RPI9|B9RPI9_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,60.61,0.0000000000001,seg,NULL; coiled-coil,NULL,CUFF.37908.1
(426 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G50910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 252 4e-67
AT5G66480.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 173 2e-43
>AT3G50910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G66480.1); Has 76 Blast hits to 75 proteins in
28 species: Archae - 0; Bacteria - 10; Metazoa - 7;
Fungi - 2; Plants - 49; Viruses - 0; Other Eukaryotes -
8 (source: NCBI BLink). | chr3:18920189-18921999 FORWARD
LENGTH=447
Length = 447
Score = 252 bits (643), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 176/451 (39%), Positives = 241/451 (53%), Gaps = 51/451 (11%)
Query: 1 MPTFTAIALDTLLEXXXXXXXXXXXXXXEFH-------KLERTTSAPKSKAP------RP 47
MPTF+AIALD +LE KLE+ K K P RP
Sbjct: 1 MPTFSAIALDRMLEPGASTSVESVPSTTNLFYSKPPISKLEKG----KGKLPNERTVTRP 56
Query: 48 RLKPALYATPEVKPLQDAPSSSFPPSPYIVNHKCRGP-RLLKSSSEASVLS--HQNVNGD 104
+ PALYATP+ PL ++PSS P YI+NHK RGP RLLKSSSEA+V+S HQ +
Sbjct: 57 LMSPALYATPDAIPLPNSPSSFPPSP-YIINHKSRGPPRLLKSSSEANVVSSSHQKTLEE 115
Query: 105 QKVNDKELENVVSSSAGDLQVTFTKPEL-EDKQVNGVCG-------------------GE 144
+ + E + VS +F E+ ED NGV
Sbjct: 116 ETIT-AETDVKVSPRRRSTSFSFPITEVTEDDYSNGVHARTVGNYNFDGIVDGPVGNWSP 174
Query: 145 LDRSNGHREPENGSLTDVLLREKALA----LNSERDREIEDFFDTQDSLSFTSNTDGEDN 200
LD +G+ + E + + L R L+ + ++++ E EDF+D +S SFTSNTD E +
Sbjct: 175 LDGKSGNGKSELDNAANGLERVNGLSEPVPIKTDKESESEDFYDPGESASFTSNTDVEGD 234
Query: 201 AGTELSMKFSSPGGEFYDAWEELSSKGTPQKSTTYDVXXXXXXXXXXXXXXIEKRKQAEE 260
AG E S + ++P GEFYDAW+ELS+ Q S ++ IEKRKQ EE
Sbjct: 235 AGDESSQRLATPVGEFYDAWDELSTDSGMQSSVN-NIESELSEIRLSLLMEIEKRKQTEE 293
Query: 261 SLNNFRNQWESVRQGLCQAGIILPSDLSVVAEGEQPNHDPVQDLCQQIYVARFISNIVGR 320
+L + W+ +R+ + Q G+ +P D + N + ++L Q+ +ARF+S+ +GR
Sbjct: 294 ALEQMQIHWQRLREQMAQVGLFVPIDPTASTN----NMNLSEELRCQLEIARFVSDSLGR 349
Query: 321 GTVRAEVEKEMEAQLESKNFEITRLLERLRCYETMNREMSQRNQEAVEMAXXXXXXXXXX 380
G +AEVE EME+ LE+KNFEITRL +RL YE +NREMSQRNQEA+E+A
Sbjct: 350 GMAKAEVEMEMESMLETKNFEITRLSDRLHYYEAVNREMSQRNQEAIEVARRERQKRKKR 409
Query: 381 XXWIWGSLTTVIALSTAAIAWSYLPTGNGSS 411
WIWGS+ I L +AA+AWSY+P SS
Sbjct: 410 QRWIWGSIAATITLGSAALAWSYIPAAKPSS 440
>AT5G66480.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT3G50910.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:26544737-26546476 REVERSE LENGTH=444
Length = 444
Score = 173 bits (439), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 150/439 (34%), Positives = 218/439 (49%), Gaps = 35/439 (7%)
Query: 1 MPTFTAIALD-TLLEXXXXXXXXXXXXXXEFHKLERTTSAPKSKA-PRPRLKPALYATPE 58
MPTF+A AL +L + L + PK K RP++ P+LYAT +
Sbjct: 1 MPTFSAAALGRSLNSGTSLSSKFPSTLQSKPSILNDESKQPKEKTFTRPQMSPSLYATTK 60
Query: 59 VKPLQDAPSSSFPPSPYIVNHKCRGPRLLKSSSEASVLSHQNVNGDQKVNDKELENVVSS 118
P ++PSS P YI+NHK RGP L SE SH +G++K++ +S
Sbjct: 61 EIPHPNSPSSYPPSP-YIINHKARGPVLFNRDSEVDGPSHPITSGEEKISGNVDVEATAS 119
Query: 119 SAGDLQVTFTKPE-LEDKQVNGV----------------CGGELDRSNGHREPENGSL-- 159
+ ++F E + NGV G L+ +G R+ NG +
Sbjct: 120 LSKSTSLSFPITEAIAVDHTNGVHTQGIHERPVWDCSPPLGTFLNEKSG-RDISNGGIGS 178
Query: 160 ---TDVLLREKALA----LNSERDREIEDFFDTQDSLSFTSNTDGEDNAGTELSMKFSSP 212
T L + L + ++++ E E+F++ + +SFTSNT+ ED E S ++
Sbjct: 179 NNATSNLEWQSYLLEPVRIKADKELEPENFYNPGELVSFTSNTEVEDFERAESSHSLATH 238
Query: 213 GGEFYDAWEELSSKGTPQKSTTYDVXXXXXXXXXXXXXXIEKRKQAEESLNNFRNQWESV 272
GEFYDA +ELS+ Q S ++ IE+R+QAE +L + W +
Sbjct: 239 VGEFYDACDELSTDSGMQSSAN-NIESEVREMRLGLLMEIERRRQAEATLEQMQVHWRRL 297
Query: 273 RQGLCQAGIILPSDLSVVAEGEQPNHDPVQDLCQQIYVARFISNIVGRGTVRAEVEKEME 332
R L G+ LP D + Q + + +L Q+ V RF+S+ +G + EVE EME
Sbjct: 298 RDQLADVGMFLPLDPT----RSQYSMNLADELRCQLEVTRFVSDTLGSDLAKTEVEMEME 353
Query: 333 AQLESKNFEITRLLERLRCYETMNREMSQRNQEAVEMAXXXXXXXXXXXXWIWGSLTTVI 392
A+LE+KNFEITRL +RL YET+N+EMSQRNQEA+E+A WIWGS+ I
Sbjct: 354 AELEAKNFEITRLSDRLHYYETVNQEMSQRNQEAIEVARRDGQKRKRRQRWIWGSIAATI 413
Query: 393 ALSTAAIAWSYLPTGNGSS 411
L + +AWSYLP G SS
Sbjct: 414 TLGSGVLAWSYLPPGMLSS 432