Miyakogusa Predicted Gene
- Lj4g3v0975550.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v0975550.1 Non Chatacterized Hit- tr|I1ID01|I1ID01_BRADI
Uncharacterized protein OS=Brachypodium distachyon
GN=,32.08,8e-19,seg,NULL,CUFF.48469.1
(243 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G50910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 105 4e-23
AT5G66480.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 68 5e-12
>AT3G50910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G66480.1); Has 76 Blast hits to 75 proteins in
28 species: Archae - 0; Bacteria - 10; Metazoa - 7;
Fungi - 2; Plants - 49; Viruses - 0; Other Eukaryotes -
8 (source: NCBI BLink). | chr3:18920189-18921999 FORWARD
LENGTH=447
Length = 447
Score = 105 bits (261), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 97/263 (36%), Positives = 144/263 (54%), Gaps = 32/263 (12%)
Query: 1 MPTFTAIAFDRLIEPGASKPAARRSASTSM--PGPNARKLGR-RTSEPTAPSVSKKPPPR 57
MPTF+AIA DR++EPGAS ++T++ P KL + + P +V+ R
Sbjct: 1 MPTFSAIALDRMLEPGASTSVESVPSTTNLFYSKPPISKLEKGKGKLPNERTVT-----R 55
Query: 58 PQLKPSLYATPEVTPLPDAPSSFPPSPYIVNHKRRGP-RLLKSFSEADV--QAKQEVHED 114
P + P+LYATP+ PLP++PSSFPPSPYI+NHK RGP RLLKS SEA+V + Q+ E+
Sbjct: 56 PLMSPALYATPDAIPLPNSPSSFPPSPYIINHKSRGPPRLLKSSSEANVVSSSHQKTLEE 115
Query: 115 ENXXXXXXXXXX------------XXXAGDLQVTVMNAEPVNEEQVTGALDTKLSSCNGS 162
E D ++A V G +D + + +
Sbjct: 116 ETITAETDVKVSPRRRSTSFSFPITEVTEDDYSNGVHARTVGNYNFDGIVDGPVGNWSPL 175
Query: 163 DLEHGCRENELSSSITNGTHVEKVGALN------SERDVESDDFFDPQDSMSVTSYTDGE 216
D + G ++EL ++ NG +E+V L+ ++++ ES+DF+DP +S S TS TD E
Sbjct: 176 DGKSGNGKSELDNA-ANG--LERVNGLSEPVPIKTDKESESEDFYDPGESASFTSNTDVE 232
Query: 217 DNTGTERGVKLSTPGGEFFDAWE 239
+ G E +L+TP GEF+DAW+
Sbjct: 233 GDAGDESSQRLATPVGEFYDAWD 255
>AT5G66480.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT3G50910.1); Has 30201 Blast hits to
17322 proteins in 780 species: Archae - 12; Bacteria -
1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:26544737-26546476 REVERSE LENGTH=444
Length = 444
Score = 68.2 bits (165), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 118/254 (46%), Gaps = 26/254 (10%)
Query: 1 MPTFTAIAFDRLIEPGASKPAARRSASTSMPGPNARKLGRRTSEPTAPSVSKKPPPRPQL 60
MPTF+A A R + G S + S S P L + +P +K RPQ+
Sbjct: 1 MPTFSAAALGRSLNSGTSLSSKFPSTLQSKPSI----LNDESKQP-----KEKTFTRPQM 51
Query: 61 KPSLYATPEVTPLPDAPSSFPPSPYIVNHKRRGPRLLKSFSEADVQAKQEVHEDENXXXX 120
PSLYAT + P P++PSS+PPSPYI+NHK RGP L SE D + +E
Sbjct: 52 SPSLYATTKEIPHPNSPSSYPPSPYIINHKARGPVLFNRDSEVDGPSHPITSGEEKISGN 111
Query: 121 XXXXXXXXXAGDLQVTVMNAEPVNEEQVTGA----------------LDTKLSSCNGSDL 164
+ ++ E + + G L T L+ +G D+
Sbjct: 112 VDVEATASLSKSTSLSFPITEAIAVDHTNGVHTQGIHERPVWDCSPPLGTFLNEKSGRDI 171
Query: 165 EHG-CRENELSSSITNGTHVEKVGALNSERDVESDDFFDPQDSMSVTSYTDGEDNTGTER 223
+G N +S++ +++ + + +++++E ++F++P + +S TS T+ ED E
Sbjct: 172 SNGGIGSNNATSNLEWQSYLLEPVRIKADKELEPENFYNPGELVSFTSNTEVEDFERAES 231
Query: 224 GVKLSTPGGEFFDA 237
L+T GEF+DA
Sbjct: 232 SHSLATHVGEFYDA 245