Miyakogusa Predicted Gene
- Lj6g3v1463940.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1463940.1 Non Chatacterized Hit- tr|C5YFR4|C5YFR4_SORBI
Putative uncharacterized protein Sb06g015250
OS=Sorghu,46.43,4e-17,seg,NULL,CUFF.59664.1
(386 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G49100.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 120 2e-27
AT3G06868.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 95 1e-19
>AT5G49100.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G06868.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:19897700-19898890 REVERSE LENGTH=396
Length = 396
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 91/164 (55%), Gaps = 14/164 (8%)
Query: 235 DRKVXXXXXXXXXXXXXXXDFFERISTGLGDCTLRRVESQREGSKSK----VGSASHGMK 290
+RKV DFFERI+ G GDCTLRRVESQREG+ +K + S+G++
Sbjct: 235 ERKVSRSRSVGCGSRSFSGDFFERITNGFGDCTLRRVESQREGNNNKGNKVSSNPSNGVR 294
Query: 291 ERVRCGGLFGGFMMMT--XXXXXXXXXXXXXXXXXXADDGINGDSLSHGKGRSWGWTLAS 348
E VRCGG+FGGFM+MT N G+ RSWGW+ AS
Sbjct: 295 EMVRCGGIFGGFMIMTSSSSSSSSSSWVSSSSAEHHHHHNHNMGHGGGGRNRSWGWSFAS 354
Query: 349 PMRAFSSKNSSKDNKRGSSV------KNPTPNLSAIPSLLTVRS 386
PMRAF+S SS KRG ++ KN TPNL AIPSLL+VRS
Sbjct: 355 PMRAFTS--SSYSGKRGRTISDSTTSKNTTPNLGAIPSLLSVRS 396
Score = 59.7 bits (143), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 24/30 (80%), Positives = 26/30 (86%)
Query: 21 NGTQCINHPHRNSPGGICAFCLQEKLGKLV 50
+G QCINHP +PGGICAFCLQEKLGKLV
Sbjct: 11 DGMQCINHPFTKNPGGICAFCLQEKLGKLV 40
>AT3G06868.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G49100.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr3:2167872-2168981
FORWARD LENGTH=369
Length = 369
Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 60/145 (41%), Positives = 73/145 (50%), Gaps = 28/145 (19%)
Query: 254 DFFERISTGLGDCTLRRVESQREGSKSKV----GSASHGMKERVRCGGLFGGFMMMTXXX 309
DFFERIS G GDC LRR+ESQRE +K G A+ M E V+CGG+FGGFM+MT
Sbjct: 241 DFFERISNGFGDCALRRIESQREATKVISNGGGGEAADAMSEMVKCGGIFGGFMIMTSSS 300
Query: 310 XXXXXXXXXXXXXXXADDGINGDSLSHGKGRSWGWTLASPMRAFSSKN--------SSKD 361
G R+WGW ASPMRA ++ S+ D
Sbjct: 301 TTSSTTSSTVDHHHNHKMG----------NRNWGWAFASPMRAKATATHRGRTITESTAD 350
Query: 362 NKRGSSVKNPTPNLSAIPSLLTVRS 386
NK SS NL +IPSLL ++S
Sbjct: 351 NKNTSS------NLDSIPSLLALKS 369
Score = 57.8 bits (138), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 54/146 (36%), Gaps = 23/146 (15%)
Query: 22 GTQCINHPHRNSPGGICAFCLQEKLGKLVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 81
G QCI HP+ +PGGICA CLQEKLGKLV
Sbjct: 13 GMQCITHPYTKNPGGICALCLQEKLGKLV----------TSSFPVPKPNHLSSSSPKSFT 62
Query: 82 XXXXXXXXXXQKNHSHHDHVHASRLPFLLAXXXXXXXXX-------XXXXXXXDVVFKRS 134
+ D + + LPFLLA ++++KRS
Sbjct: 63 PSTTSLALSLSSASNGRDSTNNNNLPFLLAKKKKNMLAASSSSSSSSSSSSSANLIYKRS 122
Query: 135 KSTATPRMGKFLNEEDFSPRKKNKFW 160
KSTA E FS RK++ FW
Sbjct: 123 KSTAAAY------GESFSQRKRSGFW 142