Miyakogusa Predicted Gene
- Lj0g3v0292149.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0292149.1 Non Chatacterized Hit- tr|K3Z0N2|K3Z0N2_SETIT
Uncharacterized protein (Fragment) OS=Setaria italica
,44.53,1e-17,seg,NULL,CUFF.19525.1
(182 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G66800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 106 8e-24
AT3G50640.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 86 2e-17
AT3G19200.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 3e-10
AT4G34419.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 59 1e-09
AT3G12970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 50 1e-06
>AT5G66800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G50640.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:26671433-26672116
FORWARD LENGTH=183
Length = 183
Score = 106 bits (265), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 105/201 (52%), Gaps = 43/201 (21%)
Query: 1 MACLDMYNNSEHKGXXXXXXXXXXPVSPRISFGNDFVDAK-QASKQERSF----RSDASV 55
MACL+MYN++ G P+SPRISF NDFV+ + + +K RS + +S
Sbjct: 1 MACLEMYNSNGGGGT---------PMSPRISFSNDFVEIRPETTKTTRSSPLSKQEGSSS 51
Query: 56 PVSSDFEFSVSNYNMMSADELFSKGRLLPFKDGGCNNQVQRPTTTLREELMVD------- 108
S +FEFSVSNY MM ADELFSKG+LLPFK+ NQVQR +
Sbjct: 52 SFSDNFEFSVSNYTMMPADELFSKGKLLPFKE---TNQVQRTLREELLVEEDEEEGPRDA 108
Query: 109 DDEFSLRPP--------KGSSSTRWKGFLGLRKSHIGSKKVDKSEGSSNRGVQAARSGLS 160
+ FSL+PP SS RWKG LGL+++H+GSK N + ++
Sbjct: 109 TNIFSLKPPIFSSSSSSSSSSKGRWKGLLGLKRAHVGSK---------NNEERFVHHMIN 159
Query: 161 NMTSQDLLIEG--GSSCRDVE 179
N I G GSSCR+++
Sbjct: 160 NNKQSQEAIGGREGSSCREMK 180
>AT3G50640.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G66800.1); Has 67 Blast hits to 67 proteins in
10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:18804027-18804639 REVERSE
LENGTH=166
Length = 166
Score = 85.5 bits (210), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 76/130 (58%), Gaps = 17/130 (13%)
Query: 27 SPRISFGNDFVDAKQASKQERS----FRSDASVPVSSDFEFSVSNYNMMSADELFSKGRL 82
+ RISF N+FV+ + +S RS S+P S+DF FSV++Y+M+ ADE+F KG++
Sbjct: 13 AARISFSNEFVEIRSEKSNAKSNNINSRSSFSMP-SADFAFSVTDYSMIPADEIFLKGKI 71
Query: 83 LPFKDGGCNNQVQRPTTTLREEL----MVDDDEFSLRPPKGSSST-----RWKGFLGLRK 133
LPFK+ + V R MVD + FSLRP SSS+ W+ LGL++
Sbjct: 72 LPFKE---TSHVHRTLGEELLTEEEGSMVDGNTFSLRPILLSSSSFSTKGTWRELLGLKR 128
Query: 134 SHIGSKKVDK 143
+H+ SKK DK
Sbjct: 129 THVRSKKTDK 138
>AT3G19200.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G34419.1); Has 51 Blast hits to 51 proteins in
10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 51; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:6649478-6650006 REVERSE
LENGTH=143
Length = 143
Score = 61.6 bits (148), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 64/115 (55%), Gaps = 18/115 (15%)
Query: 26 VSPRISFGNDFVDAKQASKQERSFRSDASV-----PVSSDFEFSVSNYNMMS-ADELFSK 79
V RISF NDF D+ SK S R+D + PVSSDF+F+V N+ S ADE+F
Sbjct: 15 VDARISFSNDFADSD--SKHTAS-RADGQMKYKEAPVSSDFKFNVENFGFTSAADEIFFG 71
Query: 80 GRLLPFKDGGCNNQVQRPTTTLREELMVDDDEFSLRPPKGSSSTRWKGFLGLRKS 134
G LLP + QR TTLR+EL D + ++ KGS + WK LGL KS
Sbjct: 72 GVLLPLE-----KTTQRKVTTLRDELSAQDSDRTI-SSKGSRNW-WK--LGLNKS 117
>AT4G34419.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G19200.1). | chr4:16456597-16456986 FORWARD
LENGTH=129
Length = 129
Score = 59.3 bits (142), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 60/105 (57%), Gaps = 11/105 (10%)
Query: 28 PRISFGNDFVDAKQASKQERSFRSDASVPVSSD-FEFSVSNYNMMSADELFSKGRLLPFK 86
PRISF + F A+K E +A PVSSD FEF V N++M +ADE+F G +LP K
Sbjct: 18 PRISFSSGFA----ATKHEMIKYKEA--PVSSDDFEFGVENFSMTTADEIFFDGMILPLK 71
Query: 87 DGGCNNQVQRPTTTLREELMVDDDEFSLRPPKGSSSTRWKGFLGL 131
+ N +R +TLREEL +D + KGSS W+ LGL
Sbjct: 72 EE--VNTTKR-MSTLREELSEEDGDSPRSKSKGSSGW-WRERLGL 112
>AT3G12970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G56020.1); Has 2408 Blast hits to 418 proteins
in 91 species: Archae - 0; Bacteria - 41; Metazoa - 198;
Fungi - 63; Plants - 125; Viruses - 13; Other Eukaryotes
- 1968 (source: NCBI BLink). | chr3:4141329-4142474
REVERSE LENGTH=381
Length = 381
Score = 49.7 bits (117), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 16/88 (18%)
Query: 60 DFEFSVSN-YNMMSADELFSKGRLLPFKDGGCNNQVQRPTTTL---------REEL---- 105
DFEF + + M+SADELFS G+L+P K G ++P T++ R E+
Sbjct: 43 DFEFLLEDPVTMLSADELFSDGKLVPLKFSGVTYPEEKPITSVVHTAVKPCRRLEMEISG 102
Query: 106 MVDDDEFSLRPPKGSSSTRWKGFLGLRK 133
+VD FS R P+ + RW+ LGL++
Sbjct: 103 VVDPYLFSPRAPR--CTVRWRELLGLKR 128