Miyakogusa Predicted Gene
- Lj1g3v2927350.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2927350.1 Non Chatacterized Hit- tr|G7JNM1|G7JNM1_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,34.05,4e-18,seg,NULL; coiled-coil,NULL,CUFF.29702.1
(476 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G37440.2 | Symbols: | unknown protein; LOCATED IN: cellular_... 153 3e-37
AT4G37440.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 152 3e-37
AT3G59670.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 147 1e-35
AT3G50040.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 108 9e-24
>AT4G37440.2 | Symbols: | unknown protein; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G50040.1);
Has 121 Blast hits to 117 proteins in 32 species: Archae
- 0; Bacteria - 6; Metazoa - 13; Fungi - 5; Plants - 66;
Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink).
| chr4:17601647-17603766 FORWARD LENGTH=444
Length = 444
Score = 153 bits (386), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 13/269 (4%)
Query: 78 IDNTECXXXXXXXDTGSGVETDS-GSAXGLTDSEGE------SSVSDDWSEPLLFRKKKL 130
+D EC +G TD S+ G TDSE E S + ++ S PL RK+KL
Sbjct: 73 VDILECNDNIEIQVSGCDDGTDGYSSSFGGTDSEHENDQEVDSMICNETSLPLWVRKRKL 132
Query: 131 TDHWRKFIRP-LMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLEFLKFSIDDFGVKS 189
TDHWR+F++P LMWRCKWIEL K+L +QA KY+KE+ EY K+LE ++ GVK+
Sbjct: 133 TDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQAKKLELENVKSEELGVKA 192
Query: 190 V-PVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENKNCRHDAGLEDFHSDAV 247
+ P+ + R+M EE D++SY SNHN+FSYY+ + D L D ++
Sbjct: 193 LPPLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYYDCRKSLADIALND---NSR 249
Query: 248 AIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQSQVGNLKTRIDNVISENP 307
+ N +E ++ L + D IL KIE +S+ NLK R+D V+SENP
Sbjct: 250 NLDKKNKSAKDETAFSEETPPLEFREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENP 309
Query: 308 GNFCYVSQLSMIEPSDGFNHSGHGSASLA 336
F + ++ + +D + S LA
Sbjct: 310 SIFPLANTVNPLGAADVYTSSEQQKPLLA 338
>AT4G37440.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G50040.1); Has 220 Blast hits to 205 proteins
in 55 species: Archae - 0; Bacteria - 15; Metazoa - 50;
Fungi - 11; Plants - 76; Viruses - 3; Other Eukaryotes -
65 (source: NCBI BLink). | chr4:17601647-17603846
FORWARD LENGTH=471
Length = 471
Score = 152 bits (385), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 13/269 (4%)
Query: 78 IDNTECXXXXXXXDTGSGVETDS-GSAXGLTDSEGE------SSVSDDWSEPLLFRKKKL 130
+D EC +G TD S+ G TDSE E S + ++ S PL RK+KL
Sbjct: 73 VDILECNDNIEIQVSGCDDGTDGYSSSFGGTDSEHENDQEVDSMICNETSLPLWVRKRKL 132
Query: 131 TDHWRKFIRP-LMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLEFLKFSIDDFGVKS 189
TDHWR+F++P LMWRCKWIEL K+L +QA KY+KE+ EY K+LE ++ GVK+
Sbjct: 133 TDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQAKKLELENVKSEELGVKA 192
Query: 190 V-PVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENKNCRHDAGLEDFHSDAV 247
+ P+ + R+M EE D++SY SNHN+FSYY+ + D L D ++
Sbjct: 193 LPPLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYYDCRKSLADIALND---NSR 249
Query: 248 AIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQSQVGNLKTRIDNVISENP 307
+ N +E ++ L + D IL KIE +S+ NLK R+D V+SENP
Sbjct: 250 NLDKKNKSAKDETAFSEETPPLEFREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENP 309
Query: 308 GNFCYVSQLSMIEPSDGFNHSGHGSASLA 336
F + ++ + +D + S LA
Sbjct: 310 SIFPLANTVNPLGAADVYTSSEQQKPLLA 338
>AT3G59670.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G37440.2); Has 77 Blast hits to 77 proteins in
14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 73; Viruses - 0; Other Eukaryotes - 4
(source: NCBI BLink). | chr3:22040485-22042380 FORWARD
LENGTH=517
Length = 517
Score = 147 bits (372), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 141/259 (54%), Gaps = 21/259 (8%)
Query: 117 DDWSEPLLFRKKKLTDHWRKFIRPLMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQLE 176
D +S FRKK+LT+HWR+FIRPLMWR KW+EL +++L S+AL+Y KEL YD K
Sbjct: 122 DSFSSIFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELESRALEYPKELELYDQEK--- 178
Query: 177 FLKFSIDDF-------GVKSVPVSDSIYRNRVMXXXXXXXAEEC--DLSSYVSNHNIFSY 227
L+ +ID G+KS+P S+ Y+ R E D++SY++ HN+FSY
Sbjct: 179 -LEANIDPSVLESCGEGIKSLPFSNPCYKKRAAKKRRKRKKVESTDDIASYMACHNLFSY 237
Query: 228 YENKNCRHDA-GLEDFHSDAVAIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIE 286
E K D GL D DA P S D+ E + L+D S H D D ++L KIE
Sbjct: 238 IETKRLSSDGMGLADDFGDA-KDPRS--DSNEPVDLDDADSLFHHRDGDSVLEEVLWKIE 294
Query: 287 KLQSQVGNLKTRIDNVISENPGNFCYVSQLSMIEPSDGFNHSGHGSASLAGNENQLPVSF 346
+ SQV LKT++D V+S+N F LS++ S + + S GN + +
Sbjct: 295 LVHSQVHRLKTQVDVVLSKNTARFSSSENLSLLAASSAPS----PTVSAGGNGDVISFGA 350
Query: 347 IHASSQHKSELYVEDQLLA 365
I+ +SQH ++ + D + +
Sbjct: 351 IYNASQHMADYGLGDIVFS 369
>AT3G50040.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G37440.2); Has 70 Blast hits to 70 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:18549489-18551019 REVERSE
LENGTH=421
Length = 421
Score = 108 bits (270), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 128/270 (47%), Gaps = 17/270 (6%)
Query: 115 VSDDWSEPLLFRKKKLTDHWRKFIRPLMWRCKWIELHVKKLNSQALKYEKELAEYDYRKQ 174
DD +E L KKK D WR+ +P+MWRCKWIEL VK++ SQA YEKE+ +Y KQ
Sbjct: 96 TCDDGTEFLGLPKKKTNDRWRRLTKPIMWRCKWIELKVKEIQSQARGYEKEVKDYYLTKQ 155
Query: 175 LEFLKFSIDDFGVKSVPVSDSIYRNRVMXXXXXXXAEE-CDLSSYVSNHNIFSYYENK-- 231
+ K ++ F KS+P ++ R V EE D+++Y+SNHN+FSY + +
Sbjct: 156 FDLEKSKLEGFDGKSIPFRENNQRRNVFKRGRRKRVEETTDVAAYMSNHNLFSYADKRVP 215
Query: 232 -NCRHDAGLEDFHSDAVAIPMSNVDNIEELKLNDMLSSLHREDNDKSFNDILQKIEKLQS 290
N + DF + A D IE+ + ++S L + +D L KI++ Q
Sbjct: 216 VNVKGQYLDSDFGTGRKAT--GKQDAIED---DSLISEL--DCSDDVLAKFLCKIDEAQG 268
Query: 291 QVGNLKTRIDNVI-SENPGNFCYVSQLSMIEPSDGFNHSGHGSASLAGNENQLPVSFIHA 349
+ L+ R+D ++ P + + Q+ D +G A + + P++ +
Sbjct: 269 KARRLRKRVDQLMWDSQPAHTSSMPQMVAPCHRDSMIQTGKKCALV-----EAPLTHVQN 323
Query: 350 SSQHKSELYVEDQLLAKNTLSTLEANTNRP 379
Q ++E ++ + + N P
Sbjct: 324 GQQCIPADHIEHLMVPQTHIGGQCLTNNSP 353