Miyakogusa Predicted Gene
- Lj1g3v4692840.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4692840.1 tr|G7IKD9|G7IKD9_MEDTR PIF-like protein
OS=Medicago truncatula GN=MTR_2g021280 PE=4
SV=1,34.18,8e-19,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,gene.Ljchr1_pseudomol_20120830.path1.gene10411.1
(260 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 63 2e-10
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 63 2e-10
AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 54 7e-08
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 2e-07
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 53 2e-07
AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 50 1e-06
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 62.8 bits (151), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 1/143 (0%)
Query: 61 STSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXX 120
S++ K W F+ + + G T+F K+GW I NE +G
Sbjct: 161 SSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQ 220
Query: 121 XXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLP 180
R+ W+ W ++ G + + WD +F A +E W I ENP + R K +P
Sbjct: 221 LKNHWDCTRKAWKIWCQLVGA-SSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVP 279
Query: 181 FAHDLTILYKDVVATGEHAWAPS 203
A L I++ V+ GE PS
Sbjct: 280 HADQLAIIFNGVIEPGETYTPPS 302
Score = 58.9 bits (141), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 4/123 (3%)
Query: 66 PKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXXXXXXX 125
PKA W+ + FV C+ Q G GT+F+K+GW I F E +G
Sbjct: 3 PKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHW 62
Query: 126 XXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDL 185
R+W+ W ++ + + + W+ + F A D+ W + ENP+ + R L HDL
Sbjct: 63 DTMSRQWKIWRRLV-ETSFMNWNPESNRFRATDDDWANYLQENPDAGQYR---LSVPHDL 118
Query: 186 TIL 188
L
Sbjct: 119 KKL 121
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 62.8 bits (151), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 60/143 (41%), Gaps = 1/143 (0%)
Query: 61 STSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXX 120
S++ K W F+ + + G T+F K+GW I NE +G
Sbjct: 161 SSNPQTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQ 220
Query: 121 XXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLP 180
R+ W+ W ++ G + + WD +F A +E W I ENP + R K +P
Sbjct: 221 LKNHWDCTRKAWKIWCQLVGA-SSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVP 279
Query: 181 FAHDLTILYKDVVATGEHAWAPS 203
A L I++ V+ GE PS
Sbjct: 280 HADQLAIIFNGVIEPGETYTPPS 302
Score = 58.9 bits (141), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 4/123 (3%)
Query: 66 PKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXXXXXXXXXX 125
PKA W+ + FV C+ Q G GT+F+K+GW I F E +G
Sbjct: 3 PKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNHW 62
Query: 126 XXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDL 185
R+W+ W ++ + + + W+ + F A D+ W + ENP+ + R L HDL
Sbjct: 63 DTMSRQWKIWRRLV-ETSFMNWNPESNRFRATDDDWANYLQENPDAGQYR---LSVPHDL 118
Query: 186 TIL 188
L
Sbjct: 119 KKL 121
>AT3G11290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
- 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
LENGTH=460
Length = 460
Score = 54.3 bits (129), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 57/140 (40%), Gaps = 2/140 (1%)
Query: 58 SNMSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFNELSGRXXX 117
SN S+ + K W FV + G +++ K+ W I + N+ +G+
Sbjct: 157 SNFSSPQ-SKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFT 215
Query: 118 XXXXXXXXXXXRRKWRTWYKVCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREK 177
R+ W+ W +V G + WD F A DE W+ + EN R K
Sbjct: 216 RPQLKNHWDCTRKSWKIWCQVIGAPV-MKWDATSRTFGATDEDWKNYLKENHRAAPFRRK 274
Query: 178 GLPFAHDLTILYKDVVATGE 197
LP A L ++K ++ G+
Sbjct: 275 QLPHADKLATIFKGLIEPGK 294
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 53.1 bits (126), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 9/134 (6%)
Query: 60 MSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFN-ELSGRXXXX 118
+S + K W +++ YFV+ + Q+ G G F+K+ WI + FN SG+
Sbjct: 162 LSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKR 221
Query: 119 XXXXXXXXXXRRKWRTWYK---VCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLR 175
K +YK KE G WD + A D W+ I ++P R
Sbjct: 222 VLRHRY-----NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYR 276
Query: 176 EKGLPFAHDLTILY 189
K LP +DL ++
Sbjct: 277 MKSLPSYNDLDTIF 290
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 52.8 bits (125), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 9/134 (6%)
Query: 60 MSTSKIPKASWDVQVTGYFVQACLIQVANGEHQGTNFTKKGWIGIEKQFN-ELSGRXXXX 118
+S + K W +++ YFV+ + Q+ G G F+K+ WI + FN SG+
Sbjct: 162 LSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKR 221
Query: 119 XXXXXXXXXXRRKWRTWYK---VCGKETGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLR 175
K +YK KE G WD + A D W+ I ++P R
Sbjct: 222 VLRHRY-----NKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHPLARTYR 276
Query: 176 EKGLPFAHDLTILY 189
K LP +DL ++
Sbjct: 277 MKSLPSYNDLDTIF 290
>AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
10 (source: NCBI BLink). | chr1:10598764-10599527
FORWARD LENGTH=222
Length = 222
Score = 50.4 bits (119), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 35/66 (53%)
Query: 143 TGVGWDHVKHNFDAPDEWWEKKIMENPNYEKLREKGLPFAHDLTILYKDVVATGEHAWAP 202
+G GWD F APDE W + +PN++ ++ + + DL I++ DVVATG A
Sbjct: 88 SGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFAVGM 147
Query: 203 SSGVMP 208
S P
Sbjct: 148 SDSTCP 153