Miyakogusa Predicted Gene
- Lj0g3v0359799.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0359799.1 Non Chatacterized Hit- tr|C5XYY7|C5XYY7_SORBI
Putative uncharacterized protein Sb04g008610
OS=Sorghu,28.48,8e-18,coiled-coil,NULL; seg,NULL,CUFF.24775.1
(457 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G58110.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 197 1e-50
AT3G58110.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 197 2e-50
AT2G42370.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 177 1e-44
>AT3G58110.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G42370.1). |
chr3:21516775-21519129 FORWARD LENGTH=754
Length = 754
Score = 197 bits (502), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 206/410 (50%), Gaps = 14/410 (3%)
Query: 25 LEEHNIELSLGQDNAXXXXXXXXXXXXXXIMMEFELSKEEEPGMWLIDQRNNVGEPFLRQ 84
+EEH +EL+LGQ+ M+ E +K+EE W + ++ G FLR+
Sbjct: 339 VEEHMLELNLGQETVSEMVSGEERGPVEGQPMDVEENKKEEDERWAWNGDSHAGSHFLRR 398
Query: 85 CRNVDVN--GMDCGLVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXFHLSPKYSNHFEGMT 142
C + D + F P + +G+
Sbjct: 399 CNHSSAREGDEDNHIEGSMEMGEDEPIEDVEEEETEEDTEKHEGGFPFFPN-GDSLQGVG 457
Query: 143 SGTGSIIHAMEAGQRPFSSGIDLHDNP-GGDFLSSRDDPPMISGS---SLFGNGH-KRDI 197
G + A G ++SG+ +H N GGDFL+SR + M GS SLFGNG+ KR+I
Sbjct: 458 QGNLMLGDASPLG---YNSGLQIHGNSIGGDFLASRGEMHMAMGSGSSSLFGNGNNKREI 514
Query: 198 GLVDNHNSHHFLNVSNKRMRSDSP-WNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEE 256
+N ++H N NKR+R++ P W+ KP +MC++QM + K R+ +A KD E+
Sbjct: 515 EH-ENGITYHSHNPINKRLRTEEPSWDEKPPPVDMCLDQMAYWAEKARLSFAEKDREREQ 573
Query: 257 SNMNGQLLLNELQKRDDEIDRLHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKE 316
S +N Q L+NELQ + I L + K EE QR+ + +Y+LE EL MM S+VEGYRKA+K
Sbjct: 574 SVINQQYLMNELQSKTAMIQELERTKFEEQQRKDIMIYKLESELRMMTSVVEGYRKALKI 633
Query: 317 TQKAFAEYRARCP-QADEPLYKDVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFG 375
TQKA E+R RCP + D+ +Y DV GSGGLVLS T
Sbjct: 634 TQKASREHRKRCPLRDDKQVYMDVKGSGGLVLSTTEIEKLRLKQEEEDRMQRVLAKRQID 693
Query: 376 DVELTYMGELESHMIVIESFNDRLMAMENQVKHLKEVKAKSKVSDPPECA 425
D E ++ + E HM +E N+RL+ E++VK L+E ++SK + E A
Sbjct: 694 DFEHNWLNKFEEHMEAVELLNERLIENEDEVKILRETLSESKNIETSEVA 743
>AT3G58110.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: cultured cell;
BEST Arabidopsis thaliana protein match is: unknown
protein (TAIR:AT2G42370.1); Has 2534 Blast hits to 1905
proteins in 233 species: Archae - 11; Bacteria - 102;
Metazoa - 890; Fungi - 241; Plants - 124; Viruses - 59;
Other Eukaryotes - 1107 (source: NCBI BLink). |
chr3:21516775-21519129 FORWARD LENGTH=784
Length = 784
Score = 197 bits (500), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 206/410 (50%), Gaps = 14/410 (3%)
Query: 25 LEEHNIELSLGQDNAXXXXXXXXXXXXXXIMMEFELSKEEEPGMWLIDQRNNVGEPFLRQ 84
+EEH +EL+LGQ+ M+ E +K+EE W + ++ G FLR+
Sbjct: 369 VEEHMLELNLGQETVSEMVSGEERGPVEGQPMDVEENKKEEDERWAWNGDSHAGSHFLRR 428
Query: 85 CRNVDVN--GMDCGLVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXFHLSPKYSNHFEGMT 142
C + D + F P + +G+
Sbjct: 429 CNHSSAREGDEDNHIEGSMEMGEDEPIEDVEEEETEEDTEKHEGGFPFFPN-GDSLQGVG 487
Query: 143 SGTGSIIHAMEAGQRPFSSGIDLHDNP-GGDFLSSRDDPPMISGS---SLFGNGH-KRDI 197
G + A G ++SG+ +H N GGDFL+SR + M GS SLFGNG+ KR+I
Sbjct: 488 QGNLMLGDASPLG---YNSGLQIHGNSIGGDFLASRGEMHMAMGSGSSSLFGNGNNKREI 544
Query: 198 GLVDNHNSHHFLNVSNKRMRSDSP-WNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEE 256
+N ++H N NKR+R++ P W+ KP +MC++QM + K R+ +A KD E+
Sbjct: 545 EH-ENGITYHSHNPINKRLRTEEPSWDEKPPPVDMCLDQMAYWAEKARLSFAEKDREREQ 603
Query: 257 SNMNGQLLLNELQKRDDEIDRLHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKE 316
S +N Q L+NELQ + I L + K EE QR+ + +Y+LE EL MM S+VEGYRKA+K
Sbjct: 604 SVINQQYLMNELQSKTAMIQELERTKFEEQQRKDIMIYKLESELRMMTSVVEGYRKALKI 663
Query: 317 TQKAFAEYRARCP-QADEPLYKDVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFG 375
TQKA E+R RCP + D+ +Y DV GSGGLVLS T
Sbjct: 664 TQKASREHRKRCPLRDDKQVYMDVKGSGGLVLSTTEIEKLRLKQEEEDRMQRVLAKRQID 723
Query: 376 DVELTYMGELESHMIVIESFNDRLMAMENQVKHLKEVKAKSKVSDPPECA 425
D E ++ + E HM +E N+RL+ E++VK L+E ++SK + E A
Sbjct: 724 DFEHNWLNKFEEHMEAVELLNERLIENEDEVKILRETLSESKNIETSEVA 773
>AT2G42370.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58110.2); Has 205 Blast hits to 191 proteins
in 60 species: Archae - 3; Bacteria - 23; Metazoa - 73;
Fungi - 8; Plants - 34; Viruses - 0; Other Eukaryotes -
64 (source: NCBI BLink). | chr2:17643334-17645533
FORWARD LENGTH=715
Length = 715
Score = 177 bits (449), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/256 (39%), Positives = 153/256 (59%), Gaps = 4/256 (1%)
Query: 159 FSSGIDLHDNPGGDFLSSRDDPPMISGSSLFGNGHKRDIGLVDNHNSHHFLN-VSNKRMR 217
++SG+ +H + DFL+ R M+ G S FGN +KR+ G +N S+HF N S KR++
Sbjct: 445 YNSGLQVHGSSTCDFLAPRAVMHMVPGRSHFGNDNKREFGH-ENDISYHFDNPASTKRLK 503
Query: 218 SDSPWNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEESNMNGQLLLNELQKRDDEIDR 277
+ S W+ KPV F++CMEQ++H K ++ Y KD+A ESNM Q+L NELQ+R+D I +
Sbjct: 504 TPS-WDDKPVPFDICMEQIKHLADKAKLSYVEKDQACGESNMREQMLQNELQRREDIIQQ 562
Query: 278 LHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKETQKAFAEYRARCPQADEPLYK 337
LHK EE ++ +E+Y+LE EL MM S++ Y+KA+KE+QKA ++R CP D+P+Y
Sbjct: 563 LHKESYEELHKKNVEIYKLENELRMMTSVLAWYQKALKESQKACRKHRKVCPLLDKPIYI 622
Query: 338 DVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFGDVELTYMGELESHM-IVIESFN 396
DV G+GGLVLS +V ++ E E ++ +E +
Sbjct: 623 DVKGTGGLVLSTAEIEKLRLKEEKEEGMRRVLIERQVKEVGSLWIKEYEVNLKKKVELLD 682
Query: 397 DRLMAMENQVKHLKEV 412
+L+ +N++K LKE
Sbjct: 683 GKLIGFQNKMKLLKET 698