Miyakogusa Predicted Gene
- Lj5g3v1206650.3
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1206650.3 CUFF.55004.3
(395 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G43745.1 | Symbols: | Protein of unknown function (DUF1012) ... 380 e-105
AT5G02940.1 | Symbols: | Protein of unknown function (DUF1012) ... 337 1e-92
AT5G49960.1 | Symbols: | unknown protein; CONTAINS InterPro DOM... 53 3e-07
>AT5G43745.1 | Symbols: | Protein of unknown function (DUF1012) |
chr5:17569435-17574954 REVERSE LENGTH=817
Length = 817
Score = 380 bits (976), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/296 (63%), Positives = 218/296 (73%), Gaps = 12/296 (4%)
Query: 100 KRMIQYMSLYFILRLTRTKFVDLMIKVVQPMLQDMLQTLSAASLPLACVSNTLNKPTPLK 159
K +I + LY + R+ + K+ Q L ++Q A LP AC SN+L PTPLK
Sbjct: 84 KVVIGCIPLYAVFRIAQ--------KICQE-LPRLVQNSVGAGLPFACASNSL--PTPLK 132
Query: 160 LDVSLPSLYDIRWSLARLLYLFNIQLERNVATFFVVLLIACFSFVVIGGLLFFKLRGNKQ 219
LDVS PS DIRW LAR LYLFNIQLE+N+ TF V L+IAC SFV+IGGLLFFK R +
Sbjct: 133 LDVSFPSFQDIRWGLARFLYLFNIQLEKNIGTFLVALMIACVSFVIIGGLLFFKFRKD-L 191
Query: 220 SLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSRLLSTMTEQFRNNMQRL 279
LEDC+WEAWACL SSSTHLKQ TRIERVIGF+LAIWGILFYSRLLSTMTEQFR NM +L
Sbjct: 192 PLEDCLWEAWACLISSSTHLKQKTRIERVIGFVLAIWGILFYSRLLSTMTEQFRYNMTKL 251
Query: 280 REGAQLQVLETDHIIVCGMNSHLPFILKQLNKYHEFSVRLGTATARKQRILLMSDLPRKQ 339
REGAQ+QVLE DHII+CG+NSHLPFILKQLN YHE +VRLGTATARKQR+LLMSD PRKQ
Sbjct: 252 REGAQMQVLEADHIIICGINSHLPFILKQLNSYHEHAVRLGTATARKQRLLLMSDTPRKQ 311
Query: 340 IDRIADNIAKDLNHIDVXXXXXXXXXXXXFEXXXXXXXXXXXXLPTKGERFEVDTD 395
+D++A+ +KD NHID+ FE LPTKG+R+EVDTD
Sbjct: 312 MDKLAEAYSKDFNHIDILTKSCSLNLTKSFERAAASMARAIIILPTKGDRYEVDTD 367
>AT5G02940.1 | Symbols: | Protein of unknown function (DUF1012) |
chr5:684671-689674 REVERSE LENGTH=813
Length = 813
Score = 337 bits (863), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 178/367 (48%), Positives = 237/367 (64%), Gaps = 31/367 (8%)
Query: 34 RFMPWTKSSALHEY---GIRAHS-EGRWEVDSHRS-DVKSNSKTNKHVEENLGTESIWME 88
R + +S +LH GI++ S G ++V S R+ D + +K K +
Sbjct: 23 RLASFHRSLSLHSLPLGGIKSSSFRGTFKVKSQRTGDTEPPNKNFKDL------------ 70
Query: 89 KNKSSSQGPQAKRMIQYMSLYFILRLTRTKFVDLMIKVVQPMLQDMLQTLSAASLPLACV 148
N + K +I + LY +LR+ + F +L +++Q A LP AC
Sbjct: 71 -NSKFYKSLPYKLVIGCIPLYAVLRIAQKIFQEL---------PNLIQNSVKAGLPFACA 120
Query: 149 SNTLNKPTPLKLDVSLPSLYDIRWSLARLLYLFNIQLERNVATFFVVLLIACFSFVVIGG 208
SN ++K LK ++PS +DI+W LAR YLFN QLE+N+ T FVVLLI CFSFV+IGG
Sbjct: 121 SNAIDKHPLLK---AIPSSHDIKWGLARSSYLFNTQLEKNLGTVFVVLLITCFSFVIIGG 177
Query: 209 LLFFKLRGNKQSLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSRLLSTM 268
L FFK R + SLEDC+WEAWACL ++ THL+Q TR ER+IGF+LAIWGI+FYSRLLSTM
Sbjct: 178 LFFFKFRKD-TSLEDCLWEAWACLVNADTHLEQKTRFERLIGFVLAIWGIVFYSRLLSTM 236
Query: 269 TEQFRNNMQRLREGAQLQVLETDHIIVCGMNSHLPFILKQLNKYHEFSVRLGTATARKQR 328
TEQFR +M+++REGA +QVLE+DHII+CG+NSHLPFILKQLN Y + +VRLGT TARKQ
Sbjct: 237 TEQFRYHMKKVREGAHMQVLESDHIIICGINSHLPFILKQLNSYQQHAVRLGTTTARKQT 296
Query: 329 ILLMSDLPRKQIDRIADNIAKDLNHIDVXXXXXXXXXXXXFEXXXXXXXXXXXXLPTKGE 388
+LLMSD PRK++D++A+ AKD + +D+ FE LPTKG+
Sbjct: 297 LLLMSDTPRKEMDKLAEAYAKDFDQLDILTKSCSLNMTKSFERAAACMARAIIILPTKGD 356
Query: 389 RFEVDTD 395
R+EVDTD
Sbjct: 357 RYEVDTD 363
>AT5G49960.1 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF1012
(InterPro:IPR010420); BEST Arabidopsis thaliana protein
match is: Protein of unknown function (DUF1012)
(TAIR:AT5G02940.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:20324173-20327687
REVERSE LENGTH=824
Length = 824
Score = 53.1 bits (126), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/106 (24%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 204 VVIGGLLFFKLRGNKQSLEDCVWEAWACLCSSSTHLKQPTRIERVIGFLLAIWGILFYSR 263
+V GGL + + + +++ +W +W + S +H + R++ ++ G+L ++
Sbjct: 208 IVYGGLALYAV--SDCGVDEALWLSWTFVADSGSHADRVGVGARIVSVAISAGGMLIFAT 265
Query: 264 LLSTMTEQFRNNMQRLREGAQLQVLETDHIIVCGMNSHLPFILKQL 309
+L +++ + LR+G +VLE++HI++ G + L +LKQL
Sbjct: 266 MLGLISDAISKMVDSLRKGKS-EVLESNHILILGWSDKLGSLLKQL 310