Miyakogusa Predicted Gene
- Lj5g3v2029570.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v2029570.1 Non Chatacterized Hit- tr|F4JSL7|F4JSL7_ARATH
Uncharacterized protein OS=Arabidopsis thaliana
GN=At4,36.1,6e-18,DUF4378,Domain of unknown function DUF4378;
VARLMGL,NULL; seg,NULL,CUFF.56443.1
(674 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G51850.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 134 2e-31
AT5G62170.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 113 4e-25
AT4G25430.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 89 1e-17
>AT5G51850.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G62170.1); Has 384 Blast hits to 375 proteins
in 79 species: Archae - 0; Bacteria - 14; Metazoa - 135;
Fungi - 31; Plants - 92; Viruses - 0; Other Eukaryotes -
112 (source: NCBI BLink). | chr5:21079419-21081478
FORWARD LENGTH=590
Length = 590
Score = 134 bits (338), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 157/501 (31%), Positives = 221/501 (44%), Gaps = 124/501 (24%)
Query: 122 PGTKTPTLVARLMGLDLLPDXXXXXXXXXXCLSTPNPHYHQNRTRQHIQIIKHRNSTDNV 181
PG+KTP LVARLMGLDLLPD L T + H+ I HR S
Sbjct: 115 PGSKTPNLVARLMGLDLLPDKTDLNHSLSD-LHTMSSHH----------ITSHRLSKKG- 162
Query: 182 STRSLPETPRISSARRSDVVEHRLSLQINKENMGLGEDLEGPRFSFSKRKYDENSSRSPS 241
TRSLP +PRISSAR+SD HRLSLQ+N+E F S+ K D+ S SP
Sbjct: 163 -TRSLPVSPRISSARKSDFDIHRLSLQLNREK----------EFGRSRLKEDQEESHSPR 211
Query: 242 HYARQIVKQVKESV--SRKVGLDITNTVKNREQGREDVVNQSKFKKPTKISVKPLDETSP 299
YARQIVKQ+KE V R VG+DITN+VKNRE + ++ T +S P S
Sbjct: 212 DYARQIVKQIKERVVTRRVVGMDITNSVKNRE-----ARPSHELRRDTTVSCSPRTRFSE 266
Query: 300 GKHSNQSYSPRPSRFMDTKHKPN---TTKPSPIAPNYQNTKPSPSPPPMVNIEAELSRVL 356
K + QS T HKPN +++P PI KP P+P +L
Sbjct: 267 -KENKQS----------TSHKPNSSSSSRPEPII-----QKPKPTPV-----------IL 299
Query: 357 TKPKPQALLQKELNNPKSVQKHKKPTQIRN--KP-PQTSIRNKQEESFITRSPSNTRAND 413
+ + Q +++ P ++ K K T+ R KP P + IRN++ E+F++ S D
Sbjct: 300 GEKQSQNRVKQRQLKPINLCK-KAETETRRPIKPSPTSDIRNRKRETFLSDS------RD 352
Query: 414 IKTKSKRTHXXXXXXXXXXXXXXXXXXXXKSNPSPPTIKIPQIQVKTQTQESDDIQEAKS 473
+K K H + PP +I + + + + E+ I+
Sbjct: 353 VKAKP--LHKIKKFKKIPKSNDLENISATR----PPHQQINERE-RLISNEAASIR---- 401
Query: 474 STQLFSSLRQSTLCTRGRTNDEDKANGVYTATGAGDEGPEYQYITTLLSRTGVHKATSLP 533
S+ + + + S R D+ A + E YI +++ G+
Sbjct: 402 SSSMHKTEKNSPQVARNHKFDD----------AATEINSEQDYIIRIMNLAGIKS----- 446
Query: 534 HHHFQWFSSTHPLDPLLFHRLEQH--YPLSNSFASSIESYRDCKFRQKNHLGPRCNRRLM 591
S LD +F +LE YP S + A LG CNRRL+
Sbjct: 447 -------DSQAMLDLSIFRKLEHFGDYP-SGTLA----------------LG--CNRRLL 480
Query: 592 FDLVDELLSEILVRPKGKGQG 612
FDLV+E+L E + + +G QG
Sbjct: 481 FDLVNEILIETVAKRRGNYQG 501
>AT5G62170.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G51850.1); Has 381 Blast hits to 359 proteins
in 81 species: Archae - 0; Bacteria - 16; Metazoa - 101;
Fungi - 21; Plants - 99; Viruses - 3; Other Eukaryotes -
141 (source: NCBI BLink). | chr5:24973115-24975475
REVERSE LENGTH=703
Length = 703
Score = 113 bits (283), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 133/372 (35%), Positives = 179/372 (48%), Gaps = 59/372 (15%)
Query: 61 IPNHHNTVPKGAEAPRNSLES--EDGTVTSISKEENFKIP---KIRTSGSTRGXXXXXXX 115
I +HH +PKG +APRNSLES E+ + + K+ N I KI+T R
Sbjct: 61 INHHHLHLPKGVDAPRNSLESTEEETSFSPTRKDGNLNISMGIKIKTKPQARSSSASLTP 120
Query: 116 XXXXXXPGTKTPTLVARLMGLDLLPDXXXXXXXXXXC-------LSTPNPHYHQNRTRQH 168
P KTPTLVARLMGLDL+PD L TP H ++H
Sbjct: 121 TETYS-PSIKTPTLVARLMGLDLVPDNYRSSPTPSSSSSSTLIDLKTPTRSSH---AKKH 176
Query: 169 IQIIKHRNSTDNVSTRSLPETPRISSARRS---DVVEH-RLSLQINKENMGLGEDLEGP- 223
RNS D TRSLPETPRIS RRS + EH R SL + N+ + + E
Sbjct: 177 RHYSLQRNSVDG-GTRSLPETPRISLGRRSVDVNCYEHQRSSLHLRDNNINVFPERESGI 235
Query: 224 ---RFSFSKRKYDENSSRSPSHYARQIVKQVKESVS--RKVGLDITNTVKNREQGREDVV 278
R + K +++ +RSP YARQIV Q+KE+VS R++G DITN Q RE V
Sbjct: 236 NNVRLTRVKEIHEDKENRSPREYARQIVMQLKENVSRRRRMGTDITN---KETQPRE--V 290
Query: 279 NQSKFKKPTKISVKPLDETSPGKHSNQSYSPRPSRFMDTKHKPNTTKPSPIAPNYQNTKP 338
++SK K +K ++ D +S SPR + P TKP+ + N +K
Sbjct: 291 HESK-KASSKTTIITHDVSS---------SPR----LGLTEVPK-TKPTSLQTNNVASKI 335
Query: 339 SPSPPPMVNIEAELSRVLTKPKPQALLQKELNNPKSVQKHKKPTQIRN---KPPQTSIRN 395
+ V + L V +P+ +KE KS +K KKP ++ KPPQ+
Sbjct: 336 LETTAMKVQDKTRLPTVHEEPQGT---EKE-KQRKSTKKCKKPENFKSRLVKPPQS---- 387
Query: 396 KQEESFITRSPS 407
QEE F+ RSP+
Sbjct: 388 MQEEPFV-RSPA 398
Score = 90.1 bits (222), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 86/148 (58%), Gaps = 29/148 (19%)
Query: 511 GPEYQYITTLLSRTGVHKATSLPHHHFQWFSSTHPLDPLLFHRLEQHYPLSNSFASSIES 570
G E +YIT L RTG+ + T P + +WFS +HPLDP +F+ LE H+ ++++
Sbjct: 486 GGELEYITRTLRRTGIDRDT--PISYAKWFSPSHPLDPSIFYFLE-HFAVTST------- 535
Query: 571 YRDCKFRQKN---HLGPRCNRRLMFDLVDELLSEIL-----VRP-KGKGQGKSHRGL--- 618
R +N +L RCNR+L+F LVDE+L++IL ++P +S R L
Sbjct: 536 ------RPRNSPENLSLRCNRKLLFHLVDEILADILKPHINLKPWVCHYPIRSQRNLKGS 589
Query: 619 -LLETVWKRVRSFPRAKCEVLEDIDGLI 645
L++ + +R+ FP AKC VLEDID L+
Sbjct: 590 ELIDELSRRIERFPLAKCLVLEDIDALV 617
>AT4G25430.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G51850.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:12998571-13000211 FORWARD LENGTH=459
Length = 459
Score = 88.6 bits (218), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/212 (38%), Positives = 108/212 (50%), Gaps = 39/212 (18%)
Query: 70 KGAEAPRNSLE-SEDGTVTSISKEENFKIPK----IRTSG--STRGXXXXXXXXXXXXXP 122
KG APRNSL+ SE+ +++ N+K+ + I G ST P
Sbjct: 62 KGLVAPRNSLDLSEESPLST-----NYKLEREGLNISVGGKKSTLRGLLVDTPSHNCNLP 116
Query: 123 GTKTPTLVARLMGLDLLPDXXXXXXXXXXCLSTPNPHYHQNRTRQHIQIIKHRNSTDNVS 182
TKTP +VARLMGLDLLPD T +P +N R HR S +
Sbjct: 117 RTKTPNVVARLMGLDLLPDNLEL---------TRSP---RNGVR------GHRLSGNGSG 158
Query: 183 TRSLPETPRISSARRSDVVEHRLSLQINKENMGLGEDLEGPRFSFSKRKYDENSSRSPSH 242
TRSLP +PRIS SD HRLSL++N+EN + E R + K DE S SP +
Sbjct: 159 TRSLPASPRIS----SDSENHRLSLELNREN---NKHEEFVRTRLKELKQDEQSP-SPRY 210
Query: 243 YARQIVKQVKESV-SRKVGLDITNTVKNREQG 273
RQIVKQ K+ V +RK G+D+TN ++ + G
Sbjct: 211 SGRQIVKQTKKRVTTRKFGMDVTNLLEKKRAG 242