Miyakogusa Predicted Gene
- Lj5g3v0528950.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v0528950.1 Non Chatacterized Hit- tr|I3T0N3|I3T0N3_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.89,0,SUBFAMILY
NOT NAMED,NULL; FAMILY NOT NAMED,NULL,CUFF.53214.1
(360 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 612 e-175
AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 545 e-155
AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 495 e-140
AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 495 e-140
AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 495 e-140
AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 72 7e-13
>AT5G25265.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane,
membrane; EXPRESSED IN: cultured cell, leaf; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:8754794-8756855 REVERSE LENGTH=366
Length = 366
Score = 612 bits (1577), Expect = e-175, Method: Compositional matrix adjust.
Identities = 299/366 (81%), Positives = 323/366 (88%), Gaps = 6/366 (1%)
Query: 1 MGCSGNLFFTILITFSVALITYNIIISGNAPLRQDFPGPSRRPTITIDPIIEMPL----R 56
MGC G LF+ +LIT SVALITYNIIIS NAPL+Q FPG S I+IDP+IE+P R
Sbjct: 1 MGCGGTLFYPLLITLSVALITYNIIISANAPLKQGFPGRSSSSDISIDPVIELPRGGGSR 60
Query: 57 RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQAD--PDSSMGGFTRILHSGKPDA 114
+ RLFHTAVTASDSVYNTWQCRVMY+WFKK QA P S MGGFTRILHSGKPD
Sbjct: 61 NNDGKRIRLFHTAVTASDSVYNTWQCRVMYYWFKKIQASAGPGSEMGGFTRILHSGKPDQ 120
Query: 115 FMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL 174
+MDEIPTFVAQPLPSGMDQGY+VLNRPWAFVQWLQQ DIKEDYILMSEPDHIIVKPIPNL
Sbjct: 121 YMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHIIVKPIPNL 180
Query: 175 AKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPTWM 234
AKDG+GAAFPFFYIEPKKYE VLRKY+PE GPVTNIDPIGNSPVIVGK++LKKIAPTWM
Sbjct: 181 AKDGLGAAFPFFYIEPKKYEKVLRKYYPEVRGPVTNIDPIGNSPVIVGKDALKKIAPTWM 240
Query: 235 NVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKEIGKSYIIHY 294
NVSLAMKKDPE DKAFGWVLEMYAYAV+SALHGV NIL+KDFMIQPPWD E+G YIIHY
Sbjct: 241 NVSLAMKKDPEADKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDIEVGDKYIIHY 300
Query: 295 TYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLVKMVNEATAS 354
TYGCDY+MKG+LTYGKIGEWRFDKRSYD PP+NLT+PPPGV +SVVTLVKM+NEATA+
Sbjct: 301 TYGCDYDMKGKLTYGKIGEWRFDKRSYDSKPPPRNLTMPPPGVSQSVVTLVKMINEATAN 360
Query: 355 IPNWYS 360
IPNW S
Sbjct: 361 IPNWGS 366
>AT2G25260.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT5G25265.1); Has 35333 Blast hits
to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:10755617-10757766 REVERSE LENGTH=358
Length = 358
Score = 545 bits (1405), Expect = e-155, Method: Compositional matrix adjust.
Identities = 261/361 (72%), Positives = 304/361 (84%), Gaps = 4/361 (1%)
Query: 1 MGCSGNLFFTILITFSVALIT-YNIIISGNAPLRQDFPGPSRRPTITIDPIIEMPLRRHS 59
MG G FF IL+T S+ LI YN I+S + PLRQ+ PG RR + D I ++ S
Sbjct: 1 MGFRGKYFFPILMTLSLFLIIRYNYIVSDDPPLRQELPG--RRSASSGDDITYT-VKTPS 57
Query: 60 SSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTRILHSGKPDAFMDEI 119
+KRLFHTAVTA+DSVY+TWQCRVMY+W+ +F+ +P S MGG+TRILHSG+PD MDEI
Sbjct: 58 KKTKRLFHTAVTATDSVYSTWQCRVMYYWYNRFRDEPGSDMGGYTRILHSGRPDGLMDEI 117
Query: 120 PTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNLAKDGM 179
PTFVA PLPSG+D+GY+VLNRPWAFVQWLQQA I+EDYILM+EPDHIIVKPIPNLA+ +
Sbjct: 118 PTFVADPLPSGVDKGYVVLNRPWAFVQWLQQAHIEEDYILMAEPDHIIVKPIPNLARGNL 177
Query: 180 GAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPTWMNVSLA 239
AAFPFFYIEPKKYE+VLRK+FP+ENGP++ IDPIGNSPVIV K +L KIAPTWMNVSLA
Sbjct: 178 AAAFPFFYIEPKKYESVLRKFFPKENGPISRIDPIGNSPVIVTKNALMKIAPTWMNVSLA 237
Query: 240 MKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKEIGKSYIIHYTYGCD 299
MK DP+TDKAFGWVLEMYAYAV+SALHGV NIL+KDFMIQPPWD E K++IIHYTYGCD
Sbjct: 238 MKNDPQTDKAFGWVLEMYAYAVSSALHGVSNILHKDFMIQPPWDTETKKTFIIHYTYGCD 297
Query: 300 YNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLVKMVNEATASIPNWY 359
++MKG++ GKIGEWRFDKRSY PP+NLTLPP GVPESVVTLV M+NEATA+IPNW
Sbjct: 298 FDMKGKMMVGKIGEWRFDKRSYGDKPPPRNLTLPPRGVPESVVTLVTMINEATANIPNWE 357
Query: 360 S 360
S
Sbjct: 358 S 358
>AT5G13500.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 495 bits (1274), Expect = e-140, Method: Compositional matrix adjust.
Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)
Query: 47 IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
+DP+++MPL R + SS FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43 LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102
Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
ILHSG D MDEIPTFV PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162
Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
+ V P+PNLA G AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222
Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282
Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
+IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+ PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342
Query: 346 KMVNEATASIPNW 358
KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355
>AT5G13500.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 228 Blast hits to 200 proteins in 21 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
| chr5:4338676-4340827 FORWARD LENGTH=358
Length = 358
Score = 495 bits (1274), Expect = e-140, Method: Compositional matrix adjust.
Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)
Query: 47 IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
+DP+++MPL R + SS FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43 LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102
Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
ILHSG D MDEIPTFV PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162
Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
+ V P+PNLA G AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222
Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282
Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
+IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+ PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342
Query: 346 KMVNEATASIPNW 358
KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355
>AT5G13500.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:4338676-4340827 FORWARD
LENGTH=358
Length = 358
Score = 495 bits (1274), Expect = e-140, Method: Compositional matrix adjust.
Identities = 228/313 (72%), Positives = 266/313 (84%), Gaps = 1/313 (0%)
Query: 47 IDPIIEMPLR-RHSSSSKRLFHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTR 105
+DP+++MPL R + SS FH A+TA+D+ YN WQCR+MY+W+K+ +A P S MGGFTR
Sbjct: 43 LDPVVQMPLNIRKAKSSPAPFHVALTATDAPYNKWQCRIMYYWYKQKKALPGSDMGGFTR 102
Query: 106 ILHSGKPDAFMDEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDH 165
ILHSG D MDEIPTFV PLP G+D+GY+VLNRPWAFVQWL++A IKEDY+LM+EPDH
Sbjct: 103 ILHSGNSDNLMDEIPTFVVDPLPPGLDRGYVVLNRPWAFVQWLERATIKEDYVLMAEPDH 162
Query: 166 IIVKPIPNLAKDGMGAAFPFFYIEPKKYETVLRKYFPEENGPVTNIDPIGNSPVIVGKES 225
+ V P+PNLA G AAFPFFYI P+KYE ++RKY+P E GPVTNIDPIGNSPVI+ KES
Sbjct: 163 VFVNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNIDPIGNSPVIISKES 222
Query: 226 LKKIAPTWMNVSLAMKKDPETDKAFGWVLEMYAYAVASALHGVRNILYKDFMIQPPWDKE 285
L+KIAPTWMNVSL MK DPETDKAFGWVLEMY YA+ASA+HGVR+IL KDFM+QPPWD
Sbjct: 223 LEKIAPTWMNVSLTMKNDPETDKAFGWVLEMYGYAIASAIHGVRHILRKDFMLQPPWDLS 282
Query: 286 IGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSYDHVAPPKNLTLPPPGVPESVVTLV 345
+IIHYTYGCDYNMKGELTYGKIGEWRFDKRS+ PP+N++LPPPGVPESVVTLV
Sbjct: 283 TKGKFIIHYTYGCDYNMKGELTYGKIGEWRFDKRSHLRGPPPRNMSLPPPGVPESVVTLV 342
Query: 346 KMVNEATASIPNW 358
KMVNEATA+IPNW
Sbjct: 343 KMVNEATATIPNW 355
>AT3G01720.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G25265.1);
Has 374 Blast hits to 211 proteins in 23 species: Archae
- 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
| chr3:262412-265608 REVERSE LENGTH=802
Length = 802
Score = 71.6 bits (174), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 114/277 (41%), Gaps = 46/277 (16%)
Query: 66 FHTAVTASDSVYNTWQCRVMYHWFKKFQADPDSSMGGFTRIL---------HSGKPDAFM 116
HT + + Y WQ H F++ G TR+L + G A
Sbjct: 394 IHTLFSTECTTYFDWQTVGFMHSFRQ-----SGQPGNITRLLSCTDEALKNYKGHDLAPT 448
Query: 117 DEIPTFVAQPLPSGMDQGYIVLNRPWAFVQWLQQADIKEDYILMSEPDHIIVKPIPNL-- 174
+P+ PL Y +N+P A V WL +I +Y+++ + D I+ PI
Sbjct: 449 HYVPSMSRHPLTGDW---YPAINKPAAVVHWLHHTNIDAEYVVILDADMILRGPITPWEF 505
Query: 175 -AKDGMGAAFPFFYIEPKKYETV-LRKYFPEENGPVTNIDPIGNSPVIVGKESLKKIAPT 232
A G + P+ Y+ + L PE V + +I+ E L+K A
Sbjct: 506 KAARGRPVSTPYDYLIGCDNDLARLHTRNPEACDKVGGV-------IIMHIEDLRKFAMY 558
Query: 233 WMNVSLAMKKDPE------TDKAF--GWVLEMYAYAVASALHGVRNILYKDFMIQPPWDK 284
W+ + ++ D E T + GW+ EMY Y+ +A +R+ + K+ MI P +
Sbjct: 559 WLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGYVP 618
Query: 285 EIGKSYIIHYTYGCDYNMKGELTYGKIGEWRFDKRSY 321
E G Y + + YG ++ K+G W FDK ++
Sbjct: 619 EPGADYRV-FHYGLEF---------KVGNWSFDKANW 645