Miyakogusa Predicted Gene
- Lj1g3v3904630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3904630.1 Non Chatacterized Hit- tr|Q9FFY0|Q9FFY0_ARATH
Putative uncharacterized protein OS=Arabidopsis
thalia,29.79,9e-17,seg,NULL,CUFF.31473.1
(771 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G13950.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 360 2e-99
AT5G13950.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 360 2e-99
AT5G13950.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 360 2e-99
AT3G45830.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 81 3e-15
AT1G02290.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 80 5e-15
>AT5G13950.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1). | chr5:4496196-4500206 REVERSE
LENGTH=954
Length = 954
Score = 360 bits (925), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 201/435 (46%), Positives = 276/435 (63%), Gaps = 24/435 (5%)
Query: 1 MAADQRRKRLN-----GASSSEPDKTKRKNLELVQSDLTMKSHVFVEWDGNHQKVVAKRE 55
MAADQRRKR+N G SS E + KRK L H+ +EWD N KVV+K+E
Sbjct: 1 MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60
Query: 56 QIGISWRQMKPFASFAHDGHNILADVLAIPEEIFGLDSLSEVLSYEVWNTLLLDSERNFL 115
Q+G+S+R ++ F N+LA V +P E F L++LSEVLS EVW + L D ERN+L
Sbjct: 61 QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120
Query: 116 MQFLPSDLKPHQVVRELLSGDNFHFGNPFVKWGSLLCSGSLHPDMIVNQEQHLKSDKRAY 175
QFLP + QVV+ LL G+NFHFGNP + WG+ +CSG HPD IV++E+ L++DKR Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180
Query: 176 YSQIHNYHKDMVGFLVKLKRRWESCKDPEKEILQMIRRS--------KNDVDKRTQPNLN 227
YS + YH+D++ +L LK +WESCKDPEK+I++M+ S + V+ R++
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGSVLYNFLFKRRTVEVRSRGGNA 240
Query: 228 EFIVDDHDGNVTATSESCSWDAEEKA-HSDNQISSIGQRDEHQKRVLDKGFSKGKPRNLM 286
+ V+ +TA S S SW+ ++K SDN IS + + E Q+R G K K +N
Sbjct: 241 Q--VNGSCQGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNG 298
Query: 287 VSSDMPNVGEKPKKGDKIHKLNEYSSDGDKYMSYIKISKKQHELVKSMKLSCKSIQSHAL 346
V NVG K +K + + K + +DG KYMSY+KISKKQH++V SMK S KSIQS AL
Sbjct: 299 V-----NVGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRAL 353
Query: 347 NHILGDLVLDNIHAQPYGVFIKEEQQKLHDHWLRLVKKDLPAAHANWAERLMQKHAVKNS 406
N I G+ +D++ QPYGVF++EEQ+KL+ HWL LV KDLPAA+A W +QK + +S
Sbjct: 354 NRIFGN--IDSLDVQPYGVFVEEEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISS 410
Query: 407 LVVEMKDKSNPLAED 421
+ E+K+K N ED
Sbjct: 411 MGRELKEKCNLWMED 425
Score = 80.9 bits (198), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 123/295 (41%), Gaps = 61/295 (20%)
Query: 504 SISEGVPFSSDGHAWQ--ASERPHYHDSSVSHDYT-ANGLSLVN-PQVNEEQQTQVISLD 559
S+ EG F S G WQ R Y YT + GLS+++ P+ +E + I L+
Sbjct: 597 SLGEGSQFCSGGDVWQPVGGIRQSYISRQA---YTPSGGLSIIHHPEGDEVAKNCFIDLE 653
Query: 560 SNLHLEDPGKEFLHRQSDDGPFSSCQSQDRIDFLQSLIKGKGVNSYHPEQ---------- 609
SN+ E ++ L R +++ F S + ++ + LQSL G+G+ S EQ
Sbjct: 654 SNMPEEVDRRKMLQRNANNS-FGSFPNNNQNEILQSLFNGQGMVSRTTEQLHPLLKVPLN 712
Query: 610 ----KCAGSSFQT-SSNVMMGDDKFSSHFKEPLQTSLTLDQGNRRASEVYMPESMSGNIY 664
+ G FQ +N MM ++FS F++ + S Q +R + + SMS NIY
Sbjct: 713 EEHKQIMGIGFQQEGNNNMMVGNQFSGQFQDQIPASHAFPQDQQRQVDAFGQGSMSDNIY 772
Query: 665 SERGRYLIPRQDALSLSKGNITNWAAKSTGMSAPPRPHVNTGDFIGHNWFSSDHQVRLGW 724
RG +L+ R D W T + +P ++TG + NW Q R W
Sbjct: 773 --RG-FLMQRPD-----------WNPNCTQIGVTAQPLLSTGPLLNQNW-----QFRSMW 813
Query: 725 NGSNNVXXXXXXXXXXXX-------------------XXXDQSLFSVLSPCSQLR 760
+N V DQSLFSVLS CSQLR
Sbjct: 814 ANTNGVGCTSQGSQTGTERDPSLLRVANNAEQIIHRGSSSDQSLFSVLSQCSQLR 868
>AT5G13950.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
LENGTH=939
Length = 939
Score = 360 bits (925), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 200/428 (46%), Positives = 272/428 (63%), Gaps = 25/428 (5%)
Query: 1 MAADQRRKRLN-----GASSSEPDKTKRKNLELVQSDLTMKSHVFVEWDGNHQKVVAKRE 55
MAADQRRKR+N G SS E + KRK L H+ +EWD N KVV+K+E
Sbjct: 1 MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60
Query: 56 QIGISWRQMKPFASFAHDGHNILADVLAIPEEIFGLDSLSEVLSYEVWNTLLLDSERNFL 115
Q+G+S+R ++ F N+LA V +P E F L++LSEVLS EVW + L D ERN+L
Sbjct: 61 QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120
Query: 116 MQFLPSDLKPHQVVRELLSGDNFHFGNPFVKWGSLLCSGSLHPDMIVNQEQHLKSDKRAY 175
QFLP + QVV+ LL G+NFHFGNP + WG+ +CSG HPD IV++E+ L++DKR Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180
Query: 176 YSQIHNYHKDMVGFLVKLKRRWESCKDPEKEILQMI-RRSKNDVDKRTQPNLNEFIVDDH 234
YS + YH+D++ +L LK +WESCKDPEK+I++M+ RS+ + V+
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQ----------VNGS 230
Query: 235 DGNVTATSESCSWDAEEKA-HSDNQISSIGQRDEHQKRVLDKGFSKGKPRNLMVSSDMPN 293
+TA S S SW+ ++K SDN IS + + E Q+R G K K +N V N
Sbjct: 231 CQGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNGV-----N 285
Query: 294 VGEKPKKGDKIHKLNEYSSDGDKYMSYIKISKKQHELVKSMKLSCKSIQSHALNHILGDL 353
VG K +K + + K + +DG KYMSY+KISKKQH++V SMK S KSIQS ALN I G+
Sbjct: 286 VGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFGN- 344
Query: 354 VLDNIHAQPYGVFIKEEQQKLHDHWLRLVKKDLPAAHANWAERLMQKHAVKNSLVVEMKD 413
+D++ QPYGVF++EEQ+KL+ HWL LV KDLPAA+A W +QK + +S+ E+K+
Sbjct: 345 -IDSLDVQPYGVFVEEEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKE 402
Query: 414 KSNPLAED 421
K N ED
Sbjct: 403 KCNLWMED 410
Score = 80.5 bits (197), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 120/292 (41%), Gaps = 55/292 (18%)
Query: 504 SISEGVPFSSDGHAWQASERPHYHDSSVSHDYTANGLSLVN-PQVNEEQQTQVISLDSNL 562
S+ EG F S G WQ S + GLS+++ P+ +E + I L+SN+
Sbjct: 582 SLGEGSQFCSGGDVWQPVGGIRQSYISRQAYTPSGGLSIIHHPEGDEVAKNCFIDLESNM 641
Query: 563 HLEDPGKEFLHRQSDDGPFSSCQSQDRIDFLQSLIKGKGVNSYHPEQ------------- 609
E ++ L R +++ F S + ++ + LQSL G+G+ S EQ
Sbjct: 642 PEEVDRRKMLQRNANNS-FGSFPNNNQNEILQSLFNGQGMVSRTTEQLHPLLKVPLNEEH 700
Query: 610 -KCAGSSFQT-SSNVMMGDDKFSSHFKEPLQTSLTLDQGNRRASEVYMPESMSGNIYSER 667
+ G FQ +N MM ++FS F++ + S Q +R + + SMS NIY R
Sbjct: 701 KQIMGIGFQQEGNNNMMVGNQFSGQFQDQIPASHAFPQDQQRQVDAFGQGSMSDNIY--R 758
Query: 668 GRYLIPRQDALSLSKGNITNWAAKSTGMSAPPRPHVNTGDFIGHNWFSSDHQVRLGWNGS 727
G +L+ R D W T + +P ++TG + NW Q R W +
Sbjct: 759 G-FLMQRPD-----------WNPNCTQIGVTAQPLLSTGPLLNQNW-----QFRSMWANT 801
Query: 728 NNVXXXXXXXXXXXX-------------------XXXDQSLFSVLSPCSQLR 760
N V DQSLFSVLS CSQLR
Sbjct: 802 NGVGCTSQGSQTGTERDPSLLRVANNAEQIIHRGSSSDQSLFSVLSQCSQLR 853
>AT5G13950.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
LENGTH=939
Length = 939
Score = 360 bits (925), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 200/428 (46%), Positives = 272/428 (63%), Gaps = 25/428 (5%)
Query: 1 MAADQRRKRLN-----GASSSEPDKTKRKNLELVQSDLTMKSHVFVEWDGNHQKVVAKRE 55
MAADQRRKR+N G SS E + KRK L H+ +EWD N KVV+K+E
Sbjct: 1 MAADQRRKRMNSANVIGTSSREHYRAKRKKNASPDGALRSGDHITLEWDRNRSKVVSKKE 60
Query: 56 QIGISWRQMKPFASFAHDGHNILADVLAIPEEIFGLDSLSEVLSYEVWNTLLLDSERNFL 115
Q+G+S+R ++ F N+LA V +P E F L++LSEVLS EVW + L D ERN+L
Sbjct: 61 QVGLSFRHLREFVDVVPPRRNVLAQVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYL 120
Query: 116 MQFLPSDLKPHQVVRELLSGDNFHFGNPFVKWGSLLCSGSLHPDMIVNQEQHLKSDKRAY 175
QFLP + QVV+ LL G+NFHFGNP + WG+ +CSG HPD IV++E+ L++DKR Y
Sbjct: 121 RQFLPEGVDVEQVVQALLDGENFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRY 180
Query: 176 YSQIHNYHKDMVGFLVKLKRRWESCKDPEKEILQMI-RRSKNDVDKRTQPNLNEFIVDDH 234
YS + YH+D++ +L LK +WESCKDPEK+I++M+ RS+ + V+
Sbjct: 181 YSNLEKYHQDIIDYLQTLKEKWESCKDPEKDIVKMMWGRSRGGNAQ----------VNGS 230
Query: 235 DGNVTATSESCSWDAEEKA-HSDNQISSIGQRDEHQKRVLDKGFSKGKPRNLMVSSDMPN 293
+TA S S SW+ ++K SDN IS + + E Q+R G K K +N V N
Sbjct: 231 CQGLTAASGSSSWNEDDKPDSSDNMISPVVRCGEVQRRSKRSGLEKEKTQNNGV-----N 285
Query: 294 VGEKPKKGDKIHKLNEYSSDGDKYMSYIKISKKQHELVKSMKLSCKSIQSHALNHILGDL 353
VG K +K + + K + +DG KYMSY+KISKKQH++V SMK S KSIQS ALN I G+
Sbjct: 286 VGGKVRKKNVLPKDSIQQTDGAKYMSYLKISKKQHQIVTSMKQSGKSIQSRALNRIFGN- 344
Query: 354 VLDNIHAQPYGVFIKEEQQKLHDHWLRLVKKDLPAAHANWAERLMQKHAVKNSLVVEMKD 413
+D++ QPYGVF++EEQ+KL+ HWL LV KDLPAA+A W +QK + +S+ E+K+
Sbjct: 345 -IDSLDVQPYGVFVEEEQKKLNAHWLHLV-KDLPAAYAIWKRLQLQKRDIISSMGRELKE 402
Query: 414 KSNPLAED 421
K N ED
Sbjct: 403 KCNLWMED 410
Score = 80.5 bits (197), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 120/292 (41%), Gaps = 55/292 (18%)
Query: 504 SISEGVPFSSDGHAWQASERPHYHDSSVSHDYTANGLSLVN-PQVNEEQQTQVISLDSNL 562
S+ EG F S G WQ S + GLS+++ P+ +E + I L+SN+
Sbjct: 582 SLGEGSQFCSGGDVWQPVGGIRQSYISRQAYTPSGGLSIIHHPEGDEVAKNCFIDLESNM 641
Query: 563 HLEDPGKEFLHRQSDDGPFSSCQSQDRIDFLQSLIKGKGVNSYHPEQ------------- 609
E ++ L R +++ F S + ++ + LQSL G+G+ S EQ
Sbjct: 642 PEEVDRRKMLQRNANNS-FGSFPNNNQNEILQSLFNGQGMVSRTTEQLHPLLKVPLNEEH 700
Query: 610 -KCAGSSFQT-SSNVMMGDDKFSSHFKEPLQTSLTLDQGNRRASEVYMPESMSGNIYSER 667
+ G FQ +N MM ++FS F++ + S Q +R + + SMS NIY R
Sbjct: 701 KQIMGIGFQQEGNNNMMVGNQFSGQFQDQIPASHAFPQDQQRQVDAFGQGSMSDNIY--R 758
Query: 668 GRYLIPRQDALSLSKGNITNWAAKSTGMSAPPRPHVNTGDFIGHNWFSSDHQVRLGWNGS 727
G +L+ R D W T + +P ++TG + NW Q R W +
Sbjct: 759 G-FLMQRPD-----------WNPNCTQIGVTAQPLLSTGPLLNQNW-----QFRSMWANT 801
Query: 728 NNVXXXXXXXXXXXX-------------------XXXDQSLFSVLSPCSQLR 760
N V DQSLFSVLS CSQLR
Sbjct: 802 NGVGCTSQGSQTGTERDPSLLRVANNAEQIIHRGSSSDQSLFSVLSQCSQLR 853
>AT3G45830.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G02290.1); Has 499 Blast hits to 438 proteins
in 100 species: Archae - 0; Bacteria - 7; Metazoa - 236;
Fungi - 15; Plants - 108; Viruses - 2; Other Eukaryotes
- 131 (source: NCBI BLink). | chr3:16841277-16845173
FORWARD LENGTH=1298
Length = 1298
Score = 80.9 bits (198), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 6/142 (4%)
Query: 81 VLAIPEEIFGLDSLSEVLSYEVWNTLLLDSERNFLMQFLPS--DLKPHQVVRELLSGDNF 138
+IP E++ L SL ++LS +VWN L + ER L +LP L + ++EL G NF
Sbjct: 80 TCSIPFELYDLPSLEDILSVDVWNECLTEKERFSLSSYLPDVDQLTFMRTLKELFEGCNF 139
Query: 139 HFGNPFVKWGSLLCSGSLHPDMIVNQEQHLKSDKRAYYSQIHNYHKDMVGFLVKLKRRWE 198
HFG+P K +L G P + E + +Y + YH DMV L + + W
Sbjct: 140 HFGSPVKKLFDMLKGGQCEPRNTLYLEGRSLFLRTKHYHSLRKYHNDMVVNLCQTRDAWT 199
Query: 199 SCK----DPEKEILQMIRRSKN 216
SCK D + +L +++ K
Sbjct: 200 SCKGYSIDEKLRVLNIVKSQKT 221
>AT1G02290.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G45830.1); Has 134 Blast hits to 134 proteins
in 37 species: Archae - 0; Bacteria - 0; Metazoa - 54;
Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:450646-451977 REVERSE
LENGTH=443
Length = 443
Score = 80.1 bits (196), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 64/117 (54%), Gaps = 4/117 (3%)
Query: 84 IPEEIFGLDSLSEVLSYEVWNTLLLDSERNFLMQFLPSDLKPHQ---VVRELLSGDNFHF 140
IP E++ L L+ +LS E WN+LL + ER FL FLP D+ P ++ELL G N +F
Sbjct: 38 IPYELYDLPDLTGILSVETWNSLLTEEERFFLSCFLP-DMDPQTFSLTMQELLDGANLYF 96
Query: 141 GNPFVKWGSLLCSGSLHPDMIVNQEQHLKSDKRAYYSQIHNYHKDMVGFLVKLKRRW 197
GNP K+ L G P + +E + +R YY + YH+ ++ +++R W
Sbjct: 97 GNPEDKFYKNLLGGLFTPKVACFKEGVMFVKRRKYYYSLKFYHEKLIRTFTEMQRVW 153