Miyakogusa Predicted Gene
- Lj6g3v1538070.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1538070.1 Non Chatacterized Hit- tr|G7IMV9|G7IMV9_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,74.31,0,seg,NULL; DUF4378,Domain of unknown function
DUF4378; VARLMGL,NULL; SUBFAMILY NOT NAMED,NULL; PHOSPH,CUFF.59591.1
(925 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 285 9e-77
AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 281 1e-75
AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 189 7e-48
AT5G26910.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 176 8e-44
AT3G05750.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 147 4e-35
AT3G05750.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 133 5e-31
AT1G67040.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 62 2e-09
AT3G53540.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 56 1e-07
>AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58650.1); Has 1322 Blast hits to 684 proteins
in 162 species: Archae - 4; Bacteria - 497; Metazoa -
157; Fungi - 101; Plants - 155; Viruses - 0; Other
Eukaryotes - 408 (source: NCBI BLink). |
chr5:9466169-9469523 REVERSE LENGTH=853
Length = 853
Score = 285 bits (730), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 275/924 (29%), Positives = 451/924 (48%), Gaps = 123/924 (13%)
Query: 15 GGFLHLFDWTSKPRKKLFA-SKSDLPEPMKKERKADYNVAPYLMDDDENGVGASARGSCD 73
GGFL+LFDW K RKKLF+ S S+L E K+ + L++ DE G +S D
Sbjct: 11 GGFLNLFDWHGKSRKKLFSGSTSELSEESKQPAQNLLKSRVSLIEVDEIGKSSSNNQRSD 70
Query: 74 HSY-ASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGTPYSDTRSLQDAQYFRKNL 132
S ASSVT D+ GTR PSVVARLMG P D L+ +Q N
Sbjct: 71 SSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQEPRLNPDLDPFLLRPSQ----NT 126
Query: 133 SHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTITRPIEKFQTEVLPPKSAKSIPVTHHK 192
+ + L NL +G S + ++ + +PIE+FQ+E PP+SAK I VT+++
Sbjct: 127 NRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNR 186
Query: 193 LLSPIKTPGFVPTNNAAYIMEAAARIIEPGSQASGKXXXXXXXXXXXXXXVKDLNKFEPS 252
LSPI++PGFVP+ N Y+MEAA+R+IEP + + +F PS
Sbjct: 187 HLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVAR------------------TRFSPS 228
Query: 253 PKGPLIGPSSMTSRVRDLKEKRETSQRTTRLSETSHRPVESNAVKYLKGQSLNRSWNGSV 312
SS+ R++DL+EK E +Q+ + + ++ +KY G+ +
Sbjct: 229 NSP-----SSVPMRIQDLREKLEAAQKVS----SRQNSNDTFNLKYPSGKHNEK------ 273
Query: 313 DATIRPPSHAEEDSSSKKKGRSISLAIQAK-----VNVQRREG---LSGGKSLTGQKEHL 364
R + S+SK G+S + ++ K V+ Q + G LS ++ QKE
Sbjct: 274 ----RITTSLTTPSTSKFMGKSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKA 329
Query: 365 DSKSNQPSRANVQRNLHKKSSGQNSSGVLRQNNLKQNYSIDKDKLPSKPSVSNSQGRKVP 424
D+K N R S G+N + +QNN KQN +D PS SV N + KV
Sbjct: 330 DAKKCVVKSQNALRG-APISMGKN---MFKQNNQKQNC---RDNQPSMTSVLNQKSSKVN 382
Query: 425 NGDSSYGRHRSSS-GKSIAKSKVGSKKSAMEVTDSEKEVLYTSTNNFPRKKRSTDKDWND 483
N + S S K + S ++K+ ++ S K+ L PR K+ +
Sbjct: 383 NKVVNKVPVESGSISKQLGLSTASAEKNT-SLSLSRKKTL-------PRSKKLPNGMQKS 434
Query: 484 RAVDNLFIDKTPKPVKSNQVSNKQYGWGEEVKKKDMDVVSFTFTTPLARSNPCFETSGQA 543
D+ ++ +K N + G++ +KK+MDV+SFTF++P+ G +
Sbjct: 435 GISDDKRTKRSENMIKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIK---------GLS 485
Query: 544 SQNYNGPSLDQRIKRVLLDTDNSRSPIGYNVIGGDALGILLEQKLRELTGGVEASSDDVS 603
S + + + + DTD S + +N IGGD+L LLEQKLRELT +E+SS ++
Sbjct: 486 SDSLSS------TQGIGQDTD---SAVSFN-IGGDSLNALLEQKLRELTSKLESSSCSLT 535
Query: 604 KVRQPSVSAPMSDGQVTNLNWRLQQNKDQDVLSTNKLXXXXXXXXXXXXLPELSLKHNSW 663
+ +PS S PM + +N + + + + + N L K
Sbjct: 536 Q-EEPSYSIPMDE-----MNGMISFSSEYEKSTQNGLRKVLSESESVSDCTSFYDKQKFQ 589
Query: 664 VDEMEPQLFNCREPSPISVLEPSFSIESYDSSMSTDFTSTEGSKLYSTVQV---QEVHGL 720
+ E E S IS + + + SS S F+ + Y T+Q QE+ +
Sbjct: 590 IQAEE------HEVSSISTVTEA---DDLRSSCSKGFSDCRQTAEYGTIQSSSDQELTWV 640
Query: 721 NFSRNFYINEYDTELSDSASSTSTGTMVKKHTGTFSAMKFGRSNTWELDYVKDILCNVEL 780
+ + + + ++ELS+S + S ++ WE +Y+ +IL + +L
Sbjct: 641 SLNESHQAQD-ESELSESVVTLSYSEAEERL-------------DWEFEYISEILGSDQL 686
Query: 781 MYMDFSLGRAREIVNPHLFNQLESRKGGFKSDAESRIQRKVIFDCVSECMDLRCRR-YVG 839
M +++LG A +++ LF+++E R ++I+RK +FD V++C+ LRC + ++G
Sbjct: 687 MVKEYALGMATDVLPASLFDEMEGR----GEVTAAKIKRKTLFDFVNKCLALRCEQMFMG 742
Query: 840 GGYKMWTKGVAMVKRKEWLAEDVYKEISGWRGMGDSMVDELVEKDMSSQYGKWLDFEVDG 899
+ KG + ++++WLAE++ +EI G + M + M+DELV+K+MSS G+WLDFE +
Sbjct: 743 SCRGLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERET 802
Query: 900 FELGTEVVDQIVNSLFDDVVTEIL 923
+E G ++ +IV++L DD+V +++
Sbjct: 803 YEEGIDIEGEIVSTLVDDLVNDLV 826
>AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G58650.1). |
chr5:9466169-9469523 REVERSE LENGTH=852
Length = 852
Score = 281 bits (720), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 275/924 (29%), Positives = 451/924 (48%), Gaps = 124/924 (13%)
Query: 15 GGFLHLFDWTSKPRKKLFA-SKSDLPEPMKKERKADYNVAPYLMDDDENGVGASARGSCD 73
GGFL+LFDW K RKKLF+ S S+L E K+ + L++ DE G +S D
Sbjct: 11 GGFLNLFDWHGKSRKKLFSGSTSELSES-KQPAQNLLKSRVSLIEVDEIGKSSSNNQRSD 69
Query: 74 HSY-ASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGTPYSDTRSLQDAQYFRKNL 132
S ASSVT D+ GTR PSVVARLMG P D L+ +Q N
Sbjct: 70 SSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQEPRLNPDLDPFLLRPSQ----NT 125
Query: 133 SHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTITRPIEKFQTEVLPPKSAKSIPVTHHK 192
+ + L NL +G S + ++ + +PIE+FQ+E PP+SAK I VT+++
Sbjct: 126 NRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNR 185
Query: 193 LLSPIKTPGFVPTNNAAYIMEAAARIIEPGSQASGKXXXXXXXXXXXXXXVKDLNKFEPS 252
LSPI++PGFVP+ N Y+MEAA+R+IEP + + +F PS
Sbjct: 186 HLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVAR------------------TRFSPS 227
Query: 253 PKGPLIGPSSMTSRVRDLKEKRETSQRTTRLSETSHRPVESNAVKYLKGQSLNRSWNGSV 312
SS+ R++DL+EK E +Q+ + + ++ +KY G+ +
Sbjct: 228 NSP-----SSVPMRIQDLREKLEAAQKVS----SRQNSNDTFNLKYPSGKHNEK------ 272
Query: 313 DATIRPPSHAEEDSSSKKKGRSISLAIQAK-----VNVQRREG---LSGGKSLTGQKEHL 364
R + S+SK G+S + ++ K V+ Q + G LS ++ QKE
Sbjct: 273 ----RITTSLTTPSTSKFMGKSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKA 328
Query: 365 DSKSNQPSRANVQRNLHKKSSGQNSSGVLRQNNLKQNYSIDKDKLPSKPSVSNSQGRKVP 424
D+K N R S G+N + +QNN KQN +D PS SV N + KV
Sbjct: 329 DAKKCVVKSQNALRG-APISMGKN---MFKQNNQKQNC---RDNQPSMTSVLNQKSSKVN 381
Query: 425 NGDSSYGRHRSSS-GKSIAKSKVGSKKSAMEVTDSEKEVLYTSTNNFPRKKRSTDKDWND 483
N + S S K + S ++K+ ++ S K+ L PR K+ +
Sbjct: 382 NKVVNKVPVESGSISKQLGLSTASAEKNT-SLSLSRKKTL-------PRSKKLPNGMQKS 433
Query: 484 RAVDNLFIDKTPKPVKSNQVSNKQYGWGEEVKKKDMDVVSFTFTTPLARSNPCFETSGQA 543
D+ ++ +K N + G++ +KK+MDV+SFTF++P+ G +
Sbjct: 434 GISDDKRTKRSENMIKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIK---------GLS 484
Query: 544 SQNYNGPSLDQRIKRVLLDTDNSRSPIGYNVIGGDALGILLEQKLRELTGGVEASSDDVS 603
S + + + + DTD S + +N IGGD+L LLEQKLRELT +E+SS ++
Sbjct: 485 SDSLSS------TQGIGQDTD---SAVSFN-IGGDSLNALLEQKLRELTSKLESSSCSLT 534
Query: 604 KVRQPSVSAPMSDGQVTNLNWRLQQNKDQDVLSTNKLXXXXXXXXXXXXLPELSLKHNSW 663
+ +PS S PM + +N + + + + + N L K
Sbjct: 535 Q-EEPSYSIPMDE-----MNGMISFSSEYEKSTQNGLRKVLSESESVSDCTSFYDKQKFQ 588
Query: 664 VDEMEPQLFNCREPSPISVLEPSFSIESYDSSMSTDFTSTEGSKLYSTVQV---QEVHGL 720
+ E E S IS + + + SS S F+ + Y T+Q QE+ +
Sbjct: 589 IQAEE------HEVSSISTVTEA---DDLRSSCSKGFSDCRQTAEYGTIQSSSDQELTWV 639
Query: 721 NFSRNFYINEYDTELSDSASSTSTGTMVKKHTGTFSAMKFGRSNTWELDYVKDILCNVEL 780
+ + + + ++ELS+S + S ++ WE +Y+ +IL + +L
Sbjct: 640 SLNESHQAQD-ESELSESVVTLSYSEAEERL-------------DWEFEYISEILGSDQL 685
Query: 781 MYMDFSLGRAREIVNPHLFNQLESRKGGFKSDAESRIQRKVIFDCVSECMDLRCRR-YVG 839
M +++LG A +++ LF+++E R ++I+RK +FD V++C+ LRC + ++G
Sbjct: 686 MVKEYALGMATDVLPASLFDEMEGR----GEVTAAKIKRKTLFDFVNKCLALRCEQMFMG 741
Query: 840 GGYKMWTKGVAMVKRKEWLAEDVYKEISGWRGMGDSMVDELVEKDMSSQYGKWLDFEVDG 899
+ KG + ++++WLAE++ +EI G + M + M+DELV+K+MSS G+WLDFE +
Sbjct: 742 SCRGLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERET 801
Query: 900 FELGTEVVDQIVNSLFDDVVTEIL 923
+E G ++ +IV++L DD+V +++
Sbjct: 802 YEEGIDIEGEIVSTLVDDLVNDLV 825
>AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.1); Has 2350 Blast
hits to 1412 proteins in 248 species: Archae - 0;
Bacteria - 487; Metazoa - 577; Fungi - 236; Plants -
184; Viruses - 4; Other Eukaryotes - 862 (source: NCBI
BLink). | chr3:21696349-21699219 REVERSE LENGTH=820
Length = 820
Score = 189 bits (481), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 193/606 (31%), Positives = 281/606 (46%), Gaps = 116/606 (19%)
Query: 15 GGFLHLFDWTSKPRKKLFASK-SDLPEPMK--KERKADYNVAPY-LMDDDENGVGASARG 70
G FL+LFDW K RKKLF+S S L E K KE + ++ P+ + + D++ +
Sbjct: 11 GAFLNLFDWHGKSRKKLFSSNLSQLSEESKQAKENVQNPSITPHSVFEVDQSVKNPTYNP 70
Query: 71 SCDHSY-ASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGTPYSD---TRSLQDAQ 126
D S ASSVT D+ R SVVARLMG P D RS + A
Sbjct: 71 RSDSSCCASSVTSDDGNVVRA-SVVARLMGLEGLPLPNVLEPRVNPDLDPYFLRSSRQAN 129
Query: 127 YFRKNLSHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTITRPIEKFQTEVLPPKSAKSI 186
+ N+ Q D + +L SR P+ R IE+FQTE LPP+SAK I
Sbjct: 130 TWDANVDRQSDFDGVSWDHL------DSRTSKGPR-----KRMIERFQTETLPPRSAKPI 178
Query: 187 PVTHHKLLSPIKTPGFVPTNNAAYIMEAAARIIEPGSQASGKXXXXXXXXXXXXXXVKDL 246
VTH+KLLSPI+ PGFVP+ N AY+MEAA+R+IE + + V
Sbjct: 179 SVTHNKLLSPIRNPGFVPSRNPAYVMEAASRMIEQSPRMIAR-----------TRMVSSS 227
Query: 247 NKFEPSPKGPLIGPSSMTSRVRDLKEKRETSQR-TTRLSETSHRPVESNAVKYLKGQSLN 305
+ P P R+RDLKEK E +Q+ +T + + S+ ++ +YL+G
Sbjct: 228 DSSSPVPL-----------RIRDLKEKLEAAQKASTSVPQISN---DTRNSRYLRGDQNE 273
Query: 306 R--------SWNGSVDATIRPPSHAEEDS-SSKKKGRSISLAIQAKVNVQRREGLSGGKS 356
+ S++ ++PPS A + SS +K S+S++ SG K
Sbjct: 274 KKTTVLGKNSYDALKGGEVKPPSFAAQAKVSSNQKQDSLSMSS------------SGNKR 321
Query: 357 L-TGQKEHLDSKSNQPSRANVQRNLHKKSSGQNSSGVLRQNNLKQNYSIDKDKLPSKPSV 415
+ +GQKE +++K +RA +N K SS VLRQNN KQN
Sbjct: 322 MSSGQKEKVEAK----NRAVKSQNSSKGSSLSTGKNVLRQNNQKQNCR------------ 365
Query: 416 SNSQGRKVPNGDSSYGRHRSSSGKSIAKSKVGSKKSAMEVTDSEK--EVLYTSTNNFPRK 473
N Q R+V N K + +S SK S ++ +EK + + + PR
Sbjct: 366 DNQQSRRVMN---------KVVNKVLVESGSISKSSGFTMSSAEKPTSLPLSRKKSLPRS 416
Query: 474 KRSTDKDWNDRAVDNLFIDKTPKPVKSNQVSNKQYGWGEEVKKKDMDVVSFTFTTPLARS 533
K+ + ++ I + K +K N + ++ +K+DMDV+SFTF++ +
Sbjct: 417 KKPRNGVQESGIYEDKRIKRGEKSIKCNISIDGDSSTSKDDQKRDMDVISFTFSSSIK-- 474
Query: 534 NPCFETSGQASQNYNGPSLDQRIKRVLLDTDNSRSPIGYNVIGGDALGILLEQKLRELTG 593
G +S + G D + S I +NVIGGD+L LLEQKLRELT
Sbjct: 475 -------GLSSPHSQGTKQD------------ADSAIRFNVIGGDSLNALLEQKLRELTT 515
Query: 594 GVEASS 599
+E+SS
Sbjct: 516 KIESSS 521
Score = 121 bits (304), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 116/195 (59%), Gaps = 15/195 (7%)
Query: 732 DTELSDSASSTSTGTMVKKHTGTFSAMKFGRSNTWELDYVKDILCNVELMYMDFSLGRA- 790
D EL+ +S+ S T+ + + T WEL+Y+ +IL + +LM+ DF+ G
Sbjct: 628 DQELTWGSSNESQHTLDETESATLD---------WELEYITEILNSGQLMFQDFASGTTT 678
Query: 791 -REIVNPHLFNQLESRKGGFKSDAESRIQRKVIFDCVSECMDLRCRRYVGGGYK-MWTKG 848
++ LF+++E +G S + +RK +FDCV++C+ ++ R + G K M G
Sbjct: 679 NESLLPSSLFDEMERSRGAATS---MKTERKALFDCVNQCLAVKFERMLIGSCKGMMMSG 735
Query: 849 VAMVKRKEWLAEDVYKEISGWRGMGDSMVDELVEKDMSSQYGKWLDFEVDGFELGTEVVD 908
+++ ++ LAE+V +E+ G + M + M+DELV+ DMS G+W+ +E + FE G ++
Sbjct: 736 GILLEHRDLLAEEVNREVKGLKKMREMMIDELVDHDMSCFEGRWIGYEREMFEEGIDMEG 795
Query: 909 QIVNSLFDDVVTEIL 923
+IV++L DD+V++IL
Sbjct: 796 EIVSALVDDLVSDIL 810
>AT5G26910.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58650.1); Has 990 Blast hits to 447 proteins
in 125 species: Archae - 0; Bacteria - 525; Metazoa -
80; Fungi - 59; Plants - 91; Viruses - 0; Other
Eukaryotes - 235 (source: NCBI BLink). |
chr5:9466804-9469523 REVERSE LENGTH=638
Length = 638
Score = 176 bits (445), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 191/613 (31%), Positives = 292/613 (47%), Gaps = 87/613 (14%)
Query: 15 GGFLHLFDWTSKPRKKLFA-SKSDLPEPMKKERKADYNVAPYLMDDDENGVGASARGSCD 73
GGFL+LFDW K RKKLF+ S S+L E K+ + L++ DE G +S D
Sbjct: 11 GGFLNLFDWHGKSRKKLFSGSTSELSEESKQPAQNLLKSRVSLIEVDEIGKSSSNNQRSD 70
Query: 74 HSY-ASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGTPYSDTRSLQDAQYFRKNL 132
S ASSVT D+ GTR PSVVARLMG P D L+ +Q N
Sbjct: 71 SSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQEPRLNPDLDPFLLRPSQ----NT 126
Query: 133 SHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTITRPIEKFQTEVLPPKSAKSIPVTHHK 192
+ + L NL +G S + ++ + +PIE+FQ+E PP+SAK I VT+++
Sbjct: 127 NRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNR 186
Query: 193 LLSPIKTPGFVPTNNAAYIMEAAARIIEPGSQASGKXXXXXXXXXXXXXXVKDLNKFEPS 252
LSPI++PGFVP+ N Y+MEAA+R+IEP + +
Sbjct: 187 HLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSP--------------- 231
Query: 253 PKGPLIGPSSMTSRVRDLKEKRETSQRTTRLSETSHRPVESNAVKYLKGQSLNRSWNGSV 312
SS+ R++DL+EK E +Q+ + ++ ++ +KY G+ +
Sbjct: 232 --------SSVPMRIQDLREKLEAAQKVSSRQNSN----DTFNLKYPSGKHNEK------ 273
Query: 313 DATIRPPSHAEEDSSSKKKGRSISLAIQAK-----VNVQRREG---LSGGKSLTGQKEHL 364
R + S+SK G+S + ++ K V+ Q + G LS ++ QKE
Sbjct: 274 ----RITTSLTTPSTSKFMGKSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKA 329
Query: 365 DSKSNQPSRANVQRNLHKKSSGQNSSGVLRQNNLKQNYSIDKDKLPSKPSVSNSQGRKVP 424
D+K N R S G+N + +QNN KQN +D PS SV N + KV
Sbjct: 330 DAKKCVVKSQNALRGA-PISMGKN---MFKQNNQKQNC---RDNQPSMTSVLNQKSSKVN 382
Query: 425 NGDSSYGRHRSSS-GKSIAKSKVGSKKSAMEVTDSEKEVLYTSTNNFPRKKRSTDKDWND 483
N + S S K + S ++K+ ++ S K+ L PR K+ +
Sbjct: 383 NKVVNKVPVESGSISKQLGLSTASAEKNT-SLSLSRKKTL-------PRSKKLPNGMQKS 434
Query: 484 RAVDNLFIDKTPKPVKSNQVSNKQYGWGEEVKKKDMDVVSFTFTTPLARSNPCFETSGQA 543
D+ ++ +K N + G++ +KK+MDV+SFTF++P+ G +
Sbjct: 435 GISDDKRTKRSENMIKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIK---------GLS 485
Query: 544 SQNYNGPSLDQRIKRVLLDTDNSRSPIGYNVIGGDALGILLEQKLRELTGGVEASSDDVS 603
S + + + + DTD S + +N IGGD+L LLEQKLRELT +E+SS ++
Sbjct: 486 SDSLSS------TQGIGQDTD---SAVSFN-IGGDSLNALLEQKLRELTSKLESSSCSLT 535
Query: 604 KVRQPSVSAPMSD 616
+ +PS S PM +
Sbjct: 536 Q-EEPSYSIPMDE 547
>AT3G05750.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.1); Has 2317 Blast
hits to 1467 proteins in 247 species: Archae - 4;
Bacteria - 750; Metazoa - 557; Fungi - 182; Plants -
180; Viruses - 0; Other Eukaryotes - 644 (source: NCBI
BLink). | chr3:1704677-1707546 FORWARD LENGTH=801
Length = 801
Score = 147 bits (370), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 182/632 (28%), Positives = 274/632 (43%), Gaps = 107/632 (16%)
Query: 3 VEKEGTKNGGYVGGFLHLFDWTSKPRKKLFASKSDLPEPMKKERKADYNVA------PYL 56
VE E + GGFL++FDW K RKKLF+S S + + ++ N P L
Sbjct: 2 VEMEAVERKRSRGGFLNMFDWPGKSRKKLFSSSSSSSKLSEGSKQEKQNAQNPSKSWPSL 61
Query: 57 MDDDENGVGAS--ARGSCDHSYASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGT 114
++ DE G ++ R S ++ +DD G++ PSVVARLMG
Sbjct: 62 IEGDEIGKNSTYNPRSDSSCSTSTPTSDD-GQGSKAPSVVARLMGLESIPVPNALEPRRN 120
Query: 115 PYSDTRSLQDAQYFRKNLSHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTITRPIEKFQ 174
P D L+ + + S + L NL +G S + ++ + K RPI++FQ
Sbjct: 121 PDFDPYFLRSS----RKASTWDAYENLGYVNLRSDYDGISWDHLDSRMNKECNRPIDRFQ 176
Query: 175 TEVLPPKSAKSIPVTHHKLLSPIKTPGFVPTNNAAYIMEAAARIIEPGSQASGKXXXXXX 234
TE LPP+SAK IPVTH++LLSPI++PGFV + N A +ME A+R+IEP + K
Sbjct: 177 TETLPPRSAKPIPVTHNRLLSPIRSPGFVQSRNPASVMEEASRMIEPSPRVVAKTRFSSS 236
Query: 235 XXXXXXXXVKDLNKFEPSPKGPLIGPSSMTSRVRDLKEKRETSQRTTRLSETSHRPVESN 294
SS+ ++RDLKEK E SQ+ P SN
Sbjct: 237 DSS-----------------------SSLPMKIRDLKEKLEASQK-------GQSPQISN 266
Query: 295 AVKYLKGQSLNRSWNGSVDA--TIRPPSHAEEDS--------SSKKKGRSISLAIQAKVN 344
G N+ + G D T P E ++ SK K + S++ AK N
Sbjct: 267 ------GTCNNKCFRGKQDEKRTTLPLKTQERNNLLGESRFGGSKGKVKPPSVSAHAKAN 320
Query: 345 -VQRREG--LSGGKSLTGQKEHLDSKSNQPSRANVQRNLHKKSSGQNSSGVLRQNNLKQN 401
+ +R+ LS G + D K ++ + ++ K+SS V + NN KQN
Sbjct: 321 TIHKRDSSMLSNG--------YRDQKKKVETKNRIVKSGLKESSASTRKTVDKPNNQKQN 372
Query: 402 YSIDKDKLPSKPSVSNSQGRKVPNGDSSYGRHRSSSGKSIAKSKVGSKKSAM--EVTDSE 459
++ SVSN +GRKV K + ++ +KK
Sbjct: 373 QF-------AETSVSNQRGRKV----------MKKVNKVLVENGTTTKKPGFTATSAKKS 415
Query: 460 KEVLYTSTNNFPRKKRSTDKDWNDRAVDNLFIDKTPKPVKSNQVSNKQYGWGEEVKKKDM 519
+ N R K+ + + I K K +K N + G++ +KKDM
Sbjct: 416 TSSSLSRKKNLSRSKKPANGVQEAGVNSDKRIKKGEKVIKCNITVDGGLKTGDDDRKKDM 475
Query: 520 DVVSFTFTTPLARSNPCFETSGQASQNYNGPSLDQRIKRVLLDTDNSRSPIGYNVIGGDA 579
DV+SFTF++P+ G S D + D D + S + +N I D+
Sbjct: 476 DVISFTFSSPI-----------------KGLSSDSQYFLKKNDQD-AESALCFNKIDSDS 517
Query: 580 LGILLEQKLRELTGGVEASSDDVSKVRQPSVS 611
L LLE+KLRELT +E+S +++ + S S
Sbjct: 518 LNFLLEKKLRELTSKMESSCSSLTQEEESSGS 549
Score = 103 bits (257), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 102/159 (64%), Gaps = 6/159 (3%)
Query: 766 WELDYVKDILCNVELMYMDFSLGRAREIVNPHLFNQLESRKGGFKSDAESRIQRKVIFDC 825
WEL+Y+ +I+ + +LM +FSLG A +I+ LF++ E ++ DA +I+RK +FD
Sbjct: 645 WELEYITEIIASGQLMIKEFSLGMATDILPLSLFDETEGKR-----DARGKIERKTLFDL 699
Query: 826 VSECMDLRCRRYVGGGYK-MWTKGVAMVKRKEWLAEDVYKEISGWRGMGDSMVDELVEKD 884
V++ + L+C + G K + K ++R+E LA+ V KE G + M + M+DELV+ D
Sbjct: 700 VNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMMDELVDND 759
Query: 885 MSSQYGKWLDFEVDGFELGTEVVDQIVNSLFDDVVTEIL 923
MSS GKWLD+ + +E G E+ ++IV+ L DD++ +++
Sbjct: 760 MSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLI 798
>AT3G05750.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.3); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr3:1705300-1707546 FORWARD LENGTH=698
Length = 698
Score = 133 bits (335), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 141/482 (29%), Positives = 213/482 (44%), Gaps = 94/482 (19%)
Query: 145 NLVEKIEGSSRNFMEPKPQKTITRPIEKFQTEVLPPKSAKSIPVTHHKLLSPIKTPGFVP 204
NL +G S + ++ + K RPI++FQTE LPP+SAK IPVTH++LLSPI++PGFV
Sbjct: 44 NLRSDYDGISWDHLDSRMNKECNRPIDRFQTETLPPRSAKPIPVTHNRLLSPIRSPGFVQ 103
Query: 205 TNNAAYIMEAAARIIEPGSQASGKXXXXXXXXXXXXXXVKDLNKFEPSPKGPLIGPSSMT 264
+ N A +ME A+R+IEP + K +F S SS+
Sbjct: 104 SRNPASVMEEASRMIEPSPRVVAK------------------TRFSSSDSS-----SSLP 140
Query: 265 SRVRDLKEKRETSQRTTRLSETSHRPVESNAVKYLKGQSLNRSWNGSVDA--TIRPPSHA 322
++RDLKEK E SQ+ P SN G N+ + G D T P
Sbjct: 141 MKIRDLKEKLEASQK-------GQSPQISN------GTCNNKCFRGKQDEKRTTLPLKTQ 187
Query: 323 EEDS--------SSKKKGRSISLAIQAKVN-VQRREG--LSGGKSLTGQKEHLDSKSNQP 371
E ++ SK K + S++ AK N + +R+ LS G + D K
Sbjct: 188 ERNNLLGESRFGGSKGKVKPPSVSAHAKANTIHKRDSSMLSNG--------YRDQKKKVE 239
Query: 372 SRANVQRNLHKKSSGQNSSGVLRQNNLKQNYSIDKDKLPSKPSVSNSQGRKVPNGDSSYG 431
++ + ++ K+SS V + NN KQN ++ SVSN +GRKV
Sbjct: 240 TKNRIVKSGLKESSASTRKTVDKPNNQKQNQF-------AETSVSNQRGRKV-------- 284
Query: 432 RHRSSSGKSIAKSKVGSKKSAM--EVTDSEKEVLYTSTNNFPRKKRSTDKDWNDRAVDNL 489
K + ++ +KK + N R K+ + +
Sbjct: 285 --MKKVNKVLVENGTTTKKPGFTATSAKKSTSSSLSRKKNLSRSKKPANGVQEAGVNSDK 342
Query: 490 FIDKTPKPVKSNQVSNKQYGWGEEVKKKDMDVVSFTFTTPLARSNPCFETSGQASQNYNG 549
I K K +K N + G++ +KKDMDV+SFTF++P+ G
Sbjct: 343 RIKKGEKVIKCNITVDGGLKTGDDDRKKDMDVISFTFSSPIK-----------------G 385
Query: 550 PSLDQRIKRVLLDTDNSRSPIGYNVIGGDALGILLEQKLRELTGGVEASSDDVSKVRQPS 609
S D + D D + S + +N I D+L LLE+KLRELT +E+S +++ + S
Sbjct: 386 LSSDSQYFLKKNDQD-AESALCFNKIDSDSLNFLLEKKLRELTSKMESSCSSLTQEEESS 444
Query: 610 VS 611
S
Sbjct: 445 GS 446
Score = 103 bits (257), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 102/159 (64%), Gaps = 6/159 (3%)
Query: 766 WELDYVKDILCNVELMYMDFSLGRAREIVNPHLFNQLESRKGGFKSDAESRIQRKVIFDC 825
WEL+Y+ +I+ + +LM +FSLG A +I+ LF++ E ++ DA +I+RK +FD
Sbjct: 542 WELEYITEIIASGQLMIKEFSLGMATDILPLSLFDETEGKR-----DARGKIERKTLFDL 596
Query: 826 VSECMDLRCRRYVGGGYK-MWTKGVAMVKRKEWLAEDVYKEISGWRGMGDSMVDELVEKD 884
V++ + L+C + G K + K ++R+E LA+ V KE G + M + M+DELV+ D
Sbjct: 597 VNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMMDELVDND 656
Query: 885 MSSQYGKWLDFEVDGFELGTEVVDQIVNSLFDDVVTEIL 923
MSS GKWLD+ + +E G E+ ++IV+ L DD++ +++
Sbjct: 657 MSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLI 695
>AT1G67040.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 20 plant
structures; EXPRESSED DURING: 11 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G26910.3); Has 89 Blast hits to 84 proteins in
15 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi
- 2; Plants - 82; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr1:25019105-25021922 REVERSE
LENGTH=826
Length = 826
Score = 62.0 bits (149), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 118/280 (42%), Gaps = 59/280 (21%)
Query: 4 EKEGTKNGGYVGGFLHLFDWTSK-PRKKLFASKSDLPEPMKKER--------KADYNVAP 54
EK + GG VG F LFDW + +KKLF+ KS LP +R K+ N+
Sbjct: 17 EKRPNRLGGCVGVFFQLFDWNRRFAKKKLFSRKSLLPGKQVSKRFGGNEKMLKSKLNLI- 75
Query: 55 YLMDDDENGVGASARGSCDHSYASSVTDDEAYGTRPPSVVARLMGXXXXXXXXXXXXYGT 114
DDEN RGS + + V + + + R PS+VARLMG
Sbjct: 76 ----DDEN------RGSFPNR--NEVMEVKKHEMRSPSLVARLMGLESMP---------- 113
Query: 115 PYSDTRSLQDAQYFRKNLSHQHDCQALYSGNLVEKIEGSSRNFMEPKPQKTIT------R 168
S+ R + + S D ++ E+ E S + + P+ + T
Sbjct: 114 --SNHRDKGKNKKKKPLFSQIQDTDKCDLFDVEEEEEDSGVDKLRPQKMQRTTGVCDRRV 171
Query: 169 PIEKFQTEVLPPKSAKSIPVTHH-------KLLSPIKTPGFVPTNNAAYIMEAAARIIEP 221
++KF +E L K+ + HH KL SP+++P ++ +++AAARI+EP
Sbjct: 172 AVKKFGSEALQIKNVLTRVRKHHQYNHQHQKLASPVRSPRM--NRRSSRLIDAAARILEP 229
Query: 222 GSQ-ASGKXXXXXXXXXXXXXXVKDLNKFEPSPKGPLIGP 260
G + A G + +FE + K P++ P
Sbjct: 230 GKRNAKGAIAYPGST---------GIRRFENAAKEPVVSP 260
>AT3G53540.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF3741
(InterPro:IPR022212); BEST Arabidopsis thaliana protein
match is: Protein of unknown function (DUF3741)
(TAIR:AT4G28760.2); Has 1710 Blast hits to 868 proteins
in 206 species: Archae - 2; Bacteria - 409; Metazoa -
304; Fungi - 204; Plants - 304; Viruses - 2; Other
Eukaryotes - 485 (source: NCBI BLink). |
chr3:19846805-19850670 REVERSE LENGTH=924
Length = 924
Score = 55.8 bits (133), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 110/249 (44%), Gaps = 23/249 (9%)
Query: 676 EPSPISVLEPSFSIESYDSSMSTDFTSTEGSKLYSTVQVQEVHGLNFSRNFYINEYDTEL 735
+PSP+SVLE SF + S + S + L +Q+ ++ + + D +
Sbjct: 693 QPSPVSVLEASFDDDVSSGSECFESVSADLRGLRMQLQLLKLESATYKEGGMLVSSDEDT 752
Query: 736 SDSASSTSTG-TMVKKHTGTFSAMKFGRSNTWELDYVKDILCNVELMYMDFSLGRAREIV 794
SST T M+ K R W+ Y+ D+L N D ++ A V
Sbjct: 753 DQEESSTITDEAMITKEL---------REEDWKSSYLVDLLANSSFSDSDHNIVMATTPV 803
Query: 795 NPHLFNQLESRKGGFKSDAESRIQRKVIFDCVSECMDLRCRRYVGGGYKMWTKGVAMVKR 854
P LF LE + K+ +R++RK++FD +S + L + + + W K + +
Sbjct: 804 EPSLFEDLEKKYSSVKT--STRLERKLLFDQISREV-LHMLKQLSDPHP-WVKSTKVCPK 859
Query: 855 KEWLAEDVYKEISGWRGMGDSMVDELVEKDMSSQYGKWLDFEVDGFELGTEVVDQIVNSL 914
W A + + + R + ++ + D+ + +WL E D +G E+ +++ L
Sbjct: 860 --WDANKIQETL---RDLVTRKDEKPSKYDVEEKELQWLSLEDDIEIIGREI--EVM--L 910
Query: 915 FDDVVTEIL 923
D+++TE++
Sbjct: 911 TDELITELV 919