Miyakogusa Predicted Gene
- Lj3g3v1605130.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v1605130.1 Non Chatacterized Hit- tr|I1NBS4|I1NBS4_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.34652
PE,65.03,0,SUBFAMILY NOT NAMED,NULL; PHOSPHATIDYLINOSITOL
N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT P (DOWN
SYNDR,NODE_28956_length_3099_cov_38.585674.path2.1
(915 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 299 8e-81
AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 297 2e-80
AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 196 9e-50
AT5G26910.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 193 6e-49
AT3G05750.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 163 6e-40
AT3G05750.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 135 1e-31
AT2G39435.1 | Symbols: | Phosphatidylinositol N-acetyglucosamin... 52 2e-06
AT3G53540.1 | Symbols: | unknown protein; LOCATED IN: plasma me... 52 2e-06
>AT5G26910.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58650.1); Has 1322 Blast hits to 684 proteins
in 162 species: Archae - 4; Bacteria - 497; Metazoa -
157; Fungi - 101; Plants - 155; Viruses - 0; Other
Eukaryotes - 408 (source: NCBI BLink). |
chr5:9466169-9469523 REVERSE LENGTH=853
Length = 853
Score = 299 bits (765), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 285/938 (30%), Positives = 447/938 (47%), Gaps = 138/938 (14%)
Query: 3 MEKRRSKGSFLSLFDWNAKSRKKLLWNDPNLPEVSKQGKENVVTLPESQLRRIKVDENGA 62
+E++RS+G FL+LFDW+ KSRKKL + E+S++ K+ L +S++ I+VDE G
Sbjct: 4 VERKRSRGGFLNLFDWHGKSRKKLF--SGSTSELSEESKQPAQNLLKSRVSLIEVDEIGK 61
Query: 63 SPSNMASGDFSSNLS-ICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHG 121
S SN D S S + SD+G G++AP +VARLMGL+SLP E LN
Sbjct: 62 SSSNNQRSDSSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPF 118
Query: 122 VSHCNEVALPMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAK 181
+ ++ D + Y+N+ + S D ++ R N+P++RFQ+E PP+SAK
Sbjct: 119 LLRPSQNTNRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAK 178
Query: 182 PIPVTHNKLLSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRI 240
PI VT+N+ LSPI+SPGF+P +N ++MEAA+++IE SP+ R R PS SSVP+RI
Sbjct: 179 PICVTNNRHLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRI 238
Query: 241 LDLKERLEAAQCAFTPEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRD 291
DL+E+LEAAQ K+ N+N + Y ++ K TS F G
Sbjct: 239 QDLREKLEAAQ------KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG--- 289
Query: 292 SEKNSSSHSATRRRSDSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKP 351
K+S+ + + ++ QAK ++ N+K ++ +KS R
Sbjct: 290 --KSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG--- 344
Query: 352 SSDRDVHQRTCTSRNSNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESS 405
S N+ QNNQKQNC MT+ S +NK + S
Sbjct: 345 ---------APISMGKNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGS 395
Query: 406 IGTR--KTTGRGAKNVNVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPD 463
I + +T KN ++ R + T R + LP+ +K IS RS +
Sbjct: 396 ISKQLGLSTASAEKNTSLSLSR---KKTLPRSKKLPN-----GMQKSGISDDKRTKRSEN 447
Query: 464 HAVNNFQSKSIKCNFTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTEQAMEI 523
IKCN T DG +++ + K+ DVISFTF SP++ DS SST+
Sbjct: 448 M---------IKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ----- 493
Query: 524 RNSVGVNSPGHNDNSYHRNLSLSPPGLNMIDSDAXXXXXXXXXXXXTSRLNLPQCTLATE 583
G+ + S++ I D+ TS+L C+L E
Sbjct: 494 ----GIGQDTDSAVSFN------------IGGDSLNALLEQKLRELTSKLESSSCSLTQE 537
Query: 584 XXXXXXXXXXQDKVPSMVSITSKEQDKSFYPDQFSDKLDCMHNYHCSSGDPVLNLNQ--- 640
D++ M+S +S+ Y + L + + S D ++
Sbjct: 538 ---EPSYSIPMDEMNGMISFSSE------YEKSTQNGLRKVLSESESVSDCTSFYDKQKF 588
Query: 641 QIQTSEVREDPRCSSKDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSSMQDE 700
QIQ E + +A+DL S S+ + D + T+ SS D+
Sbjct: 589 QIQAEEHEVSSISTVTEADDL------------RSSCSKGFSDCRQTAEYGTIQSS-SDQ 635
Query: 701 EVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELE--YI 758
E++ + +ES +E + SE + L E ++ E YI
Sbjct: 636 ELT-WVSLNESHQAQDESELSES-----------------VVTLSYSEAEERLDWEFEYI 677
Query: 759 QDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVSE 818
+IL + M +E+ +G A V+ +LFD +E +G +K++RK LFD V++
Sbjct: 678 SEILGSDQLMVKEYALGMATDVLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVNK 730
Query: 819 CLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMST 877
CL LR Q F+G C+ + +++ WLAEEL +E+ G + M E+M+DELV K+MS+
Sbjct: 731 CLALRCEQMFMGSCRGLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSS 790
Query: 878 GCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLLG 915
G+WLDF+ E +EEG ++E +I+++L+++LV+DL+ G
Sbjct: 791 FEGRWLDFERETYEEGIDIEGEIVSTLVDDLVNDLVSG 828
>AT5G26910.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: mitochondrion;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G58650.1). |
chr5:9466169-9469523 REVERSE LENGTH=852
Length = 852
Score = 297 bits (761), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 286/939 (30%), Positives = 447/939 (47%), Gaps = 141/939 (15%)
Query: 3 MEKRRSKGSFLSLFDWNAKSRKKLLWNDPNLPEVSKQGKENVVTLPESQLRRIKVDENGA 62
+E++RS+G FL+LFDW+ KSRKKL + SKQ +N++ +S++ I+VDE G
Sbjct: 4 VERKRSRGGFLNLFDWHGKSRKKLFSGSTSELSESKQPAQNLL---KSRVSLIEVDEIGK 60
Query: 63 SPSNMASGDFSSNLS-ICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHG 121
S SN D S S + SD+G G++AP +VARLMGL+SLP E LN
Sbjct: 61 SSSNNQRSDSSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPF 117
Query: 122 VSHCNEVALPMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAK 181
+ ++ D + Y+N+ + S D ++ R N+P++RFQ+E PP+SAK
Sbjct: 118 LLRPSQNTNRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAK 177
Query: 182 PIPVTHNKLLSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRI 240
PI VT+N+ LSPI+SPGF+P +N ++MEAA+++IE SP+ R R PS SSVP+RI
Sbjct: 178 PICVTNNRHLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRI 237
Query: 241 LDLKERLEAAQCAFTPEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRD 291
DL+E+LEAAQ K+ N+N + Y ++ K TS F G
Sbjct: 238 QDLREKLEAAQ------KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG--- 288
Query: 292 SEKNSSSHSATRRRSDSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKP 351
K+S+ + + ++ QAK ++ N+K ++ +KS R
Sbjct: 289 --KSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG--- 343
Query: 352 SSDRDVHQRTCTSRNSNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESS 405
S N+ QNNQKQNC MT+ S +NK + S
Sbjct: 344 ---------APISMGKNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGS 394
Query: 406 IGTR--KTTGRGAKNVNVQPKRSSLRATDNRKEFLP-SKTESISQKKKFISRSSHEARSP 462
I + +T KN ++ + +RK+ LP SK +K IS RS
Sbjct: 395 ISKQLGLSTASAEKNTSL---------SLSRKKTLPRSKKLPNGMQKSGISDDKRTKRSE 445
Query: 463 DHAVNNFQSKSIKCNFTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTEQAME 522
+ IKCN T DG +++ + K+ DVISFTF SP++ DS SST+
Sbjct: 446 NM---------IKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ---- 492
Query: 523 IRNSVGVNSPGHNDNSYHRNLSLSPPGLNMIDSDAXXXXXXXXXXXXTSRLNLPQCTLAT 582
G+ + S++ I D+ TS+L C+L
Sbjct: 493 -----GIGQDTDSAVSFN------------IGGDSLNALLEQKLRELTSKLESSSCSLTQ 535
Query: 583 EXXXXXXXXXXQDKVPSMVSITSKEQDKSFYPDQFSDKLDCMHNYHCSSGDPVLNLNQ-- 640
E D++ M+S +S+ Y + L + + S D ++
Sbjct: 536 E---EPSYSIPMDEMNGMISFSSE------YEKSTQNGLRKVLSESESVSDCTSFYDKQK 586
Query: 641 -QIQTSEVREDPRCSSKDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSSMQD 699
QIQ E + +A+DL S S+ + D + T+ SS D
Sbjct: 587 FQIQAEEHEVSSISTVTEADDL------------RSSCSKGFSDCRQTAEYGTIQSS-SD 633
Query: 700 EEVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELE--Y 757
+E++ + +ES +E + SE + L E ++ E Y
Sbjct: 634 QELT-WVSLNESHQAQDESELSES-----------------VVTLSYSEAEERLDWEFEY 675
Query: 758 IQDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVS 817
I +IL + M +E+ +G A V+ +LFD +E +G +K++RK LFD V+
Sbjct: 676 ISEILGSDQLMVKEYALGMATDVLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVN 728
Query: 818 ECLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMS 876
+CL LR Q F+G C+ + +++ WLAEEL +E+ G + M E+M+DELV K+MS
Sbjct: 729 KCLALRCEQMFMGSCRGLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMS 788
Query: 877 TGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLLG 915
+ G+WLDF+ E +EEG ++E +I+++L+++LV+DL+ G
Sbjct: 789 SFEGRWLDFERETYEEGIDIEGEIVSTLVDDLVNDLVSG 827
>AT3G58650.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.1); Has 2350 Blast
hits to 1412 proteins in 248 species: Archae - 0;
Bacteria - 487; Metazoa - 577; Fungi - 236; Plants -
184; Viruses - 4; Other Eukaryotes - 862 (source: NCBI
BLink). | chr3:21696349-21699219 REVERSE LENGTH=820
Length = 820
Score = 196 bits (497), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 189/521 (36%), Positives = 258/521 (49%), Gaps = 66/521 (12%)
Query: 3 MEKRRSKGSFLSLFDWNAKSRKKLLW-NDPNLPEVSKQGKENVVTLPESQLRRIKVDENG 61
+E++R +G+FL+LFDW+ KSRKKL N L E SKQ KENV + +VD++
Sbjct: 4 VERKRPRGAFLNLFDWHGKSRKKLFSSNLSQLSEESKQAKENVQNPSITPHSVFEVDQSV 63
Query: 62 ASPSNMASGDFSSNLS-ICSDEGCGSKAPGLVARLMGLDSLPA------SANTELSCTSL 114
+P+ D S S + SD+G +A +VARLMGL+ LP N +L L
Sbjct: 64 KNPTYNPRSDSSCCASSVTSDDGNVVRA-SVVARLMGLEGLPLPNVLEPRVNPDLDPYFL 122
Query: 115 NGSSSHGVSHCNEVALPMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEM 174
S N +D D ++ H + S + R R +E RFQTE
Sbjct: 123 RSSRQANTWDAN-----VDRQSDFDGVSWDH---LDSRTSKGPRKRMIE-----RFQTET 169
Query: 175 LPPKSAKPIPVTHNKLLSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRMPSVRSS 234
LPP+SAKPI VTHNKLLSPI++PGF+P +N A++MEAA+++IE SP+ R RM S S
Sbjct: 170 LPPRSAKPISVTHNKLLSPIRNPGFVPSRNPAYVMEAASRMIEQSPRMIARTRMVSSSDS 229
Query: 235 S--VPLRILDLKERLEAAQCAFTPEKLVGPSNANPANGILYERSSNSHKCTSAF-KGSRD 291
S VPLRI DLKE+LEAAQ A T P +N Y R + K T+ K S D
Sbjct: 230 SSPVPLRIRDLKEKLEAAQKASTSV----PQISNDTRNSRYLRGDQNEKKTTVLGKNSYD 285
Query: 292 SEKNSSSHSATRRRSDSLALQAKPNV-QNRDTL--NSNGNRKYVK-QKEQKEIKSNQLSR 347
+ K + S A QAK + Q +D+L +S+GN++ QKE+ E K N+ +
Sbjct: 286 ALKGGEV------KPPSFAAQAKVSSNQKQDSLSMSSSGNKRMSSGQKEKVEAK-NRAVK 338
Query: 348 SQKPSSDRDVHQRTCTSRNSNVLGQNNQKQNCM-TTTSKPISKIDSNKATARASSSESSI 406
SQ S + + S NVL QNNQKQNC S+ + NK + S S
Sbjct: 339 SQNSS------KGSSLSTGKNVLRQNNQKQNCRDNQQSRRVMNKVVNKVLVESGSISKSS 392
Query: 407 GTRKTTGRGAKNVNVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAV 466
G ++ ++ + K+S R+ R ES + K I R
Sbjct: 393 GFTMSSAEKPTSLPLSRKKSLPRSKKPRNGV----QESGIYEDKRIKRG----------- 437
Query: 467 NNFQSKSIKCNFTTDGRIHQDAFNMKEGKDVISFTFMSPLR 507
KSIKCN + DG + K DVISFTF S ++
Sbjct: 438 ----EKSIKCNISIDGDSSTSKDDQKRDMDVISFTFSSSIK 474
Score = 120 bits (302), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 101/163 (61%), Gaps = 9/163 (5%)
Query: 754 ELEYIQDILENADFMSEEFVMGQA--DTVIMPNLFDLLENQGSSGTENYGDEYSKLERKV 811
ELEYI +IL + M ++F G ++++ +LFD +E + T K ERK
Sbjct: 654 ELEYITEILNSGQLMFQDFASGTTTNESLLPSSLFDEMERSRGAATS------MKTERKA 707
Query: 812 LFDCVSECLELRFTQAFVGRCKSWP-RWVTSVQRKRWLAEELYKEMFGFRNMEEVMVDEL 870
LFDCV++CL ++F + +G CK ++ + LAEE+ +E+ G + M E+M+DEL
Sbjct: 708 LFDCVNQCLAVKFERMLIGSCKGMMMSGGILLEHRDLLAEEVNREVKGLKKMREMMIDEL 767
Query: 871 VSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLL 913
V DMS G+W+ ++ E FEEG ++E +I+++L+++LVSD+L
Sbjct: 768 VDHDMSCFEGRWIGYEREMFEEGIDMEGEIVSALVDDLVSDIL 810
>AT5G26910.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G58650.1); Has 990 Blast hits to 447 proteins
in 125 species: Archae - 0; Bacteria - 525; Metazoa -
80; Fungi - 59; Plants - 91; Viruses - 0; Other
Eukaryotes - 235 (source: NCBI BLink). |
chr5:9466804-9469523 REVERSE LENGTH=638
Length = 638
Score = 193 bits (490), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 183/536 (34%), Positives = 269/536 (50%), Gaps = 66/536 (12%)
Query: 3 MEKRRSKGSFLSLFDWNAKSRKKLLWNDPNLPEVSKQGKENVVTLPESQLRRIKVDENGA 62
+E++RS+G FL+LFDW+ KSRKKL + E+S++ K+ L +S++ I+VDE G
Sbjct: 4 VERKRSRGGFLNLFDWHGKSRKKLFSGSTS--ELSEESKQPAQNLLKSRVSLIEVDEIGK 61
Query: 63 SPSNMASGDFSSNLS-ICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHG 121
S SN D S S + SD+G G++AP +VARLMGL+SLP E LN
Sbjct: 62 SSSNNQRSDSSCCASSVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPF 118
Query: 122 VSHCNEVALPMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAK 181
+ ++ D + Y+N+ + S D ++ R N+P++RFQ+E PP+SAK
Sbjct: 119 LLRPSQNTNRWDAYENLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAK 178
Query: 182 PIPVTHNKLLSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRI 240
PI VT+N+ LSPI+SPGF+P +N ++MEAA+++IE SP+ R R PS SSVP+RI
Sbjct: 179 PICVTNNRHLSPIRSPGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRI 238
Query: 241 LDLKERLEAAQCAFTPEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRD 291
DL+E+LEAAQ K+ N+N + Y ++ K TS F G
Sbjct: 239 QDLREKLEAAQ------KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG--- 289
Query: 292 SEKNSSSHSATRRRSDSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKP 351
K+S+ + + ++ QAK ++ N+K ++ +KS R
Sbjct: 290 --KSSTDGLKGKVKPSYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG--- 344
Query: 352 SSDRDVHQRTCTSRNSNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESS 405
S N+ QNNQKQNC MT+ S +NK + S
Sbjct: 345 ---------APISMGKNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGS 395
Query: 406 IGTR--KTTGRGAKNVNVQPKRSSLRATDNRKEFLP-SKTESISQKKKFISRSSHEARSP 462
I + +T KN ++ SL +RK+ LP SK +K IS RS
Sbjct: 396 ISKQLGLSTASAEKNTSL-----SL----SRKKTLPRSKKLPNGMQKSGISDDKRTKRSE 446
Query: 463 DHAVNNFQSKSIKCNFTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTE 518
+ IKCN T DG +++ + K+ DVISFTF SP++ DS SST+
Sbjct: 447 NM---------IKCNITIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ 493
>AT3G05750.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.1); Has 2317 Blast
hits to 1467 proteins in 247 species: Archae - 4;
Bacteria - 750; Metazoa - 557; Fungi - 182; Plants -
180; Viruses - 0; Other Eukaryotes - 644 (source: NCBI
BLink). | chr3:1704677-1707546 FORWARD LENGTH=801
Length = 801
Score = 163 bits (412), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 179/535 (33%), Positives = 257/535 (48%), Gaps = 72/535 (13%)
Query: 3 MEKRRSKGSFLSLFDW---NAKSRKKLLWNDPNLPEVSKQGKENVVTLPESQLRRIKVDE 59
+E++RS+G FL++FDW + K + L E SKQ K+N +S I+ DE
Sbjct: 7 VERKRSRGGFLNMFDWPGKSRKKLFSSSSSSSKLSEGSKQEKQNAQNPSKSWPSLIEGDE 66
Query: 60 NGA-SPSNMASGDFSSNLSICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSS 118
G S N S S + SD+G GSKAP +VARLMGL+S+P E N
Sbjct: 67 IGKNSTYNPRSDSSCSTSTPTSDDGQGSKAPSVVARLMGLESIPVPNALE---PRRNPDF 123
Query: 119 SHGVSHCNEVALPMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPK 178
+ A D + Y+N+ + S D ++ R K NRP+ RFQTE LPP+
Sbjct: 124 DPYFLRSSRKASTWDAYENLGYVNLRSDYDGISWDHLDSRMNKECNRPIDRFQTETLPPR 183
Query: 179 SAKPIPVTHNKLLSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVP 237
SAKPIPVTHN+LLSPI+SPGF+ +N A +ME A+++IE SP+ + R S SSS+P
Sbjct: 184 SAKPIPVTHNRLLSPIRSPGFVQSRNPASVMEEASRMIEPSPRVVAKTRFSSSDSSSSLP 243
Query: 238 LRILDLKERLEAAQCAFTPEKLVGPSNANPANGILYERSSNSHKCTSAFKGSRDSEKNSS 297
++I DLKE+LEA+Q +P+ G N +KC F+G +D EK ++
Sbjct: 244 MKIRDLKEKLEASQKGQSPQISNGTCN---------------NKC---FRGKQD-EKRTT 284
Query: 298 SHSATRRRSD-----------------SLALQAKPN-VQNRD-TLNSNGNRKYVKQKEQK 338
T+ R++ S++ AK N + RD ++ SNG Y QK++
Sbjct: 285 LPLKTQERNNLLGESRFGGSKGKVKPPSVSAHAKANTIHKRDSSMLSNG---YRDQKKKV 341
Query: 339 EIKSNQLSRSQKPSSDRDVHQRTCTSRNSNVLGQNNQKQNCMTTTSKPISKIDSNKATAR 398
E K+ + K SS S V NNQKQN TS +S K +
Sbjct: 342 ETKNRIVKSGLKESS---------ASTRKTVDKPNNQKQNQFAETS--VSNQRGRKVMKK 390
Query: 399 ASSSESSIGTRKTTGRGAKNVNVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHE 458
+ GT TT + K +S S+ +++S+ KK +
Sbjct: 391 VNKVLVENGT--TTKKPGFTATSAKKSTSSSL---------SRKKNLSRSKKPANGVQEA 439
Query: 459 ARSPDHAVNNFQSKSIKCNFTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDS 513
+ D + + K IKCN T DG + + K+ DVISFTF SP++ DS
Sbjct: 440 GVNSDKRIKKGE-KVIKCNITVDGGLKTGDDDRKKDMDVISFTFSSPIKGLSSDS 493
Score = 105 bits (263), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 130/227 (57%), Gaps = 12/227 (5%)
Query: 689 YGSTVYSSMQDEEVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCE 748
Y ++ + DEEV+ +S T E++ ++ +S + N+ +++ L E
Sbjct: 584 YKKKIFQAEDDEEVNSFS-TAENLQISCSTSFSSSRNDYHH--NIEETELSESVALSEAE 640
Query: 749 VSSNMELEYIQDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLE 808
+ ELEYI +I+ + M +EF +G A ++ +LFD TE D K+E
Sbjct: 641 EGHDWELEYITEIIASGQLMIKEFSLGMATDILPLSLFD--------ETEGKRDARGKIE 692
Query: 809 RKVLFDCVSECLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMV 867
RK LFD V++ L L+ Q F+G CK + ++R+ LA+++ KE G + M E+M+
Sbjct: 693 RKTLFDLVNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMM 752
Query: 868 DELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLL 914
DELV DMS+ GKWLD+ E +EEG E+E++I++ L+++L++DL++
Sbjct: 753 DELVDNDMSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLIM 799
>AT3G05750.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: membrane;
EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G26910.3); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr3:1705300-1707546 FORWARD LENGTH=698
Length = 698
Score = 135 bits (341), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 132/394 (33%), Positives = 192/394 (48%), Gaps = 65/394 (16%)
Query: 140 YMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKSPGF 199
Y+N+ + S D ++ R K NRP+ RFQTE LPP+SAKPIPVTHN+LLSPI+SPGF
Sbjct: 42 YVNLRSDYDGISWDHLDSRMNKECNRPIDRFQTETLPPRSAKPIPVTHNRLLSPIRSPGF 101
Query: 200 LPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFTPEK 258
+ +N A +ME A+++IE SP+ + R S SSS+P++I DLKE+LEA+Q +P+
Sbjct: 102 VQSRNPASVMEEASRMIEPSPRVVAKTRFSSSDSSSSLPMKIRDLKEKLEASQKGQSPQI 161
Query: 259 LVGPSNANPANGILYERSSNSHKCTSAFKGSRDSEKNSSSHSATRRRSD----------- 307
G N +KC F+G +D EK ++ T+ R++
Sbjct: 162 SNGTCN---------------NKC---FRGKQD-EKRTTLPLKTQERNNLLGESRFGGSK 202
Query: 308 ------SLALQAKPN-VQNRD-TLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQ 359
S++ AK N + RD ++ SNG Y QK++ E K+ + K SS
Sbjct: 203 GKVKPPSVSAHAKANTIHKRDSSMLSNG---YRDQKKKVETKNRIVKSGLKESS------ 253
Query: 360 RTCTSRNSNVLGQNNQKQNCMTTTSKPISKIDSNKATARASSSESSIGTRKTTGRGAKNV 419
S V NNQKQN TS +S K + + GT TT +
Sbjct: 254 ---ASTRKTVDKPNNQKQNQFAETS--VSNQRGRKVMKKVNKVLVENGT--TTKKPGFTA 306
Query: 420 NVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCNFT 479
K +S S+ +++S+ KK + + D + + K IKCN T
Sbjct: 307 TSAKKSTSSSL---------SRKKNLSRSKKPANGVQEAGVNSDKRIKKGE-KVIKCNIT 356
Query: 480 TDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDS 513
DG + + K+ DVISFTF SP++ DS
Sbjct: 357 VDGGLKTGDDDRKKDMDVISFTFSSPIKGLSSDS 390
Score = 105 bits (263), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 130/227 (57%), Gaps = 12/227 (5%)
Query: 689 YGSTVYSSMQDEEVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCE 748
Y ++ + DEEV+ +S T E++ ++ +S + N+ +++ L E
Sbjct: 481 YKKKIFQAEDDEEVNSFS-TAENLQISCSTSFSSSRNDYHH--NIEETELSESVALSEAE 537
Query: 749 VSSNMELEYIQDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLE 808
+ ELEYI +I+ + M +EF +G A ++ +LFD TE D K+E
Sbjct: 538 EGHDWELEYITEIIASGQLMIKEFSLGMATDILPLSLFD--------ETEGKRDARGKIE 589
Query: 809 RKVLFDCVSECLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMV 867
RK LFD V++ L L+ Q F+G CK + ++R+ LA+++ KE G + M E+M+
Sbjct: 590 RKTLFDLVNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMM 649
Query: 868 DELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLL 914
DELV DMS+ GKWLD+ E +EEG E+E++I++ L+++L++DL++
Sbjct: 650 DELVDNDMSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLIM 696
>AT2G39435.1 | Symbols: | Phosphatidylinositol
N-acetyglucosaminlytransferase subunit P-related |
chr2:16464806-16466492 REVERSE LENGTH=464
Length = 464
Score = 52.0 bits (123), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 124/276 (44%), Gaps = 40/276 (14%)
Query: 651 PRC--SSKDANDLGFQHPNAVTVLETSFA------SESYLD-SEDSTYGSTVYSSMQDEE 701
P C +S+DA+ P+ V+VLE F SE LD SED Y + + Q E
Sbjct: 211 PECQTNSEDAH-----QPSPVSVLEPMFYEDNLDDSEDILDDSEDLPYPNFLSLENQLET 265
Query: 702 VSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELEYIQDI 761
+ S+++ ++G E +S + + A+K+ +G + + + YI DI
Sbjct: 266 LKSESESY------SDGSGMEVSSDEESALDSAIKESKESEPIGFLDTQESRDSSYIDDI 319
Query: 762 LENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVSECLE 821
L + V G+ D VI P +F+ LE + + T + + +RK+LFD V+ L
Sbjct: 320 LAEVLLGDKNCVPGKRDLVITPKIFEKLEKKYYTET-----SWKRSDRKILFDRVNSSL- 373
Query: 822 LRFTQAFVGRCKSWPRWVTSVQRKR-------WLAEELYKEMFGFRNMEEVMVDELVSKD 874
+ ++F + P W V R+ L +EL+K + E+ + ++K
Sbjct: 374 VEILESF----SATPTWKKPVSRRLGTALSTCGLKQELWKVL---SRQEKRSKKKSLAKV 426
Query: 875 MSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVS 910
+WL+ + + E+E I+ L++E+VS
Sbjct: 427 PVIDIDEWLELEADDESVVCELESMIVDELLSEVVS 462
>AT3G53540.1 | Symbols: | unknown protein; LOCATED IN: plasma
membrane; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s:
Protein of unknown function DUF3741
(InterPro:IPR022212); BEST Arabidopsis thaliana protein
match is: Protein of unknown function (DUF3741)
(TAIR:AT4G28760.2); Has 1710 Blast hits to 868 proteins
in 206 species: Archae - 2; Bacteria - 409; Metazoa -
304; Fungi - 204; Plants - 304; Viruses - 2; Other
Eukaryotes - 485 (source: NCBI BLink). |
chr3:19846805-19850670 REVERSE LENGTH=924
Length = 924
Score = 52.0 bits (123), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 117/262 (44%), Gaps = 67/262 (25%)
Query: 651 PRCSSKDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSS---------MQDEE 701
PR SSK+ + P+ V+VLE SF +D + GS + S MQ +
Sbjct: 684 PRESSKEGD-----QPSPVSVLEASF-------DDDVSSGSECFESVSADLRGLRMQLQL 731
Query: 702 VSDYSQTHE--SVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELEYIQ 759
+ S T++ + ++++ ++ SST T M K++ + + Y+
Sbjct: 732 LKLESATYKEGGMLVSSDEDTDQEESSTITDEAMITKELRE----------EDWKSSYLV 781
Query: 760 DILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVS-E 818
D+L N+ F + + A T + P+LF+ LE + SS + ++LERK+LFD +S E
Sbjct: 782 DLLANSSFSDSDHNIVMATTPVEPSLFEDLEKKYSSVKTS-----TRLERKLLFDQISRE 836
Query: 819 CL----ELRFTQAFVGRCKSWPRWVTS---------VQRK--------------RWLAEE 851
L +L +V K P+W + V RK +WL+ E
Sbjct: 837 VLHMLKQLSDPHPWVKSTKVCPKWDANKIQETLRDLVTRKDEKPSKYDVEEKELQWLSLE 896
Query: 852 LYKEMFGFRNMEEVMVDELVSK 873
E+ G R +E ++ DEL+++
Sbjct: 897 DDIEIIG-REIEVMLTDELITE 917