Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003964A_C01 KMC003964A_c01
(544 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||S36638 glycoprotein EP1 - carrot gi|349437|gb|AAA33136.1| N... 214 4e-62
pir||T14580 SIEP1L protein precursor - beet gi|1107526|emb|CAA61... 223 9e-58
ref|NP_178007.1| hypothetical protein; protein id: At1g78860.1 [... 205 3e-56
gb|AAN31898.1| putative glycoprotein (EP1) [Arabidopsis thaliana] 199 2e-50
ref|NP_178006.1| glycoprotein(EP1), putative; protein id: At1g78... 199 2e-50
>pir||S36638 glycoprotein EP1 - carrot gi|349437|gb|AAA33136.1| N-glycosylation
sites: (130..138), (244..252), (352..360), (734..742),
(748..756), (865..873)
Length = 389
Score = 214 bits (545), Expect(2) = 4e-62
Identities = 105/155 (67%), Positives = 123/155 (78%), Gaps = 4/155 (2%)
Frame = -1
Query: 460 SESLFRWVWEANRGNPVKENATFSLGTDGNLVLADADGRIAWQTNTANKGVVAFRLLPNG 281
+ESL RWVWEANRGNPV ENAT + G DGNLVLA ++G++AWQT+TANKGVV ++LPNG
Sbjct: 84 TESLMRWVWEANRGNPVDENATLTFGPDGNLVLARSNGQVAWQTSTANKGVVGLKILPNG 143
Query: 280 NMVLLDANGKFVWQSFDHPTDTILVGQYLRAGGPRELVSRLSEKENVNGPYSLVLESKGL 101
NMVL D+ GKF+WQSFD PTDT+LVGQ L+ G +LVSR S ENVNGPYSLV+E KGL
Sbjct: 144 NMVLYDSKGKFLWQSFDTPTDTLLVGQSLKMGAVTKLVSRASPGENVNGPYSLVMEPKGL 203
Query: 100 GLYYKPKNAPKPIRYWSESYVEK----GSLENVTF 8
LYYKP +PKPIRY+S S K SL+NVTF
Sbjct: 204 HLYYKPTTSPKPIRYYSFSLFTKLNKNESLQNVTF 238
Score = 45.8 bits (107), Expect(2) = 4e-62
Identities = 20/27 (74%), Positives = 24/27 (88%)
Frame = -3
Query: 542 FTSPFQVGFYNTTPNSFTLALRVGLQR 462
FTSPFQ+ FYN TP +FTLALR+GL+R
Sbjct: 57 FTSPFQLCFYNQTPTAFTLALRMGLRR 83
>pir||T14580 SIEP1L protein precursor - beet gi|1107526|emb|CAA61158.1| SIEP1L
protein [Beta vulgaris]
Length = 391
Score = 223 bits (569), Expect = 9e-58
Identities = 117/185 (63%), Positives = 138/185 (74%), Gaps = 5/185 (2%)
Frame = -1
Query: 544 SSPLHSKLVSTTPPQ-TPSLSLCVWASNASESLFRWVWE----ANRGNPVKENATFSLGT 380
+SP +TTP T +L +C+ S ESLFRWVWE ANRGNPV ENATFSLGT
Sbjct: 60 NSPFQLCFYNTTPNSYTLALRMCLVRS---ESLFRWVWEVRHEANRGNPVGENATFSLGT 116
Query: 379 DGNLVLADADGRIAWQTNTANKGVVAFRLLPNGNMVLLDANGKFVWQSFDHPTDTILVGQ 200
DGNLVLADAD RIAWQTNT NKGVV F+LLPNGNMVL DA KF+WQSFD+PTDT+LVGQ
Sbjct: 117 DGNLVLADADRRIAWQTNTTNKGVVGFKLLPNGNMVLHDAKDKFIWQSFDYPTDTLLVGQ 176
Query: 199 YLRAGGPRELVSRLSEKENVNGPYSLVLESKGLGLYYKPKNAPKPIRYWSESYVEKGSLE 20
LR GP +LVSR+S K N NGPY+ V+E + L +YYK N+PKP+ Y+S + K SL+
Sbjct: 177 SLRLDGPTKLVSRVSAKLNSNGPYTFVMEPRVLTMYYKSPNSPKPLLYFSMLDLSKSSLK 236
Query: 19 NVTFT 5
VTF+
Sbjct: 237 EVTFS 241
Score = 43.9 bits (102), Expect = 0.001
Identities = 20/27 (74%), Positives = 23/27 (85%)
Frame = -3
Query: 542 FTSPFQVGFYNTTPNSFTLALRVGLQR 462
F SPFQ+ FYNTTPNS+TLALR+ L R
Sbjct: 59 FNSPFQLCFYNTTPNSYTLALRMCLVR 85
>ref|NP_178007.1| hypothetical protein; protein id: At1g78860.1 [Arabidopsis
thaliana] gi|25296191|pir||A96818 hypothetical protein
F9K20.9 [imported] - Arabidopsis thaliana
gi|3834308|gb|AAC83024.1| Strong similarity to
glycoprotein EP1 gb|L16983 Daucus carota and a member of
S locus glycoprotein family PF|00954. EST gb|AA720110
comes from this gene. [Arabidopsis thaliana]
Length = 443
Score = 205 bits (522), Expect(2) = 3e-56
Identities = 98/155 (63%), Positives = 118/155 (75%), Gaps = 1/155 (0%)
Frame = -1
Query: 469 SNASESLFRWVWEANRGNPVKENATFSLGTDGNLVLADADGRIAWQTNTANKGVVAFRLL 290
+ A ES RWVWEANRG+PVKENAT + G DGNLVLA+ADGR+ WQTNTANKGVV ++L
Sbjct: 80 NRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNTANKGVVGIKIL 139
Query: 289 PNGNMVLLDANGKFVWQSFDHPTDTILVGQYLRAGGPRELVSRLSEKENVNGPYSLVLES 110
NGNMV+ D+NGKFVWQSFD PTDT+LVGQ L+ G +LVSRLS N NGPYSLV+E+
Sbjct: 140 ENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVNANGPYSLVMEA 199
Query: 109 KGLGLYYKPKNAPKPIRYWS-ESYVEKGSLENVTF 8
K L LYY PKPI Y+ E + + L+++TF
Sbjct: 200 KKLVLYYTTNKTPKPIGYYEYEFFTKIAQLQSMTF 234
Score = 35.0 bits (79), Expect(2) = 3e-56
Identities = 14/24 (58%), Positives = 20/24 (83%)
Frame = -3
Query: 542 FTSPFQVGFYNTTPNSFTLALRVG 471
F+ F++ FYNTT N++TLALR+G
Sbjct: 56 FSDNFRLCFYNTTQNAYTLALRIG 79
>gb|AAN31898.1| putative glycoprotein (EP1) [Arabidopsis thaliana]
Length = 441
Score = 199 bits (506), Expect = 2e-50
Identities = 98/169 (57%), Positives = 122/169 (71%), Gaps = 1/169 (0%)
Frame = -1
Query: 511 TPPQTPSLSLCVWASNASESLFRWVWEANRGNPVKENATFSLGTDGNLVLADADGRIAWQ 332
T P +L+L + + ES RWVWEANRG+PVKENAT + G DGNLVLA+ADGR+ WQ
Sbjct: 67 TTPNAYTLALRI-GNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQ 125
Query: 331 TNTANKGVVAFRLLPNGNMVLLDANGKFVWQSFDHPTDTILVGQYLRAGGPRELVSRLSE 152
TNTANKG V ++L NGNMV+ D++GKFVWQSFD PTDT+LVGQ L+ G +LVSRLS
Sbjct: 126 TNTANKGAVGIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSP 185
Query: 151 KENVNGPYSLVLESKGLGLYYKPKNAPKPIRYWS-ESYVEKGSLENVTF 8
N NGPYSLV+E+K L LYY PKPI Y+ E + + +++TF
Sbjct: 186 SVNTNGPYSLVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTF 234
Score = 38.1 bits (87), Expect = 0.070
Identities = 15/24 (62%), Positives = 21/24 (87%)
Frame = -3
Query: 542 FTSPFQVGFYNTTPNSFTLALRVG 471
F+ F++ FYNTTPN++TLALR+G
Sbjct: 56 FSDNFRLCFYNTTPNAYTLALRIG 79
>ref|NP_178006.1| glycoprotein(EP1), putative; protein id: At1g78850.1, supported by
cDNA: gi_15450841 [Arabidopsis thaliana]
gi|25406575|pir||H96817 hypothetical protein F9K20.10
[imported] - Arabidopsis thaliana
gi|3834309|gb|AAC83025.1| Strong similarity to
glycoprotein EP1 gb|L16983 Daucus carota and a member of
S locus glycoprotein family PF|00954. ESTs gb|F13813,
gb|T21052, gb|R30218 and gb|W43262 come from this gene.
[Arabidopsis thaliana] gi|15450842|gb|AAK96692.1| Strong
similarity to glycoprotein EP1 [Arabidopsis thaliana]
Length = 441
Score = 199 bits (506), Expect = 2e-50
Identities = 98/169 (57%), Positives = 122/169 (71%), Gaps = 1/169 (0%)
Frame = -1
Query: 511 TPPQTPSLSLCVWASNASESLFRWVWEANRGNPVKENATFSLGTDGNLVLADADGRIAWQ 332
T P +L+L + + ES RWVWEANRG+PVKENAT + G DGNLVLA+ADGR+ WQ
Sbjct: 67 TTPNAYTLALRI-GNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQ 125
Query: 331 TNTANKGVVAFRLLPNGNMVLLDANGKFVWQSFDHPTDTILVGQYLRAGGPRELVSRLSE 152
TNTANKG V ++L NGNMV+ D++GKFVWQSFD PTDT+LVGQ L+ G +LVSRLS
Sbjct: 126 TNTANKGAVGIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSP 185
Query: 151 KENVNGPYSLVLESKGLGLYYKPKNAPKPIRYWS-ESYVEKGSLENVTF 8
N NGPYSLV+E+K L LYY PKPI Y+ E + + +++TF
Sbjct: 186 SVNTNGPYSLVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTF 234
Score = 38.1 bits (87), Expect = 0.070
Identities = 15/24 (62%), Positives = 21/24 (87%)
Frame = -3
Query: 542 FTSPFQVGFYNTTPNSFTLALRVG 471
F+ F++ FYNTTPN++TLALR+G
Sbjct: 56 FSDNFRLCFYNTTPNAYTLALRIG 79
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 565,252,428
Number of Sequences: 1393205
Number of extensions: 14088526
Number of successful extensions: 50895
Number of sequences better than 10.0: 533
Number of HSP's better than 10.0 without gapping: 46565
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 49987
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18750593680
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)