Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000219A_C01 KMC000219A_c01
(541 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD32567.1| NT3 [Nicotiana tabacum] 57 3e-11
ref|NP_171807.1| unknown protein; protein id: At1g03080.1 [Arabi... 47 4e-07
ref|NP_193212.1| centromere protein homolog; protein id: At4g147... 33 2e-04
ref|NP_192180.1| unknown protein; protein id: At4g02710.1 [Arabi... 42 0.004
dbj|BAB01254.1| centromere protein [Arabidopsis thaliana] 32 0.015
>gb|AAD32567.1| NT3 [Nicotiana tabacum]
Length = 612
Score = 56.6 bits (135), Expect(2) = 3e-11
Identities = 27/41 (65%), Positives = 33/41 (79%)
Frame = -3
Query: 509 EMTEQARRDSEQIGRLQFEVQNIQYILLKLADEKNNKGESQ 387
E +EQAR+ SE+IGRLQ E+Q IQYILLKL DEK +K S+
Sbjct: 153 ESSEQARKGSEKIGRLQLEIQKIQYILLKLEDEKKSKARSR 193
Score = 32.7 bits (73), Expect(2) = 3e-11
Identities = 14/37 (37%), Positives = 26/37 (69%)
Frame = -2
Query: 393 KPNRYLLLRDFIQIGRKNSRRRRKVCVCGCSRKPSTN 283
+ N ++L++FI IGR+NS +++K +C C R S++
Sbjct: 196 RSNTGIILKNFIHIGRRNSEKKKKAHLC-CFRPSSSS 231
>ref|NP_171807.1| unknown protein; protein id: At1g03080.1 [Arabidopsis thaliana]
gi|25518002|pir||F86161 F10O3.10 protein - Arabidopsis
thaliana gi|4587570|gb|AAD25801.1|AC006550_9 Strong
similarity to gi|2244833 centromere protein homolog from
Arabidopsis thaliana chromosome 4 contig gb|Z97337. ESTs
gb|T20765 and gb|AA586277 come from this gene
Length = 1744
Score = 46.6 bits (109), Expect(2) = 4e-07
Identities = 19/40 (47%), Positives = 33/40 (82%)
Frame = -3
Query: 506 MTEQARRDSEQIGRLQFEVQNIQYILLKLADEKNNKGESQ 387
++EQARR SE+IGRLQ E+Q +Q++LLKL ++ ++ +++
Sbjct: 1663 ISEQARRGSEKIGRLQLEIQRLQFLLLKLEGDREDRAKAK 1702
Score = 28.5 bits (62), Expect(2) = 4e-07
Identities = 13/32 (40%), Positives = 19/32 (58%), Gaps = 3/32 (9%)
Frame = -2
Query: 378 LLLRDFIQIGRKNSRRRR---KVCVCGCSRKP 292
+LLRD+I G + RR+R + CGC + P
Sbjct: 1710 ILLRDYIYSGVRGERRKRIKKRFAFCGCVQPP 1741
>ref|NP_193212.1| centromere protein homolog; protein id: At4g14760.1 [Arabidopsis
thaliana] gi|7488075|pir||E71410 probable centromere
protein - Arabidopsis thaliana gi|2244833|emb|CAB10255.1|
centromere protein homolog [Arabidopsis thaliana]
gi|7268182|emb|CAB78518.1| centromere protein homolog
[Arabidopsis thaliana]
Length = 1676
Score = 33.5 bits (75), Expect(2) = 2e-04
Identities = 16/37 (43%), Positives = 26/37 (70%)
Frame = -3
Query: 506 MTEQARRDSEQIGRLQFEVQNIQYILLKLADEKNNKG 396
+ E++R SE+I +LQ ++QNI+ +LKL D +KG
Sbjct: 1597 VVEKSRSGSEKIEQLQNKMQNIEQTVLKLEDGTKSKG 1633
Score = 32.3 bits (72), Expect(2) = 2e-04
Identities = 15/33 (45%), Positives = 19/33 (57%)
Frame = -2
Query: 378 LLLRDFIQIGRKNSRRRRKVCVCGCSRKPSTNE 280
+LLRD I G K S R++K CGC R + E
Sbjct: 1644 ILLRDIIHKGGKRSARKKKNRFCGCIRSSTKEE 1676
>ref|NP_192180.1| unknown protein; protein id: At4g02710.1 [Arabidopsis thaliana]
gi|7486853|pir||T01078 hypothetical protein T10P11.2.2 -
Arabidopsis thaliana gi|3892059|gb|AAC78272.1|AAC78272
predicted protein of unknown function [Arabidopsis
thaliana] gi|7269756|emb|CAB77756.1| predicted protein of
unknown function [Arabidopsis thaliana]
Length = 1111
Score = 42.4 bits (98), Expect = 0.004
Identities = 19/38 (50%), Positives = 29/38 (76%)
Frame = -3
Query: 500 EQARRDSEQIGRLQFEVQNIQYILLKLADEKNNKGESQ 387
E ARR +E+IGRLQ E+Q IQ++L+KL E+ ++ S+
Sbjct: 1033 EHARRGTEKIGRLQSEIQRIQFLLMKLEGEREHRLRSK 1070
>dbj|BAB01254.1| centromere protein [Arabidopsis thaliana]
Length = 1728
Score = 31.6 bits (70), Expect(2) = 0.015
Identities = 14/33 (42%), Positives = 18/33 (54%)
Frame = -2
Query: 378 LLLRDFIQIGRKNSRRRRKVCVCGCSRKPSTNE 280
+LLRD I G K + R++K CGC R E
Sbjct: 1696 ILLRDIIHKGGKRTARKKKNRFCGCMRSSGNEE 1728
Score = 27.7 bits (60), Expect(2) = 0.015
Identities = 13/29 (44%), Positives = 22/29 (75%)
Frame = -3
Query: 500 EQARRDSEQIGRLQFEVQNIQYILLKLAD 414
E++R SE+I ++Q E+QNI+ +LKL +
Sbjct: 1650 EKSRIGSEKIEQMQQEMQNIERTVLKLEE 1678
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 432,449,781
Number of Sequences: 1393205
Number of extensions: 8842623
Number of successful extensions: 30270
Number of sequences better than 10.0: 13
Number of HSP's better than 10.0 without gapping: 29662
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30259
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)