Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000783A_C01 KMC000783A_c01
(578 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC67503.1| SET-domain-containing protein [Nicotiana tabacum] 92 6e-18
ref|NP_565056.1| expressed protein; protein id: At1g73100.1, sup... 81 8e-15
gb|AAK28968.1|AF344446_1 SUVH3 [Arabidopsis thaliana] 81 8e-15
ref|NP_196113.1| SET-domain protein-like; protein id: At5g04940.... 79 4e-14
gb|AAK28975.1|AF344453_1 SET1 [Oryza sativa] 76 3e-13
>emb|CAC67503.1| SET-domain-containing protein [Nicotiana tabacum]
Length = 704
Score = 91.7 bits (226), Expect = 6e-18
Identities = 40/64 (62%), Positives = 50/64 (77%)
Frame = -3
Query: 576 PNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPLKVGQKKKKCLCGSVKC 397
PNV W+ VVR++ NEA H+AF+AIRHIPPM ELT+DYG+ K ++KKCLCGS+ C
Sbjct: 643 PNVYWQLVVRQSNNEATYHIAFFAIRHIPPMQELTFDYGMD---KADHRRKKCLCGSLNC 699
Query: 396 RGYF 385
RGYF
Sbjct: 700 RGYF 703
>ref|NP_565056.1| expressed protein; protein id: At1g73100.1, supported by cDNA:
gi_14625477, supported by cDNA: gi_20466307 [Arabidopsis
thaliana] gi|25406251|pir||F96756 hypothetical protein
F3N23.30 [imported] - Arabidopsis thaliana
gi|5903099|gb|AAD55657.1|AC008017_30 Unknown protein
[Arabidopsis thaliana] gi|20466308|gb|AAM20471.1|
unknown protein [Arabidopsis thaliana]
gi|25083988|gb|AAN72148.1| unknown protein [Arabidopsis
thaliana]
Length = 669
Score = 81.3 bits (199), Expect = 8e-15
Identities = 36/69 (52%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Frame = -3
Query: 576 PNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPLKVGQK-----KKKCLC 412
PNV W+PV+RE E+ +H+AF+A+RHIPPM ELTYDYGI + + ++ CLC
Sbjct: 600 PNVFWQPVIREGNGESVIHIAFFAMRHIPPMAELTYDYGISPTSEARDESLLHGQRTCLC 659
Query: 411 GSVKCRGYF 385
GS +CRG F
Sbjct: 660 GSEQCRGSF 668
>gb|AAK28968.1|AF344446_1 SUVH3 [Arabidopsis thaliana]
Length = 669
Score = 81.3 bits (199), Expect = 8e-15
Identities = 36/69 (52%), Positives = 48/69 (69%), Gaps = 5/69 (7%)
Frame = -3
Query: 576 PNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPLKVGQK-----KKKCLC 412
PNV W+PV+RE E+ +H+AF+A+RHIPPM ELTYDYGI + + ++ CLC
Sbjct: 600 PNVFWQPVIREGNGESVIHIAFFAMRHIPPMAELTYDYGISPTSEARDESLLHGQRTCLC 659
Query: 411 GSVKCRGYF 385
GS +CRG F
Sbjct: 660 GSEQCRGSF 668
>ref|NP_196113.1| SET-domain protein-like; protein id: At5g04940.1, supported by
cDNA: gi_13517742 [Arabidopsis thaliana]
gi|10178033|dbj|BAB11516.1| SET-domain protein-like
[Arabidopsis thaliana]
gi|13517743|gb|AAK28966.1|AF344444_1 SUVH1 [Arabidopsis
thaliana]
Length = 670
Score = 79.0 bits (193), Expect = 4e-14
Identities = 37/69 (53%), Positives = 44/69 (63%), Gaps = 5/69 (7%)
Frame = -3
Query: 576 PNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPLKVGQ-----KKKKCLC 412
PNV W+PV EN ++ +HVAF+AI HIPPM ELTYDYG+ P K+KC C
Sbjct: 601 PNVFWQPVSYENNSQLFVHVAFFAISHIPPMTELTYDYGVSRPSGTQNGNPLYGKRKCFC 660
Query: 411 GSVKCRGYF 385
GS CRG F
Sbjct: 661 GSAYCRGSF 669
>gb|AAK28975.1|AF344453_1 SET1 [Oryza sativa]
Length = 812
Score = 76.3 bits (186), Expect = 3e-13
Identities = 37/70 (52%), Positives = 48/70 (67%), Gaps = 6/70 (8%)
Frame = -3
Query: 576 PNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYG-----IVLPLKVG-QKKKKCL 415
PNV W+PV+ ++ +E H+AF+AI+HIPPM ELTYDYG + L + G +K K CL
Sbjct: 742 PNVFWQPVLYDHGDEGYPHIAFFAIKHIPPMTELTYDYGQSQGNVQLGINSGCRKSKNCL 801
Query: 414 CGSVKCRGYF 385
C S KCRG F
Sbjct: 802 CWSRKCRGSF 811
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 471,732,819
Number of Sequences: 1393205
Number of extensions: 9252028
Number of successful extensions: 18017
Number of sequences better than 10.0: 190
Number of HSP's better than 10.0 without gapping: 17503
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 17883
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)