Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003806A_C01 KMC003806A_c01
(627 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_564407.1| expressed protein; protein id: At1g32730.1, sup... 55 6e-07
ref|NP_758307.1| hypothetical protein [Mycoplasma penetrans] gi|... 39 0.056
ref|XP_235683.1| similar to keratin protein K6irs [Homo sapiens]... 39 0.073
ref|NP_082518.1| RIKEN cDNA 2600017A12 [Mus musculus] gi|2505532... 38 0.12
ref|XP_236420.1| similar to Protein C20orf129 [Rattus norvegicus] 38 0.12
>ref|NP_564407.1| expressed protein; protein id: At1g32730.1, supported by cDNA:
116730. [Arabidopsis thaliana] gi|25403412|pir||C86452
protein F6N18.11 [imported] - Arabidopsis thaliana
gi|6714274|gb|AAF25970.1|AC017118_7 F6N18.11
[Arabidopsis thaliana] gi|21536980|gb|AAM61321.1|
unknown [Arabidopsis thaliana]
Length = 327
Score = 55.5 bits (132), Expect = 6e-07
Identities = 39/139 (28%), Positives = 67/139 (48%)
Frame = -1
Query: 627 RMRIQPFVDKGLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKINKA 448
+ RI V GL +L++ D+ D++ + + +E+ SA++++IDK+NKA
Sbjct: 210 KKRIAETVKAGLVKLKRL----DLGSSSDDQDDIKRRVKRKKWEEKGSALNEIIDKLNKA 265
Query: 447 RSEEDLNSCLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTA 268
R+EEDL SCLEMKS+L + T+A E + P +V
Sbjct: 266 RTEEDLKSCLEMKSKL---------------------CGQVSPTAASEKNKIFPGVVRKV 304
Query: 267 EVDQDTLNTVDRYLSSLEQ 211
E+ ++ L + L S ++
Sbjct: 305 EMSEEALQKIAENLQSFDK 323
>ref|NP_758307.1| hypothetical protein [Mycoplasma penetrans]
gi|26454383|dbj|BAC44711.1| hypothetical protein
[Mycoplasma penetrans]
Length = 915
Score = 38.9 bits (89), Expect = 0.056
Identities = 39/156 (25%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
Frame = -1
Query: 615 QPFVDK--GLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKI-NKAR 445
+ FVD + L F DD +KP E PS + + K + + +S++ D + NK
Sbjct: 731 EKFVDTEFNVEELYSFQTSDDDSKPTVESPSSLEEEVKEFFSDLSAELSNIPDVVENKKE 790
Query: 444 SEED-------LNSCLE----MKSQLFNLEVDSGSIEILDHETPENEIAKSEST-SAEEL 301
+ +D +NS L+ SQLF E +++ + + S ST S++E
Sbjct: 791 NNDDIVIEDISINSDLDSLDLSSSQLFETEFLPNELKLDEEMNNQFSTLVSPSTESSKEA 850
Query: 300 DYSSPKLVVTAEV-DQDTLNTVDRYLSSLEQHVEEL 196
D P + V D+ + + ++L L+ E+L
Sbjct: 851 DVYEPNISADKIVSDKQITSDLQKFLEDLKVEKEKL 886
>ref|XP_235683.1| similar to keratin protein K6irs [Homo sapiens] [Rattus norvegicus]
Length = 1675
Score = 38.5 bits (88), Expect = 0.073
Identities = 28/85 (32%), Positives = 43/85 (49%)
Frame = -1
Query: 435 DLNSCLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTAEVDQ 256
D N L M + NL++DS E+ ++ IA +EEL +S KL VTA
Sbjct: 349 DTNVILSMDNNR-NLDLDSIIAEV---QSQYEIIAHKSKAESEELYHSKAKLQVTAVKHG 404
Query: 255 DTLNTVDRYLSSLEQHVEEL*GEVS 181
D+L + +S L + ++ L GE+S
Sbjct: 405 DSLKEIKMEISELNRTIQRLQGEIS 429
>ref|NP_082518.1| RIKEN cDNA 2600017A12 [Mus musculus] gi|25055327|ref|XP_135878.2|
RIKEN cDNA 2600017A12 [Mus musculus]
gi|22902415|gb|AAH37711.1| Similar to HIV TAT specific
factor 1 [Mus musculus] gi|26340228|dbj|BAC33777.1|
unnamed protein product [Mus musculus]
Length = 757
Score = 37.7 bits (86), Expect = 0.12
Identities = 29/118 (24%), Positives = 52/118 (43%), Gaps = 1/118 (0%)
Frame = -1
Query: 600 KGLHRLEKFAVDDDINKPIDEEPSEVSKRPKSCSDERLSAISDLIDK-INKARSEEDLNS 424
+G L+K + DDD + +E+ SE + S + + + +D+ ++ ED+
Sbjct: 529 EGEDSLKKESEDDDSEEESEEDSSEKQSQDGSDKEIEENGVKKDVDQDVSDKEFPEDVEK 588
Query: 423 CLEMKSQLFNLEVDSGSIEILDHETPENEIAKSESTSAEELDYSSPKLVVTAEVDQDT 250
E +++ E D GS +LD E E E + EE D ++V D D+
Sbjct: 589 ESE-ENETDKSEFDEGSERVLDEEGSEREFEEDSDEKEEEGDDDEEEVVYERVFDDDS 645
>ref|XP_236420.1| similar to Protein C20orf129 [Rattus norvegicus]
Length = 1413
Score = 37.7 bits (86), Expect = 0.12
Identities = 30/104 (28%), Positives = 52/104 (49%), Gaps = 2/104 (1%)
Frame = -1
Query: 561 DINKPIDEEPSEVSKRPKSCSDERLSAISDLIDKINKARSEEDLNSCLEMKSQLFNLEVD 382
++ DE+ EV+KR + + +I+ L+D +NK ++LNS E K+ L+
Sbjct: 953 NVTHSTDEDDDEVTKRDPPSASAKSISIAALLD-VNKEEPNKELNSKKEGKASPSFLKKG 1011
Query: 381 SGSIEILDHETPE--NEIAKSESTSAEELDYSSPKLVVTAEVDQ 256
S + L TPE +AK+++ + + SS LV E +Q
Sbjct: 1012 SQKLRSLLSLTPEKRENLAKNKAPAFYRMCSSSDTLVSEGEENQ 1055
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 487,752,968
Number of Sequences: 1393205
Number of extensions: 9528049
Number of successful extensions: 31204
Number of sequences better than 10.0: 125
Number of HSP's better than 10.0 without gapping: 29471
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31010
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25586195130
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)