Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC014470A_C01 KMC014470A_c01
(932 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAA97190.1| contains similarity to DNA-binding protein~gene_... 175 8e-43
ref|NP_194262.1| putative protein; protein id: At4g25320.1, supp... 171 2e-41
ref|NP_201032.1| putative protein; protein id: At5g62260.1 [Arab... 161 2e-38
ref|NP_192945.2| putative DNA-binding protein; protein id: At4g1... 157 2e-37
ref|NP_194008.1| putative DNA binding protein; protein id: At4g2... 152 5e-36
>dbj|BAA97190.1| contains similarity to DNA-binding protein~gene_id:MMI9.9
[Arabidopsis thaliana] gi|26451694|dbj|BAC42942.1|
unknown protein [Arabidopsis thaliana]
gi|28973553|gb|AAO64101.1| unknown protein [Arabidopsis
thaliana]
Length = 404
Score = 175 bits (444), Expect = 8e-43
Identities = 94/162 (58%), Positives = 115/162 (70%), Gaps = 4/162 (2%)
Frame = +1
Query: 457 AATPAVTEKKKRGRPRKYGPDGRAIPGAAAAAAATPLSPMPISSSIPLTGDFSAWKRGRG 636
A+T + KKKRGRPRKY PDG P LSP PISSSIPL+GD+ WKRG+
Sbjct: 68 ASTGSDPTKKKRGRPRKYAPDGSLNPRFLRPT----LSPTPISSSIPLSGDYQ-WKRGKA 122
Query: 637 R----PVESIKKSFKLDFESPGPPAAPGPGEGIAYSIGGNFTAHVLTVNSGEDITMKIMS 804
+ P+E +KKS K ++ SP P P G++ +G NFT H TVN GED+TMK+M
Sbjct: 123 QQQHQPLEFVKKSHKFEYGSPAPTP---PLPGLSCYVGANFTTHQFTVNGGEDVTMKVMP 179
Query: 805 FSQQGARAICILSATGTISNVTLRQPSSSGGTLTYEGRFEIL 930
+SQQG+RAICILSATG+ISNVTL QP+++GGTLTYEGRFEIL
Sbjct: 180 YSQQGSRAICILSATGSISNVTLGQPTNAGGTLTYEGRFEIL 221
>ref|NP_194262.1| putative protein; protein id: At4g25320.1, supported by cDNA:
gi_20466212 [Arabidopsis thaliana]
gi|7486058|pir||T05553 hypothetical protein F24A6.160 -
Arabidopsis thaliana gi|4454020|emb|CAA23073.1| putative
protein [Arabidopsis thaliana]
gi|7269383|emb|CAB81343.1| putative protein [Arabidopsis
thaliana] gi|20466213|gb|AAM20424.1| putative protein
[Arabidopsis thaliana] gi|28059577|gb|AAO30071.1|
putative protein [Arabidopsis thaliana]
Length = 404
Score = 171 bits (432), Expect = 2e-41
Identities = 102/184 (55%), Positives = 116/184 (62%), Gaps = 4/184 (2%)
Frame = +1
Query: 391 PGSFHVAPRIENNL--DFSRAMVPAATPAVTEKKKRGRPRKYGPDGRAIPGAAAAAAATP 564
P + VA + N FS M T A KKKRGRPRKY PDG +
Sbjct: 54 PAAATVAAAVTENAATPFSLTMPTENTSAEQLKKKRGRPRKYNPDGTLV---------VT 104
Query: 565 LSPMPISSSIPLTGDFSAWKRGRGRPVES--IKKSFKLDFESPGPPAAPGPGEGIAYSIG 738
LSPMPISSS+PLT +F KRGRGR + +KKS F+ P G G A +G
Sbjct: 105 LSPMPISSSVPLTSEFPPRKRGRGRGKSNRWLKKSQMFQFDR-SPVDTNLAGVGTADFVG 163
Query: 739 GNFTAHVLTVNSGEDITMKIMSFSQQGARAICILSATGTISNVTLRQPSSSGGTLTYEGR 918
NFT HVL VN+GED+TMKIM+FSQQG+RAICILSA G ISNVTLRQ +SGGTLTYEGR
Sbjct: 164 ANFTPHVLIVNAGEDVTMKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGR 223
Query: 919 FEIL 930
FEIL
Sbjct: 224 FEIL 227
>ref|NP_201032.1| putative protein; protein id: At5g62260.1 [Arabidopsis thaliana]
Length = 441
Score = 161 bits (407), Expect = 2e-38
Identities = 96/196 (48%), Positives = 117/196 (58%), Gaps = 38/196 (19%)
Frame = +1
Query: 457 AATPAVTEKKKRGRPRKYGPDGRAIPGAAAAAAATPLSPMPISSSIPLTGDFSAWKRGRG 636
A+T + KKKRGRPRKY PDG P LSP PISSSIPL+GD+ WKRG+
Sbjct: 68 ASTGSDPTKKKRGRPRKYAPDGSLNPRFLRPT----LSPTPISSSIPLSGDYQ-WKRGKA 122
Query: 637 R----PVESIKKSFKLDFESPGPP---------------------------------AAP 705
+ P+E +KKS K ++ SP AAP
Sbjct: 123 QQQHQPLEFVKKSHKFEYGSPDVGKWDQHNWILLGTLLSEEAITLRPTNANSVLLSLAAP 182
Query: 706 GPG-EGIAYSIGGNFTAHVLTVNSGEDITMKIMSFSQQGARAICILSATGTISNVTLRQP 882
P G++ +G NFT H TVN GED+TMK+M +SQQG+RAICILSATG+ISNVTL QP
Sbjct: 183 TPPLPGLSCYVGANFTTHQFTVNGGEDVTMKVMPYSQQGSRAICILSATGSISNVTLGQP 242
Query: 883 SSSGGTLTYEGRFEIL 930
+++GGTLTYEGRFEIL
Sbjct: 243 TNAGGTLTYEGRFEIL 258
>ref|NP_192945.2| putative DNA-binding protein; protein id: At4g12080.1, supported by
cDNA: gi_17979484 [Arabidopsis thaliana]
gi|17979485|gb|AAL50079.1| AT4g12080/F16J13_150
[Arabidopsis thaliana] gi|23506149|gb|AAN31086.1|
At4g12080/F16J13_150 [Arabidopsis thaliana]
Length = 356
Score = 157 bits (398), Expect = 2e-37
Identities = 101/224 (45%), Positives = 124/224 (55%), Gaps = 34/224 (15%)
Frame = +1
Query: 361 GGHGVVRDEAPGSFHVAPRIENNLDFSRAMVP----------------------AATPAV 474
GG VVR +AP FHVA R E++ ++ P T A
Sbjct: 20 GGITVVRSDAPSDFHVAQRSESSNQSPTSVTPPPPQPSSHHTAPPPLQISTVTTTTTTAA 79
Query: 475 TE-------KKKRGRPRKYGPDGRAIPGAAAAAAATPLSPMPISSSIPLTG----DFSAW 621
E KKKRGRPRKYGPDG + A + P+S P S +P DFSA
Sbjct: 80 MEGISGGLMKKKRGRPRKYGPDGTVV-----ALSPKPISSAPAPSHLPPPSSHVIDFSAS 134
Query: 622 -KRGRGRPVESIKKSFKLDFESPGPPAAPGPGEGIAYSIGGNFTAHVLTVNSGEDITMKI 798
KR + +P S ++ K + GE S+GGNFT H++TVN+GED+TMKI
Sbjct: 135 EKRSKVKPTNSFNRT-KYHHQ------VENLGEWAPCSVGGNFTPHIITVNTGEDVTMKI 187
Query: 799 MSFSQQGARAICILSATGTISNVTLRQPSSSGGTLTYEGRFEIL 930
+SFSQQG R+IC+LSA G IS+VTLRQP SSGGTLTYEGRFEIL
Sbjct: 188 ISFSQQGPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEIL 231
>ref|NP_194008.1| putative DNA binding protein; protein id: At4g22770.1, supported by
cDNA: 12041. [Arabidopsis thaliana]
gi|7486882|pir||T04572 hypothetical protein T12H17.160 -
Arabidopsis thaliana gi|2827554|emb|CAA16562.1| putative
DNA binding protein [Arabidopsis thaliana]
gi|7269124|emb|CAB79232.1| putative DNA binding protein
[Arabidopsis thaliana] gi|21537115|gb|AAM61456.1|
putative DNA binding protein [Arabidopsis thaliana]
Length = 334
Score = 152 bits (385), Expect = 5e-36
Identities = 98/214 (45%), Positives = 117/214 (53%), Gaps = 24/214 (11%)
Frame = +1
Query: 361 GGHGVVRDEAPGSFHVAPRIENNLDFSRAMVPAATPAVTE----------------KKKR 492
GG VVR AP FH+APR E + ++ P P KK+R
Sbjct: 16 GGVTVVRSNAPSDFHMAPRSETSNTPPNSVAPPPPPPPQNSFTPSAAMDGFSSGPIKKRR 75
Query: 493 GRPRKYGPDGRAIPGAAAAAAATPLSPMPISSSIPLTG---DFSAW--KRGRGRPVESIK 657
GRPRKYG DG AA LSP PISS+ P T DFS KRG+ +P
Sbjct: 76 GRPRKYGHDG----------AAVTLSPNPISSAAPTTSHVIDFSTTSEKRGKMKPATPTP 125
Query: 658 KSF---KLDFESPGPPAAPGPGEGIAYSIGGNFTAHVLTVNSGEDITMKIMSFSQQGARA 828
SF K E+ G E S NFT H++TVN+GED+T +I+SFSQQG+ A
Sbjct: 126 SSFIRPKYQVENLG--------EWSPSSAAANFTPHIITVNAGEDVTKRIISFSQQGSLA 177
Query: 829 ICILSATGTISNVTLRQPSSSGGTLTYEGRFEIL 930
IC+L A G +S+VTLRQP SSGGTLTYEGRFEIL
Sbjct: 178 ICVLCANGVVSSVTLRQPDSSGGTLTYEGRFEIL 211
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 872,957,282
Number of Sequences: 1393205
Number of extensions: 22519967
Number of successful extensions: 160799
Number of sequences better than 10.0: 512
Number of HSP's better than 10.0 without gapping: 107950
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 150904
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 51859780984
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)