Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC011584A_C01 KMC011584A_c01
(632 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC69997.1| HMG I/Y like protein [Glycine max] 146 3e-34
pir||T02029 DNA-binding protein pabf - common tobacco gi|555655|... 145 4e-34
pir||G96525 protein T1N15.25 [imported] - Arabidopsis thaliana g... 128 5e-29
ref|NP_175295.1| unknown protein; protein id: At1g48620.1 [Arabi... 128 5e-29
gb|AAG50847.1|AC074308_3 hypothetical protein, 3' partial [Arabi... 128 5e-29
>emb|CAC69997.1| HMG I/Y like protein [Glycine max]
Length = 413
Score = 146 bits (368), Expect = 3e-34
Identities = 103/258 (39%), Positives = 127/258 (48%), Gaps = 55/258 (21%)
Frame = +3
Query: 21 MDPSSISSPSASAPPTATVPFAADPNNHPPPPPAPADNPAHPPYAEMIYTAIGALKEKDG 200
MDP+SI P P TVPF +P+NH P A N HPPY EMIYTAIGALKEKDG
Sbjct: 1 MDPTSIPPP-----PATTVPFTVEPSNHVTP--ADNTNTNHPPYDEMIYTAIGALKEKDG 53
Query: 201 SSKRAINKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAP-PIL--- 368
SSKRAI KY+EQVYKD L +H +LLTHHL RLK+ G+L++ KKSYKLPGS P P+L
Sbjct: 54 SSKRAIGKYMEQVYKD-LPPTHSALLTHHLNRLKSAGLLILVKKSYKLPGSDPLPVLQAQ 112
Query: 369 ------------------------------------PPPPENIAAGAGAAPSP-ASRPRG 437
P P+ IA G +P P RG
Sbjct: 113 KPRGRPPKLKSQPNTELTWPALALNDNPALQSAKRGPGRPKKIAGPVGVSPGPMVPGRRG 172
Query: 438 RP----------RKVQPPQHLPQTLTLAGVQLQPQLPPQQQ----VQPEAQPQTQNLPPQ 575
RP R +PP+ + +G++ +P PP+ + V P A P LP
Sbjct: 173 RPPGTGRSKLPKRPGRPPKPKSVSAISSGLKRRPGRPPKAESNVNVIPFAAPVAPGLPTV 232
Query: 576 FNNVQQNAAPAQNAEPVG 629
V + P + P G
Sbjct: 233 QPIVPTASVPNGSPRPRG 250
>pir||T02029 DNA-binding protein pabf - common tobacco gi|555655|gb|AAA50196.1|
DNA-binding protein
Length = 546
Score = 145 bits (366), Expect = 4e-34
Identities = 101/217 (46%), Positives = 121/217 (55%), Gaps = 16/217 (7%)
Frame = +3
Query: 21 MDPSSISSPSASAPPT----ATVPFAADPNNHPPPPPAPADNPAHPPYAEMIYTAIGALK 188
MDPS + P+ + PT V A P PPPPAP+ +P HPPYAEMI AI ALK
Sbjct: 1 MDPS-MDLPTTTESPTFNSAQVVNHAPTPTPPQPPPPAPSFSPTHPPYAEMITAAITALK 59
Query: 189 EKDGSSKRAINKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKL---PGSAP 359
E+DGSS+ AI KYI++VY + L +H +LLTHHLKRLK +G L M K SY L PGSAP
Sbjct: 60 ERDGSSRIAIAKYIDRVYTN-LPPNHSALLTHHLKRLKNSGYLAMVKHSYMLAGPPGSAP 118
Query: 360 PILPPPPENIAAGAGAAPSPAS-RPRGRPRKVQP-PQHLPQTLTLAGVQLQPQLPPQQQV 533
P PP + + G G S S R GRP K++P Q Q A VQ Q Q Q Q
Sbjct: 119 P--PPSADADSNGVGTDVSSLSKRKPGRPPKLKPEAQPHAQPQVQAQVQFQDQFQAQLQA 176
Query: 534 QPEAQPQTQ-----NLPPQFNNVQQNA--APAQNAEP 623
Q +AQ Q Q PQF +QQ P Q +P
Sbjct: 177 QLQAQLQAQQQQAAQFQPQFQLIQQQPQYLPQQQFQP 213
>pir||G96525 protein T1N15.25 [imported] - Arabidopsis thaliana
gi|8778700|gb|AAF79708.1|AC020889_16 T1N15.25
[Arabidopsis thaliana]
Length = 594
Score = 128 bits (322), Expect = 5e-29
Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
Frame = +3
Query: 39 SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
+ P+A APP + A P P P P + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 155 TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 214
Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
I++YIE++Y + +H +LLTHHLK LKT+G+LVM KKSYKL S PP PPPP ++A
Sbjct: 215 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 270
Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
A + P R RGRP K +P PQ LT + Q
Sbjct: 271 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 330
Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
+LP + + + QP L PQ
Sbjct: 331 ELPVSRPEEIQIQPPQLPLQPQ 352
>ref|NP_175295.1| unknown protein; protein id: At1g48620.1 [Arabidopsis thaliana]
Length = 479
Score = 128 bits (322), Expect = 5e-29
Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
Frame = +3
Query: 39 SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
+ P+A APP + A P P P P + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 40 TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 99
Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
I++YIE++Y + +H +LLTHHLK LKT+G+LVM KKSYKL S PP PPPP ++A
Sbjct: 100 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 155
Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
A + P R RGRP K +P PQ LT + Q
Sbjct: 156 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 215
Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
+LP + + + QP L PQ
Sbjct: 216 ELPVSRPEEIQIQPPQLPLQPQ 237
>gb|AAG50847.1|AC074308_3 hypothetical protein, 3' partial [Arabidopsis thaliana]
Length = 332
Score = 128 bits (322), Expect = 5e-29
Identities = 83/202 (41%), Positives = 109/202 (53%), Gaps = 23/202 (11%)
Frame = +3
Query: 39 SSPSASAPPTATVPFAADPNNHPPPPPAPAD-NPAHPPYAEMIYTAIGALKEKDGSSKRA 215
+ P+A APP + A P P P P + +HPPY++MI TAI AL E DGSSK+A
Sbjct: 40 TGPTAVAPPNNIHLYQAAPPQQPQTSPVPPHPSISHPPYSDMICTAIAALNEPDGSSKQA 99
Query: 216 INKYIEQVYKDQLTQSHESLLTHHLKRLKTNGMLVMDKKSYKLPGSAPPILPPPPENIAA 395
I++YIE++Y + +H +LLTHHLK LKT+G+LVM KKSYKL S PP PPPP ++A
Sbjct: 100 ISRYIERIYTG-IPTAHGALLTHHLKTLKTSGILVMVKKSYKL-ASTPP--PPPPTSVAP 155
Query: 396 G---------------------AGAAPSPASRPRGRPRKVQPPQHLPQTLTLAGVQL-QP 509
A + P R RGRP K +P PQ LT + Q
Sbjct: 156 SLEPPRSDFIVNENQPLPDPVLASSTPQTIKRGRGRPPKAKPDVVQPQPLTNGKLTWEQS 215
Query: 510 QLPPQQQVQPEAQPQTQNLPPQ 575
+LP + + + QP L PQ
Sbjct: 216 ELPVSRPEEIQIQPPQLPLQPQ 237
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 676,695,841
Number of Sequences: 1393205
Number of extensions: 19811763
Number of successful extensions: 353436
Number of sequences better than 10.0: 4349
Number of HSP's better than 10.0 without gapping: 125790
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 280386
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26154777244
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)