Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005337A_C01 KMC005337A_c01
(579 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||T12180 probable transcription factor - fava bean gi|2104681... 172 2e-42
ref|NP_193416.1| nuclear antigen homolog; protein id: At4g16830.... 104 1e-29
gb|AAM61393.1| nuclear antigen homolog [Arabidopsis thaliana] 97 5e-27
ref|NP_199532.1| putative protein; protein id: At5g47210.1, supp... 89 8e-27
gb|AAM63072.1| nuclear RNA binding protein A-like protein [Arabi... 89 8e-27
>pir||T12180 probable transcription factor - fava bean
gi|2104681|emb|CAA66481.1| transcription factor [Vicia
faba]
Length = 370
Score = 172 bits (437), Expect = 2e-42
Identities = 98/148 (66%), Positives = 109/148 (73%), Gaps = 6/148 (4%)
Frame = +3
Query: 153 KRRGLRTRASRPNRLRCLPSHFLLAQAVRDARNEPSRGGRGGARGAGRGFG-----RGRG 317
K +G R + ++P +L P+ AQAVR++RNE RG RGG RG GRGFG RGRG
Sbjct: 39 KDQGKRAQTNKPAQLPSKPAP--PAQAVRESRNEGGRGSRGG-RGGGRGFGGERGGRGRG 95
Query: 318 FNRDFS-NDENSFPASGAPDNLGPFEGDSEKASERRGYGAPRGPYRGGGGGRRGGFSNGE 494
F RDFS NDENSFPAS APD+ G EG EK SERRG+G PR PYRGG RRGGFSNGE
Sbjct: 96 FGRDFSSNDENSFPASRAPDSQGAVEG--EKFSERRGFGGPRPPYRGG---RRGGFSNGE 150
Query: 495 ADEEGRPRRAFERHSGTGRGSGFKREGA 578
EEGRPRRAFERHSGTGRGS FKR+GA
Sbjct: 151 GGEEGRPRRAFERHSGTGRGSEFKRDGA 178
Score = 77.4 bits (189), Expect = 1e-13
Identities = 43/65 (66%), Positives = 46/65 (70%), Gaps = 3/65 (4%)
Frame = +2
Query: 38 MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQA---KPAQMPS 208
MAT NP D LGDD EDPSQLI EQLKAAAA P K A +DQGK+A KPAQ+PS
Sbjct: 1 MATTNPFDLLGDDVEDPSQLIITEQLKAAAA-----PTKKAAEKDQGKRAQTNKPAQLPS 55
Query: 209 KPLPP 223
KP PP
Sbjct: 56 KPAPP 60
>ref|NP_193416.1| nuclear antigen homolog; protein id: At4g16830.1, supported by
cDNA: 118826., supported by cDNA: gi_17380879, supported
by cDNA: gi_20465928 [Arabidopsis thaliana]
gi|7488137|pir||F71435 probable nuclear antigen -
Arabidopsis thaliana gi|2245037|emb|CAB10456.1| nuclear
antigen homolog [Arabidopsis thaliana]
gi|6492264|gb|AAF14243.1|AF110227_1 nuclear RNA binding
protein [Arabidopsis thaliana]
gi|7268434|emb|CAB80954.1| nuclear antigen homolog
[Arabidopsis thaliana] gi|17380880|gb|AAL36252.1|
putative nuclear antigen homolog [Arabidopsis thaliana]
gi|20465929|gb|AAM20150.1| putative nuclear antigen-like
protein [Arabidopsis thaliana]
gi|22022571|gb|AAM83242.1| AT4g16830/dl4440w
[Arabidopsis thaliana] gi|23308317|gb|AAN18128.1|
At4g16830/dl4440w [Arabidopsis thaliana]
Length = 355
Score = 104 bits (260), Expect(2) = 1e-29
Identities = 73/125 (58%), Positives = 83/125 (66%), Gaps = 7/125 (5%)
Frame = +3
Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG-FNRDFSNDENSFPASGAPDNLGPFEGDS 401
AQAVR+AR++ RGG G RG GF RGRG +NRD D N+ + G G EGD
Sbjct: 55 AQAVREARSDAPRGG--GGRG---GFNRGRGGYNRD---DGNNGYSGGYTKPSG--EGDV 104
Query: 402 EKAS-ERRGYG-APRGPYRGGGGG----RRGGFSNGEADEEGRPRRAFERHSGTGRGSGF 563
K+S ERRG G APRG +RG GGG RRGGFSN D E RPRRAFER SGTGRGS F
Sbjct: 105 SKSSYERRGGGGAPRGSFRGEGGGPGGGRRGGFSNEGGDGE-RPRRAFERRSGTGRGSDF 163
Query: 564 KREGA 578
KR+G+
Sbjct: 164 KRDGS 168
Score = 47.0 bits (110), Expect(2) = 1e-29
Identities = 28/66 (42%), Positives = 36/66 (54%), Gaps = 4/66 (6%)
Frame = +2
Query: 38 MATMNPLDGLGDDAEDPSQLIAA----EQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMP 205
MAT+NP D L DDAEDPSQL A ++ K + ++ P K A ++P
Sbjct: 1 MATLNPFDLLDDDAEDPSQLAVAIEKIDKSKKSGQVSSLPAKSA------------PKLP 48
Query: 206 SKPLPP 223
SKPLPP
Sbjct: 49 SKPLPP 54
>gb|AAM61393.1| nuclear antigen homolog [Arabidopsis thaliana]
Length = 354
Score = 97.1 bits (240), Expect(2) = 5e-27
Identities = 69/125 (55%), Positives = 80/125 (63%), Gaps = 7/125 (5%)
Frame = +3
Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG-FNRDFSNDENSFPASGAPDNLGPFEGDS 401
AQAVR+AR++ RG RG G GF RG G +NRD D N+ + G G EGD
Sbjct: 55 AQAVREARSDAPRG-----RGRG-GFSRGHGGYNRD---DGNNGYSGGYTKPSG--EGDV 103
Query: 402 EKAS-ERRGYG-APRGPYRGGGGG----RRGGFSNGEADEEGRPRRAFERHSGTGRGSGF 563
K+S ERRG G APRG +RG GGG RRGGFSN E E RPRR +ER SGTGRGS F
Sbjct: 104 SKSSYERRGGGGAPRGSFRGEGGGPGGGRRGGFSN-EGGEGERPRRTYERRSGTGRGSDF 162
Query: 564 KREGA 578
KR+G+
Sbjct: 163 KRDGS 167
Score = 45.8 bits (107), Expect(2) = 5e-27
Identities = 27/66 (40%), Positives = 36/66 (53%), Gaps = 4/66 (6%)
Frame = +2
Query: 38 MATMNPLDGLGDDAEDPSQLIAA----EQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMP 205
MAT+NP D L DDAEDPSQL + ++ K + ++ P K A ++P
Sbjct: 1 MATLNPFDLLDDDAEDPSQLAVSIEKIDKSKKSGPVSSLPAKSA------------PKLP 48
Query: 206 SKPLPP 223
SKPLPP
Sbjct: 49 SKPLPP 54
>ref|NP_199532.1| putative protein; protein id: At5g47210.1, supported by cDNA:
19104. [Arabidopsis thaliana] gi|8809603|dbj|BAA97154.1|
gene_id:MQL5.6~pir||G71444~similar to unknown protein
[Arabidopsis thaliana] gi|22655182|gb|AAM98181.1|
putative protein [Arabidopsis thaliana]
Length = 357
Score = 89.0 bits (219), Expect(2) = 8e-27
Identities = 60/127 (47%), Positives = 74/127 (58%), Gaps = 10/127 (7%)
Frame = +3
Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG---FNRDFSNDENSFPASGAPDNLGPFEG 395
+QAVR++RN P +GGRGG G G GF RGRG +NRD N++ AP N F G
Sbjct: 51 SQAVRESRNAP-QGGRGGTGGRG-GFSRGRGNGGYNRDNRNND-------APGNENGFSG 101
Query: 396 DSEKASERRGYGAPRGPYRGG---GGGR----RGGFSNGEADEEGRPRRAFERHSGTGRG 554
+ SE GA RG GG GGGR RGG +NGE+ + RP R ++RHS TG G
Sbjct: 102 GYRRPSEDAD-GASRGGSVGGYRVGGGREGPRRGGVANGESGDVERPPRNYDRHSRTGHG 160
Query: 555 SGFKREG 575
+G KR G
Sbjct: 161 TGMKRNG 167
Score = 53.1 bits (126), Expect(2) = 8e-27
Identities = 30/62 (48%), Positives = 34/62 (54%)
Frame = +2
Query: 38 MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMPSKPL 217
MA++NP D LGDDAEDPSQL A K AAA P KA A+ P+KP
Sbjct: 1 MASLNPFDLLGDDAEDPSQLAVALSQKVEKAAAAVQPPKA------------AKFPTKPA 48
Query: 218 PP 223
PP
Sbjct: 49 PP 50
>gb|AAM63072.1| nuclear RNA binding protein A-like protein [Arabidopsis thaliana]
Length = 357
Score = 89.0 bits (219), Expect(2) = 8e-27
Identities = 60/127 (47%), Positives = 74/127 (58%), Gaps = 10/127 (7%)
Frame = +3
Query: 225 AQAVRDARNEPSRGGRGGARGAGRGFGRGRG---FNRDFSNDENSFPASGAPDNLGPFEG 395
+QAVR++RN P +GGRGG G G GF RGRG +NRD N++ AP N F G
Sbjct: 51 SQAVRESRNAP-QGGRGGTGGRG-GFSRGRGNGGYNRDNRNND-------APGNENGFSG 101
Query: 396 DSEKASERRGYGAPRGPYRGG---GGG----RRGGFSNGEADEEGRPRRAFERHSGTGRG 554
+ SE GA RG GG GGG RRGG +NGE+ + RP R ++RHS TG G
Sbjct: 102 GYRRPSEDAD-GASRGGSVGGYRVGGGLEGPRRGGVANGESGDVERPPRNYDRHSRTGHG 160
Query: 555 SGFKREG 575
+G KR G
Sbjct: 161 TGMKRNG 167
Score = 53.1 bits (126), Expect(2) = 8e-27
Identities = 30/62 (48%), Positives = 34/62 (54%)
Frame = +2
Query: 38 MATMNPLDGLGDDAEDPSQLIAAEQLKAAAAAATAPPKKAAGAQDQGKQAKPAQMPSKPL 217
MA++NP D LGDDAEDPSQL A K AAA P KA A+ P+KP
Sbjct: 1 MASLNPFDLLGDDAEDPSQLAVALSQKVEKAAAAVQPPKA------------AKFPTKPA 48
Query: 218 PP 223
PP
Sbjct: 49 PP 50
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 611,367,225
Number of Sequences: 1393205
Number of extensions: 18112776
Number of successful extensions: 277100
Number of sequences better than 10.0: 5310
Number of HSP's better than 10.0 without gapping: 106430
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 210279
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)