Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC001555A_C01 KMC001555A_c01
(1229 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_200152.1| putative protein; protein id: At5g53400.1, supp... 317 2e-85
gb|AAM65278.1| unknown [Arabidopsis thaliana] 316 4e-85
ref|NP_194518.1| putative protein; protein id: At4g27890.1 [Arab... 301 1e-80
ref|NP_035078.1| nuclear distribution gene C homolog (Aspergillu... 187 2e-46
emb|CAB95231.1| MNUDC-like protein [Leishmania major] 187 3e-46
>ref|NP_200152.1| putative protein; protein id: At5g53400.1, supported by cDNA: 38105.
[Arabidopsis thaliana] gi|8843769|dbj|BAA97317.1|
contains similarity to nuclear movement protein
nudC~gene_id:MYN8.1 [Arabidopsis thaliana]
gi|27765036|gb|AAO23639.1| At5g53400 [Arabidopsis
thaliana]
Length = 304
Score = 317 bits (813), Expect = 2e-85
Identities = 167/300 (55%), Positives = 205/300 (67%), Gaps = 24/300 (8%)
Frame = -1
Query: 1136 SKTDASPSQSKPSPSPTFAATFDRSNPAAFLERVFDFVSRESDFLATEAAEKEVASLVRA 957
S+ + S S+P P F AT +NP FLE+VFDF+ +SDFL +AE E+ VRA
Sbjct: 5 SEVEEESSSSRPMIFP-FRATLSSANPLGFLEKVFDFLGEQSDFLKKPSAEDEIVVAVRA 63
Query: 956 AREK-----RKKKREEELKASEEKAK------AEKRLKEAEKKP-------------KAE 849
A+EK +KK +E +K E+KA+ EK++++ KP K +
Sbjct: 64 AKEKLKKAEKKKAEKESVKPVEKKAEKEIVKLVEKKVEKESVKPTIAASSAEPIEVEKPK 123
Query: 848 ENDKKEGTAAIVPNKGNGMDLEKYSWTQTLQEVTVNVPVPHGTKSRFVVCEIKKNHLKVG 669
E ++K+ + IVPNKGNG DLE YSW Q LQEVTVN+PVP GTK+R VVCEIKKN LKVG
Sbjct: 124 EEEEKKESGPIVPNKGNGTDLENYSWIQNLQEVTVNIPVPTGTKARTVVCEIKKNRLKVG 183
Query: 668 LKGQPPIIEGEFFRPVKPDDCYWSIEDQSALSILLTKHDQMDWWKCLVKGDPEIDTQKVE 489
LKGQ PI++GE +R VKPDDCYW+IEDQ +SILLTK DQM+WWKC VKG+PEIDTQKVE
Sbjct: 184 LKGQDPIVDGELYRSVKPDDCYWNIEDQKVISILLTKSDQMEWWKCCVKGEPEIDTQKVE 243
Query: 488 PENSRLSDLDAETRQTVEKMMFDQRQKSMGLPTRDELHEARNF*RNLCLSIREMDFSRAK 309
PE S+L DLD ETR TVEKMMFDQRQK MGLPT +EL + + + EMDFS AK
Sbjct: 244 PETSKLGDLDPETRSTVEKMMFDQRQKQMGLPTSEEL-QKQEILKKFMSEHPEMDFSNAK 302
>gb|AAM65278.1| unknown [Arabidopsis thaliana]
Length = 304
Score = 316 bits (810), Expect = 4e-85
Identities = 166/300 (55%), Positives = 205/300 (68%), Gaps = 24/300 (8%)
Frame = -1
Query: 1136 SKTDASPSQSKPSPSPTFAATFDRSNPAAFLERVFDFVSRESDFLATEAAEKEVASLVRA 957
S+ + S S+P P F AT +NP FLE+VFDF+ +SDFL +AE E+ VRA
Sbjct: 5 SEVEEESSSSRPMIFP-FRATLSSANPLGFLEKVFDFLGEQSDFLKKPSAEDEIVVAVRA 63
Query: 956 AREK-----RKKKREEELKASEEKAK------AEKRLKEAEKKP-------------KAE 849
A+EK +KK +E +K E+KA+ EK++++ KP K +
Sbjct: 64 AKEKLKKAEKKKAEKESVKPVEKKAEKEIVKLVEKKVEKESVKPTMAASSAEPIEVEKPK 123
Query: 848 ENDKKEGTAAIVPNKGNGMDLEKYSWTQTLQEVTVNVPVPHGTKSRFVVCEIKKNHLKVG 669
+ ++K+ + IVPNKGNG DLE YSW Q LQEVTVN+PVP GTK+R VVCEIKKN LKVG
Sbjct: 124 DEEEKKESGPIVPNKGNGTDLENYSWIQNLQEVTVNIPVPTGTKARTVVCEIKKNRLKVG 183
Query: 668 LKGQPPIIEGEFFRPVKPDDCYWSIEDQSALSILLTKHDQMDWWKCLVKGDPEIDTQKVE 489
LKGQ PI++GE +R VKPDDCYW+IEDQ +SILLTK DQM+WWKC VKG+PEIDTQKVE
Sbjct: 184 LKGQDPIVDGELYRSVKPDDCYWNIEDQKVISILLTKSDQMEWWKCCVKGEPEIDTQKVE 243
Query: 488 PENSRLSDLDAETRQTVEKMMFDQRQKSMGLPTRDELHEARNF*RNLCLSIREMDFSRAK 309
PE S+L DLD ETR TVEKMMFDQRQK MGLPT +EL + + + EMDFS AK
Sbjct: 244 PETSKLGDLDPETRSTVEKMMFDQRQKQMGLPTSEEL-QKQEILKKFMSEHPEMDFSNAK 302
>ref|NP_194518.1| putative protein; protein id: At4g27890.1 [Arabidopsis thaliana]
gi|7487473|pir||T09028 hypothetical protein T27E11.130 -
Arabidopsis thaliana gi|4972120|emb|CAB43977.1| putative
protein [Arabidopsis thaliana] gi|7269642|emb|CAB81438.1|
putative protein [Arabidopsis thaliana]
gi|23296480|gb|AAN13067.1| unknown protein [Arabidopsis
thaliana]
Length = 293
Score = 301 bits (771), Expect = 1e-80
Identities = 164/285 (57%), Positives = 198/285 (68%), Gaps = 17/285 (5%)
Frame = -1
Query: 1112 QSKPSPSPTFAATFDRSNPAAFLERVFDFVSRESDFLATEAAEKEVASLVRAA----REK 945
+++PS P F A+FD SNP AFLE+V D + +ES+FL + AEKE+ + V AA RE
Sbjct: 9 EARPSMVP-FTASFDPSNPIAFLEKVLDVIGKESNFLKKDTAEKEIVAAVMAAKQRLREA 67
Query: 944 RKKKREEELKASEEKAKAEK-RLKEAE-KKPKAE----------ENDKKEGTAA-IVPNK 804
KKK E+E S E K +K LK E +KPK E E K+E + IVPNK
Sbjct: 68 EKKKLEKESVKSMEVEKPKKDSLKPTELEKPKEESLMATDPMEIEKPKEEKESGPIVPNK 127
Query: 803 GNGMDLEKYSWTQTLQEVTVNVPVPHGTKSRFVVCEIKKNHLKVGLKGQPPIIEGEFFRP 624
GNG+D EKYSW Q LQEVT+N+P+P GTKSR V CEIKKN LKVGLKGQ I++GEFF
Sbjct: 128 GNGLDFEKYSWGQNLQEVTINIPMPEGTKSRSVTCEIKKNRLKVGLKGQDLIVDGEFFNS 187
Query: 623 VKPDDCYWSIEDQSALSILLTKHDQMDWWKCLVKGDPEIDTQKVEPENSRLSDLDAETRQ 444
VKPDDC+W+IEDQ +S+LLTK DQM+WWK VKG+PEIDTQKVEPE S+L DLD ETR
Sbjct: 188 VKPDDCFWNIEDQKMISVLLTKQDQMEWWKYCVKGEPEIDTQKVEPETSKLGDLDPETRA 247
Query: 443 TVEKMMFDQRQKSMGLPTRDELHEARNF*RNLCLSIREMDFSRAK 309
+VEKMMFDQRQK MGLP DE+ E ++ + MDFS AK
Sbjct: 248 SVEKMMFDQRQKQMGLPRSDEI-EKKDMLKKFMAQNPGMDFSNAK 291
>ref|NP_035078.1| nuclear distribution gene C homolog (Aspergillus) [Mus musculus]
gi|20846655|ref|XP_124402.1| nuclear distribution gene C
homolog [Mus musculus] gi|2654358|emb|CAA75677.1| MNUDC
protein [Mus musculus] gi|2808636|emb|CAA57201.1| Sig 92
[Mus musculus] gi|15030022|gb|AAH11253.1| Similar to
nuclear distribution gene C homolog (Aspergillus) [Mus
musculus] gi|26328485|dbj|BAC27981.1| unnamed protein
product [Mus musculus]
Length = 332
Score = 187 bits (476), Expect = 2e-46
Identities = 100/220 (45%), Positives = 141/220 (63%), Gaps = 2/220 (0%)
Frame = -1
Query: 962 RAAREKRKKKREEELKASEEKAKAEKRLKEAEKKPKAEENDKKEGTAAIVPNKGNGMDLE 783
R E +KK E+ +A + + K+ + + EE++K +G + PN GNG DL
Sbjct: 114 RLQLEIDQKKDAEDQEAQLKNGSLDSPGKQDAEDEEDEEDEKDKGK--LKPNLGNGADLP 171
Query: 782 KYSWTQTLQEVTVNVP--VPHGTKSRFVVCEIKKNHLKVGLKGQPPIIEGEFFRPVKPDD 609
Y WTQTL E+ + VP V K + VV +I++ HL+VGLKGQPP+++GE + VK ++
Sbjct: 172 NYRWTQTLAELDLAVPFRVSFRLKGKDVVVDIQRRHLRVGLKGQPPVVDGELYNEVKVEE 231
Query: 608 CYWSIEDQSALSILLTKHDQMDWWKCLVKGDPEIDTQKVEPENSRLSDLDAETRQTVEKM 429
W IED +++ L K ++M+WW LV DPEI+T+K+ PENS+LSDLD+ETR VEKM
Sbjct: 232 SSWLIEDGKVVTVHLEKINKMEWWNRLVTSDPEINTKKINPENSKLSDLDSETRSMVEKM 291
Query: 428 MFDQRQKSMGLPTRDELHEARNF*RNLCLSIREMDFSRAK 309
M+DQRQKSMGLPT DE + + + EMDFS+AK
Sbjct: 292 MYDQRQKSMGLPTSDE-QKKQEILKKFMDQHPEMDFSKAK 330
>emb|CAB95231.1| MNUDC-like protein [Leishmania major]
Length = 328
Score = 187 bits (475), Expect = 3e-46
Identities = 103/244 (42%), Positives = 147/244 (60%), Gaps = 6/244 (2%)
Frame = -1
Query: 1022 SRESDFLATEAAEKEVASLVRAAREKRKKKREEELK-----ASEEKAKAEKRLKEAEKKP 858
+R S + +E+EV ++ A + +R++EL+ +E KAK E EAE
Sbjct: 84 ARTSSRVEVLESEEEVNAVAAAQAAEEAAQRKQELERAQREVAEAKAKKEAAGNEAETGA 143
Query: 857 KAEENDKKEGTAAIVPNKGNGMDLEKYSWTQTLQEVTVNVPVPH-GTKSRFVVCEIKKNH 681
D + + P NG EKY ++Q+LQE V VP+P K + V I +H
Sbjct: 144 TEAFADTGDAPRGLPPTAANGFAYEKYIFSQSLQEAEVRVPLPAVNVKGKQVSIVITSSH 203
Query: 680 LKVGLKGQPPIIEGEFFRPVKPDDCYWSIEDQSALSILLTKHDQMDWWKCLVKGDPEIDT 501
L VG+KGQPPI++G+ + V+ ++C W+IED + + L K + M+WWK + +GDPEID
Sbjct: 204 LTVGMKGQPPIVDGDLYSKVRAEECMWTIEDGHTVVVTLYKVNSMEWWKTIFQGDPEIDL 263
Query: 500 QKVEPENSRLSDLDAETRQTVEKMMFDQRQKSMGLPTRDELHEARNF*RNLCLSIREMDF 321
QKV PENS+L DLD++TRQTVEKMM+DQRQK MG PT DE + ++ R + EMDF
Sbjct: 264 QKVIPENSKLDDLDSDTRQTVEKMMYDQRQKMMGKPTSDE-QKKQDMLRKFMEAHPEMDF 322
Query: 320 SRAK 309
S+AK
Sbjct: 323 SQAK 326
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,085,323,680
Number of Sequences: 1393205
Number of extensions: 27154378
Number of successful extensions: 249625
Number of sequences better than 10.0: 3033
Number of HSP's better than 10.0 without gapping: 125888
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 208020
length of database: 448,689,247
effective HSP length: 126
effective length of database: 273,145,417
effective search space used: 77300153011
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)