FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0486, 180 aa
1>>>pF1KE0486 180 - 180 aa - 180 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9005+/-0.000329; mu= 14.2091+/- 0.020
mean_var=56.4251+/-11.577, 0's: 0 Z-trim(114.4): 12 B-trim: 604 in 1/52
Lambda= 0.170741
statistics sampled from 24203 (24213) to 24203 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.679), E-opt: 0.2 (0.284), width: 16
Scan time: 5.700
The best scores are: opt bits E(85289)
NP_076958 (OMIM: 610152) centromere protein M isof ( 180) 1160 293.6 1.1e-79
NP_001291299 (OMIM: 610152) centromere protein M i ( 146) 953 242.6 2.1e-64
XP_011528670 (OMIM: 610152) PREDICTED: centromere ( 236) 870 222.2 4.6e-58
NP_001002876 (OMIM: 610152) centromere protein M i ( 107) 668 172.3 2.2e-43
NP_001291300 (OMIM: 610152) centromere protein M i ( 132) 653 168.6 3.5e-42
NP_001291302 (OMIM: 610152) centromere protein M i ( 73) 461 121.2 3.6e-28
NP_001103685 (OMIM: 610152) centromere protein M i ( 58) 296 80.5 5.2e-16
NP_001291301 (OMIM: 610152) centromere protein M i ( 125) 291 79.5 2.3e-15
>>NP_076958 (OMIM: 610152) centromere protein M isoform (180 aa)
initn: 1160 init1: 1160 opt: 1160 Z-score: 1551.3 bits: 293.6 E(85289): 1.1e-79
Smith-Waterman score: 1160; 100.0% identity (100.0% similar) in 180 aa overlap (1-180:1-180)
10 20 30 40 50 60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
130 140 150 160 170 180
>>NP_001291299 (OMIM: 610152) centromere protein M isofo (146 aa)
initn: 953 init1: 953 opt: 953 Z-score: 1277.1 bits: 242.6 E(85289): 2.1e-64
Smith-Waterman score: 953; 100.0% identity (100.0% similar) in 146 aa overlap (35-180:1-146)
10 20 30 40 50 60
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
::::::::::::::::::::::::::::::
NP_001 MLKEDCASELKVHLAKSLPLPSSVNRPRID
10 20 30
70 80 90 100 110 120
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
40 50 60 70 80 90
130 140 150 160 170 180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
100 110 120 130 140
>>XP_011528670 (OMIM: 610152) PREDICTED: centromere prot (236 aa)
initn: 870 init1: 870 opt: 870 Z-score: 1163.5 bits: 222.2 E(85289): 4.6e-58
Smith-Waterman score: 870; 100.0% identity (100.0% similar) in 134 aa overlap (1-134:1-134)
10 20 30 40 50 60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
::::::::::::::
XP_011 AHTYQSPLLYCDLEGLPSWKPGGRPCSPHSMASRCIIPCPGPPALDVGHSARERDPCRPQ
130 140 150 160 170 180
>>NP_001002876 (OMIM: 610152) centromere protein M isofo (107 aa)
initn: 668 init1: 668 opt: 668 Z-score: 899.7 bits: 172.3 E(85289): 2.2e-43
Smith-Waterman score: 668; 99.1% identity (100.0% similar) in 106 aa overlap (1-106:1-106)
10 20 30 40 50 60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
:::::::::::::::::::::::::::::::::::::::::::.::
NP_001 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGGRL
70 80 90 100
130 140 150 160 170 180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
>>NP_001291300 (OMIM: 610152) centromere protein M isofo (132 aa)
initn: 653 init1: 653 opt: 653 Z-score: 878.4 bits: 168.6 E(85289): 3.5e-42
Smith-Waterman score: 653; 100.0% identity (100.0% similar) in 103 aa overlap (1-103:1-103)
10 20 30 40 50 60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
:::::::::::::::::::::::::::::::::::::::::::
NP_001 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGKYVPRLLLPTPSQGKA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
NP_001 GAAVGFLLRHPG
130
>>NP_001291302 (OMIM: 610152) centromere protein M isofo (73 aa)
initn: 461 init1: 461 opt: 461 Z-score: 626.6 bits: 121.2 E(85289): 3.6e-28
Smith-Waterman score: 461; 98.6% identity (100.0% similar) in 72 aa overlap (35-106:1-72)
10 20 30 40 50 60
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
::::::::::::::::::::::::::::::
NP_001 MLKEDCASELKVHLAKSLPLPSSVNRPRID
10 20 30
70 80 90 100 110 120
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
:::::::::::::::::::::::::::::::::::::::.::
NP_001 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGGRL
40 50 60 70
130 140 150 160 170 180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
>>NP_001103685 (OMIM: 610152) centromere protein M isofo (58 aa)
initn: 296 init1: 296 opt: 296 Z-score: 408.5 bits: 80.5 E(85289): 5.2e-16
Smith-Waterman score: 296; 97.9% identity (100.0% similar) in 48 aa overlap (133-180:11-58)
110 120 130 140 150 160
pF1KE0 GAGRESHCSIHRHTVVKLAHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSA
:.::::::::::::::::::::::::::::
NP_001 MGRVWDLPGVLKVEGFRATMAQRLVRVLQICAGHVPGVSA
10 20 30 40
170 180
pF1KE0 LNLLSLLRSSEGPSLEDL
::::::::::::::::::
NP_001 LNLLSLLRSSEGPSLEDL
50
>>NP_001291301 (OMIM: 610152) centromere protein M isofo (125 aa)
initn: 294 init1: 277 opt: 291 Z-score: 396.8 bits: 79.5 E(85289): 2.3e-15
Smith-Waterman score: 291; 74.2% identity (83.3% similar) in 66 aa overlap (35-100:1-64)
10 20 30 40 50 60
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
::::::::::::::::::::::::::::::
NP_001 MLKEDCASELKVHLAKSLPLPSSVNRPRID
10 20 30
70 80 90 100 110 120
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
:::::::::::: ..... : : : . ::::
NP_001 LIVFVVNLHSKYRIREARTSAFSVVK--FCSLVCFLTLAWPPQSPEHRGVPAPCGCQLLL
40 50 60 70 80
130 140 150 160 170 180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
NP_001 GEGVFPRHRCWAGEPLQHSPAHRGEAGPHLSKPPALL
90 100 110 120
180 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 04:54:27 2016 done: Thu Nov 3 04:54:28 2016
Total Scan time: 5.700 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]