FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE9617, 344 aa
1>>>pF1KE9617 344 - 344 aa - 344 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6136+/-0.000367; mu= 14.4552+/- 0.023
mean_var=67.2637+/-13.621, 0's: 0 Z-trim(113.5): 7 B-trim: 0 in 0/53
Lambda= 0.156381
statistics sampled from 22804 (22811) to 22804 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.655), E-opt: 0.2 (0.267), width: 16
Scan time: 7.960
The best scores are: opt bits E(85289)
NP_201576 (OMIM: 611503) centromere protein L isof ( 344) 2315 531.2 1.2e-150
NP_001164653 (OMIM: 611503) centromere protein L i ( 344) 2315 531.2 1.2e-150
NP_001120653 (OMIM: 611503) centromere protein L i ( 390) 1404 325.7 1e-88
>>NP_201576 (OMIM: 611503) centromere protein L isoform (344 aa)
initn: 2315 init1: 2315 opt: 2315 Z-score: 2825.4 bits: 531.2 E(85289): 1.2e-150
Smith-Waterman score: 2315; 100.0% identity (100.0% similar) in 344 aa overlap (1-344:1-344)
10 20 30 40 50 60
pF1KE9 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_201 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_201 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 FSTLLGMKGTQRDPEAFLVQIVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_201 FSTLLGMKGTQRDPEAFLVQIVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 LPLFLANGAESNTAIIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_201 LPLFLANGAESNTAIIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 WSVPCSPQSLDISFAIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_201 WSVPCSPQSLDISFAIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLS
250 260 270 280 290 300
310 320 330 340
pF1KE9 ATRLVRVSTSVASAHTDGKIKILCHKYLIGVLAYLTELAIFQIE
::::::::::::::::::::::::::::::::::::::::::::
NP_201 ATRLVRVSTSVASAHTDGKIKILCHKYLIGVLAYLTELAIFQIE
310 320 330 340
>>NP_001164653 (OMIM: 611503) centromere protein L isofo (344 aa)
initn: 2315 init1: 2315 opt: 2315 Z-score: 2825.4 bits: 531.2 E(85289): 1.2e-150
Smith-Waterman score: 2315; 100.0% identity (100.0% similar) in 344 aa overlap (1-344:1-344)
10 20 30 40 50 60
pF1KE9 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE9 FSTLLGMKGTQRDPEAFLVQIVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FSTLLGMKGTQRDPEAFLVQIVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE9 LPLFLANGAESNTAIIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LPLFLANGAESNTAIIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE9 WSVPCSPQSLDISFAIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 WSVPCSPQSLDISFAIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLS
250 260 270 280 290 300
310 320 330 340
pF1KE9 ATRLVRVSTSVASAHTDGKIKILCHKYLIGVLAYLTELAIFQIE
::::::::::::::::::::::::::::::::::::::::::::
NP_001 ATRLVRVSTSVASAHTDGKIKILCHKYLIGVLAYLTELAIFQIE
310 320 330 340
>>NP_001120653 (OMIM: 611503) centromere protein L isofo (390 aa)
initn: 1404 init1: 1404 opt: 1404 Z-score: 1713.7 bits: 325.7 E(85289): 1e-88
Smith-Waterman score: 2124; 87.8% identity (87.8% similar) in 376 aa overlap (15-344:15-390)
10 20 30 40 50 60
pF1KE9 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MDSYSAPESTPSASSRPEDYFIGATPLQKRLESVRKQSSFILTPPRRKIPQCSQLQEDVD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE9 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PQKVAFLLHKQWTLYSLTPLYKFSYSNLKEYSRLLNAFIVAEKQKGLAVEVGEDFNIKVI
70 80 90 100 110 120
130 140
pF1KE9 FSTLLGMKGTQRDPEAFLVQ----------------------------------------
::::::::::::::::::::
NP_001 FSTLLGMKGTQRDPEAFLVQGLILSPRLEYSGTILVDCNLCLLGSSDPSTLAFQVAGTAG
130 140 150 160 170 180
150 160 170 180 190
pF1KE9 ------IVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTCLPLFLANGAESNTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 ACHHTRIVSKSQLPSENREGKVLWTGWFCCVFGDSLLETVSEDFTCLPLFLANGAESNTA
190 200 210 220 230 240
200 210 220 230 240 250
pF1KE9 IIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFLWSVPCSPQSLDISF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IIGTWFQKTFDCYFSPLAINAFNLSWMAAMWTACKMDHYVATTEFLWSVPCSPQSLDISF
250 260 270 280 290 300
260 270 280 290 300 310
pF1KE9 AIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLSATRLVRVSTSVASA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AIHPEDAKALWDSVHKTPGEVTQEEVDLFMDCLYSHFHRHFKIHLSATRLVRVSTSVASA
310 320 330 340 350 360
320 330 340
pF1KE9 HTDGKIKILCHKYLIGVLAYLTELAIFQIE
::::::::::::::::::::::::::::::
NP_001 HTDGKIKILCHKYLIGVLAYLTELAIFQIE
370 380 390
344 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 06:24:01 2016 done: Sun Nov 6 06:24:02 2016
Total Scan time: 7.960 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]