FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6662, 451 aa
1>>>pF1KB6662 451 - 451 aa - 451 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.1779+/-0.000983; mu= 7.1256+/- 0.060
mean_var=215.0850+/-44.268, 0's: 0 Z-trim(112.5): 143 B-trim: 0 in 0/52
Lambda= 0.087452
statistics sampled from 13073 (13218) to 13073 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.743), E-opt: 0.2 (0.406), width: 16
Scan time: 3.500
The best scores are: opt bits E(32554)
CCDS5995.1 MSR1 gene_id:4481|Hs108|chr8 ( 451) 3054 397.9 1.1e-110
CCDS5996.1 MSR1 gene_id:4481|Hs108|chr8 ( 388) 2291 301.5 9.6e-82
CCDS5997.1 MSR1 gene_id:4481|Hs108|chr8 ( 358) 2287 301.0 1.3e-81
CCDS6064.1 SCARA5 gene_id:286133|Hs108|chr8 ( 495) 643 93.7 4.4e-19
CCDS3709.1 PRSS12 gene_id:8492|Hs108|chr4 ( 875) 438 68.1 4.1e-11
>>CCDS5995.1 MSR1 gene_id:4481|Hs108|chr8 (451 aa)
initn: 3054 init1: 3054 opt: 3054 Z-score: 2100.5 bits: 397.9 E(32554): 1.1e-110
Smith-Waterman score: 3054; 100.0% identity (100.0% similar) in 451 aa overlap (1-451:1-451)
10 20 30 40 50 60
pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF
370 380 390 400 410 420
430 440 450
pF1KB6 GRESSIEECKIRQWGTRACSHSEDAGVTCTL
:::::::::::::::::::::::::::::::
CCDS59 GRESSIEECKIRQWGTRACSHSEDAGVTCTL
430 440 450
>>CCDS5996.1 MSR1 gene_id:4481|Hs108|chr8 (388 aa)
initn: 2281 init1: 2281 opt: 2291 Z-score: 1581.1 bits: 301.5 E(32554): 9.6e-82
Smith-Waterman score: 2458; 85.8% identity (86.0% similar) in 451 aa overlap (1-451:1-388)
10 20 30 40 50 60
pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE
::::::::::::::::::::::::::::::::::::::::::::.
CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLS---------------
310 320 330 340
370 380 390 400 410 420
pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF
::::::::::::
CCDS59 ------------------------------------------------TGPIWLNEVFCF
350
430 440 450
pF1KB6 GRESSIEECKIRQWGTRACSHSEDAGVTCTL
:::::::::::::::::::::::::::::::
CCDS59 GRESSIEECKIRQWGTRACSHSEDAGVTCTL
360 370 380
>>CCDS5997.1 MSR1 gene_id:4481|Hs108|chr8 (358 aa)
initn: 2447 init1: 2287 opt: 2287 Z-score: 1578.8 bits: 301.0 E(32554): 1.3e-81
Smith-Waterman score: 2287; 99.7% identity (99.7% similar) in 346 aa overlap (1-346:1-346)
10 20 30 40 50 60
pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE
:::::::::::::::::::::::::::::::::::::::::::: :
CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLRPVQLTDHIRAGPS
310 320 330 340 350
370 380 390 400 410 420
pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF
>>CCDS6064.1 SCARA5 gene_id:286133|Hs108|chr8 (495 aa)
initn: 790 init1: 451 opt: 643 Z-score: 456.0 bits: 93.7 E(32554): 4.4e-19
Smith-Waterman score: 910; 35.5% identity (63.9% similar) in 482 aa overlap (13-449:15-492)
10 20 30 40
pF1KB6 MEQWDHFHNQQEDTDS-CSESVKFDARSMTALL----PPNPKNSPSL----QEKLKSF
::.: : .: ::.::.. : : : :. .:...
CCDS60 MENKAMYLHTVSDCDTSSICEDS--FDGRSLSKLNLCEDGPCHKRRASICCTQLGSLSAL
10 20 30 40 50
50 60 70 80 90 100
pF1KB6 KAALIALYLLVFAVLIPLIGIVAAQL------LKWETKNCSVSSTNANDITQSLTG---K
: :...:::::: .:. .. ..... :: :.: . . . :. : .
CCDS60 KHAVLGLYLLVFLILVGIFILAVSRPRSSPDDLKALTRNVNRLNESFRDLQLRLLQAPLQ
60 70 80 90 100 110
110 120 130 140 150
pF1KB6 GNDSEEEMRFQEVFMEH------MSNMEKRIQHIL-DMEANLMDTEHFQNFSMTTDQ--R
.. .:. . :...... ... .:.. : ..:. ..:: : .. :. .
CCDS60 ADLTEQVWKVQDALQNQSDSLLALAGAVQRLEGALWGLQAQAVQTE--QAVALLRDRTGQ
120 130 140 150 160 170
160 170 180 190 200
pF1KB6 FNDI----LLQLSTLFSSVQ----GHGNAIDEISKSLISLNTTLLDLQLNIENLNGKIQE
.: : ::.. .: : :.. .: ... . :. : :. ...:: ...
CCDS60 QSDTAQLELYQLQVESNSSQLLLRRHAGLLDGLARRVGILGEELADVGGVLRGLNHSLSY
180 190 200 210 220 230
210 220 230 240 250 260
pF1KB6 NTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEIKGEVKVLNNITNDLRLKDWEHSQ
.. .. ... :. : :.: . .. .: .: ..: :. .:: .:.::::::::::
CCDS60 DVALHRTRLQDLRVLVSNASEDTRRLRLAHVGMELQLKQELAMLNAVTEDLRLKDWEHSI
240 250 260 270 280 290
270 280 290 300 310
pF1KB6 TLRNITLIQGPPGPPGEKGDRGPTGESG-P-----RGFPGPIGPPGLKGDRGAIGFPGSR
.::::.: .::::: :..::.: :. : : ::.:: : ::: : .: : :.
CCDS60 ALRNISLAKGPPGPKGDQGDEGKEGRPGIPGLPGLRGLPGERGTPGLPGPKGDDGKLGAT
300 310 320 330 340 350
320 330 340 350 360 370
pF1KB6 GLPGYAGRPGNSGPKGQKGEKG--SGNT--LTPFTKVRLVGGSGPHEGRVEILHSGQWGT
: :. : :. ::::.::::: .:.. . .:::.::::::::::. :. .:::
CCDS60 GPMGMRGFKGDRGPKGEKGEKGDRAGDASGVEAPMMIRLVNGSGPHEGRVEVYHDRRWGT
360 370 380 390 400 410
380 390 400 410 420 430
pF1KB6 ICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQ
.::: :. . :.:::: ::. ::. :...:.:::::: ::...: : : : .: .:.. .
CCDS60 VCDDGWDKKDGDVVCRMLGFRGVEEVYRTARFGQGTGRIWMDDVACKGTEETIFRCSFSK
420 430 440 450 460 470
440 450
pF1KB6 WGTRACSHSEDAGVTCTL
::. :.:.:::.:::
CCDS60 WGVTNCGHAEDASVTCNRH
480 490
>>CCDS3709.1 PRSS12 gene_id:8492|Hs108|chr4 (875 aa)
initn: 727 init1: 438 opt: 438 Z-score: 313.0 bits: 68.1 E(32554): 4.1e-11
Smith-Waterman score: 438; 52.9% identity (77.9% similar) in 104 aa overlap (347-450:277-380)
320 330 340 350 360 370
pF1KB6 RGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHEGRVEILHSGQWGTICD
: .::.:::. ::::::. :.:::::.::
CCDS37 LLCEKDIWQGGVCPQKMAAAVTCSFSHGPTFPIIRLAGGSSVHEGRVELYHAGQWGTVCD
250 260 270 280 290 300
380 390 400 410 420 430
pF1KB6 DRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQWGT
:.:. ..:.::.:: :. . . :.::.:.::. :.:: : : : :::.: .::
CCDS37 DQWDDADAEVICRQLGLSGIAKAWHQAYFGEGSGPVMLDEVRCTGNELSIEQCPKSSWGE
310 320 330 340 350 360
440 450
pF1KB6 RACSHSEDAGVTCTL
. :.:.:::::.::
CCDS37 HNCGHKEDAGVSCTPLTDGVIRLAGGKGSHEGRLEVYYRGQWGTVCDDGWTELNTYVVCR
370 380 390 400 410 420
451 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 01:32:23 2016 done: Tue Nov 8 01:32:23 2016
Total Scan time: 3.500 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]