FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6662, 451 aa 1>>>pF1KB6662 451 - 451 aa - 451 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1779+/-0.000983; mu= 7.1256+/- 0.060 mean_var=215.0850+/-44.268, 0's: 0 Z-trim(112.5): 143 B-trim: 0 in 0/52 Lambda= 0.087452 statistics sampled from 13073 (13218) to 13073 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.743), E-opt: 0.2 (0.406), width: 16 Scan time: 3.500 The best scores are: opt bits E(32554) CCDS5995.1 MSR1 gene_id:4481|Hs108|chr8 ( 451) 3054 397.9 1.1e-110 CCDS5996.1 MSR1 gene_id:4481|Hs108|chr8 ( 388) 2291 301.5 9.6e-82 CCDS5997.1 MSR1 gene_id:4481|Hs108|chr8 ( 358) 2287 301.0 1.3e-81 CCDS6064.1 SCARA5 gene_id:286133|Hs108|chr8 ( 495) 643 93.7 4.4e-19 CCDS3709.1 PRSS12 gene_id:8492|Hs108|chr4 ( 875) 438 68.1 4.1e-11 >>CCDS5995.1 MSR1 gene_id:4481|Hs108|chr8 (451 aa) initn: 3054 init1: 3054 opt: 3054 Z-score: 2100.5 bits: 397.9 E(32554): 1.1e-110 Smith-Waterman score: 3054; 100.0% identity (100.0% similar) in 451 aa overlap (1-451:1-451) 10 20 30 40 50 60 pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF 370 380 390 400 410 420 430 440 450 pF1KB6 GRESSIEECKIRQWGTRACSHSEDAGVTCTL ::::::::::::::::::::::::::::::: CCDS59 GRESSIEECKIRQWGTRACSHSEDAGVTCTL 430 440 450 >>CCDS5996.1 MSR1 gene_id:4481|Hs108|chr8 (388 aa) initn: 2281 init1: 2281 opt: 2291 Z-score: 1581.1 bits: 301.5 E(32554): 9.6e-82 Smith-Waterman score: 2458; 85.8% identity (86.0% similar) in 451 aa overlap (1-451:1-388) 10 20 30 40 50 60 pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE ::::::::::::::::::::::::::::::::::::::::::::. CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLS--------------- 310 320 330 340 370 380 390 400 410 420 pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF :::::::::::: CCDS59 ------------------------------------------------TGPIWLNEVFCF 350 430 440 450 pF1KB6 GRESSIEECKIRQWGTRACSHSEDAGVTCTL ::::::::::::::::::::::::::::::: CCDS59 GRESSIEECKIRQWGTRACSHSEDAGVTCTL 360 370 380 >>CCDS5997.1 MSR1 gene_id:4481|Hs108|chr8 (358 aa) initn: 2447 init1: 2287 opt: 2287 Z-score: 1578.8 bits: 301.0 E(32554): 1.3e-81 Smith-Waterman score: 2287; 99.7% identity (99.7% similar) in 346 aa overlap (1-346:1-346) 10 20 30 40 50 60 pF1KB6 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEQWDHFHNQQEDTDSCSESVKFDARSMTALLPPNPKNSPSLQEKLKSFKAALIALYLLV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 FAVLIPLIGIVAAQLLKWETKNCSVSSTNANDITQSLTGKGNDSEEEMRFQEVFMEHMSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MEKRIQHILDMEANLMDTEHFQNFSMTTDQRFNDILLQLSTLFSSVQGHGNAIDEISKSL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ISLNTTLLDLQLNIENLNGKIQENTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KGEVKVLNNITNDLRLKDWEHSQTLRNITLIQGPPGPPGEKGDRGPTGESGPRGFPGPIG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB6 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHE :::::::::::::::::::::::::::::::::::::::::::: : CCDS59 PPGLKGDRGAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTLRPVQLTDHIRAGPS 310 320 330 340 350 370 380 390 400 410 420 pF1KB6 GRVEILHSGQWGTICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCF >>CCDS6064.1 SCARA5 gene_id:286133|Hs108|chr8 (495 aa) initn: 790 init1: 451 opt: 643 Z-score: 456.0 bits: 93.7 E(32554): 4.4e-19 Smith-Waterman score: 910; 35.5% identity (63.9% similar) in 482 aa overlap (13-449:15-492) 10 20 30 40 pF1KB6 MEQWDHFHNQQEDTDS-CSESVKFDARSMTALL----PPNPKNSPSL----QEKLKSF ::.: : .: ::.::.. : : : :. .:... CCDS60 MENKAMYLHTVSDCDTSSICEDS--FDGRSLSKLNLCEDGPCHKRRASICCTQLGSLSAL 10 20 30 40 50 50 60 70 80 90 100 pF1KB6 KAALIALYLLVFAVLIPLIGIVAAQL------LKWETKNCSVSSTNANDITQSLTG---K : :...:::::: .:. .. ..... :: :.: . . . :. : . CCDS60 KHAVLGLYLLVFLILVGIFILAVSRPRSSPDDLKALTRNVNRLNESFRDLQLRLLQAPLQ 60 70 80 90 100 110 110 120 130 140 150 pF1KB6 GNDSEEEMRFQEVFMEH------MSNMEKRIQHIL-DMEANLMDTEHFQNFSMTTDQ--R .. .:. . :...... ... .:.. : ..:. ..:: : .. :. . CCDS60 ADLTEQVWKVQDALQNQSDSLLALAGAVQRLEGALWGLQAQAVQTE--QAVALLRDRTGQ 120 130 140 150 160 170 160 170 180 190 200 pF1KB6 FNDI----LLQLSTLFSSVQ----GHGNAIDEISKSLISLNTTLLDLQLNIENLNGKIQE .: : ::.. .: : :.. .: ... . :. : :. ...:: ... CCDS60 QSDTAQLELYQLQVESNSSQLLLRRHAGLLDGLARRVGILGEELADVGGVLRGLNHSLSY 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB6 NTFKQQEEISKLEERVYNVSAEIMAMKEEQVHLEQEIKGEVKVLNNITNDLRLKDWEHSQ .. .. ... :. : :.: . .. .: .: ..: :. .:: .:.:::::::::: CCDS60 DVALHRTRLQDLRVLVSNASEDTRRLRLAHVGMELQLKQELAMLNAVTEDLRLKDWEHSI 240 250 260 270 280 290 270 280 290 300 310 pF1KB6 TLRNITLIQGPPGPPGEKGDRGPTGESG-P-----RGFPGPIGPPGLKGDRGAIGFPGSR .::::.: .::::: :..::.: :. : : ::.:: : ::: : .: : :. CCDS60 ALRNISLAKGPPGPKGDQGDEGKEGRPGIPGLPGLRGLPGERGTPGLPGPKGDDGKLGAT 300 310 320 330 340 350 320 330 340 350 360 370 pF1KB6 GLPGYAGRPGNSGPKGQKGEKG--SGNT--LTPFTKVRLVGGSGPHEGRVEILHSGQWGT : :. : :. ::::.::::: .:.. . .:::.::::::::::. :. .::: CCDS60 GPMGMRGFKGDRGPKGEKGEKGDRAGDASGVEAPMMIRLVNGSGPHEGRVEVYHDRRWGT 360 370 380 390 400 410 380 390 400 410 420 430 pF1KB6 ICDDRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQ .::: :. . :.:::: ::. ::. :...:.:::::: ::...: : : : .: .:.. . CCDS60 VCDDGWDKKDGDVVCRMLGFRGVEEVYRTARFGQGTGRIWMDDVACKGTEETIFRCSFSK 420 430 440 450 460 470 440 450 pF1KB6 WGTRACSHSEDAGVTCTL ::. :.:.:::.::: CCDS60 WGVTNCGHAEDASVTCNRH 480 490 >>CCDS3709.1 PRSS12 gene_id:8492|Hs108|chr4 (875 aa) initn: 727 init1: 438 opt: 438 Z-score: 313.0 bits: 68.1 E(32554): 4.1e-11 Smith-Waterman score: 438; 52.9% identity (77.9% similar) in 104 aa overlap (347-450:277-380) 320 330 340 350 360 370 pF1KB6 RGLPGYAGRPGNSGPKGQKGEKGSGNTLTPFTKVRLVGGSGPHEGRVEILHSGQWGTICD : .::.:::. ::::::. :.:::::.:: CCDS37 LLCEKDIWQGGVCPQKMAAAVTCSFSHGPTFPIIRLAGGSSVHEGRVELYHAGQWGTVCD 250 260 270 280 290 300 380 390 400 410 420 430 pF1KB6 DRWEVRVGQVVCRSLGYPGVQAVHKAAHFGQGTGPIWLNEVFCFGRESSIEECKIRQWGT :.:. ..:.::.:: :. . . :.::.:.::. :.:: : : : :::.: .:: CCDS37 DQWDDADAEVICRQLGLSGIAKAWHQAYFGEGSGPVMLDEVRCTGNELSIEQCPKSSWGE 310 320 330 340 350 360 440 450 pF1KB6 RACSHSEDAGVTCTL . :.:.:::::.:: CCDS37 HNCGHKEDAGVSCTPLTDGVIRLAGGKGSHEGRLEVYYRGQWGTVCDDGWTELNTYVVCR 370 380 390 400 410 420 451 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:32:23 2016 done: Tue Nov 8 01:32:23 2016 Total Scan time: 3.500 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]