FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9944, 652 aa 1>>>pF1KB9944 652 - 652 aa - 652 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.7293+/-0.000892; mu= 11.4874+/- 0.054 mean_var=220.7332+/-46.279, 0's: 0 Z-trim(114.4): 43 B-trim: 0 in 0/51 Lambda= 0.086326 statistics sampled from 14892 (14928) to 14892 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.459), width: 16 Scan time: 4.370 The best scores are: opt bits E(32554) CCDS53848.1 CHFR gene_id:55743|Hs108|chr12 ( 652) 4557 580.5 2.4e-165 CCDS53849.1 CHFR gene_id:55743|Hs108|chr12 ( 664) 3658 468.6 1.2e-131 CCDS53847.1 CHFR gene_id:55743|Hs108|chr12 ( 572) 3246 417.2 3.2e-116 CCDS31937.1 CHFR gene_id:55743|Hs108|chr12 ( 623) 3246 417.2 3.3e-116 >>CCDS53848.1 CHFR gene_id:55743|Hs108|chr12 (652 aa) initn: 4557 init1: 4557 opt: 4557 Z-score: 3081.9 bits: 580.5 E(32554): 2.4e-165 Smith-Waterman score: 4557; 99.8% identity (100.0% similar) in 652 aa overlap (1-652:1-652) 10 20 30 40 50 60 pF1KB9 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ESLSEKQGMTQESFDTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTSTSDLFPTASASS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ESLSEKQGMTQESFDTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTSTSDLFPTASASS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB9 PDRRVEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN ::::.::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 PDRRAEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB9 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL 550 560 570 580 590 600 610 620 630 640 650 pF1KB9 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN 610 620 630 640 650 >>CCDS53849.1 CHFR gene_id:55743|Hs108|chr12 (664 aa) initn: 3647 init1: 3647 opt: 3658 Z-score: 2476.7 bits: 468.6 E(32554): 1.2e-131 Smith-Waterman score: 4523; 98.0% identity (98.2% similar) in 664 aa overlap (1-652:1-664) 10 20 30 40 50 60 pF1KB9 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY 70 80 90 100 110 120 130 140 150 160 pF1KB9 ESLSEKQGMTQESF------------DTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTS :::::::::::::: :::::::::::::::::::::::::::::::::: CCDS53 ESLSEKQGMTQESFEANKENVFHGTKDTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KB9 TSDLFPTASASSTEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TSDLFPTASASSTEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTA 190 200 210 220 230 240 230 240 250 260 270 280 pF1KB9 SFSSLEPQDQEDLEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 SFSSLEPQDQEDLEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEE 250 260 270 280 290 300 290 300 310 320 330 340 pF1KB9 TLTCIICQDLLHDCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TLTCIICQDLLHDCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVE 310 320 330 340 350 360 350 360 370 380 390 400 pF1KB9 AYLIQHPDKSRSEEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 AYLIQHPDKSRSEEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSD 370 380 390 400 410 420 410 420 430 440 450 460 pF1KB9 ISQPYVVCRQCPEYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ISQPYVVCRQCPEYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQG 430 440 450 460 470 480 470 480 490 500 510 520 pF1KB9 SHALCTCCFQPMPDRRVEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELN ::::::::::::::::.::::::::::::::::::::::::::::::::::::::::::: CCDS53 SHALCTCCFQPMPDRRAEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELN 490 500 510 520 530 540 530 540 550 560 570 580 pF1KB9 LGDKCLDGVLNNNSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LGDKCLDGVLNNNSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVL 550 560 570 580 590 600 590 600 610 620 630 640 pF1KB9 CYCCGLRSFRELTYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CYCCGLRSFRELTYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQT 610 620 630 640 650 660 650 pF1KB9 RFKN :::: CCDS53 RFKN >>CCDS53847.1 CHFR gene_id:55743|Hs108|chr12 (572 aa) initn: 4004 init1: 3232 opt: 3246 Z-score: 2200.1 bits: 417.2 E(32554): 3.2e-116 Smith-Waterman score: 3847; 87.4% identity (87.6% similar) in 652 aa overlap (1-652:1-572) 10 20 30 40 50 60 pF1KB9 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHR----- 70 80 90 100 110 130 140 150 160 170 180 pF1KB9 ESLSEKQGMTQESFDTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTSTSDLFPTASASS CCDS53 ------------------------------------------------------------ 190 200 210 220 230 240 pF1KB9 TEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED ::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ---------------SGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED 120 130 140 150 160 250 260 270 280 290 300 pF1KB9 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH 170 180 190 200 210 220 310 320 330 340 350 360 pF1KB9 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS 230 240 250 260 270 280 370 380 390 400 410 420 pF1KB9 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP 290 300 310 320 330 340 430 440 450 460 470 480 pF1KB9 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM 350 360 370 380 390 400 490 500 510 520 530 540 pF1KB9 PDRRVEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN ::::.::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 PDRRAEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN 410 420 430 440 450 460 550 560 570 580 590 600 pF1KB9 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL 470 480 490 500 510 520 610 620 630 640 650 pF1KB9 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN 530 540 550 560 570 >>CCDS31937.1 CHFR gene_id:55743|Hs108|chr12 (623 aa) initn: 4139 init1: 3240 opt: 3246 Z-score: 2199.7 bits: 417.2 E(32554): 3.3e-116 Smith-Waterman score: 4089; 91.7% identity (93.1% similar) in 652 aa overlap (1-652:1-623) 10 20 30 40 50 60 pF1KB9 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MERPEEGKQSPPPQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 CRIVVDEKSGQVTLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ESLSEKQGMTQESFDTSGAGAGRGADPRVPPSSPATQVCFEEPQPSTSTSDLFPTASASS ::::::::::::::. :: : . . :. :: ...: CCDS31 ESLSEKQGMTQESFEM------------VP-------CCVAQAGLKLLGSSDPPTLASQS 130 140 150 160 190 200 210 220 230 240 pF1KB9 TEPSPAGRERSSSCGSGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED . :::::::::::::::::::::::::::::::::::::::::::::: CCDS31 IVIT----------GSGGGGISPKGSGPSVASDEVSSFASALPDRKTASFSSLEPQDQED 170 180 190 200 210 250 260 270 280 290 300 pF1KB9 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LEPVKKKMRGDGDLDLNGQLLVAQPRRNAQTVHEDVRAAAGKPDKMEETLTCIICQDLLH 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB9 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 DCVSLQPCMHTFCAACYSGWMERSSLCPTCRCPVERICKNHILNNLVEAYLIQHPDKSRS 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB9 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EEDVQSMDARNKITQDMLQPKVRRSFSDEEGSSEDLLELSDVDSESSDISQPYVVCRQCP 340 350 360 370 380 390 430 440 450 460 470 480 pF1KB9 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EYRRQAAQPPHCPAPEGEPGAPQALGDAPSTSVSLTTAVQDYVCPLQGSHALCTCCFQPM 400 410 420 430 440 450 490 500 510 520 530 540 pF1KB9 PDRRVEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN ::::.::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 PDRRAEREQDPRVAPQQCAVCLQPFCHLYWGCTRTGCYGCLAPFCELNLGDKCLDGVLNN 460 470 480 490 500 510 550 560 570 580 590 600 pF1KB9 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 NSYESDILKNYLATRGLTWKNMLTESLVALQRGVFLLSDYRVTGDTVLCYCCGLRSFREL 520 530 540 550 560 570 610 620 630 640 650 pF1KB9 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TYQYRQNIPASELPVAVTSRPDCYWGRNCRTQVKAHHAMKFNHICEQTRFKN 580 590 600 610 620 652 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:15:06 2016 done: Sun Nov 6 06:15:07 2016 Total Scan time: 4.370 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]