FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE9254, 748 aa 1>>>pF1KE9254 748 - 748 aa - 748 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0994+/-0.000849; mu= 13.2851+/- 0.051 mean_var=119.8449+/-23.797, 0's: 0 Z-trim(111.1): 11 B-trim: 0 in 0/50 Lambda= 0.117156 statistics sampled from 12108 (12115) to 12108 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.372), width: 16 Scan time: 3.950 The best scores are: opt bits E(32554) CCDS33406.1 HJURP gene_id:55355|Hs108|chr2 ( 748) 5033 861.8 0 CCDS63167.1 HJURP gene_id:55355|Hs108|chr2 ( 694) 4149 712.4 5.9e-205 CCDS63166.1 HJURP gene_id:55355|Hs108|chr2 ( 663) 3948 678.4 9.6e-195 >>CCDS33406.1 HJURP gene_id:55355|Hs108|chr2 (748 aa) initn: 5033 init1: 5033 opt: 5033 Z-score: 4600.1 bits: 861.8 E(32554): 0 Smith-Waterman score: 5033; 100.0% identity (100.0% similar) in 748 aa overlap (1-748:1-748) 10 20 30 40 50 60 pF1KE9 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 QGLRIWGGRLIKERNEGEIQDSSMKPADRTDGSVQAAAWGPELPSHRTVLGADSKSGEVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 QGLRIWGGRLIKERNEGEIQDSSMKPADRTDGSVQAAAWGPELPSHRTVLGADSKSGEVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE9 ATSDQEESVAWALAPAVPQSPLKNELRRKYLTQVDILLQGAEYFECAGNRAGRDVRVTPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ATSDQEESVAWALAPAVPQSPLKNELRRKYLTQVDILLQGAEYFECAGNRAGRDVRVTPL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE9 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE9 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE9 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE9 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE9 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE9 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE9 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE9 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE9 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN 670 680 690 700 710 720 730 740 pF1KE9 SEERGENTSYRMEEKSDFMLEKLETKSV :::::::::::::::::::::::::::: CCDS33 SEERGENTSYRMEEKSDFMLEKLETKSV 730 740 >>CCDS63167.1 HJURP gene_id:55355|Hs108|chr2 (694 aa) initn: 4149 init1: 4149 opt: 4149 Z-score: 3793.1 bits: 712.4 E(32554): 5.9e-205 Smith-Waterman score: 4559; 92.8% identity (92.8% similar) in 748 aa overlap (1-748:1-694) 10 20 30 40 50 60 pF1KE9 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 QGLRIWGGRLIKERNEGEIQDSSMKPADRTDGSVQAAAWGPELPSHRTVLGADSKSGEVD :::::::::::::::::::: CCDS63 QGLRIWGGRLIKERNEGEIQ---------------------------------------- 70 80 130 140 150 160 170 180 pF1KE9 ATSDQEESVAWALAPAVPQSPLKNELRRKYLTQVDILLQGAEYFECAGNRAGRDVRVTPL :::::::::::::::::::::::::::::::::::::::::::::: CCDS63 --------------PAVPQSPLKNELRRKYLTQVDILLQGAEYFECAGNRAGRDVRVTPL 90 100 110 120 190 200 210 220 230 240 pF1KE9 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS 130 140 150 160 170 180 250 260 270 280 290 300 pF1KE9 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR 190 200 210 220 230 240 310 320 330 340 350 360 pF1KE9 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL 250 260 270 280 290 300 370 380 390 400 410 420 pF1KE9 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP 310 320 330 340 350 360 430 440 450 460 470 480 pF1KE9 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL 370 380 390 400 410 420 490 500 510 520 530 540 pF1KE9 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS 430 440 450 460 470 480 550 560 570 580 590 600 pF1KE9 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT 490 500 510 520 530 540 610 620 630 640 650 660 pF1KE9 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS 550 560 570 580 590 600 670 680 690 700 710 720 pF1KE9 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN 610 620 630 640 650 660 730 740 pF1KE9 SEERGENTSYRMEEKSDFMLEKLETKSV :::::::::::::::::::::::::::: CCDS63 SEERGENTSYRMEEKSDFMLEKLETKSV 670 680 690 >>CCDS63166.1 HJURP gene_id:55355|Hs108|chr2 (663 aa) initn: 3948 init1: 3948 opt: 3948 Z-score: 3609.8 bits: 678.4 E(32554): 9.6e-195 Smith-Waterman score: 4295; 88.6% identity (88.6% similar) in 748 aa overlap (1-748:1-663) 10 20 30 40 50 60 pF1KE9 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MLGTLRAMEGEDVEDDQLLQKLRASRRRFQRRMQRLIEKYNQPFEDTPVVQMATLTYETP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE9 QGLRIWGGRLIKERNEGEIQDSSMKPADRTDGSVQAAAWGPELPSHRTVLGADSKSGEVD :::::::::::::::::::: CCDS63 QGLRIWGGRLIKERNEGEIQ---------------------------------------- 70 80 130 140 150 160 170 180 pF1KE9 ATSDQEESVAWALAPAVPQSPLKNELRRKYLTQVDILLQGAEYFECAGNRAGRDVRVTPL ::::::::::::::: CCDS63 ---------------------------------------------CAGNRAGRDVRVTPL 90 190 200 210 220 230 240 pF1KE9 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 PSLASPAVPAPGYCSRISRKSPGDPAKPASSPREWDPLHPSSTDMALVPRNDSLSLQETS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KE9 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 SSSFLSSQPFEDDDICNVTISDLYAGMLHSMSRLLSTKPSSIISTKTFIMQNWNSRRRHR 160 170 180 190 200 210 310 320 330 340 350 360 pF1KE9 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 YKSRMNKTYCKGARRSQRSSKENFIPCSEPVKGTGALRDCKNVLDVSCRKTGLKLEKAFL 220 230 240 250 260 270 370 380 390 400 410 420 pF1KE9 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 EVNRPQIHKLDPSWKERKVTPSKYSSLIYFDSSATYNLDEENRFRTLKWLISPVKIVSRP 280 290 300 310 320 330 430 440 450 460 470 480 pF1KE9 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 TIRQGHGENRQREIEIRFDQLHREYCLSPRNQPRRMCLPDSWAMNMYRGGPASPGGLQGL 340 350 360 370 380 390 490 500 510 520 530 540 pF1KE9 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 ETRRLSLPSSKAKAKSLSEAFENLGKRSLEAGRCLPKSDSSSSLPKTNPTHSATRPQQTS 400 410 420 430 440 450 550 560 570 580 590 600 pF1KE9 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 DLHVQGNSSGIFRKSVSPSKTLSVPDKEVPGHGRNRYDEIKEEFDKLHQKYCLKSPGQMT 460 470 480 490 500 510 610 620 630 640 650 660 pF1KE9 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 VPLCIGVSTDKASMEVRYQTEGFLGKLNPDPHFQGFQKLPSSPLGCRKSLLGSTAIEAPS 520 530 540 550 560 570 670 680 690 700 710 720 pF1KE9 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 STCVARAITRDGTRDHQFPAKRPRLSEPQGSGRQGNSLGASDGVDNTVRPGDQGSSSQPN 580 590 600 610 620 630 730 740 pF1KE9 SEERGENTSYRMEEKSDFMLEKLETKSV :::::::::::::::::::::::::::: CCDS63 SEERGENTSYRMEEKSDFMLEKLETKSV 640 650 660 748 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:05:08 2016 done: Sun Nov 6 12:05:09 2016 Total Scan time: 3.950 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]