FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6709, 713 aa 1>>>pF1KE6709 713 - 713 aa - 713 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4844+/-0.000952; mu= 18.7928+/- 0.057 mean_var=70.3124+/-13.614, 0's: 0 Z-trim(105.1): 19 B-trim: 0 in 0/51 Lambda= 0.152953 statistics sampled from 8229 (8236) to 8229 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.253), width: 16 Scan time: 2.820 The best scores are: opt bits E(32554) CCDS9303.1 MIPEP gene_id:4285|Hs108|chr13 ( 713) 4783 1065.1 0 CCDS12095.1 THOP1 gene_id:7064|Hs108|chr19 ( 689) 644 151.8 3.3e-36 CCDS3989.1 NLN gene_id:57486|Hs108|chr5 ( 704) 631 148.9 2.4e-35 >>CCDS9303.1 MIPEP gene_id:4285|Hs108|chr13 (713 aa) initn: 4783 init1: 4783 opt: 4783 Z-score: 5699.4 bits: 1065.1 E(32554): 0 Smith-Waterman score: 4783; 99.9% identity (100.0% similar) in 713 aa overlap (1-713:1-713) 10 20 30 40 50 60 pF1KE6 MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVGAAFNVKPQGSRLDLFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 MLCVGRLGGLGARAAALPPRRAGRGSLEAGIRARRVSTSWSPVGAAFNVKPQGSRLDLFG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ERRGLFGVPELSAPEGFHIAQEKALRKTELLVDRACSTPPGPQTVLIFDELSDSLCRVAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 ERRGLFGVPELSAPEGFHIAQEKALRKTELLVDRACSTPPGPQTVLIFDELSDSLCRVAD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LADFVKIAHPEPAFREAAEEACRSIGTMVEKLNTNVDLYQSLQKLLADKKLVDSLDPETR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 LADFVKIAHPEPAFREAAEEACRSIGTMVEKLNTNVDLYQSLQKLLADKKLVDSLDPETR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 RVAELFMFDFEISGIHLDKEKRKRAVDLNVKILDLSSTFLMGTNFPNKIEKHLLPEHIRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 RVAELFMFDFEISGIHLDKEKRKRAVDLNVKILDLSSTFLMGTNFPNKIEKHLLPEHIRR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 NFTSAGDHIIIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 NFTSAGDHIIIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYST 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 FSHRALQGTIAKNPETVMQFLEKLSDKLSERTLKDFEMIRGMKMKLNPQNSEVMPWDPPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 FSHRALQGTIAKNPETVMQFLEKLSDKLSERTLKDFEMIRGMKMKLNPQNSEVMPWDPPY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 YSGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 YSGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 AVVHESEGLLGYIYCDFFQRADKPHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSRS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 AVVHESEGLLGYIYCDFFQRADKPHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSRS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 SPTLLTPGMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRVV :::::::.:::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 SPTLLTPSMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRVV 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE6 NQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 NQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDIL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE6 KETQEKFYGLPYVPNTAWQLRFSHLVGYGARYYSYLMSRAVASMVWKECFLQDPFNRAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 KETQEKFYGLPYVPNTAWQLRFSHLVGYGARYYSYLMSRAVASMVWKECFLQDPFNRAAG 610 620 630 640 650 660 670 680 690 700 710 pF1KE6 ERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 ERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE 670 680 690 700 710 >>CCDS12095.1 THOP1 gene_id:7064|Hs108|chr19 (689 aa) initn: 542 init1: 292 opt: 644 Z-score: 763.6 bits: 151.8 E(32554): 3.3e-36 Smith-Waterman score: 714; 27.5% identity (58.5% similar) in 607 aa overlap (120-694:80-672) 90 100 110 120 130 140 pF1KE6 LLVDRACSTPPGPQTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMV .. :: . . : .: :. :: .... . CCDS12 QVGTQEFEDVSYESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKLSEFD 50 60 70 80 90 100 150 160 170 180 190 200 pF1KE6 EKLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKE--------K ... :.:: . : .: ::: ::. : : .. . .:.:: .: : CCDS12 VEMSMREDVYQRIV-WLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIKRIK 110 120 130 140 150 160 210 220 230 240 pF1KE6 RKRA---VDLNVKILDLSSTFL------MG---TNFPNKIEKHLLPEHIRRNFTSAGDHI .: . .:.: : :. ..::: .: .: :..:: : . . : : CCDS12 KKLSLLCIDFN-KNLNEDTTFLPFTLQELGGLPEDFLNSLEKM---EDGKLKVTLKYPHY 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE6 --IIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYSTFSHRALQ .. :. :.:: : . :: ::.. : ..:.:. : . .:. CCDS12 FPLLKKCHVPETRRKVEEAFNCRCKEENCAILK---ELVTLRAQKSRLLGFHTHADYVLE 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE6 GTIAKNPETVMQFLEKLSDKLS-----ERTLKDFEMIRGMKMKLN-PQNSEVMPWDPPYY ..::. .:: ::..:..::. ::.. .:. :. . . : .... :: :: CCDS12 MNMAKTSQTVATFLDELAQKLKPLGEQERAVI-LELKRAECERRGLPFDGRIRAWDMRYY 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE6 SGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKLA . .. :: .. .: .: . . .:: . ..:::.... :. :.. : :::: . CCDS12 MNQVEETRYCVDQNLLKEYFPVQVVTHGLLGIYQELLGLAFHHEEGASA--WHEDVRLYT 350 360 370 380 390 430 440 450 460 470 pF1KE6 VVHESEG-LLGYIYCDFFQRADK-PHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSR . . : ..: .: :.. : : : : : .. : :..::. :. ..... :. . . CCDS12 ARDAASGEVVGKFYLDLYPREGKYGHAAC-FGLQPGCLRQDGSRQIAIAAMVANFTKPTA 400 410 420 430 440 450 480 490 500 510 520 530 pF1KE6 SSPTLLTPGMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRV ..:.:: .:. :::.::.::.. ..... .::. ::.:.:: ..: .. . . CCDS12 DAPSLLQHDEVETYFHEFGHVMHQLCSQAEFAMFSGTHVERDFVEAPSQMLENWVWEQEP 460 470 480 490 500 510 540 550 560 570 580 590 pF1KE6 VNQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDI . ...:::.::. .:.... .: ::... .. :. : .:: : . . . . CCDS12 LLRMSRHYRTGSAVPRELLEKLIESRQANTGLFNLRQIVLAKVDQALHTQTDA-DPAEEY 520 530 540 550 560 570 600 610 620 630 640 650 pF1KE6 LKETQEKFYGLPYVPNTAWQLRFSHLVG-YGARYYSYLMSRAVASMVWKECFLQDP-FNR . :: . :.: .:.: :.::.: : :.::.:: :.. . ... : :. .: CCDS12 ARLCQE-ILGVPATPGTNMPATFGHLAGGYDAQYYGYLWSEVYSMDMFHTRFKQEGVLNS 580 590 600 610 620 630 660 670 680 690 700 710 pF1KE6 AAGERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE .: :: .: ::... :.. .: . :. : :. CCDS12 KVGMDYRSCILRPGGSEDASAMLRRFLGRDPKQDAFLLSKGLQVGGCEPEPQVC 640 650 660 670 680 >>CCDS3989.1 NLN gene_id:57486|Hs108|chr5 (704 aa) initn: 515 init1: 271 opt: 631 Z-score: 747.9 bits: 148.9 E(32554): 2.4e-35 Smith-Waterman score: 659; 26.9% identity (56.0% similar) in 605 aa overlap (121-694:105-696) 100 110 120 130 140 150 pF1KE6 LVDRACSTPPGPQTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMVE . :: . . . : :. :: . .. . CCDS39 VGMLGIEEVTYENCLQALADVEVKYIVERTMLDFPQHVSSDKEVRAASTEADKRLSRFDI 80 90 100 110 120 130 160 170 180 190 200 pF1KE6 KLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLD-------KEKRK ... :... . .: : .. ::.:: : . . .:.:: : .: CCDS39 EMSMRGDIFERIVHLQETCDL-GKIKPEARRYLEKSIKMGKRNGLHLPEQVQNEIKSMKK 140 150 160 170 180 190 210 220 230 240 250 pF1KE6 RAVDLNV---KILDLSSTFLMGTNFPNKIEKHLLPEHIRRNFTSAGD---HIIIDGLHAE : .: . : :. ..:::. : .: : ::. . .. .. : .: . : CCDS39 RMSELCIDFNKNLNEDDTFLV---F-SKAELGALPDDFIDSLEKTDDDKYKITLKYPHYF 200 210 220 230 240 260 270 280 290 300 pF1KE6 S--PDDLVREAAYKIFLYPNAGQLKCLEE-------LLSSRDLLAKLVGYSTFSHRALQG . :. .. . :. .: :: :: : .:::.:::: . .:. CCDS39 PVMKKCCIPETRRRMEMAFNT---RCKEENTIILQQLLPLRTKVAKLLGYSTHADFVLEM 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 TIAKNPETVMQFLEKLSDKLSERTLKDFEMIRGMKMKLNPQ-----NSEVMPWDPPYYSG . ::. : ::. ::.::. . :.: ..: : . .... :: :: CCDS39 NTAKSTSRVTAFLDDLSQKLKPLGEAEREFILNLKKKECKDRGFEYDGKINAWDLYYYMT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 VIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKLAVV . .:.:. . .: . . ::: ..:::.:. :: . ..::...: .: CCDS39 QTEELKYSIDQEFLKEYFPIEVVTEGLLNTYQELLGLSF--EQMTDAHVWNKSVTLYTVK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 HESEG-LLGYIYCDFFQRADK-PHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSRSS .. : .:: .: :.. : : : : : .. : : ::. .. :..:..:. . . CCDS39 DKATGEVLGQFYLDLYPREGKYNHAAC-FGLQPGCLLPDGSRMMAVAALVVNFSQPVAGR 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 PTLLTPGMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRVVN :.:: ... :::.::.::.. ..: . . .:: :::.:::: ..: .. : . CCDS39 PSLLRHDEVRTYFHEFGHVMHQICAQTDFARFSGTNVETDFVEVPSQMLENWVWDVDSLR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE6 QFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDILK ....::. :.:. ... .: :. : .. :. . .:: : . : ..... : CCDS39 RLSKHYKDGSPIADDLLEKLVASRLVNTGLLTLRQIVLSKVDQSLHTNTSL-DAASEYAK 550 560 570 580 590 600 610 620 630 640 650 pF1KE6 ETQEKFYGLPYVPNTAWQLRFSHLVG-YGARYYSYLMSRAVASMVWKECFLQDPF-NRAA .: . :. .:.: :.::.: : ..::.:: :.. . .. :: .. . : . CCDS39 YCSE-ILGVAATPGTNMPATFGHLAGGYDGQYYGYLWSEVFSMDMFYSCFKKEGIMNPEV 610 620 630 640 650 660 660 670 680 690 700 710 pF1KE6 GERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE : .:: .: ::. . : :....:.. :. :. CCDS39 GMKYRNLILKPGGSLDGMDMLHNFLKREPNQKAFLMSRGLHAP 670 680 690 700 713 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:35:16 2016 done: Tue Nov 8 15:35:16 2016 Total Scan time: 2.820 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]