FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4562, 689 aa 1>>>pF1KE4562 689 - 689 aa - 689 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3326+/-0.00103; mu= 14.2265+/- 0.061 mean_var=76.4740+/-14.680, 0's: 0 Z-trim(104.8): 24 B-trim: 5 in 1/51 Lambda= 0.146662 statistics sampled from 8089 (8093) to 8089 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.615), E-opt: 0.2 (0.249), width: 16 Scan time: 3.720 The best scores are: opt bits E(32554) CCDS12095.1 THOP1 gene_id:7064|Hs108|chr19 ( 689) 4628 989.2 0 CCDS3989.1 NLN gene_id:57486|Hs108|chr5 ( 704) 2990 642.6 5.8e-184 CCDS9303.1 MIPEP gene_id:4285|Hs108|chr13 ( 713) 637 144.7 4.4e-34 >>CCDS12095.1 THOP1 gene_id:7064|Hs108|chr19 (689 aa) initn: 4628 init1: 4628 opt: 4628 Z-score: 5289.5 bits: 989.2 E(32554): 0 Smith-Waterman score: 4628; 99.9% identity (99.9% similar) in 689 aa overlap (1-689:1-689) 10 20 30 40 50 60 pF1KE4 MKPPAACAGDMADAASPCSVVNDLRWDLSAQQIEERTRELIEQTKRVYDQVGTQEFEDVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MKPPAACAGDMADAASPCSVVNDLRWDLSAQQIEERTRELIEQTKRVYDQVGTQEFEDVS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 YESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKLSEFDVEMSMREDVYQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 YESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKLSEFDVEMSMREDVYQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 RIVWLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIKRIKKKLSLLCIDFNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RIVWLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIKRIKKKLSLLCIDFNK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 NLNEDTTFLPFTLQELGGLPEDFLNSLEKMEDGKLKVTLKYPHYFPLLKKCHVPETRRKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 NLNEDTTFLPFTLQELGGLPEDFLNSLEKMEDGKLKVTLKYPHYFPLLKKCHVPETRRKV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 EEAFNCRCKEENCAILKELVTLRAQKSRLLGFHTHADYVLEMNMAKTSQTVATFLDELAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 EEAFNCRCKEENCAILKELVTLRAQKSRLLGFHTHADYVLEMNMAKTSQTVATFLDELAQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 KPKPLGEQERAVILELKRAECERRGLPFDGRIRAWDMRYYMNQVEETRYCVDQNLLKEYF : :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KLKPLGEQERAVILELKRAECERRGLPFDGRIRAWDMRYYMNQVEETRYCVDQNLLKEYF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 PVQVVTHGLLGIYQELLGLAFHHEEGASAWHEDVRLYTARDAASGEVVGKFYLDLYPREG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PVQVVTHGLLGIYQELLGLAFHHEEGASAWHEDVRLYTARDAASGEVVGKFYLDLYPREG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 KYGHAACFGLQPGCLRQDGSRQIAIAAMVANFTKPTADAPSLLQHDEVETYFHEFGHVMH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KYGHAACFGLQPGCLRQDGSRQIAIAAMVANFTKPTADAPSLLQHDEVETYFHEFGHVMH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 QLCSQAEFAMFSGTHVERDFVEAPSQMLENWVWEQEPLLRMSRHYRTGSAVPRELLEKLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 QLCSQAEFAMFSGTHVERDFVEAPSQMLENWVWEQEPLLRMSRHYRTGSAVPRELLEKLI 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 ESRQANTGLFNLRQIVLAKVDQALHTQTDADPAEEYARLCQEILGVPATPGTNMPATFGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ESRQANTGLFNLRQIVLAKVDQALHTQTDADPAEEYARLCQEILGVPATPGTNMPATFGH 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE4 LAGGYDAQYYGYLWSEVYSMDMFHTRFKQEGVLNSKVGMDYRSCILRPGGSEDASAMLRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LAGGYDAQYYGYLWSEVYSMDMFHTRFKQEGVLNSKVGMDYRSCILRPGGSEDASAMLRR 610 620 630 640 650 660 670 680 pF1KE4 FLGRDPKQDAFLLSKGLQVGGCEPEPQVC ::::::::::::::::::::::::::::: CCDS12 FLGRDPKQDAFLLSKGLQVGGCEPEPQVC 670 680 >>CCDS3989.1 NLN gene_id:57486|Hs108|chr5 (704 aa) initn: 3015 init1: 2987 opt: 2990 Z-score: 3416.2 bits: 642.6 E(32554): 5.8e-184 Smith-Waterman score: 2990; 64.7% identity (86.8% similar) in 657 aa overlap (22-678:46-702) 10 20 30 40 50 pF1KE4 MKPPAACAGDMADAASPCSVVNDLRWDLSAQQIEERTRELIEQTKRVYDQV : :::::: .::. ::.::: :::.::: : CCDS39 GGSRILLRMTLGREVMSPLQAMSSYTVAGRNVLRWDLSPEQIKTRTEELIVQTKQVYDAV 20 30 40 50 60 70 60 70 80 90 100 110 pF1KE4 GTQEFEDVSYESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKLSEFDVE : .:.:.::. :.::::::: : :.:..:::::::: .:..:.:::::::.::.::.: CCDS39 GMLGIEEVTYENCLQALADVEVKYIVERTMLDFPQHVSSDKEVRAASTEADKRLSRFDIE 80 90 100 110 120 130 120 130 140 150 160 170 pF1KE4 MSMREDVYQRIVWLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIKRIKKKL :::: :...::: ::: . ...::: ::::. ::.:.:::::::...:..:: .::.. CCDS39 MSMRGDIFERIVHLQETCDLGKIKPEARRYLEKSIKMGKRNGLHLPEQVQNEIKSMKKRM 140 150 160 170 180 190 180 190 200 210 220 230 pF1KE4 SLLCIDFNKNLNEDTTFLPFTLQELGGLPEDFLNSLEKMEDGKLKVTLKYPHYFPLLKKC : :::::::::::: ::: :. :::.::.::..:::: .: : :.:::::::::..::: CCDS39 SELCIDFNKNLNEDDTFLVFSKAELGALPDDFIDSLEKTDDDKYKITLKYPHYFPVMKKC 200 210 220 230 240 250 240 250 260 270 280 290 pF1KE4 HVPETRRKVEEAFNCRCKEENCAILKELVTLRAQKSRLLGFHTHADYVLEMNMAKTSQTV .:::::..: ::: :::::: ::..:. ::.. ..:::. ::::.::::: ::... : CCDS39 CIPETRRRMEMAFNTRCKEENTIILQQLLPLRTKVAKLLGYSTHADFVLEMNTAKSTSRV 260 270 280 290 300 310 300 310 320 330 340 350 pF1KE4 ATFLDELAQKPKPLGEQERAVILELKRAECERRGLPFDGRIRAWDMRYYMNQVEETRYCV ..:::.:.:: ::::: :: ::.::. ::. ::. .::.: :::. :::.:.:: .: . CCDS39 TAFLDDLSQKLKPLGEAEREFILNLKKKECKDRGFEYDGKINAWDLYYYMTQTEELKYSI 320 330 340 350 360 370 360 370 380 390 400 410 pF1KE4 DQNLLKEYFPVQVVTHGLLGIYQELLGLAFHHEEGASAWHEDVRLYTARDAASGEVVGKF ::..::::::..:::.:::. :::::::.:.. : .:...: :::..: :.:::.:.: CCDS39 DQEFLKEYFPIEVVTEGLLNTYQELLGLSFEQMTDAHVWNKSVTLYTVKDKATGEVLGQF 380 390 400 410 420 430 420 430 440 450 460 470 pF1KE4 YLDLYPREGKYGHAACFGLQPGCLRQDGSRQIAIAAMVANFTKPTADAPSLLQHDEVETY :::::::::::.:::::::::::: ::::..:.::.:.::..:.: ::::.::::.:: CCDS39 YLDLYPREGKYNHAACFGLQPGCLLPDGSRMMAVAALVVNFSQPVAGRPSLLRHDEVRTY 440 450 460 470 480 490 480 490 500 510 520 530 pF1KE4 FHEFGHVMHQLCSQAEFAMFSGTHVERDFVEAPSQMLENWVWEQEPLLRMSRHYRTGSAV ::::::::::.:.:..:: ::::.:: ::::.::::::::::. . : :.:.::. :: . CCDS39 FHEFGHVMHQICAQTDFARFSGTNVETDFVEVPSQMLENWVWDVDSLRRLSKHYKDGSPI 500 510 520 530 540 550 540 550 560 570 580 590 pF1KE4 PRELLEKLIESRQANTGLFNLRQIVLAKVDQALHTQTDADPAEEYARLCQEILGVPATPG .:::::. :: .::::..::::::.::::.:::.:. : : :::. :.::::: :::: CCDS39 ADDLLEKLVASRLVNTGLLTLRQIVLSKVDQSLHTNTSLDAASEYAKYCSEILGVAATPG 560 570 580 590 600 610 600 610 620 630 640 650 pF1KE4 TNMPATFGHLAGGYDAQYYGYLWSEVYSMDMFHTRFKQEGVLNSKVGMDYRSCILRPGGS :::::::::::::::.::::::::::.:::::.. ::.::..: .::: ::. ::.:::: CCDS39 TNMPATFGHLAGGYDGQYYGYLWSEVFSMDMFYSCFKKEGIMNPEVGMKYRNLILKPGGS 620 630 640 650 660 670 660 670 680 pF1KE4 EDASAMLRRFLGRDPKQDAFLLSKGLQVGGCEPEPQVC :. ::. :: :.:.: :::.:.::. CCDS39 LDGMDMLHNFLKREPNQKAFLMSRGLHAP 680 690 700 >>CCDS9303.1 MIPEP gene_id:4285|Hs108|chr13 (713 aa) initn: 508 init1: 293 opt: 637 Z-score: 725.4 bits: 144.7 E(32554): 4.4e-34 Smith-Waterman score: 707; 27.3% identity (58.5% similar) in 607 aa overlap (80-672:120-694) 50 60 70 80 90 100 pF1KE4 QVGTQEFEDVSYESTLKALADVEVTYTVQRNILDFPQHVSPSKDIRTASTEADKKLSEFD .. :: . . : .: :. :: .... . CCDS93 LLVDRACSTPPGPQTVLIFDELSDSLCRVADLADFVKIAHPEPAFREAAEEACRSIGTMV 90 100 110 120 130 140 110 120 130 140 150 160 pF1KE4 VEMSMREDVYQRIV-WLQEKVQKDSLRPEAARYLERLIKLGRRNGLHLPRETQENIKRIK ... :.:: . : .: ::: ::. : : .. . .:.:: .: : CCDS93 EKLNTNVDLYQSLQKLLADKKLVDSLDPETRRVAELFMFDFEISGIHLDKE--------K 150 160 170 180 190 200 170 180 190 200 210 220 pF1KE4 KKLSLLCIDFN-KNLNEDTTFLPFTLQELGGLPEDFLNSLEKM---EDGKLKVTLKYPHY .: . .:.: : :. ..::: .: .: :..:: : . . : : CCDS93 RKRA---VDLNVKILDLSSTFL------MG---TNFPNKIEKHLLPEHIRRNFTSAGDHI 210 220 230 240 230 240 250 260 270 280 pF1KE4 FPLLKKCHVPETRRKVEEAFNCRCKEENCAILK---ELVTLRAQKSRLLGFHTHADYVLE .. :. :.:: : . :: ::.. : ..:.:. : . .:. CCDS93 --IIDGLHAESPDDLVREAAYKIFLYPNAGQLKCLEELLSSRDLLAKLVGYSTFSHRALQ 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 MNMAKTSQTVATFLDELAQKPKPLGEQERAVI-LELKRAECERRGLPFDGRIRAWDMRYY ..::. .:: ::..:..: .::.. .:. :. . . : .... :: :: CCDS93 GTIAKNPETVMQFLEKLSDK-----LSERTLKDFEMIRGMKMKLN-PQNSEVMPWDPPYY 310 320 330 340 350 360 350 360 370 380 390 pF1KE4 MNQVEETRYCVDQNLLKEYFPVQVVTHGLLGIYQELLGLAFHHEEGASA--WHEDVRLYT . .. :: .. .: .: . . .:: . ..:::.... :. :.. : :::: . CCDS93 SGVIRAERYNIEPSLYCPFFSLGACMEGLNILLNRLLGISLYAEQPAKGEVWSEDVRKLA 370 380 390 400 410 420 400 410 420 430 440 450 pF1KE4 ARDAASGEVVGKFYLDLYPREGKYGHAAC-FGLQPGCLRQDGSRQIAIAAMVANFTKPTA . . : ..: .: :.. : : : : : .. : :..::. :. ..... :. . . CCDS93 VVHESEG-LLGYIYCDFFQRADK-PHQDCHFTIRGGRLKEDGDYQLPVVVLMLNLPRSSR 430 440 450 460 470 460 470 480 490 500 510 pF1KE4 DAPSLLQHDEVETYFHEFGHVMHQLCSQAEFAMFSGTHVERDFVEAPSQMLENWVWEQEP ..:.:: . .:. :::.::.::.. ..... .::. ::.:.:: ..: .. . . CCDS93 SSPTLLTPSMMENLFHEMGHAMHSMLGRTRYQHVTGTRCPTDFAEVPSILMEYFANDYRV 480 490 500 510 520 530 520 530 540 550 560 570 pF1KE4 LLRMSRHYRTGSAVPRELLEKLIESRQANTGLFNLRQIVLAKVDQALHTQTDA-DPAEEY . ...:::.::. .:.... .: ::... .. :. : .:: : . . . . CCDS93 VNQFARHYQTGQPLPKNMVSRLCESKKVCAAADMQLQVFYATLDQIYHGKHPLRNSTTDI 540 550 560 570 580 590 580 590 600 610 620 630 pF1KE4 ARLCQE-ILGVPATPGTNMPATFGHLAGGYDAQYYGYLWSEVYSMDMFHTRFKQEGVLNS . :: . :.: .:.: :.::.: : :.::.:: :.. . ... : :. .: CCDS93 LKETQEKFYGLPYVPNTAWQLRFSHLVG-YGARYYSYLMSRAVASMVWKECFLQDP-FNR 600 610 620 630 640 650 640 650 660 670 680 pF1KE4 KVGMDYRSCILRPGGSEDASAMLRRFLGRDPKQDAFLLSKGLQVGGCEPEPQVC .: :: .: ::... :.. .: . :. : :. CCDS93 AAGERYRREMLAHGGGREPMLMVEGMLQKCPSVDDFVSALVSDLDLDFETFLMDSE 660 670 680 690 700 710 689 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 23:54:29 2016 done: Sat Nov 5 23:54:30 2016 Total Scan time: 3.720 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]