FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5232, 251 aa 1>>>pF1KE5232 251 - 251 aa - 251 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9117+/-0.00076; mu= -0.2813+/- 0.046 mean_var=209.4735+/-41.989, 0's: 0 Z-trim(116.5): 16 B-trim: 0 in 0/51 Lambda= 0.088616 statistics sampled from 17092 (17105) to 17092 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.525), width: 16 Scan time: 2.410 The best scores are: opt bits E(32554) CCDS34347.1 HDGFL1 gene_id:154150|Hs108|chr6 ( 251) 1747 234.8 4.3e-62 CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 ( 240) 719 103.4 1.5e-22 CCDS44248.1 HDGF gene_id:3068|Hs108|chr1 ( 233) 598 87.9 6.7e-18 CCDS44247.1 HDGF gene_id:3068|Hs108|chr1 ( 256) 598 87.9 7.2e-18 CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 670) 495 75.0 1.4e-13 CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 671) 495 75.0 1.4e-13 CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 ( 203) 449 68.8 3.3e-12 CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 ( 329) 451 69.2 4e-12 CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 ( 333) 451 69.2 4e-12 CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 ( 530) 451 69.3 5.8e-12 >>CCDS34347.1 HDGFL1 gene_id:154150|Hs108|chr6 (251 aa) initn: 1747 init1: 1747 opt: 1747 Z-score: 1228.2 bits: 234.8 E(32554): 4.3e-62 Smith-Waterman score: 1747; 100.0% identity (100.0% similar) in 251 aa overlap (1-251:1-251) 10 20 30 40 50 60 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHMTQPNRYQVFFFGTHETAFLSPKRLFPYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHMTQPNRYQVFFFGTHETAFLSPKRLFPYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEGDEDKPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEGDEDKPT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 HAGGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEAEAERAAEAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 HAGGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEAEAERAAEAE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 RAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVADEEASQEWHA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVADEEASQEWHA 190 200 210 220 230 240 250 pF1KE5 EAPGGGDRDSL ::::::::::: CCDS34 EAPGGGDRDSL 250 >>CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 (240 aa) initn: 791 init1: 450 opt: 719 Z-score: 518.2 bits: 103.4 E(32554): 1.5e-22 Smith-Waterman score: 816; 57.4% identity (70.5% similar) in 251 aa overlap (9-251:10-240) 10 20 30 40 50 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHMTQP------NRYQVFFFGTHETAFLSP :: :::::::.::: ::::::..: . :.::::::::::::::.: CCDS11 MSRSNRQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 KRLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAE : ::::.: ::::::::::.::: :::::::::::.:: ...:. . : ::::::: CCDS11 KDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEPEAAE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 GDEDKPTHAGGGGDELGKPDDDKPTEE--EKGPLKRSAGDPPEDAPKRPKEAAPDQEEEA :: :: .: :..:: :: :.:..: ::: ::: ::: ::.::::::: . :: CCDS11 GDGDKKGNAEGSSDEEGKLVIDEPAKEKNEKGALKRRAGDLLEDSPKRPKEAENPEGEEK 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 EAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVAD :: . :.:: :. . ::..:.::::: ::: :::: ::: CCDS11 EAA-TLEVER--------------PLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEE--- 190 200 210 220 240 250 pF1KE5 EEASQEWHAEAPGGGDRDSL ::..: ::::: :..:: CCDS11 -EATKE-DAEAPGIRDHESL 230 240 >>CCDS44248.1 HDGF gene_id:3068|Hs108|chr1 (233 aa) initn: 661 init1: 450 opt: 598 Z-score: 434.8 bits: 87.9 E(32554): 6.7e-18 Smith-Waterman score: 695; 57.3% identity (70.6% similar) in 218 aa overlap (36-251:36-233) 10 20 30 40 50 60 pF1KE5 MPMYKSGDLVFAKLKGYAHWPARIEHMTQPNRYQVFFFGTHETAFLSPKRLFPYKECKEK :.::::::::::::::.:: ::::.: ::: CCDS44 GGNRVQTSTLNCAGAAVIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPYEESKEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 FGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEGDEDKPTHAGGG :::::::.::: :::::::::::.:: ...:. . : ::::::::: :: .: :. CCDS44 FGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEPEAAEGDGDKKGNAEGS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 GDELGKPDDDKPTEE--EKGPLKRSAGDPPEDAPKRPKEAAPDQEEEAEAERAAEAERAA .:: :: :.:..: ::: ::: ::: ::.::::::: . :: :: . :.:: CCDS44 SDEEGKLVIDEPAKEKNEKGALKRRAGDLLEDSPKRPKEAENPEGEEKEAA-TLEVER-- 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVADEEASQEWHAEAP :. . ::..:.::::: ::: :::: ::: ::..: :::: CCDS44 ------------PLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEE----EATKE-DAEAP 190 200 210 220 250 pF1KE5 GGGDRDSL : :..:: CCDS44 GIRDHESL 230 >>CCDS44247.1 HDGF gene_id:3068|Hs108|chr1 (256 aa) initn: 661 init1: 450 opt: 598 Z-score: 434.2 bits: 87.9 E(32554): 7.2e-18 Smith-Waterman score: 695; 57.3% identity (70.6% similar) in 218 aa overlap (36-251:59-256) 10 20 30 40 50 60 pF1KE5 MPMYKSGDLVFAKLKGYAHWPARIEHMTQPNRYQVFFFGTHETAFLSPKRLFPYKECKEK :.::::::::::::::.:: ::::.: ::: CCDS44 GGRRAQIPDVSRATPHTIDEMPEAAVKSTANKYQVFFFGTHETAFLGPKDLFPYEESKEK 30 40 50 60 70 80 70 80 90 100 110 120 pF1KE5 FGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEGDEDKPTHAGGG :::::::.::: :::::::::::.:: ...:. . : ::::::::: :: .: :. CCDS44 FGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEPEAAEGDGDKKGNAEGS 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE5 GDELGKPDDDKPTEE--EKGPLKRSAGDPPEDAPKRPKEAAPDQEEEAEAERAAEAERAA .:: :: :.:..: ::: ::: ::: ::.::::::: . :: :: . :.:: CCDS44 SDEEGKLVIDEPAKEKNEKGALKRRAGDLLEDSPKRPKEAENPEGEEKEAA-TLEVER-- 150 160 170 180 190 200 190 200 210 220 230 240 pF1KE5 AAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVADEEASQEWHAEAP :. . ::..:.::::: ::: :::: ::: ::..: :::: CCDS44 ------------PLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEE----EATKE-DAEAP 210 220 230 240 250 pF1KE5 GGGDRDSL : :..:: CCDS44 GIRDHESL 250 >>CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 (670 aa) initn: 487 init1: 332 opt: 495 Z-score: 357.2 bits: 75.0 E(32554): 1.4e-13 Smith-Waterman score: 497; 39.9% identity (67.3% similar) in 208 aa overlap (6-196:1-206) 10 20 30 40 50 pF1KE5 MSAYGMPM-YKSGDLVFAKLKGYAHWPARIEHMTQ------PNRYQVFFFGTHETAFLSP :: .: :::::::.::: ::::::. ... ::.: .:::::::::::.: CCDS59 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGP 10 20 30 40 50 60 70 80 90 100 pF1KE5 KRLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEP---- : :::: .::.:.::::::.::. :::::.::: .. : : .: . : ..: .: CCDS59 KDLFPYDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDS-EAPEANPADGS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 EAAEGDEDKPTHA------GGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEA .: : :::. . : ...:.. . .:. . ...: :::.. .. :: ..: CCDS59 DADEDDEDRGVMAVTAVTATAASDRMESDSDSDKSSDNSG-LKRKTPALKMSVSKRARKA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE5 APDQEEEAEAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEE . : .. . . : .... . :. .. .: CCDS59 SSDLDQASVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASD 180 190 200 210 220 230 230 240 250 pF1KE5 LREEEVADEEASQEWHAEAPGGGDRDSL CCDS59 SDSKADSDGAKPEPVAMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPK 240 250 260 270 280 290 >>CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 (671 aa) initn: 520 init1: 332 opt: 495 Z-score: 357.1 bits: 75.0 E(32554): 1.4e-13 Smith-Waterman score: 497; 39.9% identity (67.3% similar) in 208 aa overlap (6-196:1-206) 10 20 30 40 50 pF1KE5 MSAYGMPM-YKSGDLVFAKLKGYAHWPARIEHMTQ------PNRYQVFFFGTHETAFLSP :: .: :::::::.::: ::::::. ... ::.: .:::::::::::.: CCDS42 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGP 10 20 30 40 50 60 70 80 90 100 pF1KE5 KRLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEP---- : :::: .::.:.::::::.::. :::::.::: .. : : .: . : ..: .: CCDS42 KDLFPYDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDS-EAPEANPADGS 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 EAAEGDEDKPTHA------GGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEA .: : :::. . : ...:.. . .:. . ...: :::.. .. :: ..: CCDS42 DADEDDEDRGVMAVTAVTATAASDRMESDSDSDKSSDNSG-LKRKTPALKMSVSKRARKA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE5 APDQEEEAEAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEE . : .. . . : .... . :. .. .: CCDS42 SSDLDQASVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASD 180 190 200 210 220 230 230 240 250 pF1KE5 LREEEVADEEASQEWHAEAPGGGDRDSL CCDS42 SDSKADSDGAKPEPVAMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPK 240 250 260 270 280 290 >>CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 (203 aa) initn: 483 init1: 301 opt: 449 Z-score: 332.7 bits: 68.8 E(32554): 3.3e-12 Smith-Waterman score: 449; 41.2% identity (66.0% similar) in 194 aa overlap (9-192:9-195) 10 20 30 40 50 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHM----TQP--NRYQVFFFGTHETAFLSPK ::.:::::::.::: ::::::... ..: :.: .:::::::::::.:: CCDS32 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 RLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEG :::::: :.:::: :::.::. ::::::::: :. . ....:. : :. :: CCDS32 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSS-----ETEG-EG 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 DEDKPTHAGGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEE----E . . . ::.. . : ..::. ::. . . . :. ... :... : CCDS32 GNTADASSEEEGDRV-EEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 AEAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVA : . ..:. :. . :. : CCDS32 EENKSSSEGGDAGNDTRNTTSDLQKTSEGT 180 190 200 >>CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 (329 aa) initn: 475 init1: 297 opt: 451 Z-score: 331.1 bits: 69.2 E(32554): 4e-12 Smith-Waterman score: 451; 35.9% identity (61.2% similar) in 245 aa overlap (9-237:5-240) 10 20 30 40 50 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHM----TQP--NRYQVFFFGTHETAFLSPK .: :::.:::.::: :::::.... ..: :. .:::::::::::.:: CCDS83 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 RLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKG--SGDGPWPEPEAA .:::.: :::.::::::.::. :::::.::: :. :. :.... :.: : :.. CCDS83 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 EGDEDKPTHAGGGGDELGKPDD-DKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEA . :: . ...... : : : ..: ::.: : .::. : CCDS83 VSKEDTDHEEKASNEDVTKAVDITTPKAARRGR-KRKAEKQVET-----EEAGVVTTATA 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 EAERAAEAERAAAAAAATAVDEES--PFLVAVENGSAPSEPGLVCEPPQP-----EEEEL .. . .:. ::. . . . : .: . ::: .. : . ::.. CCDS83 SVNLKVSPKRGRPAATEVKIPKPRGRPKMV---KQPCPSESDIITEEDKSKKKGQEEKQP 180 190 200 210 220 230 240 250 pF1KE5 REEEVADEEASQEWHAEAPGGGDRDSL ... :::...: CCDS83 KKQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKK 230 240 250 260 270 280 >>CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 (333 aa) initn: 475 init1: 297 opt: 451 Z-score: 331.1 bits: 69.2 E(32554): 4e-12 Smith-Waterman score: 451; 35.9% identity (61.2% similar) in 245 aa overlap (9-237:5-240) 10 20 30 40 50 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHM----TQP--NRYQVFFFGTHETAFLSPK .: :::.:::.::: :::::.... ..: :. .:::::::::::.:: CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 RLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKG--SGDGPWPEPEAA .:::.: :::.::::::.::. :::::.::: :. :. :.... :.: : :.. CCDS64 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 EGDEDKPTHAGGGGDELGKPDD-DKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEA . :: . ...... : : : ..: ::.: : .::. : CCDS64 VSKEDTDHEEKASNEDVTKAVDITTPKAARRGR-KRKAEKQVET-----EEAGVVTTATA 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 EAERAAEAERAAAAAAATAVDEES--PFLVAVENGSAPSEPGLVCEPPQP-----EEEEL .. . .:. ::. . . . : .: . ::: .. : . ::.. CCDS64 SVNLKVSPKRGRPAATEVKIPKPRGRPKMV---KQPCPSESDIITEEDKSKKKGQEEKQP 180 190 200 210 220 230 240 250 pF1KE5 REEEVADEEASQEWHAEAPGGGDRDSL ... :::...: CCDS64 KKQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKK 230 240 250 260 270 280 >>CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 (530 aa) initn: 475 init1: 297 opt: 451 Z-score: 328.2 bits: 69.3 E(32554): 5.8e-12 Smith-Waterman score: 451; 35.9% identity (61.2% similar) in 245 aa overlap (9-237:5-240) 10 20 30 40 50 pF1KE5 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHM----TQP--NRYQVFFFGTHETAFLSPK .: :::.:::.::: :::::.... ..: :. .:::::::::::.:: CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK 10 20 30 40 50 60 70 80 90 100 110 pF1KE5 RLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKG--SGDGPWPEPEAA .:::.: :::.::::::.::. :::::.::: :. :. :.... :.: : :.. CCDS64 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE5 EGDEDKPTHAGGGGDELGKPDD-DKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEEEA . :: . ...... : : : ..: ::.: : .::. : CCDS64 VSKEDTDHEEKASNEDVTKAVDITTPKAARRGR-KRKAEKQVET-----EEAGVVTTATA 120 130 140 150 160 170 180 190 200 210 220 pF1KE5 EAERAAEAERAAAAAAATAVDEES--PFLVAVENGSAPSEPGLVCEPPQP-----EEEEL .. . .:. ::. . . . : .: . ::: .. : . ::.. CCDS64 SVNLKVSPKRGRPAATEVKIPKPRGRPKMV---KQPCPSESDIITEEDKSKKKGQEEKQP 180 190 200 210 220 230 240 250 pF1KE5 REEEVADEEASQEWHAEAPGGGDRDSL ... :::...: CCDS64 KKQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKK 230 240 250 260 270 280 251 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 22:43:57 2016 done: Mon Nov 7 22:43:57 2016 Total Scan time: 2.410 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]