FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2324, 333 aa 1>>>pF1KE2324 333 - 333 aa - 333 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.9581+/-0.00105; mu= -3.9298+/- 0.063 mean_var=294.3977+/-58.317, 0's: 0 Z-trim(113.2): 25 B-trim: 0 in 0/54 Lambda= 0.074749 statistics sampled from 13835 (13857) to 13835 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.426), width: 16 Scan time: 2.550 The best scores are: opt bits E(32554) CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 ( 333) 2211 251.4 7.3e-67 CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 ( 329) 2162 246.2 2.8e-65 CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 ( 530) 2160 246.1 4.7e-65 CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 ( 203) 613 78.9 3.8e-15 CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 670) 593 77.2 4.1e-14 CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 671) 593 77.2 4.1e-14 CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 ( 240) 576 75.0 6.9e-14 >>CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 (333 aa) initn: 2211 init1: 2211 opt: 2211 Z-score: 1313.9 bits: 251.4 E(32554): 7.3e-67 Smith-Waterman score: 2211; 100.0% identity (100.0% similar) in 333 aa overlap (1-333:1-333) 10 20 30 40 50 60 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR 250 260 270 280 290 300 310 320 330 pF1KE2 NMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ ::::::::::::::::::::::::::::::::: CCDS64 NMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ 310 320 330 >>CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 (329 aa) initn: 2162 init1: 2162 opt: 2162 Z-score: 1285.4 bits: 246.2 E(32554): 2.8e-65 Smith-Waterman score: 2162; 100.0% identity (100.0% similar) in 326 aa overlap (1-326:1-326) 10 20 30 40 50 60 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR 250 260 270 280 290 300 310 320 330 pF1KE2 NMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ :::::::::::::::::::::::::: CCDS83 NMLKGQHEKEAADRKRKQEEQMETEHFAL 310 320 >>CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 (530 aa) initn: 2247 init1: 2160 opt: 2160 Z-score: 1281.4 bits: 246.1 E(32554): 4.7e-65 Smith-Waterman score: 2160; 99.7% identity (100.0% similar) in 327 aa overlap (1-327:1-327) 10 20 30 40 50 60 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETSVSKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 DTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVNLKVSPKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKKQPKKDEEGQKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 EDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKRKGGRNFQTAHRR 250 260 270 280 290 300 310 320 330 pF1KE2 NMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ :::::::::::::::::::::::::.: CCDS64 NMLKGQHEKEAADRKRKQEEQMETEQQNKDEGKKPEVKKVEKKRETSMDSRLQRIHAEIK 310 320 330 340 350 360 >>CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 (203 aa) initn: 604 init1: 572 opt: 613 Z-score: 385.4 bits: 78.9 E(32554): 3.8e-15 Smith-Waterman score: 613; 55.6% identity (78.8% similar) in 160 aa overlap (3-155:7-166) 10 20 30 40 50 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK :..: :::.:::::::::::::.::.:.::::::.:: :::::::::::::::: CCDS32 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS :.:::.: :.:.:: :::::::::::::.::: :::.. :: .::.. .. : . . CCDS32 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA 70 80 90 100 110 120 120 130 140 150 160 pF1KE2 VSKEDTDHEE-----KASNEDV--TKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTAT :.:. :. : : .:: . . . :. :.....:: ... CCDS32 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE2 ASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKK CCDS32 EGGDAGNDTRNTTSDLQKTSEGT 190 200 >>CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 (670 aa) initn: 648 init1: 559 opt: 593 Z-score: 366.8 bits: 77.2 E(32554): 4.1e-14 Smith-Waterman score: 671; 34.9% identity (63.7% similar) in 358 aa overlap (1-327:1-353) 10 20 30 40 50 60 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP : . ::::::.:::::::::::::.:.. ::::::: :: :::::::::::::::::.:: CCDS59 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPKDLFP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFS------SQQAATKQSNAS--SDVEVEE :.. :.::::::::::::::::::.:::....: :... . ..: . ::.. .. CCDS59 YDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDADEDD 70 80 90 100 110 120 120 130 140 150 pF1KE2 KE------TSVS--------KEDTDHEEKASNEDVTKAVDITTPKAARRGRKRKAE-KQV .. :.:. . :.: .....: . . . ....:.:: ... :. CCDS59 EDRGVMAVTAVTATAASDRMESDSDSDKSSDNSGLKRKTPALKMSVSKRARKASSDLDQA 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE2 ETEEAGVVTTATASVNLKVSPKRGRPAA-TEVKIPK--PRGRPKMVKQPCPSESDIITEE . . .. ..: . :.: . : . :. :. : : : : : :.:: .. CCDS59 SVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDSKADS 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE2 DKSKKKGQEEKQPKKQPKKDEEGQKEEDKPRKEP-DKKEGKKEVESKRKNLAKT---GVT : .: . . .. ... .. . . .: : .: ..: . . : : . CCDS59 DGAKPEPVAMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPKPERPPSS 250 260 270 280 290 300 280 290 300 310 320 pF1KE2 STSDSEEEGDDQEGEKKRKGGRNFQTAHRRNM-LKGQHEKEAADRKRKQEEQMETEHQTT :.:::. . :. .: ::. . :.::.. . ..:.: :. ...:. : :.. CCDS59 SSSDSDSDEVDRISEWKRR-----DEARRRELEARRRREQEEELRRLREQEKEEKERRRE 310 320 330 340 350 330 pF1KE2 CNLQ CCDS59 RADRGEAERGSGGSSGDELREDDEPVKKRGRKGRGRGPPSSSDSEPEAELEREAKKSAKK 360 370 380 390 400 410 >>CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 (671 aa) initn: 648 init1: 559 opt: 593 Z-score: 366.8 bits: 77.2 E(32554): 4.1e-14 Smith-Waterman score: 671; 34.9% identity (63.7% similar) in 358 aa overlap (1-327:1-353) 10 20 30 40 50 60 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPKDIFP : . ::::::.:::::::::::::.:.. ::::::: :: :::::::::::::::::.:: CCDS42 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPKDLFP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 YSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFS------SQQAATKQSNAS--SDVEVEE :.. :.::::::::::::::::::.:::....: :... . ..: . ::.. .. CCDS42 YDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDADEDD 70 80 90 100 110 120 120 130 140 150 pF1KE2 KE------TSVS--------KEDTDHEEKASNEDVTKAVDITTPKAARRGRKRKAE-KQV .. :.:. . :.: .....: . . . ....:.:: ... :. CCDS42 EDRGVMAVTAVTATAASDRMESDSDSDKSSDNSGLKRKTPALKMSVSKRARKASSDLDQA 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE2 ETEEAGVVTTATASVNLKVSPKRGRPAA-TEVKIPK--PRGRPKMVKQPCPSESDIITEE . . .. ..: . :.: . : . :. :. : : : : : :.:: .. CCDS42 SVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDSKADS 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE2 DKSKKKGQEEKQPKKQPKKDEEGQKEEDKPRKEP-DKKEGKKEVESKRKNLAKT---GVT : .: . . .. ... .. . . .: : .: ..: . . : : . CCDS42 DGAKPEPVAMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPKPERPPSS 250 260 270 280 290 300 280 290 300 310 320 pF1KE2 STSDSEEEGDDQEGEKKRKGGRNFQTAHRRNM-LKGQHEKEAADRKRKQEEQMETEHQTT :.:::. . :. .: ::. . :.::.. . ..:.: :. ...:. : :.. CCDS42 SSSDSDSDEVDRISEWKRR-----DEARRRELEARRRREQEEELRRLREQEKEEKERRRE 310 320 330 340 350 330 pF1KE2 CNLQ CCDS42 RADRGEAERGSGGSSGDELREDDEPVKKRGRKGRGRGPPSSSDSEPEAELEREAKKSAKK 360 370 380 390 400 410 >>CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 (240 aa) initn: 609 init1: 535 opt: 576 Z-score: 362.8 bits: 75.0 E(32554): 6.9e-14 Smith-Waterman score: 611; 43.2% identity (64.8% similar) in 264 aa overlap (3-259:8-239) 10 20 30 40 50 pF1KE2 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGP ...: :::.:::::::::::::.::.:..::: .:: .::::::::::::: CCDS11 MSRSNRQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 KDIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKET ::.::: :.:::.:::::::::.::::::.::: :: :. :.. :.: :: : : CCDS11 KDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSC----VEEPEPEP 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 SVSKEDTDHEEKA--SNEDVTKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTATASVN ... : :.. .: :... : : : : :.: :: . ..:: . CCDS11 EAAEGDGDKKGNAEGSSDEEGKLV-IDEPA------KEKNEKGALKRRAGDL-------- 120 130 140 150 160 180 190 200 210 220 pF1KE2 LKVSPKRGRPAATEVKIPKPRGRPKM-----VKQPCPSESDIITEEDKSKKKGQEEKQPK :. :::: . : . :.:. : :..: : : .:.... .. . : CCDS11 LEDSPKRPKEAEN------PEGEEKEAATLEVERPLPME----VEKNSTPSEPGSGRGP- 170 180 190 200 210 230 240 250 260 270 280 pF1KE2 KQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGEKKR :...:: . ::.. :: . : .. :: CCDS11 --PQEEEEEEDEEEEATKEDAEAPGIRDHESL 220 230 240 290 300 310 320 330 pF1KE2 KGGRNFQTAHRRNMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ 333 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 15:50:45 2016 done: Sun Nov 6 15:50:45 2016 Total Scan time: 2.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]