FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5568, 431 aa 1>>>pF1KE5568 431 - 431 aa - 431 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.3254+/-0.000932; mu= 0.4480+/- 0.056 mean_var=228.4081+/-45.811, 0's: 0 Z-trim(113.9): 10 B-trim: 136 in 1/50 Lambda= 0.084863 statistics sampled from 14488 (14497) to 14488 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.769), E-opt: 0.2 (0.445), width: 16 Scan time: 3.010 The best scores are: opt bits E(32554) CCDS33249.1 ASTL gene_id:431705|Hs108|chr2 ( 431) 2947 373.3 2.5e-103 CCDS77173.1 MEP1B gene_id:4225|Hs108|chr18 ( 700) 464 69.5 1.2e-11 CCDS45846.1 MEP1B gene_id:4225|Hs108|chr18 ( 701) 464 69.5 1.2e-11 CCDS34856.1 BMP1 gene_id:649|Hs108|chr8 ( 730) 446 67.3 5.7e-11 CCDS7449.1 TLL2 gene_id:7093|Hs108|chr10 (1015) 449 67.8 5.7e-11 CCDS6026.1 BMP1 gene_id:649|Hs108|chr8 ( 986) 446 67.4 7.2e-11 CCDS56342.1 TLL1 gene_id:7092|Hs108|chr4 ( 392) 435 65.8 8.9e-11 >>CCDS33249.1 ASTL gene_id:431705|Hs108|chr2 (431 aa) initn: 2947 init1: 2947 opt: 2947 Z-score: 1968.5 bits: 373.3 E(32554): 2.5e-103 Smith-Waterman score: 2947; 99.5% identity (100.0% similar) in 431 aa overlap (1-431:1-431) 10 20 30 40 50 60 pF1KE5 MEGVGGLWPWVLGLLSLPGVILGAPLASSCAGACGTSFPDGLTPEGTQASGDKDIPAINQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MEGVGGLWPWVLGLLSLPGVILGAPLASSCAGACGTSFPDGLTPEGTQASGDKDIPAINQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 GLILEETPESSFLIEGDIIRPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GLILEETPESSFLIEGDIIRPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 LEALAEFERSTCIRFVTYQDQRDFISIIPMYGCFSSVGRSGGMQVVSLAPTCLQKGRGIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LEALAEFERSTCIRFVTYQDQRDFISIIPMYGCFSSVGRSGGMQVVSLAPTCLQKGRGIV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 LHELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSRSSNMLTPYDYSSVMHYGR :::::::::::::::::::::::::::::::::::::::::.:::::::::::::::::: CCDS33 LHELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSQSSNMLTPYDYSSVMHYGR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 LAFSRRGLPTITPLWAPSVHIGQRWNLSASDITRVLQLYGCSPSGPRPRGRGSHAHSTGR ::::::::::::::::::::::::::::::::::::.::::::::::::::::::::::: CCDS33 LAFSRRGLPTITPLWAPSVHIGQRWNLSASDITRVLKLYGCSPSGPRPRGRGSHAHSTGR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 SPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 RQPQTLASSPRSRPGAGAPGVAQEQSWLAGVSTKPTVPSSEAGIQPVPVQGSPALPGGCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RQPQTLASSPRSRPGAGAPGVAQEQSWLAGVSTKPTVPSSEAGIQPVPVQGSPALPGGCV 370 380 390 400 410 420 430 pF1KE5 PRNHFKGMSED ::::::::::: CCDS33 PRNHFKGMSED 430 >>CCDS77173.1 MEP1B gene_id:4225|Hs108|chr18 (700 aa) initn: 462 init1: 186 opt: 464 Z-score: 322.6 bits: 69.5 E(32554): 1.2e-11 Smith-Waterman score: 501; 37.8% identity (63.8% similar) in 254 aa overlap (41-284:20-258) 20 30 40 50 60 pF1KE5 VLGLLSLPGVILGAPLASSCAGACGTSFPDGL-TPEGTQASG--DKDIPAINQGLILEET :: :::. ...: :.:: ::.:: :. CCDS77 MDLWNLSWFLFLDALLVISGLATPENFDVDGGMDQDIFDINEGLGLD-- 10 20 30 40 70 80 90 100 110 120 pF1KE5 PESSFLIEGDII--RPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVILEALA :.:::: : . . . . .:: .:..: .. . .. :::.:. CCDS77 -----LFEGDIRLDRAQIRNSIIGEKYRWPH------TIPYVLEDSLEMNAKGVILNAFE 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 EFERSTCIRFVTYQDQRDFISIIPMYGCFSSVG-RSGGMQVVSLAPTCLQKGRGIVLHEL ... .::: : . . ..::.. ::.:::: : : : .:.. .: . . : ::. CCDS77 RYRLKTCIDFKPWAGETNYISVFKGSGCWSSVGNRRVGKQELSIGANCDRIAT--VQHEF 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 MHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSR---SSNMLTPYDYSSVMHYGRL .:.::::::..:.::: :.:. :..:: : : :: :... .::::.:::::.. CCDS77 LHALGFWHEQSRSDRDDYVRIMWDRILSGREHNFNTYSDDISDSLNVPYDYTSVMHYSKT 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE5 AFSRRGLPTITPLWAPSVH-IGQRWNLSASDITRVLQLYGCSPSGPRPRGRGSHAHSTGR ::. :::. . :::: ..: ::. .. :::.:: : CCDS77 AFQNGTEPTIVTRISDFEDVIGQRMDFSDSDLLKLNQLYNCSSSLSFMDSCSFELENVCG 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE5 SPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASA CCDS77 MIQSSGDNADWQRVSQVPRGPESDHSNMGQCQGSGFFMHFDSSSVNVGATAVLESRTLYP 280 290 300 310 320 330 >>CCDS45846.1 MEP1B gene_id:4225|Hs108|chr18 (701 aa) initn: 462 init1: 186 opt: 464 Z-score: 322.6 bits: 69.5 E(32554): 1.2e-11 Smith-Waterman score: 501; 37.8% identity (63.8% similar) in 254 aa overlap (41-284:20-258) 20 30 40 50 60 pF1KE5 VLGLLSLPGVILGAPLASSCAGACGTSFPDGL-TPEGTQASG--DKDIPAINQGLILEET :: :::. ...: :.:: ::.:: :. CCDS45 MDLWNLSWFLFLDALLVISGLATPENFDVDGGMDQDIFDINEGLGLD-- 10 20 30 40 70 80 90 100 110 120 pF1KE5 PESSFLIEGDII--RPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVILEALA :.:::: : . . . . .:: .:..: .. . .. :::.:. CCDS45 -----LFEGDIRLDRAQIRNSIIGEKYRWPH------TIPYVLEDSLEMNAKGVILNAFE 50 60 70 80 90 130 140 150 160 170 180 pF1KE5 EFERSTCIRFVTYQDQRDFISIIPMYGCFSSVG-RSGGMQVVSLAPTCLQKGRGIVLHEL ... .::: : . . ..::.. ::.:::: : : : .:.. .: . . : ::. CCDS45 RYRLKTCIDFKPWAGETNYISVFKGSGCWSSVGNRRVGKQELSIGANCDRIAT--VQHEF 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE5 MHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSR---SSNMLTPYDYSSVMHYGRL .:.::::::..:.::: :.:. :..:: : : :: :... .::::.:::::.. CCDS45 LHALGFWHEQSRSDRDDYVRIMWDRILSGREHNFNTYSDDISDSLNVPYDYTSVMHYSKT 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE5 AFSRRGLPTITPLWAPSVH-IGQRWNLSASDITRVLQLYGCSPSGPRPRGRGSHAHSTGR ::. :::. . :::: ..: ::. .. :::.:: : CCDS45 AFQNGTEPTIVTRISDFEDVIGQRMDFSDSDLLKLNQLYNCSSSLSFMDSCSFELENVCG 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE5 SPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKLSAEASA CCDS45 MIQSSGDNADWQRVSQVPRGPESDHSNMGQCQGSGFFMHFDSSSVNVGATAVLESRTLYP 280 290 300 310 320 330 >>CCDS34856.1 BMP1 gene_id:649|Hs108|chr8 (730 aa) initn: 398 init1: 245 opt: 446 Z-score: 310.4 bits: 67.3 E(32554): 5.7e-11 Smith-Waterman score: 449; 38.1% identity (64.1% similar) in 231 aa overlap (94-300:130-351) 70 80 90 100 110 120 pF1KE5 LEETPESSFLIEGDIIRPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVILEA :: .::. ::...... .: :. .: CCDS34 TNGQPQRGACGRWRGRSRSRRAATSRPERVWP---DGVI--PFVIGGNFTGSQRAVFRQA 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE5 LAEFERSTCIRFVTYQDQRDFISII--PMYGCFSSVGR-SGGMQVVSLAPTCLQKGRGIV . ..:. ::. :. :. ..: . : :: : ::: .:: :..:.. .: . ::: CCDS34 MRHWEKHTCVTFLERTDEDSYIVFTYRPC-GCCSYVGRRGGGPQAISIGKNCDK--FGIV 160 170 180 190 200 210 190 200 210 220 230 pF1KE5 LHELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSRSSNMLT---PYDYSSVMH .::: ::.:::::::: ::::.. . ..: :: : ::.: . ... . ::..:.:: CCDS34 VHELGHVVGFWHEHTRPDRDRHVSIVRENIQPGQEYNFLKMEPQEVESLGETYDFDSIMH 220 230 240 250 260 270 240 250 260 270 280 pF1KE5 YGRLAFSRRG--LPTITPLW-APSVH--IGQRWNLSASDITRVLQLYGCSPSG------- :.: .::: : : ::.: . . .:. :::: :: .::... .:: : : CCDS34 YARNTFSR-GIFLDTIVPKYEVNGVKPPIGQRTRLSKGDIAQARKLYKCPACGETLQDST 280 290 300 310 320 330 290 300 310 320 330 pF1KE5 -----PR-PRGRGSHAHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGP :. : : ..: : . : CCDS34 GNFSSPEYPNGYSAHMHCVWRISVTPGEKIILNFTSLDLYRSRLCWYDYVEVRDGFWRKA 340 350 360 370 380 390 >>CCDS7449.1 TLL2 gene_id:7093|Hs108|chr10 (1015 aa) initn: 342 init1: 219 opt: 449 Z-score: 310.4 bits: 67.8 E(32554): 5.7e-11 Smith-Waterman score: 450; 35.2% identity (62.5% similar) in 261 aa overlap (37-285:105-352) 10 20 30 40 50 60 pF1KE5 LWPWVLGLLSLPGVILGAPLASSCAGACGTSFPDGLTPE-GTQASGDKDIPAINQGLILE : :: . . ::. .: :: .. : CCDS74 FHIDKARDWTKQTVGATGHSTGGLEEQASESSPDTTAMDTGTKEAG-KDG---RENTTLL 80 90 100 110 120 130 70 80 90 100 110 120 pF1KE5 ETPESSFLIEGDIIRPSPFRLLSATSNK-WPMGGSGVVEVPFLLSSKYDEPSRQVILEAL ..: ... . . : : .. ... :: :: .:....... .: .. .:. CCDS74 HSP-GTLHAAAKTFSPRVRRATTSRTERIWP-GGV----IPYVIGGNFTGSQRAIFKQAM 140 150 160 170 180 130 140 150 160 170 180 pF1KE5 AEFERSTCIRFVTYQDQRDFISI-IPMYGCFSSVGR-SGGMQVVSLAPTCLQKGRGIVLH ..:. ::. :. :...:: . :: : ::: .:: :..:.. .: . ::: : CCDS74 RHWEKHTCVTFIERTDEESFIVFSYRTCGCCSYVGRRGGGPQAISIGKNCDK--FGIVAH 190 200 210 220 230 240 190 200 210 220 230 pF1KE5 ELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSRS---SNMLTPYDYSSVMHYG :: ::.:::::::: :::... . ..: :: : ::.: .. :.. ::..:.:::. CCDS74 ELGHVVGFWHEHTRPDRDQHVTIIRENIQPGQEYNFLKMEAGEVSSLGETYDFDSIMHYA 250 260 270 280 290 300 240 250 260 270 280 290 pF1KE5 RLAFSRRG--LPTITPLWAPS-VH--IGQRWNLSASDITRVLQLYGCSPSGPRPRGRGSH : .::: : : :: : . :. :::: :: .::... .:: : : CCDS74 RNTFSR-GVFLDTILPRQDDNGVRPTIGQRVRLSQGDIAQARKLYKCPACGETLQDTTGN 310 320 330 340 350 360 300 310 320 330 340 350 pF1KE5 AHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGESPHGWESPALKKL CCDS74 FSAPGFPNGYPSYSHCVWRISVTPGEKIVLNFTSMDLFKSRLCWYDYVEVRDGYWRKAPL 370 380 390 400 410 420 >>CCDS6026.1 BMP1 gene_id:649|Hs108|chr8 (986 aa) initn: 398 init1: 245 opt: 446 Z-score: 308.6 bits: 67.4 E(32554): 7.2e-11 Smith-Waterman score: 449; 38.1% identity (64.1% similar) in 231 aa overlap (94-300:130-351) 70 80 90 100 110 120 pF1KE5 LEETPESSFLIEGDIIRPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKYDEPSRQVILEA :: .::. ::...... .: :. .: CCDS60 TNGQPQRGACGRWRGRSRSRRAATSRPERVWP---DGVI--PFVIGGNFTGSQRAVFRQA 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE5 LAEFERSTCIRFVTYQDQRDFISII--PMYGCFSSVGR-SGGMQVVSLAPTCLQKGRGIV . ..:. ::. :. :. ..: . : :: : ::: .:: :..:.. .: . ::: CCDS60 MRHWEKHTCVTFLERTDEDSYIVFTYRPC-GCCSYVGRRGGGPQAISIGKNCDK--FGIV 160 170 180 190 200 210 190 200 210 220 230 pF1KE5 LHELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIKSRSSNMLT---PYDYSSVMH .::: ::.:::::::: ::::.. . ..: :: : ::.: . ... . ::..:.:: CCDS60 VHELGHVVGFWHEHTRPDRDRHVSIVRENIQPGQEYNFLKMEPQEVESLGETYDFDSIMH 220 230 240 250 260 270 240 250 260 270 280 pF1KE5 YGRLAFSRRG--LPTITPLW-APSVH--IGQRWNLSASDITRVLQLYGCSPSG------- :.: .::: : : ::.: . . .:. :::: :: .::... .:: : : CCDS60 YARNTFSR-GIFLDTIVPKYEVNGVKPPIGQRTRLSKGDIAQARKLYKCPACGETLQDST 280 290 300 310 320 330 290 300 310 320 330 pF1KE5 -----PR-PRGRGSHAHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGP :. : : ..: : . : CCDS60 GNFSSPEYPNGYSAHMHCVWRISVTPGEKIILNFTSLDLYRSRLCWYDYVEVRDGFWRKA 340 350 360 370 380 390 >>CCDS56342.1 TLL1 gene_id:7092|Hs108|chr4 (392 aa) initn: 349 init1: 192 opt: 435 Z-score: 307.0 bits: 65.8 E(32554): 8.9e-11 Smith-Waterman score: 435; 36.4% identity (61.8% similar) in 228 aa overlap (83-299:146-364) 60 70 80 90 100 110 pF1KE5 KDIPAINQGLILEETPESSFLIEGDIIRPSPFRLLSATSNKWPMGGSGVVEVPFLLSSKY : : : :: :: .:....... CCDS56 RRIGFGLEQNNTVKGKVPLQFSGQNEKNRVPRAATSRTERIWP-GGV----IPYVIGGNF 120 130 140 150 160 170 120 130 140 150 160 pF1KE5 DEPSRQVILEALAEFERSTCIRFVTYQDQRDFISII--PMYGCFSSVGRSG-GMQVVSLA .: .. .:. ..:. ::. :. .:....: . : :: : ::: : : :..:.. CCDS56 TGSQRAMFKQAMRHWEKHTCVTFIERSDEESYIVFTYRPC-GCCSYVGRRGNGPQAISIG 180 190 200 210 220 170 180 190 200 210 220 pF1KE5 PTCLQKGRGIVLHELMHVLGFWHEHTRADRDRYIRVNWNEILPGFEINFIK---SRSSNM .: . :::.::: ::.:::::::: ::: .. . ..: :: : ::.: .. ... CCDS56 KNCDK--FGIVVHELGHVIGFWHEHTRPDRDNHVTIIRENIQPGQEYNFLKMEPGEVNSL 230 240 250 260 270 280 230 240 250 260 270 280 pF1KE5 LTPYDYSSVMHYGRLAFSRRG--LPTITPLWAPS---VHIGQRWNLSASDITRVLQLYGC ::..:.:::.: .::: : : :: : . :::: :: .::... .:: : CCDS56 GERYDFDSIMHYARNTFSR-GMFLDTILPSRDDNGIRPAIGQRTRLSKGDIAQARKLYRC 290 300 310 320 330 340 290 300 310 320 330 340 pF1KE5 SPSGPRPRGRGSHAHSTGRSPAPASLSLQRLLEALSAESRSPDPSGSSAGGQPVPAGPGE : . ... : : CCDS56 PACGETLQESNGNLSSPGFPNGYPSYTHCIWRVSVTPGEKVVFSLC 350 360 370 380 390 431 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:55:09 2016 done: Tue Nov 8 01:55:09 2016 Total Scan time: 3.010 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]