FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5586, 371 aa 1>>>pF1KE5586 371 - 371 aa - 371 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1604+/-0.000734; mu= 13.1116+/- 0.044 mean_var=90.7605+/-18.495, 0's: 0 Z-trim(111.3): 7 B-trim: 99 in 1/51 Lambda= 0.134625 statistics sampled from 12278 (12281) to 12278 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.377), width: 16 Scan time: 3.190 The best scores are: opt bits E(32554) CCDS519.1 HPDL gene_id:84842|Hs108|chr1 ( 371) 2516 498.3 4.3e-141 CCDS53839.1 HPD gene_id:3242|Hs108|chr12 ( 354) 341 75.9 6.1e-14 CCDS9224.1 HPD gene_id:3242|Hs108|chr12 ( 393) 341 75.9 6.6e-14 >>CCDS519.1 HPDL gene_id:84842|Hs108|chr1 (371 aa) initn: 2516 init1: 2516 opt: 2516 Z-score: 2646.5 bits: 498.3 E(32554): 4.3e-141 Smith-Waterman score: 2516; 100.0% identity (100.0% similar) in 371 aa overlap (1-371:1-371) 10 20 30 40 50 60 pF1KE5 MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVDGWRQLALRSGDAVFLVNEGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVDGWRQLALRSGDAVFLVNEGA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 GSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAATYAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 GSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAATYAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 VSSPAGILSLTLLERAGYRGPFLPGFRPVSSAPGPGWVSRVDHLTLACTPGSSPTLLRWF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 VSSPAGILSLTLLERAGYRGPFLPGFRPVSSAPGPGWVSRVDHLTLACTPGSSPTLLRWF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 HDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVLAESLPGATTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 HDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVLAESLPGATTR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 QDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQPGKERQIRAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 QDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQPGKERQIRAA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 GHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGATGFGQGNIRALWQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS51 GHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGATGFGQGNIRALWQS 310 320 330 340 350 360 370 pF1KE5 VQEQSARSQEA ::::::::::: CCDS51 VQEQSARSQEA 370 >>CCDS53839.1 HPD gene_id:3242|Hs108|chr12 (354 aa) initn: 312 init1: 158 opt: 341 Z-score: 363.8 bits: 75.9 E(32554): 6.1e-14 Smith-Waterman score: 365; 26.4% identity (55.7% similar) in 348 aa overlap (30-364:2-335) 10 20 30 40 50 pF1KE5 MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVD-GWRQLA---LRSGDAVFLV ::.::: : .. : :... ...: ::.. CCDS53 MGFEPLAYRGLETGSREVVSHVIKQGKIVFVL 10 20 30 60 70 80 90 100 110 pF1KE5 NEGAGSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAA . . . . .: . .. .. :.: : ... : .. : .: : . CCDS53 SSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMREPWVEQDKFGKV 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE5 TYAVVSSPAGILSLTLLERAGYRGPFLPGFR-PVSSAPGPGWVSR-----VDHLTLACTP .::... : . ::.:. .: : ::::.. :. : . . .::.. CCDS53 KFAVLQT-YGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEMIDHIVGNQPD 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE5 GSSPTLLRWFHDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVL . .:. : : : : .: .. :... :: .. :: . . CCDS53 QEMVSASEWYLKNLQF-HRFWSV-DDTQVHTEYSS------LRSIVVANYEESI--KMPI 160 170 180 190 200 240 250 260 270 280 290 pF1KE5 AESLPGATTRQDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQ : :: ...:..... . : :.::..: : .:. : . . : .::. :..::.: CCDS53 NEPAPG--KKKSQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYKQ 210 220 230 240 250 300 310 320 330 340 pF1KE5 ---PGKERQIRAAGHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGAT : .:.. .. : . ::.: :. .:::.::: . . :.:::.:::.. CCDS53 LREKLKTAKIKVK-ENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNHQ 260 270 280 290 300 310 350 360 370 pF1KE5 GFGQGNIRALWQSVQEQSARSQEA ::: ::. .:... .:. CCDS53 GFGAGNFNSLFKAFEEEQNLRGNLTNMETNGVVPGM 320 330 340 350 >>CCDS9224.1 HPD gene_id:3242|Hs108|chr12 (393 aa) initn: 312 init1: 158 opt: 341 Z-score: 363.1 bits: 75.9 E(32554): 6.6e-14 Smith-Waterman score: 365; 26.4% identity (55.7% similar) in 348 aa overlap (30-364:41-374) 10 20 30 40 50 pF1KE5 MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVD-GWRQLA---LRSGDAVFL ::.::: : .. : :... ...: ::. CCDS92 PERGRFLHFHSVTFWVGNAKQATSFYCSKMGFEPLAYRGLETGSREVVSHVIKQGKIVFV 20 30 40 50 60 70 60 70 80 90 100 110 pF1KE5 VNEGAGSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGA .. . . . .: . .. .. :.: : ... : .. : .: : CCDS92 LSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMREPWVEQDKFGK 80 90 100 110 120 130 120 130 140 150 160 pF1KE5 ATYAVVSSPAGILSLTLLERAGYRGPFLPGFR-PVSSAPGPGWVSR-----VDHLTLACT . .::... : . ::.:. .: : ::::.. :. : . . .::.. CCDS92 VKFAVLQT-YGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEMIDHIVGNQP 140 150 160 170 180 170 180 190 200 210 220 pF1KE5 PGSSPTLLRWFHDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLV . .:. : : : : .: .. :... :: .. :: . CCDS92 DQEMVSASEWYLKNLQF-HRFWSV-DDTQVHTEYSS------LRSIVVANYEESI--KMP 190 200 210 220 230 230 240 250 260 270 280 pF1KE5 LAESLPGATTRQDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQ . : :: ...:..... . : :.::..: : .:. : . . : .::. :..::. CCDS92 INEPAPG--KKKSQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYK 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE5 Q---PGKERQIRAAGHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGA : : .:.. .. : . ::.: :. .:::.::: . . :.:::.:::.. CCDS92 QLREKLKTAKIKVK-ENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNH 300 310 320 330 340 350 350 360 370 pF1KE5 TGFGQGNIRALWQSVQEQSARSQEA ::: ::. .:... .:. CCDS92 QGFGAGNFNSLFKAFEEEQNLRGNLTNMETNGVVPGM 360 370 380 390 371 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 02:04:27 2016 done: Tue Nov 8 02:04:27 2016 Total Scan time: 3.190 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]