FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6308, 393 aa 1>>>pF1KE6308 393 - 393 aa - 393 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6446+/-0.000967; mu= 15.1583+/- 0.058 mean_var=67.5546+/-13.236, 0's: 0 Z-trim(104.3): 20 B-trim: 2 in 1/49 Lambda= 0.156044 statistics sampled from 7843 (7847) to 7843 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.613), E-opt: 0.2 (0.241), width: 16 Scan time: 2.360 The best scores are: opt bits E(32554) CCDS9224.1 HPD gene_id:3242|Hs108|chr12 ( 393) 2625 600.1 1.2e-171 CCDS53839.1 HPD gene_id:3242|Hs108|chr12 ( 354) 2357 539.7 1.5e-153 CCDS519.1 HPDL gene_id:84842|Hs108|chr1 ( 371) 345 86.8 3.6e-17 >>CCDS9224.1 HPD gene_id:3242|Hs108|chr12 (393 aa) initn: 2625 init1: 2625 opt: 2625 Z-score: 3195.4 bits: 600.1 E(32554): 1.2e-171 Smith-Waterman score: 2625; 99.7% identity (100.0% similar) in 393 aa overlap (1-393:1-393) 10 20 30 40 50 60 pF1KE6 MTTYSDKGAKPERGRFLHFHSVTFWVGNAKQAASFYCSKMGFEPLAYRGLETGSREVVSH ::::::::::::::::::::::::::::::::.::::::::::::::::::::::::::: CCDS92 MTTYSDKGAKPERGRFLHFHSVTFWVGNAKQATSFYCSKMGFEPLAYRGLETGSREVVSH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VIKQGKIVFVLSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMRE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 VIKQGKIVFVLSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMRE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 PWVEQDKFGKVKFAVLQTYGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 PWVEQDKFGKVKFAVLQTYGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 IDHIVGNQPDQEMVSASEWYLKNLQFHRFWSVDDTQVHTEYSSLRSIVVANYEESIKMPI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 IDHIVGNQPDQEMVSASEWYLKNLQFHRFWSVDDTQVHTEYSSLRSIVVANYEESIKMPI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 NEPAPGKKKSQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYKQLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 NEPAPGKKKSQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYKQLR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 EKLKTAKIKVKENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNHQGFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 EKLKTAKIKVKENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNHQGFG 310 320 330 340 350 360 370 380 390 pF1KE6 AGNFNSLFKAFEEEQNLRGNLTNMETNGVVPGM ::::::::::::::::::::::::::::::::: CCDS92 AGNFNSLFKAFEEEQNLRGNLTNMETNGVVPGM 370 380 390 >>CCDS53839.1 HPD gene_id:3242|Hs108|chr12 (354 aa) initn: 2357 init1: 2357 opt: 2357 Z-score: 2870.0 bits: 539.7 E(32554): 1.5e-153 Smith-Waterman score: 2357; 100.0% identity (100.0% similar) in 354 aa overlap (40-393:1-354) 10 20 30 40 50 60 pF1KE6 KPERGRFLHFHSVTFWVGNAKQAASFYCSKMGFEPLAYRGLETGSREVVSHVIKQGKIVF :::::::::::::::::::::::::::::: CCDS53 MGFEPLAYRGLETGSREVVSHVIKQGKIVF 10 20 30 70 80 90 100 110 120 pF1KE6 VLSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMREPWVEQDKFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 VLSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMREPWVEQDKFG 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 KVKFAVLQTYGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEMIDHIVGNQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 KVKFAVLQTYGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLEMIDHIVGNQP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 DQEMVSASEWYLKNLQFHRFWSVDDTQVHTEYSSLRSIVVANYEESIKMPINEPAPGKKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 DQEMVSASEWYLKNLQFHRFWSVDDTQVHTEYSSLRSIVVANYEESIKMPINEPAPGKKK 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE6 SQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYKQLREKLKTAKIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 SQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLEFLSVPSTYYKQLREKLKTAKIK 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE6 VKENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNHQGFGAGNFNSLFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 VKENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTLFLEVIQRHNHQGFGAGNFNSLFK 280 290 300 310 320 330 370 380 390 pF1KE6 AFEEEQNLRGNLTNMETNGVVPGM :::::::::::::::::::::::: CCDS53 AFEEEQNLRGNLTNMETNGVVPGM 340 350 >>CCDS519.1 HPDL gene_id:84842|Hs108|chr1 (371 aa) initn: 312 init1: 158 opt: 345 Z-score: 421.8 bits: 86.8 E(32554): 3.6e-17 Smith-Waterman score: 369; 26.1% identity (55.1% similar) in 372 aa overlap (17-374:6-364) 10 20 30 40 50 60 pF1KE6 MTTYSDKGAKPERGRFLHFHSVTFWVGNAKQAASFYCSKMGFEPLAYRGLETGSREVVSH :.. ..: : .. : .::.::: : .. : :.. CCDS51 MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVD-GWRQL--- 10 20 30 40 70 80 90 100 110 120 pF1KE6 VIKQGKIVFVLSSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCDYIVQKARERGAKIMRE ....: ::... . . . .: . .. .. :.: : ... : .. CCDS51 ALRSGDAVFLVNEGAGSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVP 50 60 70 80 90 100 130 140 150 160 170 pF1KE6 PWVEQDKFGKVKFAVLQT-YGDTTHTLVEKMNYIGQFLPGYEAPAFMDPLLPKLPKCSLE : .: : . .::... : . ::.:. .: : ::::.. :. : : . CCDS51 PVRVRDAQGAATYAVVSSPAGILSLTLLERAGYRGPFLPGFR-PVSSAPG-PGW----VS 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 MIDHIVGNQPDQEMVSASEWYLKNLQF-HRFWSV-DDTQVHTEYSS------LRSIVVAN .::.. . .:. : : : : .: .. :... :: .. CCDS51 RVDHLTLACTPGSSPTLLRWFHDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQA 160 170 180 190 200 210 240 250 260 270 280 pF1KE6 YEESI--KMPINEPAPGK--KKSQIQEYVDYNGGAGVQHIALKTEDIITAIRHLRERGLE :: . . : :: ...:..... . : :.::..: : .:. : . . : . CCDS51 QPGSIVPTLVLAESLPGATTRQDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQ 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE6 FLSVPSTYYKQLREKLKTAKIKVK-ENIDALEELKILVDYDEKGYLLQIFTKPVQDRPTL ::. :..::.: : .:.. .. : . ::.: :. .:::.::: . . :. CCDS51 FLAPPGAYYQQ---PGKERQIRAAGHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTF 280 290 300 310 320 330 350 360 370 380 390 pF1KE6 FLEVIQRHNHQGFGAGNFNSLFKAFEEEQNLRGNLTNMETNGVVPGM :::.:::.. ::: ::. .:... .:. CCDS51 FLELIQRQGATGFGQGNIRALWQSVQEQSARSQEA 340 350 360 370 393 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:57:15 2016 done: Tue Nov 8 11:57:16 2016 Total Scan time: 2.360 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]