FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7642, 387 aa 1>>>pF1KB7642 387 - 387 aa - 387 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.6453+/-0.000993; mu= -8.4109+/- 0.061 mean_var=442.7729+/-88.611, 0's: 0 Z-trim(117.3): 119 B-trim: 0 in 0/55 Lambda= 0.060951 statistics sampled from 17935 (18055) to 17935 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.82), E-opt: 0.2 (0.555), width: 16 Scan time: 3.370 The best scores are: opt bits E(32554) CCDS730.1 BARHL2 gene_id:343472|Hs108|chr1 ( 387) 2639 245.6 5.8e-65 CCDS6950.1 BARHL1 gene_id:56751|Hs108|chr9 ( 327) 1003 101.6 1e-21 >>CCDS730.1 BARHL2 gene_id:343472|Hs108|chr1 (387 aa) initn: 2639 init1: 2639 opt: 2639 Z-score: 1279.8 bits: 245.6 E(32554): 5.8e-65 Smith-Waterman score: 2639; 100.0% identity (100.0% similar) in 387 aa overlap (1-387:1-387) 10 20 30 40 50 60 pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQESNAVHE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 SFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 DHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 AGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRV 310 320 330 340 350 360 370 380 pF1KB7 LIHGLGPGGQPALNPLSSPIPGTPHPR ::::::::::::::::::::::::::: CCDS73 LIHGLGPGGQPALNPLSSPIPGTPHPR 370 380 >>CCDS6950.1 BARHL1 gene_id:56751|Hs108|chr9 (327 aa) initn: 1075 init1: 723 opt: 1003 Z-score: 503.2 bits: 101.6 E(32554): 1e-21 Smith-Waterman score: 1075; 53.0% identity (68.4% similar) in 389 aa overlap (3-387:1-327) 10 20 30 40 50 60 pF1KB7 MTMEGASGSSFGIDTILSSASSGSPGMMNGDFRPLGEARTADFRSQATPSPCSEIDTVGT :::..: ::::.::: .:::.. .:: :: .: :: :: :: .. . CCDS69 MEGSNG--FGIDSILSH-RAGSPALPKGD--PL----LGDCRSPLELSPRSESSSDCS 10 20 30 40 70 80 90 100 110 120 pF1KB7 APSSPISVTMEPPEPHLVADATQHHHHLHHSQQPPPPAAAPTQSLQPLPQQQQPLPPQQP .:.:: .: :. : :..: : .. :: CCDS69 SPASPGRDCLETGTPR------------------PGGASGPG-----LDSHLQP------ 50 60 70 80 130 140 150 160 170 pF1KB7 PPPPPQQLGSAASAPRTSTSSFLIKDILGDSKPLAACAPYSTS--VSSPHHTPKQESNAV :: :: . :: ::::::.:::.: ::::::::::.: ..:. . ..:. CCDS69 -----GQL-SAPAQSRTVTSSFLIRDILADCKPLAACAPYSSSGQPAAPEPGGRLAAKAA 90 100 110 120 130 180 190 200 210 220 230 pF1KB7 HESFRPKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTA :.:: ::.. :... :.:. : .::::::::.:::.::::: ::::::::: CCDS69 -EDFRDKLDKSGSNAS------SDSEYK---VKEEGDREISSSRDSPPVRLKKPRKARTA 140 150 160 170 180 240 250 260 270 280 290 pF1KB7 FSDHQLNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELL :.:::: ::::::::::::::::::.:::.:::::::::::::::::::::::::::::: CCDS69 FTDHQLAQLERSFERQKYLSVQDRMELAASLNLTDTQVKTWYQNRRTKWKRQTAVGLELL 190 200 210 220 230 240 300 310 320 330 340 350 pF1KB7 AEAGNYSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVP ::::::::::::::::::: ::....: .::.: .:: : :: : ::::::: CCDS69 AEAGNYSALQRMFPSPYFYPQSLVSNLD------PGAALY--LYRGPSAPPPALQRPLVP 250 260 270 280 290 360 370 380 pF1KB7 RVLIHGLGPGGQPA--LNPLSSPIPGTPHPR :.::::: ...: : ::.. .: . .:: CCDS69 RILIHGLQGASEPPPPLPPLAGVLPRAAQPR 300 310 320 387 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 03:10:57 2016 done: Sun Nov 6 03:10:57 2016 Total Scan time: 3.370 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]