FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1721, 210 aa 1>>>pF1KE1721 210 - 210 aa - 210 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.8354+/-0.000838; mu= -1.8932+/- 0.051 mean_var=165.0402+/-32.937, 0's: 0 Z-trim(113.2): 13 B-trim: 96 in 1/50 Lambda= 0.099834 statistics sampled from 13880 (13888) to 13880 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.427), width: 16 Scan time: 2.330 The best scores are: opt bits E(32554) CCDS53841.1 BCL7A gene_id:605|Hs108|chr12 ( 210) 1375 209.0 1.7e-54 CCDS9226.1 BCL7A gene_id:605|Hs108|chr12 ( 231) 1233 188.6 2.7e-48 CCDS5550.1 BCL7B gene_id:9275|Hs108|chr7 ( 202) 497 82.5 2e-16 CCDS10693.1 BCL7C gene_id:9274|Hs108|chr16 ( 217) 418 71.2 5.6e-13 CCDS67012.1 BCL7C gene_id:9274|Hs108|chr16 ( 242) 413 70.5 1e-12 CCDS56489.1 BCL7B gene_id:9275|Hs108|chr7 ( 145) 386 66.5 9.6e-12 >>CCDS53841.1 BCL7A gene_id:605|Hs108|chr12 (210 aa) initn: 1375 init1: 1375 opt: 1375 Z-score: 1091.6 bits: 209.0 E(32554): 1.7e-54 Smith-Waterman score: 1375; 100.0% identity (100.0% similar) in 210 aa overlap (1-210:1-210) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPSGDSGLAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 SAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPSGDSGLAA 130 140 150 160 170 180 190 200 210 pF1KE1 ETSAISQDLEGVPPSKKMKLEASQQNSEEM :::::::::::::::::::::::::::::: CCDS53 ETSAISQDLEGVPPSKKMKLEASQQNSEEM 190 200 210 >>CCDS9226.1 BCL7A gene_id:605|Hs108|chr12 (231 aa) initn: 1227 init1: 1227 opt: 1233 Z-score: 980.5 bits: 188.6 E(32554): 2.7e-48 Smith-Waterman score: 1323; 90.9% identity (90.9% similar) in 231 aa overlap (1-210:1-231) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 SAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPSGDSGLAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 SAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPSGDSGLAA 130 140 150 160 170 180 190 200 210 pF1KE1 ETSAISQ---------------------DLEGVPPSKKMKLEASQQNSEEM ::::::: ::::::::::::::::::::::: CCDS92 ETSAISQVPRSRSQRGSQIGREPIGLSGDLEGVPPSKKMKLEASQQNSEEM 190 200 210 220 230 >>CCDS5550.1 BCL7B gene_id:9275|Hs108|chr7 (202 aa) initn: 483 init1: 375 opt: 497 Z-score: 408.5 bits: 82.5 E(32554): 2e-16 Smith-Waterman score: 508; 45.3% identity (70.6% similar) in 201 aa overlap (1-197:1-187) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :::::::::::::::::::.:::::::::::::::::::::::::.::::::. : .:. CCDS55 MSGRSVRAETRSRAKDDIKKVMAAIEKVRKWEKKWVTVGDTSLRIFKWVPVTDSKEKEKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN :..... : : . ::: .....:.::::::..:. .: ..:.::::.:. . CCDS55 KSNSSAAREPNGFPSDASANSSL--LLEFQDENSNQSSVSDVYQLKVDSSTNSSPSPQQS 70 80 90 100 110 130 140 150 160 170 pF1KE1 SAV-PSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPS--GDSG .. :. .. ..:..: : . : : :. ::: .:. :. . CCDS55 ESLSPAHTSDFRTDDSQP-----PTLGQEILE-------EPSLPSSEVADEPPTLTKEEP 120 130 140 150 160 180 190 200 210 pF1KE1 LAAETSAISQDLE-GVPPSKKMKLEASQQNSEEM . ::... .. . :.:: :. CCDS55 VPLETQVVEEEEDSGAPPLKRFCVDQPTVPQTASES 170 180 190 200 >>CCDS10693.1 BCL7C gene_id:9274|Hs108|chr16 (217 aa) initn: 408 init1: 339 opt: 418 Z-score: 346.5 bits: 71.2 E(32554): 5.6e-13 Smith-Waterman score: 429; 37.6% identity (69.0% similar) in 213 aa overlap (1-198:1-210) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :.::.::::::::::::::.:::.:::::.:::.:::::::::::.:::::..:. ... CCDS10 MAGRTVRAETRSRAKDDIKKVMATIEKVRRWEKRWVTVGDTSLRIFKWVPVVDPQEEERR 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 K-----NKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSP . ....:.... .. ..:.... ..:..:.::::: ...: ... . . . CCDS10 RAGGGAERSRGRERR--GRGASPRGGGPLILLDLNDENSNQSFHSEGS-LQKGTEPSPGG 70 80 90 100 110 120 130 140 150 160 pF1KE1 APEPNSAV-PSDGTEAKVDEAQAD--GKEH-PGAEDA--SDEQNSQSSMEH--SMNSSEK .:.:. : :. :. .::: :.:. ::. : .:: .. : . .: CCDS10 TPQPSRPVSPAGPPEGVPEEAQPPRLGQERDPGGITAGSTDEPPMLTKEEPVPELLEAEA 120 130 140 150 160 170 170 180 190 200 210 pF1KE1 VDRQPSGDSGLAAETSAI--SQDLEGVPPSKKMKLEASQQNSEEM . : . . .: ..: ::.:: :.. CCDS10 PEAYPVFEPVPPVPEAAQGDTEDSEGAPPLKRICPNAPDP 180 190 200 210 >>CCDS67012.1 BCL7C gene_id:9274|Hs108|chr16 (242 aa) initn: 376 init1: 339 opt: 413 Z-score: 341.8 bits: 70.5 E(32554): 1e-12 Smith-Waterman score: 418; 38.4% identity (71.7% similar) in 198 aa overlap (1-181:1-195) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :.::.::::::::::::::.:::.:::::.:::.:::::::::::.:::::..:. ... CCDS67 MAGRTVRAETRSRAKDDIKKVMATIEKVRRWEKRWVTVGDTSLRIFKWVPVVDPQEEERR 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 K-----NKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSP . ....:.... .. ..:.... ..:..:.::::: ...: ... . . . CCDS67 RAGGGAERSRGRERR--GRGASPRGGGPLILLDLNDENSNQSFHSEGS-LQKGTEPSPGG 70 80 90 100 110 120 130 140 150 160 pF1KE1 APEPNSAV-PSDGTEAKVDEAQAD--GKEH-PGAEDA--SDEQNSQSSME------HSMN .:.:. : :. :. .::: :.:. ::. : .:: .. : .. . CCDS67 TPQPSRPVSPAGPPEGVPEEAQPPRLGQERDPGGITAGSTDEPPMLTKEEPVPELLEAED 120 130 140 150 160 170 170 180 190 200 210 pF1KE1 SSEKVDRQPSGDSGLAAETSAISQDLEGVPPSKKMKLEASQQNSEEM :. .. :. ..:: .: CCDS67 SGVRMTRRALHEKGLKTEPLRRLLPRRGLRTNVRPSSMAVPDTRAPGGGSKAPRAPRTIP 180 190 200 210 220 230 >>CCDS56489.1 BCL7B gene_id:9275|Hs108|chr7 (145 aa) initn: 388 init1: 368 opt: 386 Z-score: 324.4 bits: 66.5 E(32554): 9.6e-12 Smith-Waterman score: 386; 59.6% identity (78.8% similar) in 104 aa overlap (1-104:1-102) 10 20 30 40 50 60 pF1KE1 MSGRSVRAETRSRAKDDIKRVMAAIEKVRKWEKKWVTVGDTSLRIYKWVPVTEPKVDDKN :::::::::::::::::::.:::::::::::::::::::::::::.::::::. : .:. CCDS56 MSGRSVRAETRSRAKDDIKKVMAAIEKVRKWEKKWVTVGDTSLRIFKWVPVTDSKEKEKS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 KNKKKGKDEKCGSEVTTPENSSSPGMMDMHDDNSNQSSIADASPIKQENSSNSSPAPEPN :..... : : . ::: ...... . .: .:: : CCDS56 KSNSSAAREPNGFPSDASANSSL--LLEFQEPSLPSSEVADEPPTLTKEEPVPLETQVVE 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 SAVPSDGTEAKVDEAQADGKEHPGAEDASDEQNSQSSMEHSMNSSEKVDRQPSGDSGLAA CCDS56 EEEDSGAPPLKRFCVDQPTVPQTASES 120 130 140 210 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:33:43 2016 done: Sun Nov 6 21:33:43 2016 Total Scan time: 2.330 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]