FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1609, 114 aa 1>>>pF1KE1609 114 - 114 aa - 114 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7064+/-0.000527; mu= 13.7551+/- 0.032 mean_var=58.4110+/-11.342, 0's: 0 Z-trim(113.4): 6 B-trim: 0 in 0/52 Lambda= 0.167814 statistics sampled from 14005 (14011) to 14005 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.801), E-opt: 0.2 (0.43), width: 16 Scan time: 1.410 The best scores are: opt bits E(32554) CCDS14449.1 SH3BGRL gene_id:6451|Hs108|chrX ( 114) 747 187.8 1.2e-48 CCDS13666.1 SH3BGR gene_id:6450|Hs108|chr21 ( 239) 502 128.7 1.6e-30 CCDS4991.1 SH3BGRL2 gene_id:83699|Hs108|chr6 ( 107) 443 114.2 1.6e-26 CCDS82675.1 SH3BGR gene_id:6450|Hs108|chr21 ( 97) 279 74.5 1.4e-14 CCDS33560.1 SH3BGR gene_id:6450|Hs108|chr21 ( 128) 272 72.9 5.5e-14 >>CCDS14449.1 SH3BGRL gene_id:6451|Hs108|chrX (114 aa) initn: 747 init1: 747 opt: 747 Z-score: 986.8 bits: 187.8 E(32554): 1.2e-48 Smith-Waterman score: 747; 100.0% identity (100.0% similar) in 114 aa overlap (1-114:1-114) 10 20 30 40 50 60 pF1KE1 MVIRVYIASSSGSTAIKKKQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MVIRVYIASSSGSTAIKKKQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPA 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 TGYPLPPQIFNESQYRGDYDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQA :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TGYPLPPQIFNESQYRGDYDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQA 70 80 90 100 110 >>CCDS13666.1 SH3BGR gene_id:6450|Hs108|chr21 (239 aa) initn: 532 init1: 502 opt: 502 Z-score: 661.7 bits: 128.7 E(32554): 1.6e-30 Smith-Waterman score: 502; 65.4% identity (87.9% similar) in 107 aa overlap (1-107:64-170) 10 20 30 pF1KE1 MVIRVYIASSSGSTAIKKKQQDVLGFLEAN :::.:..:.:::: ::.::::.:.:::::: CCDS13 LACLCHCQDLSSGAFPDRGVLGGVLFPTVEMVIKVFVATSSGSIAIRKKQQEVVGFLEAN 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE1 KIGFEEKDIAANEENRKWMRENVPENSRPATGYPLPPQIFNESQYRGDYDAFFEARENNA :: :.: :::..:.::.::::::: ...: .: ::::::::: :: ::.:.:: :.:.: CCDS13 KIDFKELDIAGDEDNRRWMRENVPGEKKPQNGIPLPPQIFNEEQYCGDFDSFFSAKEENI 100 110 120 130 140 150 100 110 pF1KE1 VYAFLGLTAPPGSKEAEVQAKQQA .:.::::. :: :: .: CCDS13 IYSFLGLAPPPDSKGSEKAEEGGETEAQKEGSEDVGNLPEAQEKNEEEGETATEETEEIA 160 170 180 190 200 210 >>CCDS4991.1 SH3BGRL2 gene_id:83699|Hs108|chr6 (107 aa) initn: 442 init1: 442 opt: 443 Z-score: 589.4 bits: 114.2 E(32554): 1.6e-26 Smith-Waterman score: 443; 63.6% identity (86.0% similar) in 107 aa overlap (1-107:1-106) 10 20 30 40 50 60 pF1KE1 MVIRVYIASSSGSTAIKKKQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPA :::::.:::::: .:::::::::. ::::::: ::: ::. .::.:.:: .::: ...:. CCDS49 MVIRVFIASSSGFVAIKKKQQDVVRFLEANKIEFEEVDITMSEEQRQWMYKNVPPEKKPT 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 TGYPLPPQIFNESQYRGDYDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQA : :::::::: ..: ::::.:::..:.:.:..:::: : ...:: CCDS49 QGNPLPPQIFNGDRYCGDYDSFFESKESNTVFSFLGLK-PRLASKAEP 70 80 90 100 >>CCDS82675.1 SH3BGR gene_id:6450|Hs108|chr21 (97 aa) initn: 307 init1: 279 opt: 279 Z-score: 375.5 bits: 74.5 E(32554): 1.4e-14 Smith-Waterman score: 279; 64.4% identity (83.1% similar) in 59 aa overlap (49-107:1-59) 20 30 40 50 60 70 pF1KE1 KQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPATGYPLPPQIFNESQYRGD :::::: ...: .: ::::::::: :: :: CCDS82 MRENVPGEKKPQNGIPLPPQIFNEEQYCGD 10 20 30 80 90 100 110 pF1KE1 YDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQA .:.:: :.:.: .:.::::. :: ::: : CCDS82 FDSFFSAKEENIIYSFLGLAPPPDSKEEEGETATEETEEIAMEGAEGEAEEEEETAEGEE 40 50 60 70 80 90 >>CCDS33560.1 SH3BGR gene_id:6450|Hs108|chr21 (128 aa) initn: 285 init1: 272 opt: 272 Z-score: 364.6 bits: 72.9 E(32554): 5.5e-14 Smith-Waterman score: 272; 62.7% identity (83.1% similar) in 59 aa overlap (49-107:1-59) 20 30 40 50 60 70 pF1KE1 KQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPATGYPLPPQIFNESQYRGD :::::: ...: .: ::::::::: :: :: CCDS33 MRENVPGEKKPQNGIPLPPQIFNEEQYCGD 10 20 30 80 90 100 110 pF1KE1 YDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQA .:.:: :.:.: .:.::::. :: :: .: CCDS33 FDSFFSAKEENIIYSFLGLAPPPDSKGSEKAEEGGETEAQKEGSEDVGNLPEAQEKNEEE 40 50 60 70 80 90 114 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:45:57 2016 done: Sun Nov 6 12:45:58 2016 Total Scan time: 1.410 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]