FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6554, 152 aa 1>>>pF1KE6554 152 - 152 aa - 152 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4864+/-0.000593; mu= 12.2049+/- 0.036 mean_var=64.9948+/-13.126, 0's: 0 Z-trim(112.7): 7 B-trim: 109 in 1/49 Lambda= 0.159087 statistics sampled from 13398 (13405) to 13398 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.765), E-opt: 0.2 (0.412), width: 16 Scan time: 1.560 The best scores are: opt bits E(32554) CCDS6703.1 NINJ1 gene_id:4814|Hs108|chr9 ( 152) 993 235.4 1e-62 CCDS73418.1 NINJ2 gene_id:4815|Hs108|chr12 ( 135) 431 106.4 6.3e-24 CCDS8505.1 NINJ2 gene_id:4815|Hs108|chr12 ( 188) 431 106.4 8.4e-24 CCDS76499.1 NINJ2 gene_id:4815|Hs108|chr12 ( 106) 333 83.8 3e-17 >>CCDS6703.1 NINJ1 gene_id:4814|Hs108|chr9 (152 aa) initn: 993 init1: 993 opt: 993 Z-score: 1239.3 bits: 235.4 E(32554): 1e-62 Smith-Waterman score: 993; 100.0% identity (100.0% similar) in 152 aa overlap (1-152:1-152) 10 20 30 40 50 60 pF1KE6 MDSGTEEYELNGGLPPGTPGSPDASPARWGWRHGPINVNHYASKKSAAESMLDIALLMAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 MDSGTEEYELNGGLPPGTPGSPDASPARWGWRHGPINVNHYASKKSAAESMLDIALLMAN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 ASQLKAVVEQGPSFAFYVPLVVLISISLVLQIGVGVLLIFLVKYDLNNPAKHAKLDFLNN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 ASQLKAVVEQGPSFAFYVPLVVLISISLVLQIGVGVLLIFLVKYDLNNPAKHAKLDFLNN 70 80 90 100 110 120 130 140 150 pF1KE6 LATGLVFIIVVVNIFITAFGVQKPLMDMAPQQ :::::::::::::::::::::::::::::::: CCDS67 LATGLVFIIVVVNIFITAFGVQKPLMDMAPQQ 130 140 150 >>CCDS73418.1 NINJ2 gene_id:4815|Hs108|chr12 (135 aa) initn: 455 init1: 421 opt: 431 Z-score: 543.0 bits: 106.4 E(32554): 6.3e-24 Smith-Waterman score: 431; 57.6% identity (80.8% similar) in 125 aa overlap (19-143:5-122) 10 20 30 40 50 60 pF1KE6 MDSGTEEYELNGGLPPGTPGSPDASPARWGWRHGPINVNHYASKKSAAESMLDIALLMAN ::: : : : :::.::::.:::.::::::.::.:.: CCDS73 MMYMPGSSD--P-----RSQPINLNHYATKKSVAESMLDVALFMSN 10 20 30 70 80 90 100 110 120 pF1KE6 ASQLKAVVEQGPSFAFYVPLVVLISISLVLQIGVGVLLIFLVKYDLNNPAKHAKLDFLNN : .::::.::::: .:. ::.:::.::.::. .::::. ... .::. :. .:. ::: CCDS73 AMRLKAVLEQGPSSHYYTTLVTLISLSLLLQVVIGVLLVVIARLNLNEVEKQWRLNQLNN 40 50 60 70 80 90 130 140 150 pF1KE6 LATGLVFIIVVVNIFITAFGVQKPLMDMAPQQ :: :::. ::.:.::::::..: CCDS73 AATILVFFTVVINVFITAFGAHKTGFLAARASRNPL 100 110 120 130 >>CCDS8505.1 NINJ2 gene_id:4815|Hs108|chr12 (188 aa) initn: 455 init1: 421 opt: 431 Z-score: 540.8 bits: 106.4 E(32554): 8.4e-24 Smith-Waterman score: 432; 53.1% identity (76.9% similar) in 143 aa overlap (1-143:47-175) 10 20 30 pF1KE6 MDSGTEEYELNGGLPPGTPGSPDASPARWG :.:. :. .:. ::: : : CCDS85 AAETQTAEPGGAHAVCSRHPVRVKGLEGSEMESARENIDLQ-------PGSSD--P---- 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 WRHGPINVNHYASKKSAAESMLDIALLMANASQLKAVVEQGPSFAFYVPLVVLISISLVL : :::.::::.:::.::::::.::.:.:: .::::.::::: .:. ::.:::.::.: CCDS85 -RSQPINLNHYATKKSVAESMLDVALFMSNAMRLKAVLEQGPSSHYYTTLVTLISLSLLL 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 QIGVGVLLIFLVKYDLNNPAKHAKLDFLNNLATGLVFIIVVVNIFITAFGVQKPLMDMAP :. .::::. ... .::. :. .:. ::: :: :::. ::.:.::::::..: CCDS85 QVVIGVLLVVIARLNLNEVEKQWRLNQLNNAATILVFFTVVINVFITAFGAHKTGFLAAR 130 140 150 160 170 180 pF1KE6 QQ CCDS85 ASRNPL >>CCDS76499.1 NINJ2 gene_id:4815|Hs108|chr12 (106 aa) initn: 351 init1: 333 opt: 333 Z-score: 423.1 bits: 83.8 E(32554): 3e-17 Smith-Waterman score: 333; 57.0% identity (84.9% similar) in 93 aa overlap (51-143:1-93) 30 40 50 60 70 80 pF1KE6 SPDASPARWGWRHGPINVNHYASKKSAAESMLDIALLMANASQLKAVVEQGPSFAFYVPL :::.::.:.:: .::::.::::: .:. : CCDS76 MLDVALFMSNAMRLKAVLEQGPSSHYYTTL 10 20 30 90 100 110 120 130 140 pF1KE6 VVLISISLVLQIGVGVLLIFLVKYDLNNPAKHAKLDFLNNLATGLVFIIVVVNIFITAFG :.:::.::.::. .::::. ... .::. :. .:. ::: :: :::. ::.:.:::::: CCDS76 VTLISLSLLLQVVIGVLLVVIARLNLNEVEKQWRLNQLNNAATILVFFTVVINVFITAFG 40 50 60 70 80 90 150 pF1KE6 VQKPLMDMAPQQ ..: CCDS76 AHKTGFLAARASRNPL 100 152 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:22:01 2016 done: Tue Nov 8 14:22:01 2016 Total Scan time: 1.560 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]