FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6570, 117 aa 1>>>pF1KE6570 117 - 117 aa - 117 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0406+/-0.000566; mu= 8.1546+/- 0.034 mean_var=85.4292+/-16.675, 0's: 0 Z-trim(114.8): 11 B-trim: 69 in 1/52 Lambda= 0.138762 statistics sampled from 15388 (15399) to 15388 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.827), E-opt: 0.2 (0.473), width: 16 Scan time: 1.270 The best scores are: opt bits E(32554) CCDS963.1 ENSA gene_id:2029|Hs108|chr1 ( 117) 763 161.0 1.5e-40 CCDS958.1 ENSA gene_id:2029|Hs108|chr1 ( 121) 763 161.0 1.6e-40 CCDS964.1 ENSA gene_id:2029|Hs108|chr1 ( 113) 643 137.0 2.5e-33 CCDS961.1 ENSA gene_id:2029|Hs108|chr1 ( 117) 643 137.0 2.6e-33 CCDS32242.1 ARPP19 gene_id:10776|Hs108|chr15 ( 112) 554 119.2 5.8e-28 CCDS81883.1 ARPP19 gene_id:10776|Hs108|chr15 ( 96) 536 115.5 6.1e-27 CCDS76757.1 ARPP19 gene_id:10776|Hs108|chr15 ( 131) 537 115.8 6.9e-27 CCDS965.1 ENSA gene_id:2029|Hs108|chr1 ( 105) 389 86.1 4.8e-18 CCDS962.1 ENSA gene_id:2029|Hs108|chr1 ( 133) 389 86.2 5.8e-18 CCDS959.1 ENSA gene_id:2029|Hs108|chr1 ( 137) 389 86.2 6e-18 CCDS960.1 ENSA gene_id:2029|Hs108|chr1 ( 133) 374 83.2 4.7e-17 >>CCDS963.1 ENSA gene_id:2029|Hs108|chr1 (117 aa) initn: 763 init1: 763 opt: 763 Z-score: 841.4 bits: 161.0 E(32554): 1.5e-40 Smith-Waterman score: 763; 99.1% identity (99.1% similar) in 117 aa overlap (1-117:1-117) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS96 MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG 70 80 90 100 110 >>CCDS958.1 ENSA gene_id:2029|Hs108|chr1 (121 aa) initn: 763 init1: 763 opt: 763 Z-score: 841.2 bits: 161.0 E(32554): 1.6e-40 Smith-Waterman score: 763; 99.1% identity (99.1% similar) in 117 aa overlap (1-117:1-117) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS95 MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS95 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAGGQV 70 80 90 100 110 120 CCDS95 E >>CCDS964.1 ENSA gene_id:2029|Hs108|chr1 (113 aa) initn: 643 init1: 643 opt: 643 Z-score: 711.8 bits: 137.0 E(32554): 2.5e-33 Smith-Waterman score: 643; 98.0% identity (99.0% similar) in 99 aa overlap (19-117:15-113) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK .:: :::::::::::::::::::::::::::::::::::::: CCDS96 MAGGLGCDVCYWFVEDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG 60 70 80 90 100 110 >>CCDS961.1 ENSA gene_id:2029|Hs108|chr1 (117 aa) initn: 643 init1: 643 opt: 643 Z-score: 711.6 bits: 137.0 E(32554): 2.6e-33 Smith-Waterman score: 643; 98.0% identity (99.0% similar) in 99 aa overlap (19-117:15-113) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK .:: :::::::::::::::::::::::::::::::::::::: CCDS96 MAGGLGCDVCYWFVEDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAGGQV 60 70 80 90 100 110 CCDS96 E >>CCDS32242.1 ARPP19 gene_id:10776|Hs108|chr15 (112 aa) initn: 552 init1: 535 opt: 554 Z-score: 615.6 bits: 119.2 E(32554): 5.8e-28 Smith-Waterman score: 554; 76.4% identity (88.2% similar) in 110 aa overlap (8-117:4-112) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK : : ..::... .: . ::.::::::::.:: ::::::::::: ::::: CCDS32 MSAEVPEAASAEEQKE-MEDKVTSPEKAEEAKLKARYPHLGQKPGGSDFLRKRLQK 10 20 30 40 50 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG :::::::::::::::::::::::.:.:::. :::::::::::::::: :::.::::: CCDS32 GQKYFDSGDYNMAKAKMKNKQLPTAAPDKTEVTGDHIPTPQDLPQRKPSLVASKLAG 60 70 80 90 100 110 >>CCDS81883.1 ARPP19 gene_id:10776|Hs108|chr15 (96 aa) initn: 535 init1: 535 opt: 536 Z-score: 597.1 bits: 115.5 E(32554): 6.1e-27 Smith-Waterman score: 536; 83.3% identity (91.7% similar) in 96 aa overlap (22-117:1-96) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK .: . ::.::::::::.:: ::::::::::: ::::: CCDS81 MEDKVTSPEKAEEAKLKARYPHLGQKPGGSDFLRKRLQK 10 20 30 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG :::::::::::::::::::::::.:.:::. :::::::::::::::: :::.::::: CCDS81 GQKYFDSGDYNMAKAKMKNKQLPTAAPDKTEVTGDHIPTPQDLPQRKPSLVASKLAG 40 50 60 70 80 90 >>CCDS76757.1 ARPP19 gene_id:10776|Hs108|chr15 (131 aa) initn: 551 init1: 535 opt: 537 Z-score: 596.1 bits: 115.8 E(32554): 6.9e-27 Smith-Waterman score: 537; 81.6% identity (90.8% similar) in 98 aa overlap (20-117:34-131) 10 20 30 40 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPG . .: . ::.::::::::.:: :::::: CCDS76 EVPEAASAEEQKSEHNMLPWSLQPSIPNSLEEMEDKVTSPEKAEEAKLKARYPHLGQKPG 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 GSDFLMKRLQKGQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSS ::::: ::::::::::::::::::::::::::::.:.:::. :::::::::::::::: : CCDS76 GSDFLRKRLQKGQKYFDSGDYNMAKAKMKNKQLPTAAPDKTEVTGDHIPTPQDLPQRKPS 70 80 90 100 110 120 110 pF1KE6 LVTSKLAG ::.::::: CCDS76 LVASKLAG 130 >>CCDS965.1 ENSA gene_id:2029|Hs108|chr1 (105 aa) initn: 389 init1: 389 opt: 389 Z-score: 437.5 bits: 86.1 E(32554): 4.8e-18 Smith-Waterman score: 389; 98.4% identity (98.4% similar) in 61 aa overlap (1-61:1-61) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS96 MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 GQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLPQRKSSLVTSKLAG : CCDS96 GVWGIVSYPLSLELKEVLRMKSVEVLLDPFLEVLLLNRSRGEFEI 70 80 90 100 >>CCDS962.1 ENSA gene_id:2029|Hs108|chr1 (133 aa) initn: 749 init1: 389 opt: 389 Z-score: 435.9 bits: 86.2 E(32554): 5.8e-18 Smith-Waterman score: 721; 87.2% identity (87.2% similar) in 133 aa overlap (1-117:1-133) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS96 MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 60 70 80 90 100 pF1KE6 G----------------QKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLP : ::::::::::::::::::::::::::::::::::::::::::: CCDS96 GDYKSLHWSVLLCADEMQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLP 70 80 90 100 110 120 110 pF1KE6 QRKSSLVTSKLAG ::::::::::::: CCDS96 QRKSSLVTSKLAG 130 >>CCDS959.1 ENSA gene_id:2029|Hs108|chr1 (137 aa) initn: 749 init1: 389 opt: 389 Z-score: 435.7 bits: 86.2 E(32554): 6e-18 Smith-Waterman score: 721; 87.2% identity (87.2% similar) in 133 aa overlap (1-117:1-133) 10 20 30 40 50 60 pF1KE6 MSQKQEEENPAEETGEEKQDTLEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS95 MSQKQEEENPAEETGEEKQDTQEKEGILPERAEEAKLKAKYPSLGQKPGGSDFLMKRLQK 10 20 30 40 50 60 70 80 90 100 pF1KE6 G----------------QKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLP : ::::::::::::::::::::::::::::::::::::::::::: CCDS95 GDYKSLHWSVLLCADEMQKYFDSGDYNMAKAKMKNKQLPSAGPDKNLVTGDHIPTPQDLP 70 80 90 100 110 120 110 pF1KE6 QRKSSLVTSKLAG ::::::::::::: CCDS95 QRKSSLVTSKLAGGQVE 130 117 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:29:41 2016 done: Tue Nov 8 14:29:41 2016 Total Scan time: 1.270 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]