FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2724, 199 aa 1>>>pF1KE2724 199 - 199 aa - 199 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4350+/-0.000698; mu= 12.2665+/- 0.042 mean_var=57.5414+/-11.545, 0's: 0 Z-trim(108.5): 13 B-trim: 0 in 0/51 Lambda= 0.169077 statistics sampled from 10274 (10283) to 10274 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.716), E-opt: 0.2 (0.316), width: 16 Scan time: 0.920 The best scores are: opt bits E(32554) CCDS6650.1 NMRK1 gene_id:54981|Hs108|chr9 ( 199) 1339 334.4 2.8e-92 CCDS83374.1 NMRK1 gene_id:54981|Hs108|chr9 ( 203) 1284 321.0 3.2e-88 CCDS12115.1 NMRK2 gene_id:27231|Hs108|chr19 ( 230) 715 182.2 2.1e-46 CCDS74259.1 NMRK2 gene_id:27231|Hs108|chr19 ( 235) 695 177.3 6.4e-45 CCDS47981.1 NMRK1 gene_id:54981|Hs108|chr9 ( 175) 679 173.4 7.3e-44 >>CCDS6650.1 NMRK1 gene_id:54981|Hs108|chr9 (199 aa) initn: 1339 init1: 1339 opt: 1339 Z-score: 1770.1 bits: 334.4 E(32554): 2.8e-92 Smith-Waterman score: 1339; 100.0% identity (100.0% similar) in 199 aa overlap (1-199:1-199) 10 20 30 40 50 60 pF1KE2 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYDVLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYDVLEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 LNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSYFLTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 LNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSYFLTI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 PYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEEDLFLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 PYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEEDLFLQ 130 140 150 160 170 180 190 pF1KE2 VYEDLIQELAKQKCLQVTA ::::::::::::::::::: CCDS66 VYEDLIQELAKQKCLQVTA 190 >>CCDS83374.1 NMRK1 gene_id:54981|Hs108|chr9 (203 aa) initn: 1284 init1: 1284 opt: 1284 Z-score: 1697.4 bits: 321.0 E(32554): 3.2e-88 Smith-Waterman score: 1284; 99.5% identity (99.5% similar) in 192 aa overlap (8-199:12-203) 10 20 30 40 50 pF1KE2 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYD :: :::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MGKRRHRRGNVISCVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE2 VLEALNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 VLEALNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSY 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE2 FLTIPYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 FLTIPYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEED 130 140 150 160 170 180 180 190 pF1KE2 LFLQVYEDLIQELAKQKCLQVTA ::::::::::::::::::::::: CCDS83 LFLQVYEDLIQELAKQKCLQVTA 190 200 >>CCDS12115.1 NMRK2 gene_id:27231|Hs108|chr19 (230 aa) initn: 697 init1: 434 opt: 715 Z-score: 946.4 bits: 182.2 E(32554): 2.1e-46 Smith-Waterman score: 715; 57.0% identity (77.7% similar) in 193 aa overlap (1-189:1-191) 10 20 30 40 50 60 pF1KE2 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYDVLEA :: .:.::.:.::.:::::...: . :::: :: :::::::...: . ..:: :.::::. CCDS12 MK-LIVGIGGMTNGGKTTLTNSLLRALPNCCVIHQDDFFKPQDQIAVGEDGFKQWDVLES 10 20 30 40 50 70 80 90 100 110 pF1KE2 LNMEKMMSAISCWMES----ARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSY :.:: :..... :. : :: ::. : : . ::..:::::..:::: ...: : CCDS12 LDMEAMLDTVQAWLSSPQKFARAHGVSV-QPEASDTHILLLEGFLLYSYKPLVDLYSRRY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 FLTIPYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEED :::.:::::: ::::: : :: :: ::::::::: ::::::. ::::::: ::.:. CCDS12 FLTVPYEECKWRRSTRNYTVPDPPGLFDGHVWPMYQKYRQEMEANGVEVVYLDGMKSREE 120 130 140 150 160 170 180 190 pF1KE2 LFLQVYEDLIQELAKQKCLQVTA :: .: ::. . : CCDS12 LFREVLEDIQNSLLNRSQESAPSPARPARTQGPGRGCGHRTARPAASQQDSM 180 190 200 210 220 230 >>CCDS74259.1 NMRK2 gene_id:27231|Hs108|chr19 (235 aa) initn: 673 init1: 434 opt: 695 Z-score: 919.9 bits: 177.3 E(32554): 6.4e-45 Smith-Waterman score: 695; 55.6% identity (75.8% similar) in 198 aa overlap (1-189:1-196) 10 20 30 40 50 pF1KE2 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFK-----PESEIETDKNGFLQY :: .:.::.:.::.:::::...: . :::: :: :::::: :...: . ..:: :. CCDS74 MK-LIVGIGGMTNGGKTTLTNSLLRALPNCCVIHQDDFFKAPLFQPQDQIAVGEDGFKQW 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 DVLEALNMEKMMSAISCWMES----ARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTI ::::.:.:: :..... :. : :: ::. : : . ::..:::::..:::: . CCDS74 DVLESLDMEAMLDTVQAWLSSPQKFARAHGVSV-QPEASDTHILLLEGFLLYSYKPLVDL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 WNRSYFLTIPYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGT ..: ::::.:::::: ::::: : :: :: ::::::::: ::::::. ::::::: CCDS74 YSRRYFLTVPYEECKWRRSTRNYTVPDPPGLFDGHVWPMYQKYRQEMEANGVEVVYLDGM 120 130 140 150 160 170 180 190 pF1KE2 KSEEDLFLQVYEDLIQELAKQKCLQVTA ::.:.:: .: ::. . : CCDS74 KSREELFREVLEDIQNSLLNRSQESAPSPARPARTQGPGRGCGHRTARPAASQQDSM 180 190 200 210 220 230 >>CCDS47981.1 NMRK1 gene_id:54981|Hs108|chr9 (175 aa) initn: 1147 init1: 679 opt: 679 Z-score: 900.9 bits: 173.4 E(32554): 7.3e-44 Smith-Waterman score: 1103; 87.4% identity (87.9% similar) in 199 aa overlap (1-199:1-175) 10 20 30 40 50 60 pF1KE2 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYDVLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFFKPESEIETDKNGFLQYDVLEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 LNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNYKPLDTIWNRSYFLTI ::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LNMEKMMSAISCWMESARHSVVSTDQESAEEIPILIIEGFLLFNY--------------- 70 80 90 100 130 140 150 160 170 180 pF1KE2 PYEECKRRRSTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEEDLFLQ .:::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ---------NTRVYQPPDSPGYFDGHVWPMYLKYRQEMQDITWEVVYLDGTKSEEDLFLQ 110 120 130 140 150 190 pF1KE2 VYEDLIQELAKQKCLQVTA ::::::::::::::::::: CCDS47 VYEDLIQELAKQKCLQVTA 160 170 199 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Jun 13 19:13:01 2018 done: Wed Jun 13 19:13:02 2018 Total Scan time: 0.920 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]