FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3808, 359 aa 1>>>pF1KE3808 359 - 359 aa - 359 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0475+/-0.000801; mu= 1.0070+/- 0.048 mean_var=288.2349+/-65.353, 0's: 0 Z-trim(116.4): 264 B-trim: 1020 in 1/53 Lambda= 0.075544 statistics sampled from 16730 (17064) to 16730 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.828), E-opt: 0.2 (0.524), width: 16 Scan time: 3.180 The best scores are: opt bits E(32554) CCDS74457.1 SBK3 gene_id:100130827|Hs108|chr19 ( 359) 2491 284.3 1.1e-76 CCDS42631.1 SBK2 gene_id:646643|Hs108|chr19 ( 348) 760 95.6 6.6e-20 CCDS32416.1 SBK1 gene_id:388228|Hs108|chr16 ( 424) 674 86.4 5e-17 >>CCDS74457.1 SBK3 gene_id:100130827|Hs108|chr19 (359 aa) initn: 2491 init1: 2491 opt: 2491 Z-score: 1490.3 bits: 284.3 E(32554): 1.1e-76 Smith-Waterman score: 2491; 100.0% identity (100.0% similar) in 359 aa overlap (1-359:1-359) 10 20 30 40 50 60 pF1KE3 MERRASETPEDGDPEEDTATALQRLVELTTSRVTPVRSLRDQYHLIRKLGSGSYGRVLLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MERRASETPEDGDPEEDTATALQRLVELTTSRVTPVRSLRDQYHLIRKLGSGSYGRVLLA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 QPHQGGPAVALKLLRRDLVLRSTFLREFCVGRCVSAHPGLLQTLAGPLQTPRYFAFAQEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QPHQGGPAVALKLLRRDLVLRSTFLREFCVGRCVSAHPGLLQTLAGPLQTPRYFAFAQEY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 APCGDLSGMLQERGLPELLVKRVVAQLAGALDFLHSRGLVHADVKPDNVLVFDPVCSRVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 APCGDLSGMLQERGLPELLVKRVVAQLAGALDFLHSRGLVHADVKPDNVLVFDPVCSRVA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 LGDLGLTRPEGSPTPAPPVPLPTAPPELCLLLPPDTLPLRPAVDSWGLGVLLFCAATACF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LGDLGLTRPEGSPTPAPPVPLPTAPPELCLLLPPDTLPLRPAVDSWGLGVLLFCAATACF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 PWDVALAPNPEFEAFAGWVTTKPQPPQPPPPWDQFAPPALALLQGLLDLDPETRSPPLAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 PWDVALAPNPEFEAFAGWVTTKPQPPQPPPPWDQFAPPALALLQGLLDLDPETRSPPLAV 250 260 270 280 290 300 310 320 330 340 350 pF1KE3 LDFLGDDWGLQGNREGPGVLGSAVSYEDREEGGSSLEEWTDEGDDSKSGGRTGTDGGAP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LDFLGDDWGLQGNREGPGVLGSAVSYEDREEGGSSLEEWTDEGDDSKSGGRTGTDGGAP 310 320 330 340 350 >>CCDS42631.1 SBK2 gene_id:646643|Hs108|chr19 (348 aa) initn: 735 init1: 274 opt: 760 Z-score: 470.9 bits: 95.6 E(32554): 6.6e-20 Smith-Waterman score: 760; 42.0% identity (64.8% similar) in 324 aa overlap (8-327:28-345) 10 20 30 40 pF1KE3 MERRASETPEDGDPEEDTATALQRLVELTTSRVTPVRSLR : :. . ...: ::. . .: : : ::. CCDS42 MPGKQSEEGPAEAGASEDSEEEGLGGLTLEELQQGQEAARALEDM--MTLSAQTLVRAEV 10 20 30 40 50 50 60 70 80 90 pF1KE3 DQ-YHLIRKLGSGSYGRVLLAQPHQGGPAVALKLLRRDLVLRSTFLREFCVGRCVSAHPG :. :. .: ::.: ::::::. .: : .::: : . . :: ::::: ..:: . CCDS42 DELYEEVRPLGQGCYGRVLLVTHRQKGTPLALKQLPKPRTSLRGFLYEFCVGLSLGAHSA 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE3 LLQTLAGPLQTPRYFAFAQEYAPCGDLSGMLQER-GLPELLVKRVVAQLAGALDFLHSRG .. . . ... . ..: : . ::: ...: . :::. :.: .::::.::...:.:: CCDS42 IVTAYGIGIESAHSYSFLTEPVLHGDLMAFIQPKVGLPQPAVHRCAAQLASALEYIHARG 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE3 LVHADVKPDNVLVFDPVCSRVALGDLGLTRPEGSPTPAPPVPLPTAPPELCLLLP-PDTL ::. :.::.:::: ::.: : : :.: :::.:. :.: . :::: : :. : CCDS42 LVYRDLKPENVLVCDPACRRFKLTDFGHTRPRGTLLRLAGPPIPYTAPELCAPPPLPEGL 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE3 PLRPAVDSWGLGVLLFCAATACFPWDVALAP-NPEFEAFAGWVTTKPQPPQPPPPWDQFA :..::.:.:.::::::: :. :::: :: .: .: : : .. :: . : :: .: CCDS42 PIQPALDAWALGVLLFCLLTGYFPWDRPLAEADPFYEDFLIWQASG-QPRDRPQPWFGLA 240 250 260 270 280 290 280 290 300 310 320 330 pF1KE3 PPALALLQGLLDLDPETRSPPLAVLDFLGDDWGLQGNREGPGVLGSAVSYEDREEGGSSL : :::.:::: :. :: .:. . :: : .::: . .:: : CCDS42 AAADALLRGLLDPHPRRRSAVIAIREHLGRPWR---QREGEAEAVGAVEEEAGQ 300 310 320 330 340 340 350 pF1KE3 EEWTDEGDDSKSGGRTGTDGGAP >>CCDS32416.1 SBK1 gene_id:388228|Hs108|chr16 (424 aa) initn: 672 init1: 390 opt: 674 Z-score: 419.2 bits: 86.4 E(32554): 5e-17 Smith-Waterman score: 674; 39.9% identity (65.2% similar) in 276 aa overlap (30-304:40-313) 10 20 30 40 50 pF1KE3 MERRASETPEDGDPEEDTATALQRLVELTTSRVTPVRSLRDQYHLIRKLGSGSYGRVLL : :. . .. .:.:.:.::.:.::.: : CCDS32 PPRSLTCCGPGTAPGPGAGVPLLTEDMQALTLRTLAASDVTKHYELVRELGKGTYGKVDL 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 AQPHQGGPAVALKLLRRDLVLRSTFLREFCVGRCVSAHPGLLQTLAGPLQTPRYFAFAQE . . : .:::.. .. . ..:::: . .:. : ..... ..: ..:::: CCDS32 VVYKGTGTKMALKFVNKSKTKLKNFLREVSITNSLSSSPFIIKVFDVVFETEDCYVFAQE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE3 YAPCGDLSGMLQER-GLPELLVKRVVAQLAGALDFLHSRGLVHADVKPDNVLVFDPVCSR ::: ::: .. . :::: ::: : ::. ::::.:.: ::: :.::.:::.:: : : CCDS32 YAPAGDLFDIIPPQVGLPEDTVKRCVQQLGLALDFMHGRQLVHRDIKPENVLLFDRECRR 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE3 VALGDLGLTRPEGSPTPAPPVPLPTAPPELCLLLPPDTLPLRPAVDSWGLGVLLFCAATA : :.:.:.:: : . .: . ::.: : : . .:: :..:::.::. :. CCDS32 VKLADFGMTRRVGCRVKRVSGTIPYTAPEVCQAGRADGLAVDTGVDVWAFGVLIFCVLTG 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE3 CFPWDVALAPNPEFEAFAGWVTTKPQPPQPPPPWDQFAPPALALLQGLLDLDPETRSPPL :::..: . . :: :. : . . : : : .:. ::: ..: :: :.:: :.: CCDS32 NFPWEAASGADAFFEEFVRW--QRGRLPGLPSQWRRFTEPALRMFQRLLALEPERRGPAK 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE3 AVLDFLGDDWGLQGNREGPGVLGSAVSYEDREEGGSSLEEWTDEGDDSKSGGRTGTDGGA :. :: CCDS32 EVFRFLKHELTSELRRRPSHRARKPPGDRPPAAGPLRLEAPGPLKRTVLTESGSGSRPAP 310 320 330 340 350 360 359 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:39:15 2016 done: Sun Nov 6 06:39:16 2016 Total Scan time: 3.180 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]