FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1249, 494 aa 1>>>pF1KE1249 494 - 494 aa - 494 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.3036+/-0.00109; mu= -3.2891+/- 0.066 mean_var=556.1011+/-114.278, 0's: 0 Z-trim(117.3): 26 B-trim: 0 in 0/53 Lambda= 0.054387 statistics sampled from 18013 (18034) to 18013 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.554), width: 16 Scan time: 3.610 The best scores are: opt bits E(32554) CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19 ( 494) 3398 281.0 2.1e-75 CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18 ( 504) 1038 95.8 1.2e-19 CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15 ( 465) 1028 95.0 1.9e-19 >>CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19 (494 aa) initn: 3398 init1: 3398 opt: 3398 Z-score: 1467.2 bits: 281.0 E(32554): 2.1e-75 Smith-Waterman score: 3398; 100.0% identity (100.0% similar) in 494 aa overlap (1-494:1-494) 10 20 30 40 50 60 pF1KE1 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE1 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA 430 440 450 460 470 480 490 pF1KE1 PGGPAGATATFSKA :::::::::::::: CCDS45 PGGPAGATATFSKA 490 >>CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18 (504 aa) initn: 1167 init1: 949 opt: 1038 Z-score: 466.3 bits: 95.8 E(32554): 1.2e-19 Smith-Waterman score: 1452; 52.4% identity (69.1% similar) in 531 aa overlap (2-494:23-504) 10 20 pF1KE1 MELSLESLGGLHSVAHAQAG------------------E ::..:::: ::. : . .: : CCDS42 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE 10 20 30 40 50 60 30 40 50 60 70 pF1KE1 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG :: :: :: :.::.. :: : : :. ....... .. . . .: : CCDS42 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG 70 80 90 100 110 80 90 100 110 120 130 pF1KE1 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV .:.: ::. ::: ::.:.:.. ::.: .::::::::: :::...:.::::. CCDS42 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHP------ 120 130 140 150 160 170 140 150 160 170 180 190 pF1KE1 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP ::: ::: :::...::::::::::::. : ....::.:: ::.: CCDS42 ------HPHHHPHHHHH----HHHQRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP 180 190 200 210 200 210 220 230 240 pF1KE1 AMGSPLSPLPNALP-----PALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH .:.. :::: : : .::.: : : :::::: ::.: : : CCDS42 GMSQSLSPLA-ATPLGNGLGGLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH 220 230 240 250 260 250 260 270 280 290 300 pF1KE1 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG-- .:.: :.:. :.::: :. :: ... :.:: : .::: . . CCDS42 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE 270 280 290 300 310 310 320 330 340 350 360 pF1KE1 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR :..:.:. : .. :::::::::::::::::::::::::::::.::::::::::::: CCDS42 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE1 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF :::::::::::::::::::::::::::::::::::::::::::: .:.: . ::.:::: CCDS42 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVF 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE1 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP ::::::::.::::::::::::::.::::::::::.:::::::::::: ...: .. :: CCDS42 TDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST-- 440 450 460 470 480 490 490 pF1KE1 GGPAGATATFSKA :: .....: .:: CCDS42 GGSSSTSSTCTKA 500 >>CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15 (465 aa) initn: 1320 init1: 963 opt: 1028 Z-score: 462.5 bits: 95.0 E(32554): 1.9e-19 Smith-Waterman score: 1500; 54.3% identity (70.6% similar) in 506 aa overlap (2-494:4-465) 10 20 30 40 50 pF1KE1 MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL .:..:..: ::.:.: : : : . ::::..: ::: . :..: . ::::: CCDS10 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP ::::.::: . : :::::::.: ::::.: :.. ::::::: CCDS10 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP 60 70 80 90 100 120 130 140 150 160 pF1KE1 LQHLPPLAAVADKF-HQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFT :: :::...:.::: :.: : : : ::: ::::..:::::: CCDS10 LQPLPPISTVSDKFPHHH---------HHHHHHHHPHHH---------QRLAGNVSGSFT 110 120 130 140 150 170 180 190 200 210 220 pF1KE1 LMRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYG ::::::. :::...:: :: :.. .::. :::: .. ..:.. : : :. CCDS10 LMRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YA 160 170 180 190 200 230 240 250 260 270 280 pF1KE1 PPGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAP :: . ::.: : .:: : :.::: .:. :. : ..: . .: . : CCDS10 HPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNA 210 220 230 240 250 290 300 310 320 330 340 pF1KE1 LG-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFA : : . :. :. . : :.: : .. :::::::::::::.::::::::::::: CCDS10 QGHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFA 260 270 280 290 300 310 350 360 370 380 390 400 pF1KE1 QRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE ::.::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 QRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE 320 330 340 350 360 370 410 420 430 440 450 460 pF1KE1 QQKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNA . :.:. ::: ::::::.::::: ::::::::::::.:.::::::::::.::::::::: CCDS10 HGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELSTVSNFFMNA 380 390 400 410 420 430 470 480 490 pF1KE1 RRRCMNRWAEEPSTAPGGPAGATATFSKA ::: ...: .: :. :. .....: .:: CCDS10 RRRSLDKWQDEGSSNSGNSSSSSSTCTKA 440 450 460 494 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:29:19 2016 done: Tue Nov 8 04:29:19 2016 Total Scan time: 3.610 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]