FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1249, 494 aa
1>>>pF1KE1249 494 - 494 aa - 494 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.3036+/-0.00109; mu= -3.2891+/- 0.066
mean_var=556.1011+/-114.278, 0's: 0 Z-trim(117.3): 26 B-trim: 0 in 0/53
Lambda= 0.054387
statistics sampled from 18013 (18034) to 18013 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.554), width: 16
Scan time: 3.610
The best scores are: opt bits E(32554)
CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19 ( 494) 3398 281.0 2.1e-75
CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18 ( 504) 1038 95.8 1.2e-19
CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15 ( 465) 1028 95.0 1.9e-19
>>CCDS45900.1 ONECUT3 gene_id:390874|Hs108|chr19 (494 aa)
initn: 3398 init1: 3398 opt: 3398 Z-score: 1467.2 bits: 281.0 E(32554): 2.1e-75
Smith-Waterman score: 3398; 100.0% identity (100.0% similar) in 494 aa overlap (1-494:1-494)
10 20 30 40 50 60
pF1KE1 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MELSLESLGGLHSVAHAQAGELLSPGHARSAAAQHRGLVAPGRPGLVAGMASLLDGGGGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAPGLGGTYTTLTPLQHLPPLAAV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 ADKFHQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALAS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 VGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 PPAAFEPHAALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAGAHGPHG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE1 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 RNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLV
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE1 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 FTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTA
430 440 450 460 470 480
490
pF1KE1 PGGPAGATATFSKA
::::::::::::::
CCDS45 PGGPAGATATFSKA
490
>>CCDS42440.1 ONECUT2 gene_id:9480|Hs108|chr18 (504 aa)
initn: 1167 init1: 949 opt: 1038 Z-score: 466.3 bits: 95.8 E(32554): 1.2e-19
Smith-Waterman score: 1452; 52.4% identity (69.1% similar) in 531 aa overlap (2-494:23-504)
10 20
pF1KE1 MELSLESLGGLHSVAHAQAG------------------E
::..:::: ::. : . .: :
CCDS42 MKAAYTAYRCLTKDLEGCAMNPELTMESLGTLHGPAGGGSGGGGGGGGGGGGGGPGHEQE
10 20 30 40 50 60
30 40 50 60 70
pF1KE1 LL---SPGHA-RSAAAQHRGLVAPGRPGLVAGMASLLDGGGGGGGGGAGGAGGAGSAGGG
:: :: :: :.::.. :: : : :. ....... .. . . .: :
CCDS42 LLASPSPHHAGRGAAGSLRGPPPPPTAHQELGTAA----AAAAAASRSAMVTSMASILDG
70 80 90 100 110
80 90 100 110 120 130
pF1KE1 ADFRGELAGPLHPAMGMACEA--PGLG--GTYTTLTPLQHLPPLAAVADKFHQHAAAAAV
.:.: ::. ::: ::.:.:.. ::.: .::::::::: :::...:.::::.
CCDS42 GDYRPELSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQPLPPISTVSDKFHHP------
120 130 140 150 160 170
140 150 160 170 180 190
pF1KE1 AGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFTLMRDERAALASVGHLYGPYGKELP
::: ::: :::...::::::::::::. : ....::.:: ::.:
CCDS42 ------HPHHHPHHHHH----HHHQRLSGNVSGSFTLMRDERG-LPAMNNLYSPY-KEMP
180 190 200 210
200 210 220 230 240
pF1KE1 AMGSPLSPLPNALP-----PALHGAPQPPPPPPPPPLAAYGPPGHLAGDKLLPPAAFEPH
.:.. :::: : : .::.: : : :::::: ::.: : :
CCDS42 GMSQSLSPLA-ATPLGNGLGGLHNAQQSLP--------NYGPPGH---DKMLSPNFDAHH
220 230 240 250 260
250 260 270 280 290 300
pF1KE1 AALLGRAEDALARGLPGGGGGTGSGGAGSGSAAGLLAPLGGLAAAG---AHGPHGGGG--
.:.: :.:. :.::: :. :: ... :.:: : .::: . .
CCDS42 TAMLTRGEQHLSRGL------------GTPPAA-MMSHLNGLHHPGHTQSHGPVLAPSRE
270 280 290 300 310
310 320 330 340 350 360
pF1KE1 -GPGGSGGGPSAGAAA-EEINTKEVAQRITAELKRYSIPQAIFAQRILCRSQGTLSDLLR
:..:.:. : .. :::::::::::::::::::::::::::::.:::::::::::::
CCDS42 RPPSSSSGSQVATSGQLEEINTKEVAQRITAELKRYSIPQAIFAQRVLCRSQGTLSDLLR
320 330 340 350 360 370
370 380 390 400 410 420
pF1KE1 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEQQKERALQPKKQRLVF
:::::::::::::::::::::::::::::::::::::::::::: .:.: . ::.::::
CCDS42 NPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQEPNKDRNNSQKKSRLVF
380 390 400 410 420 430
430 440 450 460 470 480
pF1KE1 TDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNARRRCMNRWAEEPSTAP
::::::::.::::::::::::::.::::::::::.:::::::::::: ...: .. ::
CCDS42 TDLQRRTLFAIFKENKRPSKEMQITISQQLGLELTTVSNFFMNARRRSLEKWQDDLST--
440 450 460 470 480 490
490
pF1KE1 GGPAGATATFSKA
:: .....: .::
CCDS42 GGSSSTSSTCTKA
500
>>CCDS10150.1 ONECUT1 gene_id:3175|Hs108|chr15 (465 aa)
initn: 1320 init1: 963 opt: 1028 Z-score: 462.5 bits: 95.0 E(32554): 1.9e-19
Smith-Waterman score: 1500; 54.3% identity (70.6% similar) in 506 aa overlap (2-494:4-465)
10 20 30 40 50
pF1KE1 MELSLESLGGLHSVAH----AQAGELLSPGHARSAAAQHRGL-VAPGRPGLVAGMASL
.:..:..: ::.:.: : : : . ::::..: ::: . :..: . :::::
CCDS10 MNAQLTMEAIGELHGVSHEPVPAPADLLGGSPHARSSVA-HRGSHLPPAHPRSM-GMASL
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 LDGGGGGGGGGAGGAGGAGSAGGGADFRGELAGPLHPAMGMACEAP-GLG--GTYTTLTP
::::.::: . : :::::::.: ::::.: :.. :::::::
CCDS10 LDGGSGGGDYHHHHRAPEHS----------LAGPLHPTMTMACETPPGMSMPTTYTTLTP
60 70 80 90 100
120 130 140 150 160
pF1KE1 LQHLPPLAAVADKF-HQHAAAAAVAGAHGGHPHAHPHPAAAPPPPPPPQRLAASVSGSFT
:: :::...:.::: :.: : : : ::: ::::..::::::
CCDS10 LQPLPPISTVSDKFPHHH---------HHHHHHHHPHHH---------QRLAGNVSGSFT
110 120 130 140 150
170 180 190 200 210 220
pF1KE1 LMRDERAALASVGHLYGPYGKELPAMGSPLSPLPNALPPALHGAPQPPPPPPPPPLAAYG
::::::. :::...:: :: :.. .::. :::: .. ..:.. : : :.
CCDS10 LMRDERG-LASMNNLYTPYHKDVAGMGQSLSPLSSSGLGSIHNSQQGLPH--------YA
160 170 180 190 200
230 240 250 260 270 280
pF1KE1 PPGH-LAGDKLLPPAAFEPH-AALLGR-AEDALARGLPGGGGGTGSGGAGSGSAAGLLAP
:: . ::.: : .:: : :.::: .:. :. : ..: . .: . :
CCDS10 HPGAAMPTDKMLTPNGFEAHHPAMLGRHGEQHLT---PTSAGMVPINGLPPHHPHAHLNA
210 220 230 240 250
290 300 310 320 330 340
pF1KE1 LG-GLAAAGAHGPHGGGGGPGGSGGGPSAGAAAEEINTKEVAQRITAELKRYSIPQAIFA
: : . :. :. . : :.: : .. :::::::::::::.:::::::::::::
CCDS10 QGHGQLLGTAREPNPSVTGAQVSNG--SNSGQMEEINTKEVAQRITTELKRYSIPQAIFA
260 270 280 290 300 310
350 360 370 380 390 400
pF1KE1 QRILCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 QRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLAACKRKEQE
320 330 340 350 360 370
410 420 430 440 450 460
pF1KE1 QQKERALQPKKQRLVFTDLQRRTLIAIFKENKRPSKEMQVTISQQLGLELNTVSNFFMNA
. :.:. ::: ::::::.::::: ::::::::::::.:.::::::::::.:::::::::
CCDS10 HGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELSTVSNFFMNA
380 390 400 410 420 430
470 480 490
pF1KE1 RRRCMNRWAEEPSTAPGGPAGATATFSKA
::: ...: .: :. :. .....: .::
CCDS10 RRRSLDKWQDEGSSNSGNSSSSSSTCTKA
440 450 460
494 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 04:29:19 2016 done: Tue Nov 8 04:29:19 2016
Total Scan time: 3.610 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]