FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5667, 337 aa 1>>>pF1KB5667 337 - 337 aa - 337 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.8126+/-0.00102; mu= -4.7462+/- 0.062 mean_var=393.9402+/-78.891, 0's: 0 Z-trim(116.5): 3 B-trim: 0 in 0/52 Lambda= 0.064619 statistics sampled from 17145 (17147) to 17145 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.805), E-opt: 0.2 (0.527), width: 16 Scan time: 3.370 The best scores are: opt bits E(32554) CCDS72873.1 LIX1L gene_id:128077|Hs108|chr1 ( 337) 2266 224.4 1.1e-58 CCDS4088.1 LIX1 gene_id:167410|Hs108|chr5 ( 282) 980 104.4 1.1e-22 >>CCDS72873.1 LIX1L gene_id:128077|Hs108|chr1 (337 aa) initn: 2266 init1: 2266 opt: 2266 Z-score: 1167.4 bits: 224.4 E(32554): 1.1e-58 Smith-Waterman score: 2266; 100.0% identity (100.0% similar) in 337 aa overlap (1-337:1-337) 10 20 30 40 50 60 pF1KB5 METMRAQRLQPGVGTSGRGTLRALRPGVTGAAAATATPPAGPPPAPPPPAPPPPPLLLSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 METMRAQRLQPGVGTSGRGTLRALRPGVTGAAAATATPPAGPPPAPPPPAPPPPPLLLSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 APGLPLPPGAAGSPAVLREAVEAVVRSFAKHTQGYGRVNVVEALQEFWQMKQSRGADLKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 APGLPLPPGAAGSPAVLREAVEAVVRSFAKHTQGYGRVNVVEALQEFWQMKQSRGADLKN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 GALVVYEMVPSNSPPYVCYVTLPGGSCFGSFQFCPTKAEARRSAAKIALMNSVFNEHPSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 GALVVYEMVPSNSPPYVCYVTLPGGSCFGSFQFCPTKAEARRSAAKIALMNSVFNEHPSR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 RITDEFIEKSVSEALASFNGNREEADNPNTGIGAFRFMLESNKGKSMLEFQELMTVFQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 RITDEFIEKSVSEALASFNGNREEADNPNTGIGAFRFMLESNKGKSMLEFQELMTVFQLL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 HWNGSLKAMRERQCSRQEVLAHYSHRALDDDIRHQMALDWVSREQSVPGALSRELASTER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 HWNGSLKAMRERQCSRQEVLAHYSHRALDDDIRHQMALDWVSREQSVPGALSRELASTER 250 260 270 280 290 300 310 320 330 pF1KB5 ELDEARLAGKELRFHKEKKDILVLAAGQLGNMHSSNC ::::::::::::::::::::::::::::::::::::: CCDS72 ELDEARLAGKELRFHKEKKDILVLAAGQLGNMHSSNC 310 320 330 >>CCDS4088.1 LIX1 gene_id:167410|Hs108|chr5 (282 aa) initn: 1033 init1: 979 opt: 980 Z-score: 520.4 bits: 104.4 E(32554): 1.1e-22 Smith-Waterman score: 980; 61.2% identity (86.2% similar) in 232 aa overlap (98-329:28-259) 70 80 90 100 110 120 pF1KB5 PGAAGSPAVLREAVEAVVRSFAKHTQGYGRVNVVEALQEFWQMKQSRGADLKNGALVVYE .::: :::::. ::.. : . . ..:::: CCDS40 MDRTLESLRHIIAQVLPHRDPALVFKDLNVVSMLQEFWESKQQQKAAFPSEGVVVYE 10 20 30 40 50 130 140 150 160 170 180 pF1KB5 MVPSNSPPYVCYVTLPGGSCFGSFQFCPTKAEARRSAAKIALMNSVFNEHPSRRITDEFI .:. .::.: :::::::::::.:: : ..:::::.:::.::.::.::: :::::: ::: CCDS40 SLPAPGPPFVSYVTLPGGSCFGNFQCCLSRAEARRDAAKVALINSLFNELPSRRITKEFI 60 70 80 90 100 110 190 200 210 220 230 240 pF1KB5 EKSVSEALASFNGNREEADNPNTGIGAFRFMLESNKGKSMLEFQELMTVFQLLHWNGSLK .::.::.:: .:. ..::.:.:..::...::::: ::.:::::::::.::::::::::: CCDS40 MESVQEAVASTSGTLDDADDPSTSVGAYHYMLESNMGKTMLEFQELMTIFQLLHWNGSLK 120 130 140 150 160 170 250 260 270 280 290 300 pF1KB5 AMRERQCSRQEVLAHYSHRALDDDIRHQMALDWVSREQSVPGALSRELASTERELDEARL :.:: .::::::...::. .::. .: .:::::. .:.. :: .:.:: . :.:.::: CCDS40 ALRETKCSRQEVISYYSQYSLDEKMRSHMALDWIMKERDSPGIVSQELRMALRQLEEARK 180 190 200 210 220 230 310 320 330 pF1KB5 AGKELRFHKEKKDILVLAAGQLGNMHSSNC ::.::::.::::.:: :: :. CCDS40 AGQELRFYKEKKEILSLALTQICSDPDTSSPSDDQLSLTALCGYH 240 250 260 270 280 337 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 04:02:02 2016 done: Mon Nov 7 04:02:02 2016 Total Scan time: 3.370 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]