FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7580, 284 aa 1>>>pF1KB7580 284 - 284 aa - 284 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1244+/-0.000882; mu= 5.0435+/- 0.054 mean_var=230.5985+/-48.068, 0's: 0 Z-trim(114.6): 145 B-trim: 821 in 1/51 Lambda= 0.084459 statistics sampled from 14953 (15115) to 14953 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.781), E-opt: 0.2 (0.464), width: 16 Scan time: 2.550 The best scores are: opt bits E(32554) CCDS14424.1 CDX4 gene_id:1046|Hs108|chrX ( 284) 1949 249.5 2e-66 CCDS4304.1 CDX1 gene_id:1044|Hs108|chr5 ( 265) 572 81.7 6.2e-16 CCDS9328.1 CDX2 gene_id:1045|Hs108|chr13 ( 313) 448 66.7 2.5e-11 >>CCDS14424.1 CDX4 gene_id:1046|Hs108|chrX (284 aa) initn: 1949 init1: 1949 opt: 1949 Z-score: 1305.9 bits: 249.5 E(32554): 2e-66 Smith-Waterman score: 1949; 100.0% identity (100.0% similar) in 284 aa overlap (1-284:1-284) 10 20 30 40 50 60 pF1KB7 MYGSCLLEKEAGMYPGTLMSPGGDGTAGTGGTGGGGSPMPASNFAAAPAFSHYMGYPHMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MYGSCLLEKEAGMYPGTLMSPGGDGTAGTGGTGGGGSPMPASNFAAAPAFSHYMGYPHMP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SMDPHWPSLGVWGSPYSPPREDWSVYPGPSSTMGTVPVNDVTSSPAAFCSTDYSNLGPVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SMDPHWPSLGVWGSPYSPPREDWSVYPGPSSTMGTVPVNDVTSSPAAFCSTDYSNLGPVG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GGTSGSSLPGQAGGSLVPTDAGAAKASSPSRSRHSPYAWMRKTVQVTGKTRTKEKYRVVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GGTSGSSLPGQAGGSLVPTDAGAAKASSPSRSRHSPYAWMRKTVQVTGKTRTKEKYRVVY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 TDHQRLELEKEFHCNRYITIQRKSELAVNLGLSERQVKIWFQNRRAKERKMIKKKISQFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TDHQRLELEKEFHCNRYITIQRKSELAVNLGLSERQVKIWFQNRRAKERKMIKKKISQFE 190 200 210 220 230 240 250 260 270 280 pF1KB7 NSGGSVQSDSDSISPGELPNTFFTTPSAVRGFQPIEIQQVIVSE :::::::::::::::::::::::::::::::::::::::::::: CCDS14 NSGGSVQSDSDSISPGELPNTFFTTPSAVRGFQPIEIQQVIVSE 250 260 270 280 >>CCDS4304.1 CDX1 gene_id:1044|Hs108|chr5 (265 aa) initn: 604 init1: 394 opt: 572 Z-score: 399.5 bits: 81.7 E(32554): 6.2e-16 Smith-Waterman score: 613; 45.1% identity (66.4% similar) in 244 aa overlap (1-238:1-219) 10 20 30 40 50 pF1KB7 MYGSCLLEKEAGMYPGTLMSPGGDGTAGTGGTGGGGSPMPASNFAAAPAFSHYMGYPHM- :: . .:.:.. .::: :. .. : : . : : : : : . . .: :. CCDS43 MYVGYVLDKDSPVYPG----PARPASLGLGPQAYG-PPAPP---PAPPQYPDFSSYSHVE 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 PSMDPHWPSLGVWGSPYSPPREDWSVYPGPSSTMGTVPVNDVTSSPAAFCSTDYSNLGPV :. : :. .::.:. :..::.. ::. . .:. .:::.. ...:: CCDS43 PAPAP--PT--AWGAPFPAPKDDWAAAYGPGPA---APA----ASPASLAFGPPPDFSPV 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 GGGTSGSSLPGQAGGSLVPTDAGAAKASSPSRSRHSPYAWMRKTVQV-----TGKTRTKE . :: . : :. .: . :::. .: .:: :::..: . .::::::. CCDS43 ------PAPPGPGPGLLAQPLGGPGTPSSPGAQRPTPYEWMRRSVAAGGGGGSGKTRTKD 110 120 130 140 150 180 190 200 210 220 230 pF1KB7 KYRVVYTDHQRLELEKEFHCNRYITIQRKSELAVNLGLSERQVKIWFQNRRAKERKMIKK ::::::::::::::::::: .:::::.::::::.::::.:::::::::::::::::. :: CCDS43 KYRVVYTDHQRLELEKEFHYSRYITIRRKSELAANLGLTERQVKIWFQNRRAKERKVNKK 160 170 180 190 200 210 240 250 260 270 280 pF1KB7 KISQFENSGGSVQSDSDSISPGELPNTFFTTPSAVRGFQPIEIQQVIVSE : .: CCDS43 KQQQQQPPQPPMAHDITATPAGPSLGGLCPSNTSLLATSSPMPVKEEFLP 220 230 240 250 260 >>CCDS9328.1 CDX2 gene_id:1045|Hs108|chr13 (313 aa) initn: 551 init1: 384 opt: 448 Z-score: 316.9 bits: 66.7 E(32554): 2.5e-11 Smith-Waterman score: 544; 40.9% identity (63.6% similar) in 286 aa overlap (1-266:1-279) 10 20 30 40 50 pF1KB7 MYGSCLLEKEAGMYPGTLMSPGGDGTAGTGGTGGGGSP-MPASNFAAAPAFSHYMGYPHM :: : ::.:...:::... :: . : . .. : . . . ::: : . . . CCDS93 MYVSYLLDKDVSMYPSSVRHSGGLNLAPQNFVSPPQYPDYGGYHVAAAAAAAANLDSAQS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 PSMDPHWPSLGVWGSPYSPPREDWSVY-PGPSSTMGTVPVNDVTS-SPAA---FCS-TDY :. : ::. ..:.: ::::. : :: ... ... .. ... :::: . : .:: CCDS93 PG--PSWPA--AYGAPL---REDWNGYAPGGAAAAANAVAHGLNGGSPAAAMGYSSPADY 70 80 90 100 110 120 130 140 150 160 pF1KB7 S-NLGPVGGGTSGSSLPGQAGG---SLVPTDAG-----AAKASSPSRSRHSPYAWMRKTV . : .. :. :.: .: : : ::. ::. .:.. :::: . CCDS93 HPHHHPHHHPHHPAAAPSCASGLLQTLNPGPPGPAATAAAEQLSPGGQRRNLCEWMRKPA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 QVT-G---KTRTKEKYRVVYTDHQRLELEKEFHCNRYITIQRKSELAVNLGLSERQVKIW : . : :::::.::::::::::::::::::: .:::::.::.:::..::::::::::: CCDS93 QQSLGSQVKTRTKDKYRVVYTDHQRLELEKEFHYSRYITIRRKAELAATLGLSERQVKIW 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB7 FQNRRAKERKMIKKKISQFENSGGSVQSDSDSISPGELPNTFFTTPSAVRGFQPIEIQQV ::::::::::. :::..: ... : :. . ..: CCDS93 FQNRRAKERKINKKKLQQQQQQQPPQPPPPPPQPPQPQPGPLRSVPEPLSPVSSLQASVP 240 250 260 270 280 290 pF1KB7 IVSE CCDS93 GSVPGVLGPTGGVLNPTVTQ 300 310 284 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:40:54 2016 done: Sun Nov 6 04:40:54 2016 Total Scan time: 2.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]