FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1844, 361 aa 1>>>pF1KE1844 361 - 361 aa - 361 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4335+/-0.000906; mu= 13.2599+/- 0.055 mean_var=136.1372+/-26.529, 0's: 0 Z-trim(110.9): 23 B-trim: 0 in 0/51 Lambda= 0.109922 statistics sampled from 11967 (11973) to 11967 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.368), width: 16 Scan time: 3.070 The best scores are: opt bits E(32554) CCDS53308.1 EBNA1BP2 gene_id:10969|Hs108|chr1 ( 361) 2404 392.4 3.2e-109 CCDS478.1 EBNA1BP2 gene_id:10969|Hs108|chr1 ( 306) 2020 331.4 6.1e-91 >>CCDS53308.1 EBNA1BP2 gene_id:10969|Hs108|chr1 (361 aa) initn: 2404 init1: 2404 opt: 2404 Z-score: 2074.3 bits: 392.4 E(32554): 3.2e-109 Smith-Waterman score: 2404; 99.7% identity (100.0% similar) in 361 aa overlap (1-361:1-361) 10 20 30 40 50 60 pF1KE1 MYPEALPVGILSNPDTFKRRSGSYSNDKPEVWFAAGSGSPNQKLSSSCVGRACGEMDTPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MYPEALPVGILSNPDTFKRRSGSYSNDKPEVWFAAGSGSPNQKLSSSCVGRACGEMDTPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LSDSESESDESLVTDRELQDAFSRGLLKPGLNVVLEGPKKAVNDVNGLKQCLAEFKRDLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LSDSESESDESLVTDRELQDAFSRGLLKPGLNVVLEGPKKAVNDVNGLKQCLAEFKRDLE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 WVERLDVTLGPVPEIGGSEAPAPQNKDQKAVDPEDDFQREMSFYRQAQAAVLAVLPRLHQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 WVERLDVTLGPVPEIGGSEAPAPQNKDQKAVDPEDDFQREMSFYRQAQAAVLAVLPRLHQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LKVPTKRPTDYFAEMAKSDLQMQKIRQKLQTKQAAMERSEKAKQLRALRKYGKKVQTEVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LKVPTKRPTDYFAEMAKSDLQMQKIRQKLQTKQAAMERSEKAKQLRALRKYGKKVQTEVL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 QKRQQEKAHMMNAIKKYQKGFSDKLDFLEGDQKPLAQHKKAGAKGQQMRKGPSAKRRYKN :::::::::::::::::::::::::::::::::::::.:::::::::::::::::::::: CCDS53 QKRQQEKAHMMNAIKKYQKGFSDKLDFLEGDQKPLAQRKKAGAKGQQMRKGPSAKRRYKN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 QKFGFGGKKKGSKWNTRESYDDVSSFRAKTAHGRGLKRPGKKGSNKRPGKRTREKMKNRT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 QKFGFGGKKKGSKWNTRESYDDVSSFRAKTAHGRGLKRPGKKGSNKRPGKRTREKMKNRT 310 320 330 340 350 360 pF1KE1 H : CCDS53 H >>CCDS478.1 EBNA1BP2 gene_id:10969|Hs108|chr1 (306 aa) initn: 2020 init1: 2020 opt: 2020 Z-score: 1746.1 bits: 331.4 E(32554): 6.1e-91 Smith-Waterman score: 2020; 99.7% identity (100.0% similar) in 306 aa overlap (56-361:1-306) 30 40 50 60 70 80 pF1KE1 NDKPEVWFAAGSGSPNQKLSSSCVGRACGEMDTPPLSDSESESDESLVTDRELQDAFSRG :::::::::::::::::::::::::::::: CCDS47 MDTPPLSDSESESDESLVTDRELQDAFSRG 10 20 30 90 100 110 120 130 140 pF1KE1 LLKPGLNVVLEGPKKAVNDVNGLKQCLAEFKRDLEWVERLDVTLGPVPEIGGSEAPAPQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LLKPGLNVVLEGPKKAVNDVNGLKQCLAEFKRDLEWVERLDVTLGPVPEIGGSEAPAPQN 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE1 KDQKAVDPEDDFQREMSFYRQAQAAVLAVLPRLHQLKVPTKRPTDYFAEMAKSDLQMQKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KDQKAVDPEDDFQREMSFYRQAQAAVLAVLPRLHQLKVPTKRPTDYFAEMAKSDLQMQKI 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE1 RQKLQTKQAAMERSEKAKQLRALRKYGKKVQTEVLQKRQQEKAHMMNAIKKYQKGFSDKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 RQKLQTKQAAMERSEKAKQLRALRKYGKKVQTEVLQKRQQEKAHMMNAIKKYQKGFSDKL 160 170 180 190 200 210 270 280 290 300 310 320 pF1KE1 DFLEGDQKPLAQHKKAGAKGQQMRKGPSAKRRYKNQKFGFGGKKKGSKWNTRESYDDVSS ::::::::::::.::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DFLEGDQKPLAQRKKAGAKGQQMRKGPSAKRRYKNQKFGFGGKKKGSKWNTRESYDDVSS 220 230 240 250 260 270 330 340 350 360 pF1KE1 FRAKTAHGRGLKRPGKKGSNKRPGKRTREKMKNRTH :::::::::::::::::::::::::::::::::::: CCDS47 FRAKTAHGRGLKRPGKKGSNKRPGKRTREKMKNRTH 280 290 300 361 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 21:30:06 2016 done: Sun Nov 6 21:30:07 2016 Total Scan time: 3.070 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]