FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2733, 348 aa 1>>>pF1KE2733 348 - 348 aa - 348 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.8889+/-0.000737; mu= 8.1490+/- 0.045 mean_var=215.3707+/-42.226, 0's: 0 Z-trim(117.7): 85 B-trim: 51 in 1/54 Lambda= 0.087394 statistics sampled from 18640 (18729) to 18640 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.844), E-opt: 0.2 (0.56), width: 16 Scan time: 1.520 The best scores are: opt bits E(33420) CCDS13706.1 AIRE gene_id:326|Hs109|chr21 ( 545) 1881 249.1 6.7e-66 >>CCDS13706.1 AIRE gene_id:326|Hs109|chr21 (545 aa) initn: 1851 init1: 1851 opt: 1881 Z-score: 1296.8 bits: 249.1 E(33420): 6.7e-66 Smith-Waterman score: 1881; 83.5% identity (89.8% similar) in 322 aa overlap (29-348:227-545) 10 20 30 40 50 pF1KE2 MWLVYSSGAPGTQQPARNRVFFPIGMAPGGVCWRPDGWGTGGQGRISGPGSMGAGQRL :: . :. . .:.:. .. .: : . CCDS13 GDVPGARGAVEGILIQQVFESGGSKKCIQVGGEFYTPSKFEDSGSGKNKARSSSGPKPLV 200 210 220 230 240 250 60 70 80 90 100 110 pF1KE2 GSSGTQRCCWGSCFGKEVAL-RRVLHPSPVCMGVS-CLCQKNEDECAVCRDGGELICCDG ..:.: : : :. : .. :.:. . . : ::::::::::::::::::::: CCDS13 RAKGAQGAAPG---GGEARLGQQGSVPAPLALPSDPQLHQKNEDECAVCRDGGELICCDG 260 270 280 290 300 310 120 130 140 150 160 170 pF1KE2 CPRAFHLACLSPPLREIPSGTWRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CPRAFHLACLSPPLREIPSGTWRCSSCLQATVQEVQPRAEEPRPQEPPVETPLPPGLRSA 320 330 340 350 360 370 180 190 200 210 220 230 pF1KE2 GEEVRGPPGEPLAGMDTTLVYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GEEVRGPPGEPLAGMDTTLVYKHLPAPPSAAPLPGLDSSALHPLLCVGPEGQQNLAPGAR 380 390 400 410 420 430 240 250 260 270 280 290 pF1KE2 CGVCGDGTDVLRCTHCAAAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 CGVCGDGTDVLRCTHCAAAFHWRCHFPAGTSRPGTGLRCRSCSGDVTPAPVEGVLAPSPA 440 450 460 470 480 490 300 310 320 330 340 pF1KE2 RLAPGPAKDDTASHEPALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 RLAPGPAKDDTASHEPALHRDDLESLLSEHTFDGILQWAIQSMARPAAPFPS 500 510 520 530 540 348 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Sep 11 18:46:18 2018 done: Tue Sep 11 18:46:19 2018 Total Scan time: 1.520 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]