FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1385, 125 aa 1>>>pF1KE1385 125 - 125 aa - 125 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5829+/-0.000669; mu= 14.1401+/- 0.040 mean_var=51.3981+/-10.528, 0's: 0 Z-trim(108.2): 17 B-trim: 0 in 0/50 Lambda= 0.178896 statistics sampled from 10050 (10063) to 10050 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.702), E-opt: 0.2 (0.309), width: 16 Scan time: 1.370 The best scores are: opt bits E(32554) CCDS41584.1 IFITM1 gene_id:8519|Hs108|chr11 ( 125) 821 219.0 6.1e-58 CCDS41585.1 IFITM3 gene_id:10410|Hs108|chr11 ( 133) 569 154.0 2.4e-38 CCDS41583.1 IFITM2 gene_id:10581|Hs108|chr11 ( 132) 547 148.3 1.2e-36 CCDS31323.1 IFITM5 gene_id:387733|Hs108|chr11 ( 132) 270 76.8 4.1e-15 CCDS53593.2 IFITM10 gene_id:402778|Hs108|chr11 ( 228) 245 70.5 5.6e-13 >>CCDS41584.1 IFITM1 gene_id:8519|Hs108|chr11 (125 aa) initn: 821 init1: 821 opt: 821 Z-score: 1153.7 bits: 219.0 E(32554): 6.1e-58 Smith-Waterman score: 821; 99.2% identity (99.2% similar) in 125 aa overlap (1-125:1-125) 10 20 30 40 50 60 pF1KE1 MHKEEHEVAVLGAPPSTILPRSTVINIHSETSVPDHVVWSLFNTLFLNWCCLGFIAFAYS :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MHKEEHEVAVLGPPPSTILPRSTVINIHSETSVPDHVVWSLFNTLFLNWCCLGFIAFAYS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 VKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILMTIGFILLLVFGSVTVYHIMLQIIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 VKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILMTIGFILLLVFGSVTVYHIMLQIIQ 70 80 90 100 110 120 pF1KE1 EKRGY ::::: CCDS41 EKRGY >>CCDS41585.1 IFITM3 gene_id:10410|Hs108|chr11 (133 aa) initn: 590 init1: 564 opt: 569 Z-score: 801.8 bits: 154.0 E(32554): 2.4e-38 Smith-Waterman score: 569; 83.3% identity (90.7% similar) in 108 aa overlap (1-106:22-129) 10 20 30 pF1KE1 MHKEEHEVAVLGAPPSTILPRSTVINIHSETSVPDHVVW : :::::::::::: . : ::::.:.::::::::::: CCDS41 MNHTVQTFFSPVNSGQPPNYEMLKEEHEVAVLGAPHNPAPPTSTVIHIRSETSVPDHVVW 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE1 SLFNTLFLNWCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILMTIG :::::::.: ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SLFNTLFMNPCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILMTIL 70 80 90 100 110 120 100 110 120 pF1KE1 FILL--LVFGSVTVYHIMLQIIQEKRGY .:.. :.: CCDS41 LIVIPVLIFQAYG 130 >>CCDS41583.1 IFITM2 gene_id:10581|Hs108|chr11 (132 aa) initn: 564 init1: 543 opt: 547 Z-score: 771.2 bits: 148.3 E(32554): 1.2e-36 Smith-Waterman score: 547; 80.2% identity (91.5% similar) in 106 aa overlap (1-106:21-126) 10 20 30 40 pF1KE1 MHKEEHEVAVLGAPPSTILPRSTVINIHSETSVPDHVVWS : :::.:::.::.: . : ::::.:.:::::::::::: CCDS41 MNHIVQTFSPVNSGQPPNYEMLKEEQEVAMLGVPHNPAPPMSTVIHIRSETSVPDHVVWS 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 LFNTLFLNWCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTAKCLNIWALILGILMTIGF ::::::.: :::::::::::::::::::::::::::::::::::::::::::::.::: . CCDS41 LFNTLFMNTCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTAKCLNIWALILGIFMTILL 70 80 90 100 110 120 110 120 pF1KE1 ILLLVFGSVTVYHIMLQIIQEKRGY :.. :. CCDS41 IIIPVLVVQAQR 130 >>CCDS31323.1 IFITM5 gene_id:387733|Hs108|chr11 (132 aa) initn: 276 init1: 255 opt: 270 Z-score: 384.8 bits: 76.8 E(32554): 4.1e-15 Smith-Waterman score: 270; 42.1% identity (73.7% similar) in 95 aa overlap (13-102:12-105) 10 20 30 40 50 pF1KE1 MHKEEHEVAVLGAPPSTILPRSTVINIHSETSVP-DHVVWSLFNTLFLNWCCLGFIAFAY :: . :.... . : ::..::.:.::.:: :::::.:.:: CCDS31 MDTAYPREDTRAPTPSKAGAHTALTLGAPHPPPRDHLIWSVFSTLYLNLCCLGFLALAY 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SVKSRDRKMVGDVTGAQAYASTAKCLNI----WALILGILMTIGFILLLVFGSVTVYHIM :.:.::.:.:::. .:. ..: ::: :: :.:. .:. .:... CCDS31 SIKARDQKVVGDLEAARRFGSKAKCYNILAAMWTLVPPLLL-LGLVVTGALHLARLAKDS 60 70 80 90 100 110 120 pF1KE1 LQIIQEKRGY CCDS31 AAFFSTKFDDADYD 120 130 >>CCDS53593.2 IFITM10 gene_id:402778|Hs108|chr11 (228 aa) initn: 243 init1: 223 opt: 245 Z-score: 346.5 bits: 70.5 E(32554): 5.6e-13 Smith-Waterman score: 251; 40.7% identity (71.3% similar) in 108 aa overlap (8-106:116-220) 10 20 pF1KE1 MHKEEHEVAVLGAPPS--------TILPRSTVINIHS : . ::::. :. .:::... CCDS53 PAAPAPEPSASPPMAPTLFPMESKSSKTDSVRAAGAPPACKHLAEKKTMTNPTTVIEVYP 90 100 110 120 130 140 30 40 50 60 70 80 pF1KE1 ETS-VPDHVVWSLFNTLFLNWCCLGFIAFAYSVKSRDRKMVGDVTGAQAYASTAKCLNIW .:. : :. .::.:: ..::.:::::::.:::.: ::.:...:..:: :.::. .:: CCDS53 DTTEVNDYYLWSIFNFVYLNFCCLGFIALAYSLKVRDKKLLNDLNGAVEDAKTARLFNIT 150 160 170 180 190 200 90 100 110 120 pF1KE1 ALILGILMTIGFILLLVFGSVTVYHIMLQIIQEKRGY . . : . .::...: CCDS53 S---SALAASCIILVFIFLRYPLTDY 210 220 125 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 02:59:26 2016 done: Mon Nov 7 02:59:26 2016 Total Scan time: 1.370 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]