FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4083, 435 aa 1>>>pF1KE4083 435 - 435 aa - 435 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8853+/-0.00101; mu= 10.5230+/- 0.060 mean_var=150.5295+/-32.336, 0's: 0 Z-trim(108.9): 106 B-trim: 0 in 0/48 Lambda= 0.104535 statistics sampled from 10447 (10553) to 10447 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.688), E-opt: 0.2 (0.324), width: 16 Scan time: 2.900 The best scores are: opt bits E(32554) CCDS11622.2 RNFT1 gene_id:51136|Hs108|chr17 ( 435) 2981 461.7 6.5e-130 CCDS44987.1 RNFT2 gene_id:84900|Hs108|chr12 ( 444) 1087 176.0 6.4e-44 CCDS9180.2 RNFT2 gene_id:84900|Hs108|chr12 ( 420) 844 139.4 6.6e-33 >>CCDS11622.2 RNFT1 gene_id:51136|Hs108|chr17 (435 aa) initn: 2981 init1: 2981 opt: 2981 Z-score: 2445.8 bits: 461.7 E(32554): 6.5e-130 Smith-Waterman score: 2981; 100.0% identity (100.0% similar) in 435 aa overlap (1-435:1-435) 10 20 30 40 50 60 pF1KE4 MPLFLLSLPTPPSASGHERRQRPEAKTSGSEKKYLRAMQANRSQLHSPPGTGSSEDASTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MPLFLLSLPTPPSASGHERRQRPEAKTSGSEKKYLRAMQANRSQLHSPPGTGSSEDASTP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 QCVHTRLTGEGSCPHSGDVHIQINSIPKECAENASSRNIRSGVHSCAHGCVHSRLRGHSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QCVHTRLTGEGSCPHSGDVHIQINSIPKECAENASSRNIRSGVHSCAHGCVHSRLRGHSH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 SEARLTDDTAAESGDHGSSSFSEFRYLFKWLQKSLPYILILSVKLVMQHITGISLGIGLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SEARLTDDTAAESGDHGSSSFSEFRYLFKWLQKSLPYILILSVKLVMQHITGISLGIGLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 TTFMYANKSIVNQVFLRERSSKIQCAWLLVFLAGSSVLLYYTFHSQSLYYSLIFLNPTLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TTFMYANKSIVNQVFLRERSSKIQCAWLLVFLAGSSVLLYYTFHSQSLYYSLIFLNPTLD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 HLSFWEVFWIVGITDFILKFFFMGLKCLILLVPSFIMPFKSKGYWYMLLEELCQYYRTFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 HLSFWEVFWIVGITDFILKFFFMGLKCLILLVPSFIMPFKSKGYWYMLLEELCQYYRTFV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 PIPVWFRYLISYGEFGNVTRWSLGILLALLYLILKLLEFFGHLRTFRQVLRIFFTQPSYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PIPVWFRYLISYGEFGNVTRWSLGILLALLYLILKLLEFFGHLRTFRQVLRIFFTQPSYG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 VAASKRQCSDVDDICSICQAEFQKPILLICQHIFCEECMTLWFNREKTCPLCRTVISDHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VAASKRQCSDVDDICSICQAEFQKPILLICQHIFCEECMTLWFNREKTCPLCRTVISDHI 370 380 390 400 410 420 430 pF1KE4 NKWKDGATSSHLQIY ::::::::::::::: CCDS11 NKWKDGATSSHLQIY 430 >>CCDS44987.1 RNFT2 gene_id:84900|Hs108|chr12 (444 aa) initn: 1092 init1: 595 opt: 1087 Z-score: 902.0 bits: 176.0 E(32554): 6.4e-44 Smith-Waterman score: 1087; 39.8% identity (71.9% similar) in 420 aa overlap (20-435:28-444) 10 20 30 40 50 pF1KE4 MPLFLLSLPTPPSASGHERRQRPEAKTSGSEKKYLRAMQANRSQLHSPPGTG :.: .: .: . .... ... :::. CCDS44 MWLFTVNQVLRKMQRRHSSNTDNIPPERNRSQALSSEASVDEGGVFESLKAEAASPPALF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 SSEDASTPQCVHTRLTGEGSCPHSGDVHIQINSIPKECAE--NASSRNIRSGVHSCAHGC :. ..: : :: .::: ::. . .: . .... . :. : :: CCDS44 SGLSGSLPTSSFPSSLVLGSSAGGGDVFIQMPASREEGGGRGEGGAYHHRQPHHHFHHGG 70 80 90 100 110 120 120 130 140 150 160 pF1KE4 VHS-RLRGHSHSEAR-LTDDTAAESGDHGSSSFSEFRYLFKWLQKSLPYILILSVKLVMQ .. : : .. : ... . :. . ..::.. .. ::::.::.:::: .:: .: CCDS44 HRGGSLLQHVGGDHRGHSEEGGDEQPGTPAPALSELKAVICWLQKGLPFILILLAKLCFQ 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 HITGISLGIGLLTTFMYANKSIVNQVFLRERSSKIQCAWLLVFLAGSSVLLYYTFHSQSL : ::.. ::. .:: :::... .:: :.:. : . :.:.::::... . ::: ::.: CCDS44 HKLGIAVCIGMASTFAYANSTLREQVSLKEKRSVLVILWILAFLAGNTLYVLYTFSSQQL 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 YYSLIFLNPTLDHLSFWEVFWIVGITDFILKFFFMGLKCLILLVPSFIMPFKSKGYWYML : :::::.:.:. :.:....:::::.::.::.. ..:::::. .:..:. :::: .:.. CCDS44 YNSLIFLKPNLEMLDFFDLLWIVGIADFVLKYITIALKCLIVALPKIILAVKSKGKFYLV 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 LEELCQYYRTFVPIPVWFRYLISYGEFGNVTRWSLGILLALLYLILKLLEFFGHLRTFRQ .::: : .:..::: .:..:.. :. . . . :: .: .:: . : ... :.. :. CCDS44 IEELSQLFRSLVPIQLWYKYIM--GD-DSSNSYFLGGVLIVLYSLCKSFDICGRVGGVRK 310 320 330 340 350 350 360 370 380 390 400 pF1KE4 VLRIFFTQPSYGVAASKRQCSDVDDICSICQAEFQKPILLICQHIFCEECMTLWFNREKT .:... :. .::: :. .::... :::.::::::..:..:.:::.:::::. ::..::.: CCDS44 ALKLLCTSQNYGVRATGQQCTEAGDICAICQAEFREPLILLCQHVFCEECLCLWLDRERT 360 370 380 390 400 410 410 420 430 pF1KE4 CPLCRTVISDHINKWKDGATSSHLQIY :::::.: : . :::::::.:.:.: CCDS44 CPLCRSVAVDTLRCWKDGATSAHFQVY 420 430 440 >>CCDS9180.2 RNFT2 gene_id:84900|Hs108|chr12 (420 aa) initn: 849 init1: 595 opt: 844 Z-score: 704.2 bits: 139.4 E(32554): 6.6e-33 Smith-Waterman score: 844; 36.8% identity (70.0% similar) in 380 aa overlap (20-394:28-403) 10 20 30 40 50 pF1KE4 MPLFLLSLPTPPSASGHERRQRPEAKTSGSEKKYLRAMQANRSQLHSPPGTG :.: .: .: . .... ... :::. CCDS91 MWLFTVNQVLRKMQRRHSSNTDNIPPERNRSQALSSEASVDEGGVFESLKAEAASPPALF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 SSEDASTPQCVHTRLTGEGSCPHSGDVHIQINSIPKECAE--NASSRNIRSGVHSCAHGC :. ..: : :: .::: ::. . .: . .... . :. : :: CCDS91 SGLSGSLPTSSFPSSLVLGSSAGGGDVFIQMPASREEGGGRGEGGAYHHRQPHHHFHHGG 70 80 90 100 110 120 120 130 140 150 160 pF1KE4 VH--SRLRGHSHSEAR-LTDDTAAESGDHGSSSFSEFRYLFKWLQKSLPYILILSVKLVM : . : : .. : ... . :. . ..::.. .. ::::.::.:::: .:: . CCDS91 -HRGGSLLQHVGGDHRGHSEEGGDEQPGTPAPALSELKAVICWLQKGLPFILILLAKLCF 130 140 150 160 170 170 180 190 200 210 220 pF1KE4 QHITGISLGIGLLTTFMYANKSIVNQVFLRERSSKIQCAWLLVFLAGSSVLLYYTFHSQS :: ::.. ::. .:: :::... .:: :.:. : . :.:.::::... . ::: ::. CCDS91 QHKLGIAVCIGMASTFAYANSTLREQVSLKEKRSVLVILWILAFLAGNTLYVLYTFSSQQ 180 190 200 210 220 230 230 240 250 260 270 280 pF1KE4 LYYSLIFLNPTLDHLSFWEVFWIVGITDFILKFFFMGLKCLILLVPSFIMPFKSKGYWYM :: :::::.:.:. :.:....:::::.::.::.. ..:::::. .:..:. :::: .:. CCDS91 LYNSLIFLKPNLEMLDFFDLLWIVGIADFVLKYITIALKCLIVALPKIILAVKSKGKFYL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KE4 LLEELCQYYRTFVPIPVWFRYLISYGEFGNVTRWSLGILLALLYLILKLLEFFGHLRTFR ..::: : .:..::: .:..:.. :. . . . :: .: .:: . : ... :.. : CCDS91 VIEELSQLFRSLVPIQLWYKYIM--GD-DSSNSYFLGGVLIVLYSLCKSFDICGRVGGVR 300 310 320 330 340 350 350 360 370 380 390 400 pF1KE4 QVLRIFFTQPSYGVAASKRQCSDVDDICSICQAEFQKPILLICQHIFCEECMTLWFNREK ..:... :. .::: :. .::... :::.::::::..:..:.:: .. CCDS91 KALKLLCTSQNYGVRATGQQCTEAGDICAICQAEFREPLILLCQMLLKGHKKLELEKIDE 360 370 380 390 400 410 410 420 430 pF1KE4 TCPLCRTVISDHINKWKDGATSSHLQIY CCDS91 SAGV 420 435 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 03:58:15 2016 done: Sun Nov 6 03:58:16 2016 Total Scan time: 2.900 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]