FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1700, 246 aa 1>>>pF1KE1700 246 - 246 aa - 246 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4623+/-0.000742; mu= 13.9988+/- 0.045 mean_var=67.6257+/-13.569, 0's: 0 Z-trim(109.1): 17 B-trim: 174 in 2/48 Lambda= 0.155962 statistics sampled from 10679 (10687) to 10679 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.721), E-opt: 0.2 (0.328), width: 16 Scan time: 2.270 The best scores are: opt bits E(32554) CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 ( 246) 1700 390.9 4.1e-109 CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 ( 232) 579 138.7 3.3e-33 CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 ( 225) 319 80.2 1.3e-15 CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 ( 263) 315 79.3 2.8e-15 CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 ( 572) 269 69.1 7.2e-12 >>CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 (246 aa) initn: 1700 init1: 1700 opt: 1700 Z-score: 2072.3 bits: 390.9 E(32554): 4.1e-109 Smith-Waterman score: 1700; 98.8% identity (98.8% similar) in 246 aa overlap (1-246:1-246) 10 20 30 40 50 60 pF1KE1 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQYQQRAFGWFRCSSCQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQYQQRAFGWFRCSSCQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSNLVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSNLVQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 HILKKYYGNGMRKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSYMTKPSKSLLPH :::::::::: ::::::::::::::::::::::::::::::::::::: ::::::::::: CCDS33 HILKKYYGNGTRKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSCMTKPSKSLLPH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 LKTGNSSPGIGAVYLANQAKNQSDEAKEAKGSGYEKLGPSRDPDPLNICVFILLLVFIVV ::::::::::::::::::::::: :::::::::::::::::::::::::::::::::::: CCDS33 LKTGNSSPGIGAVYLANQAKNQSAEAKEAKGSGYEKLGPSRDPDPLNICVFILLLVFIVV 190 200 210 220 230 240 pF1KE1 KCFTSE :::::: CCDS33 KCFTSE >>CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 (232 aa) initn: 604 init1: 551 opt: 579 Z-score: 709.6 bits: 138.7 E(32554): 3.3e-33 Smith-Waterman score: 584; 39.4% identity (64.3% similar) in 241 aa overlap (1-241:1-229) 10 20 30 40 50 60 pF1KE1 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQYQQRAFGWFRCSSCQR :. : .:.: ::::..:.:: :::. : .: . : :: :::: .:. :.::::.: CCDS27 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQYQQWTFARFQCSSCSR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 SWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSNLVQ .:::::: .: : : . :.:::.::.: :::.:: .: :::.... :::.::: CCDS27 NWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKNLVF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 HILKKYYGNGMRKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSYMTKPSKSLLPH .:::: : . .. :.:.: ..:::: :.. ::::: :.:. .. ::.. CCDS27 RILKKCYRGRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPSQT---- 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 LKTGNSSPGIGAVYLANQAKNQSDEAKEAKGSGYEKLGPSRDPDPLNICVFILLLVFIVV : . ..: .... . . : : . :. . . ::.....: ::: CCDS27 -------PRVHSIYKVEEVV-KPWASGENVYSYACQNHICRNLSIFCCCVILIVIVVIVV 180 190 200 210 220 pF1KE1 KCFTSE : CCDS27 KTAI 230 >>CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 (225 aa) initn: 329 init1: 154 opt: 319 Z-score: 393.6 bits: 80.2 E(32554): 1.3e-15 Smith-Waterman score: 330; 31.2% identity (53.2% similar) in 237 aa overlap (8-240:10-213) 10 20 30 40 50 pF1KE1 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQY-QQRAFGWFRCSS :...: : .. ::: .: : .: ::. . :: ::::: .:.: : :.:: CCDS33 MCTSLTTCEWKKVFYEKMEVAKPADSWELIIDPNLKPSELAPGWKQYLEQHASGRFHCSW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 CQRSWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSN : ..: ::.: :: : . .. :.::::.: : : .:. .. . . .. ...: CCDS33 CWHTWQSAHVVILFHMFLDRAQRAGSVRMRVFKQLCYECGTARLDESSMLEENIEGLVDN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 LVQHILKKYY---GNGMRKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSYMTKPS :. . .. : :. .: . : : . :::: :: ::: CCDS33 LITSLREQCYEEDGGQYRIH-----VASRPDSGPHRAEFCEACQEGIVHW-------KPS 130 140 150 160 180 190 200 210 220 230 pF1KE1 KSLLPHLKTGNSSPGIGAVYLANQAKNQSDEAKEAKGSGYEKLGPSRDPDPLNICVFILL ..:: . : .: :.. . :. ::::. :. : :.: CCDS33 EKLLEEEVTTYTSE-------ASKPRAQA-------GSGYNFLS-------LRWCLFWAS 170 180 190 200 240 pF1KE1 LVFIVVKCFTSE : ..:: CCDS33 LCLLVVYLQFSFLSPAFF 210 220 >>CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 (263 aa) initn: 347 init1: 169 opt: 315 Z-score: 387.7 bits: 79.3 E(32554): 2.8e-15 Smith-Waterman score: 319; 32.6% identity (57.0% similar) in 172 aa overlap (8-178:46-208) 10 20 30 pF1KE1 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDC :...: : ..:::: .: : .: ::. . CCDS32 LPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLIIDPNLKHNV 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE1 LAQGWKQYQQ-RAFGWFRCSSCQRSWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKC :. ::::: . .: : :.:: : ..: : : :: : . .. :.::::.: : : .: CCDS32 LSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMRVFKQLCYEC 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE1 SWSQYEMPEFSSDSTMRILSNLVQHILKKYYGNGMRKSPEMPVILEVSLEGSHDTANCEA . .. . . .. ...::. . .. ::. : . . . . : ::: CCDS32 GTARLDESSMLEENIEGLVDNLITSLREQCYGE--RGGQYRIHVASRQDNRRHRGEFCEA 140 150 160 170 180 190 160 170 180 190 200 210 pF1KE1 CTLGICGQGLKSYMTKPSKSLLPHLKTGNSSPGIGAVYLANQAKNQSDEAKEAKGSGYEK : :: :::..:: CCDS32 CQEGIVHW-------KPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIPWCLFWA 200 210 220 230 240 >>CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 (572 aa) initn: 348 init1: 226 opt: 269 Z-score: 326.6 bits: 69.1 E(32554): 7.2e-12 Smith-Waterman score: 301; 34.6% identity (57.9% similar) in 159 aa overlap (8-162:9-152) 10 20 30 40 50 pF1KE1 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQYQQRAFGWFRCSSCQ : .:: . : ::. .:.: . .: :: : :: ... ..:. : CCDS42 MDRAGADMWASTFTLAMAERKPQDVWVLLPEHSLVPGCLDGGGVQYLLVGLSRLQCGHCP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 RSWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWS---QYEMPEFSSDSTMRILS .: ::.:..: : .:.. . .: :.::..::::. : : . : . . .:: CCDS42 GTWDSAHVHVLFHLWWDRASHRGLVKMRIWGQRCRLCPAPGDCQVRPP-----GEQPFLS 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 NLVQHILKKYYGNGMRKSP-EMPVILEVSLEGSHDTANCEACTLGICGQGLKSYMTKPSK :: :::. ::.: .: . : . . :: :::: ::.: CCDS42 RLVLHILQDCYGDG--PGPARHP---REAYEGC-----CEACELGVCFLQKAPDPAWSAN 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 SLLPHLKTGNSSPGIGAVYLANQAKNQSDEAKEAKGSGYEKLGPSRDPDPLNICVFILLL CCDS42 ATKGNFPATAWGGTGTVSRGKPLSTPGDDLGKGGVVIAIPFSLVGTSNDQVPIAEGPAPP 170 180 190 200 210 220 246 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:35:19 2016 done: Mon Nov 7 01:35:20 2016 Total Scan time: 2.270 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]