FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1699, 232 aa 1>>>pF1KE1699 232 - 232 aa - 232 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3853+/-0.000631; mu= 14.1346+/- 0.038 mean_var=64.6535+/-13.043, 0's: 0 Z-trim(111.4): 7 B-trim: 214 in 1/51 Lambda= 0.159506 statistics sampled from 12357 (12362) to 12357 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.745), E-opt: 0.2 (0.38), width: 16 Scan time: 2.240 The best scores are: opt bits E(32554) CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 ( 232) 1653 388.3 2.2e-108 CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 ( 246) 578 141.0 6.8e-34 CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 ( 225) 380 95.4 3.3e-20 CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 ( 263) 357 90.1 1.5e-18 CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 ( 572) 325 82.9 4.7e-16 >>CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 (232 aa) initn: 1653 init1: 1653 opt: 1653 Z-score: 2059.3 bits: 388.3 E(32554): 2.2e-108 Smith-Waterman score: 1653; 100.0% identity (100.0% similar) in 232 aa overlap (1-232:1-232) 10 20 30 40 50 60 pF1KE1 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQYQQWTFARFQCSSCSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQYQQWTFARFQCSSCSR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKNLVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 NWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKNLVF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RILKKCYRGRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPSQTPRVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 RILKKCYRGRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPSQTPRVH 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 SIYKVEEVVKPWASGENVYSYACQNHICRNLSIFCCCVILIVIVVIVVKTAI :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 SIYKVEEVVKPWASGENVYSYACQNHICRNLSIFCCCVILIVIVVIVVKTAI 190 200 210 220 230 >>CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 (246 aa) initn: 603 init1: 550 opt: 578 Z-score: 721.9 bits: 141.0 E(32554): 6.8e-34 Smith-Waterman score: 590; 39.0% identity (66.0% similar) in 241 aa overlap (1-229:1-241) 10 20 30 40 50 60 pF1KE1 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQYQQWTFARFQCSSCSR :. : .:.: ::::..:.:: :::. : .: . : :: :::: .:. :.::::.: CCDS33 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDCLAQGWKQYQQRAFGWFRCSSCQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKNLVF .:::::: .: : : . :.:::.::.: :::.:: .: :::.... :::.::: CCDS33 SWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKCSWSQYEMPEFSSDSTMRILSNLVQ 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 RILKKCYRGRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPSQT---- .:::: : . . :.:.: ..:::: :.. ::::: :.:. .. ::.. CCDS33 HILKKYYGNGTRKSPEMPVILEVSLEGSHDTANCEACTLGICGQGLKSCMTKPSKSLLPH 130 140 150 160 170 180 180 190 200 210 220 pF1KE1 -------PRVHSIYKVEEVVKPWASGENVYSYACQN-HICRNLSIFCCCVILIVIVVIVV : . ..: .... . : .... . . .. :. . . ::.....: ::: CCDS33 LKTGNSSPGIGAVYLANQAKNQSAEAKEAKGSGYEKLGPSRDPDPLNICVFILLLVFIVV 190 200 210 220 230 240 230 pF1KE1 KTAI : CCDS33 KCFTSE >>CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 (225 aa) initn: 387 init1: 216 opt: 380 Z-score: 476.3 bits: 95.4 E(32554): 3.3e-20 Smith-Waterman score: 380; 37.6% identity (63.7% similar) in 157 aa overlap (8-160:10-161) 10 20 30 40 50 pF1KE1 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQY-QQWTFARFQCSS ::..: : :. .:: : : : .: :. : ::: :: .: . .::.:: CCDS33 MCTSLTTCEWKKVFYEKMEVAKPADSWELIIDPNLKPSELAPGWKQYLEQHASGRFHCSW 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 CSRNWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKN : ..: ::.:..:::: .. . :.:.:::: : : .: ... . .::: .. : CCDS33 CWHTWQSAHVVILFHMFLDRAQRAGSVRMRVFKQLCYECGTARLDESSMLEENIEGLVDN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE1 LVFRILKKCYR---GRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPS :. . ..::. :.... .: : ::: .. :::: .: CCDS33 LITSLREQCYEEDGGQYRI--HVASRPD---SGPHRAEFCEACQEGIVHWKPSEKLLEEE 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 QTPRVHSIYKVEEVVKPWASGENVYSYACQNHICRNLSIFCCCVILIVIVVIVVKTAI CCDS33 VTTYTSEASKPRAQAGSGYNFLSLRWCLFWASLCLLVVYLQFSFLSPAFF 180 190 200 210 220 >>CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 (263 aa) initn: 369 init1: 208 opt: 357 Z-score: 446.6 bits: 90.1 E(32554): 1.5e-18 Smith-Waterman score: 357; 37.7% identity (61.6% similar) in 159 aa overlap (5-160:43-197) 10 20 30 pF1KE1 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLL :. ::..: : :.:.:: : : : .: CCDS32 ALHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLIIDPNLK 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE1 PNVLKPGWMQYQQW-TFARFQCSSCSRNWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRC :::.::: :: . . .::.:: : ..: : :..:::: .. . :.:.:::: : : CCDS32 HNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMRVFKQLC 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE1 KKCPQPLFEDPEFTQENISRILKNLVFRILKKCY--RGRFQLIEEVPMIKDISLEGPHNS .: ... . .::: .. ::. . ..:: :: : .: .: . : . CCDS32 YECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGG-QYRIHVASRQD---NRRHRG 140 150 160 170 180 160 170 180 190 200 210 pF1KE1 DNCEACLQGFCAGPIQVTSLPPSQTPRVHSIYKVEEVVKPWASGENVYSYACQNHICRNL . :::: .: CCDS32 EFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIPWCLFWATV 190 200 210 220 230 240 >>CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 (572 aa) initn: 351 init1: 273 opt: 325 Z-score: 401.6 bits: 82.9 E(32554): 4.7e-16 Smith-Waterman score: 335; 37.3% identity (59.6% similar) in 161 aa overlap (2-162:4-152) 10 20 30 40 50 pF1KE1 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLLPNVLKPGWMQYQQWTFARFQCSSC :: ...: . : : : :: :.: :...:.:. : : .:: ..:.::. : CCDS42 MDRAG-ADMWASTFTLAMAERKPQDVWVLLPEHSLVPGCLDGGGVQYLLVGLSRLQCGHC 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SRNWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRCKKCPQPLFEDPEFTQENISRILKNL .: ::.: ::::. :.. . :: ::::.. :::. :: : : . . . .:. : CCDS42 PGTWDSAHVHVLFHLWWDRASHRGLVKMRIWGQRCRLCPAP--GDCQVRPPGEQPFLSRL 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VFRILKKCYRGRFQLIEEVPMIKDISLEGPHNSDNCEACLQGFCAGPIQVTSLPPSQTPR :..::. :: : . : .. . :: :::: : : CCDS42 VLHILQDCY-GDGPGPARHP--RE-AYEG-----CCEACELGVCFLQKAPDPAWSANATK 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 VHSIYKVEEVVKPWASGENVYSYACQNHICRNLSIFCCCVILIVIVVIVVKTAI CCDS42 GNFPATAWGGTGTVSRGKPLSTPGDDLGKGGVVIAIPFSLVGTSNDQVPIAEGPAPPAGA 170 180 190 200 210 220 232 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:51:24 2016 done: Sun Nov 6 17:51:24 2016 Total Scan time: 2.240 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]