FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1697, 263 aa 1>>>pF1KE1697 263 - 263 aa - 263 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6245+/-0.000669; mu= 13.2009+/- 0.040 mean_var=64.6368+/-12.993, 0's: 0 Z-trim(109.7): 11 B-trim: 0 in 0/54 Lambda= 0.159527 statistics sampled from 11051 (11058) to 11051 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.34), width: 16 Scan time: 2.400 The best scores are: opt bits E(32554) CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 ( 263) 1875 439.7 9.5e-124 CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 ( 225) 1284 303.7 7.2e-83 CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 ( 232) 357 90.4 1.2e-18 CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 ( 246) 315 80.7 1.1e-15 CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 ( 572) 266 69.6 5.7e-12 >>CCDS3287.2 RTP1 gene_id:132112|Hs108|chr3 (263 aa) initn: 1875 init1: 1875 opt: 1875 Z-score: 2335.2 bits: 439.7 E(32554): 9.5e-124 Smith-Waterman score: 1875; 100.0% identity (100.0% similar) in 263 aa overlap (1-263:1-263) 10 20 30 40 50 60 pF1KE1 MRIFRPWRLRCPALHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MRIFRPWRLRCPALHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 DSWDLIIDPNLKHNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 DSWDLIIDPNLKHNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 GSVRMRVFKQLCYECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGGQYRIHVASR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 GSVRMRVFKQLCYECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGGQYRIHVASR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 QDNRRHRGEFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 QDNRRHRGEFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIP 190 200 210 220 230 240 250 260 pF1KE1 WCLFWATVLLLIIYLQFSFRSSV ::::::::::::::::::::::: CCDS32 WCLFWATVLLLIIYLQFSFRSSV 250 260 >>CCDS33911.1 RTP2 gene_id:344892|Hs108|chr3 (225 aa) initn: 1273 init1: 1124 opt: 1284 Z-score: 1601.2 bits: 303.7 E(32554): 7.2e-83 Smith-Waterman score: 1284; 82.2% identity (90.2% similar) in 225 aa overlap (37-261:1-221) 10 20 30 40 50 60 pF1KE1 WRLRCPALHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLI :: :.:: ::::::::::: :::::::.:: CCDS33 MCTSLTTCEWKKVFYEKMEVAKPADSWELI 10 20 30 70 80 90 100 110 120 pF1KE1 IDPNLKHNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMR :::::: . :.:::::::: ::::::::::::::::: .::::::::::::::::::::: CCDS33 IDPNLKPSELAPGWKQYLEQHASGRFHCSWCWHTWQSAHVVILFHMFLDRAQRAGSVRMR 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 VFKQLCYECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGGQYRIHVASRQDNRRH :::::::::::::::::::::::::::::::::::::::: : ::::::::::: :. : CCDS33 VFKQLCYECGTARLDESSMLEENIEGLVDNLITSLREQCYEEDGGQYRIHVASRPDSGPH 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 RGEFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIPWCLFWA :.:::::::::::::::::::::::.:::: :.: .: . :.:::.:: :. :::::: CCDS33 RAEFCEACQEGIVHWKPSEKLLEEEVTTYT-SEASKP---RAQAGSGYNFLSLRWCLFWA 160 170 180 190 200 250 260 pF1KE1 TVLLLIIYLQFSFRSSV .. ::..:::::: : CCDS33 SLCLLVVYLQFSFLSPAFF 210 220 >>CCDS2740.1 RTP3 gene_id:83597|Hs108|chr3 (232 aa) initn: 369 init1: 208 opt: 357 Z-score: 447.9 bits: 90.4 E(32554): 1.2e-18 Smith-Waterman score: 357; 37.7% identity (61.6% similar) in 159 aa overlap (43-197:5-160) 20 30 40 50 60 70 pF1KE1 ALHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLIIDPNLK :. ::..: : :.:.:: : : : .: CCDS27 MAGDTEVWKQMFQELMREVKPWHRWTLRPDKGLL 10 20 30 80 90 100 110 120 130 pF1KE1 HNVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMRVFKQLC :::.::: :: . . .::.:: : ..: : :..:::: .. . :.:.:::: : : CCDS27 PNVLKPGWMQYQQW-TFARFQCSSCSRNWASAQVLVLFHMNWSEEKSRGQVKMRVFTQRC 40 50 60 70 80 90 140 150 160 170 180 pF1KE1 YECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGG-QYRIHVASRQD---NRRHRG .: ... . .::: .. ::. . ..:: :: : .: .: . : . CCDS27 KKCPQPLFEDPEFTQENISRILKNLVFRILKKCY--RGRFQLIEEVPMIKDISLEGPHNS 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 EFCEACQEGIVHWKPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIPWCLFWATV . :::: .: CCDS27 DNCEACLQGFCAGPIQVTSLPPSQTPRVHSIYKVEEVVKPWASGENVYSYACQNHICRNL 160 170 180 190 200 210 >>CCDS33910.1 RTP4 gene_id:64108|Hs108|chr3 (246 aa) initn: 347 init1: 169 opt: 315 Z-score: 395.3 bits: 80.7 E(32554): 1.1e-15 Smith-Waterman score: 319; 32.6% identity (57.0% similar) in 172 aa overlap (46-208:8-178) 20 30 40 50 60 70 pF1KE1 LPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLIIDPNLKHNV :...: : ..:::: .: : .: ::. . CCDS33 MVVDFWTWEQTFQELIQEAKPRATWTLKLDGNLQLDC 10 20 30 80 90 100 110 120 130 pF1KE1 LSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMRVFKQLCYEC :. ::::: . .: : :.:: : ..: : : :: : . .. :.::::.: : : .: CCDS33 LAQGWKQYQQ-RAFGWFRCSSCQRSWASAQVQILCHTYWEHWTSQGQVRMRLFGQRCQKC 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE1 GTARLDESSMLEENIEGLVDNLITSLREQCYGE--RGGQYRIHVASRQDNRRHRGEFCEA . .. . . .. ...::. . .. ::. : . . . . : ::: CCDS33 SWSQYEMPEFSSDSTMRILSNLVQHILKKYYGNGTRKSPEMPVILEVSLEGSHDTANCEA 100 110 120 130 140 150 200 210 220 230 240 pF1KE1 CQEGIVHW-------KPSEKLLEEEATTYTFSRAPSPTKSQDQTGSGWNFCSIPWCLFWA : :: :::..:: CCDS33 CTLGICGQGLKSCMTKPSKSLLPHLKTGNSSPGIGAVYLANQAKNQSAEAKEAKGSGYEK 160 170 180 190 200 210 >>CCDS42843.1 RTP5 gene_id:285093|Hs108|chr2 (572 aa) initn: 164 init1: 130 opt: 266 Z-score: 328.4 bits: 69.6 E(32554): 5.7e-12 Smith-Waterman score: 291; 28.8% identity (53.8% similar) in 212 aa overlap (44-243:7-208) 20 30 40 50 60 70 pF1KE1 LHLPSLSVFSLRWKLPSLTTDETMCKSVTTDEWKKVFYEKMEEAKPADSWDLIIDPNLKH : : ..: : : :: : : :. . .: CCDS42 MDRAGADMWASTFTLAMAERKPQDVWVLLPEHSLVP 10 20 30 80 90 100 110 120 130 pF1KE1 NVLSPGWKQYLELHASGRFHCSWCWHTWQSPYVVILFHMFLDRAQRAGSVRMRVFKQLCY . :. : ::: . : :..:. : ::.: .: .:::.. :::.. : :.::.. : : CCDS42 GCLDGGGVQYLLVGLS-RLQCGHCPGTWDSAHVHVLFHLWWDRASHRGLVKMRIWGQRCR 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE1 ECGTARLDESSMLEENIEGLVDNLITSLREQCYGERGGQYRIHVASRQDNRRHRGEFCEA : . . .. . . ... :. . ..:::. : .:. . ..: ::: CCDS42 LCPAP--GDCQVRPPGEQPFLSRLVLHILQDCYGDGPGP------ARHPREAYEG-CCEA 100 110 120 130 140 200 210 220 230 240 pF1KE1 CQEGIVH--------WKPSEKLLEEEATTY----TFSRAPSPTKSQDQTGSGWNFCSIPW :. :. :. . . ::.. : ::. . :. :.: .::. CCDS42 CELGVCFLQKAPDPAWSANATKGNFPATAWGGTGTVSRGKPLSTPGDDLGKGGVVIAIPF 150 160 170 180 190 200 250 260 pF1KE1 CLFWATVLLLIIYLQFSFRSSV : CCDS42 SLVGTSNDQVPIAEGPAPPAGASLPVTGSCEALVIGQGSIFLSGDSVAMPGGKGFPVAIG 210 220 230 240 250 260 263 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:33:34 2016 done: Sun Nov 6 20:33:35 2016 Total Scan time: 2.400 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]