FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4070, 373 aa 1>>>pF1KE4070 373 - 373 aa - 373 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.8331+/-0.00092; mu= 6.7843+/- 0.056 mean_var=188.5480+/-39.104, 0's: 0 Z-trim(112.6): 34 B-trim: 42 in 1/50 Lambda= 0.093404 statistics sampled from 13328 (13361) to 13328 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.41), width: 16 Scan time: 2.820 The best scores are: opt bits E(32554) CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 ( 373) 2562 357.3 1.3e-98 CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 ( 372) 2548 355.4 4.7e-98 CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 ( 180) 1215 175.5 3.3e-44 CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 ( 363) 981 144.2 1.7e-34 >>CCDS9221.1 RNF34 gene_id:80196|Hs108|chr12 (373 aa) initn: 2562 init1: 2562 opt: 2562 Z-score: 1884.0 bits: 357.3 E(32554): 1.3e-98 Smith-Waterman score: 2562; 100.0% identity (100.0% similar) in 373 aa overlap (1-373:1-373) 10 20 30 40 50 60 pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS92 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC 310 320 330 340 350 360 370 pF1KE4 RQYVVRAVHVFKS ::::::::::::: CCDS92 RQYVVRAVHVFKS 370 >>CCDS31915.1 RNF34 gene_id:80196|Hs108|chr12 (372 aa) initn: 2548 init1: 2548 opt: 2548 Z-score: 1873.9 bits: 355.4 E(32554): 4.7e-98 Smith-Waterman score: 2548; 100.0% identity (100.0% similar) in 371 aa overlap (3-373:2-372) 10 20 30 40 50 60 pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGPFRFTPNPEFSTYPPAATEGPN 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 IVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRRCSTCHLLQETAFQRPQLMRLK 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 VKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMDTSSLNSSRSQTSSFFTRSF 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE4 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 FSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTEDDDDDDDEDDDDEEENA 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE4 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEKWELVEKVNRL 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE4 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 YKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHMVTCTKCGKRMSECPIC 300 310 320 330 340 350 370 pF1KE4 RQYVVRAVHVFKS ::::::::::::: CCDS31 RQYVVRAVHVFKS 360 370 >>CCDS73538.1 RNF34 gene_id:80196|Hs108|chr12 (180 aa) initn: 1215 init1: 1215 opt: 1215 Z-score: 907.2 bits: 175.5 E(32554): 3.3e-44 Smith-Waterman score: 1215; 99.4% identity (100.0% similar) in 179 aa overlap (195-373:2-180) 170 180 190 200 210 220 pF1KE4 LNSSRSQTSSFFTRSFFSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQSEITSANTED .::::::::::::::::::::::::::::: CCDS73 MKGELMDGDQTSRSGVPAQVQSEITSANTED 10 20 30 230 240 250 260 270 280 pF1KE4 DDDDDDEDDDDEEENAEDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 DDDDDDEDDDDEEENAEDRNPGLSKERVRASLSDLSSLDDVEGMSVRQLKEILARNFVNY 40 50 60 70 80 90 290 300 310 320 330 340 pF1KE4 SGCCEKWELVEKVNRLYKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 SGCCEKWELVEKVNRLYKENEENQKSYGERLQLQDEEDDSLCRICMDAVIDCVLLECGHM 100 110 120 130 140 150 350 360 370 pF1KE4 VTCTKCGKRMSECPICRQYVVRAVHVFKS ::::::::::::::::::::::::::::: CCDS73 VTCTKCGKRMSECPICRQYVVRAVHVFKS 160 170 180 >>CCDS11286.1 RFFL gene_id:117584|Hs108|chr17 (363 aa) initn: 939 init1: 432 opt: 981 Z-score: 732.8 bits: 144.2 E(32554): 1.7e-34 Smith-Waterman score: 981; 43.9% identity (70.0% similar) in 380 aa overlap (9-373:1-363) 10 20 30 40 50 pF1KE4 MRKAGATSMWASCCGLLNEVMGTGAVRGQQSAFAGATGP-FRFTPNPEFSTYP-PAATEG :::.::. . . :: : .. :: .:..: :.. : CCDS11 MWATCCNWF-------CLDGQPEEVPPPQGARMQAYSNPGYSSFPSPTGLE- 10 20 30 40 60 70 80 90 100 110 pF1KE4 PNIVCKACGLSFSVFRKKHVCCDCKKDFCSVCSVLQENLRR-CSTCHLLQETAFQRPQLM :. ::.:: :. .:..: ::::.:: .:: : : : :. .. ::::: .:: CCDS11 PS--CKSCGAHFANTARKQTCLDCKKNFCMTCSSQVGNGPRLCLLCQRFRATAFQREELM 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 RLKVKDLRQYLILRNIPIDTCREKEDLVDLVLCHHGLGSEDDMD-TSSLNSSRSQTSSFF ..::::::.:: :..: . :::::.:: ::: .. . :..: .:.:. . . ..:. CCDS11 KMKVKDLRDYLSLHDISTEMCREKEELVLLVLGQQPVISQEDRTRASTLSPDFPEQQAFL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 TRSFFSNYTAPSATMSSFQGELMDGDQTSRSGVPAQVQ-SEITSANTEDDDDDDDEDDDD :. :... :.. .: ... . : ::::: .. ..... .:... .. CCDS11 TQPH-SSMVPPTSP------NLPSSSAQATSVPPAQVQENQQANGHVSQDQEEPVYLESV 170 180 190 200 210 240 250 260 270 280 290 pF1KE4 EEENAEDRNPGLSKERV-----RASLSDLSSLDDVEGMSVRQLKEILARNFVNYSGCCEK . :::.. ....: :::::::..:.:.::..:::::::::::::::.::::: CCDS11 ARVPAEDETQSIDSEDSFVPGRRASLSDLTDLEDIEGLTVRQLKEILARNFVNYKGCCEK 220 230 240 250 260 270 300 310 320 330 340 pF1KE4 WELVEKVNRLYKENEENQKSY-GERLQ----LQDEEDDSLCRICMDAVIDCVLLECGHMV :::.:.:.::::... :. : . : . . ...::.::::. :::::::::::: CCDS11 WELMERVTRLYKDQKGLQHLVSGAEDQNGGAVPSGLEENLCKICMDSPIDCVLLECGHMV 280 290 300 310 320 330 350 360 370 pF1KE4 TCTKCGKRMSECPICRQYVVRAVHVFKS :::::::::.:::::::::.::::::.: CCDS11 TCTKCGKRMNECPICRQYVIRAVHVFRS 340 350 360 373 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 16:54:54 2016 done: Mon Nov 7 16:54:55 2016 Total Scan time: 2.820 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]