FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6773, 600 aa 1>>>pF1KE6773 600 - 600 aa - 600 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.8918+/-0.00103; mu= -3.7638+/- 0.062 mean_var=329.7603+/-68.075, 0's: 0 Z-trim(114.7): 3 B-trim: 246 in 1/51 Lambda= 0.070628 statistics sampled from 15260 (15263) to 15260 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.771), E-opt: 0.2 (0.469), width: 16 Scan time: 4.130 The best scores are: opt bits E(32554) CCDS55051.1 RSPH4A gene_id:345895|Hs108|chr6 ( 600) 4043 425.6 8.8e-119 CCDS34521.1 RSPH4A gene_id:345895|Hs108|chr6 ( 716) 3718 392.6 9.3e-109 CCDS12675.1 RSPH6A gene_id:81492|Hs108|chr19 ( 717) 1245 140.6 6.7e-33 >>CCDS55051.1 RSPH4A gene_id:345895|Hs108|chr6 (600 aa) initn: 4043 init1: 4043 opt: 4043 Z-score: 2246.0 bits: 425.6 E(32554): 8.8e-119 Smith-Waterman score: 4043; 100.0% identity (100.0% similar) in 600 aa overlap (1-600:1-600) 10 20 30 40 50 60 pF1KE6 MEDSTSPKQEKENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEDSTSPKQEKENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 WSPQSRAKTPLGGPAGPETSSPAPVSPREPSSSPSPLAPARQDLAAPPQSDRTTSVIPEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 WSPQSRAKTPLGGPAGPETSSPAPVSPREPSSSPSPLAPARQDLAAPPQSDRTTSVIPEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GTPYPDPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKELRFDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GTPYPDPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKELRFDV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 FQEEDSNSDYDLQQPAPGGSEVAPSMLEITIQNAKAYLLKTSSNSGFNLYDHLSNMLTKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 FQEEDSNSDYDLQQPAPGGSEVAPSMLEITIQNAKAYLLKTSSNSGFNLYDHLSNMLTKI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 LNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQKALFLQGHLEGVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQKALFLQGHLEGVD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 QELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLTDTHPIQRCRFWGKIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 QELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLTDTHPIQRCRFWGKIL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 GLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGESEAHEDEEDELPKSFYKAPQAIPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 GLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGESEAHEDEEDELPKSFYKAPQAIPKE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 ESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIARKIKKFFTGRLDAPIISYPPFPGNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIARKIKKFFTGRLDAPIISYPPFPGNE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 SNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEEAEGGRNSFEENPDFEGIQVIDLVES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEEAEGGRNSFEENPDFEGIQVIDLVES 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE6 LSNWVHHVQHILSQRFRIYPPGQHGYPQISFHNMLLQSFNPTFGLEHMPSPMAKSLKIST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 LSNWVHHVQHILSQRFRIYPPGQHGYPQISFHNMLLQSFNPTFGLEHMPSPMAKSLKIST 550 560 570 580 590 600 >>CCDS34521.1 RSPH4A gene_id:345895|Hs108|chr6 (716 aa) initn: 3718 init1: 3718 opt: 3718 Z-score: 2066.0 bits: 392.6 E(32554): 9.3e-109 Smith-Waterman score: 3718; 100.0% identity (100.0% similar) in 554 aa overlap (1-554:1-554) 10 20 30 40 50 60 pF1KE6 MEDSTSPKQEKENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MEDSTSPKQEKENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 WSPQSRAKTPLGGPAGPETSSPAPVSPREPSSSPSPLAPARQDLAAPPQSDRTTSVIPEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 WSPQSRAKTPLGGPAGPETSSPAPVSPREPSSSPSPLAPARQDLAAPPQSDRTTSVIPEA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 GTPYPDPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKELRFDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GTPYPDPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKELRFDV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 FQEEDSNSDYDLQQPAPGGSEVAPSMLEITIQNAKAYLLKTSSNSGFNLYDHLSNMLTKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 FQEEDSNSDYDLQQPAPGGSEVAPSMLEITIQNAKAYLLKTSSNSGFNLYDHLSNMLTKI 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 LNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQKALFLQGHLEGVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQKALFLQGHLEGVD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 QELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLTDTHPIQRCRFWGKIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLTDTHPIQRCRFWGKIL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 GLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGESEAHEDEEDELPKSFYKAPQAIPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGESEAHEDEEDELPKSFYKAPQAIPKE 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE6 ESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIARKIKKFFTGRLDAPIISYPPFPGNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIARKIKKFFTGRLDAPIISYPPFPGNE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE6 SNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEEAEGGRNSFEENPDFEGIQVIDLVES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 SNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEEAEGGRNSFEENPDFEGIQVIDLVES 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE6 LSNWVHHVQHILSQRFRIYPPGQHGYPQISFHNMLLQSFNPTFGLEHMPSPMAKSLKIST :::::::::::::: CCDS34 LSNWVHHVQHILSQGRCNWFNSIQKNEEEEEEEDEEKDDSDYIEQEVGLPLLTPISEDLE 550 560 570 580 590 600 >>CCDS12675.1 RSPH6A gene_id:81492|Hs108|chr19 (717 aa) initn: 1504 init1: 810 opt: 1245 Z-score: 704.1 bits: 140.6 E(32554): 6.7e-33 Smith-Waterman score: 1572; 48.3% identity (73.4% similar) in 549 aa overlap (42-554:8-551) 20 30 40 50 60 70 pF1KE6 ENQEELGETRRPWEGKTAASPQYSEPESSEPLEAKQGPETGRQSRSSRPWSPQSRAKTPL : . : : : :..:. ...:.. CCDS12 MGDLPPYPERPAQQPPGRRTSQASQRRHSRDQAQALA 10 20 30 80 90 100 110 120 pF1KE6 GGPAGPETSSPAPVSPRE-PSSSPSPLAPARQDLAAPP--QSD--RTTSV-IPEAGTPYP . : : .. : . :. :. : ...: : :.. : .. : ..: .: CCDS12 ADPE--ERQQIPPDAQRNAPGWSQRGSLSQQENLLMPQVFQAEEARLGGMEYPSVNTGFP 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 DPLEQSSDKRESTPHHTSQSEGNTFQQSQQPKPHLCGRRDVSYNNAKQKEL-RFDVFQEE . .. . . :: . . . . .:. :: . : . : .... . : .:...: . CCDS12 SEFQPQPYSDESRMQVAELTTSLMLQRLQQGQSSLFQQLDPTFQEPPVNPLGQFNLYQTD 100 110 120 130 140 150 190 200 210 220 pF1KE6 D-----SNSDYDLQQPA----PG-------GSEVA-PSMLEITIQNAKAYLLKTSSNSGF . ... : ..:: :. ...: : ::...::::::::.:: : . CCDS12 QFSEGAQHGPYIRDDPALQFLPSELGFPHYSAQVPEPEPLELAVQNAKAYLLQTSINCDL 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE6 NLYDHLSNMLTKILNERPENAVDIFENISQDVKMAHFSKKFDALQNENELLPTYEIAEKQ .::.:: :.::::::.:::. ....:.... .. : :.:.:... :. :::..:::: CCDS12 SLYEHLVNLLTKILNQRPEDPLSVLESLNRTTQWEWFHPKLDTLRDDPEMQPTYKMAEKQ 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE6 KALFLQ--GHLEGVDQELEDEIAENALPNVMESAFYFEQAGVGLGTDETYRIFLALKQLT :::: . : :: .::.:.:..:. .::.::.:::::::::::..::..:::::.:::. CCDS12 KALFTRSGGGTEG-EQEMEEEVGETPVPNIMETAFYFEQAGVGLSSDESFRIFLAMKQLV 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE6 DTHPIQRCRFWGKILGLEMNYIVAEVEFREGEDEEEVEEEDVAEERDNGES-EAH---ED . .::. :::::::::.. .:.:::::::::: ::.:::.: : ..:: ::: : CCDS12 EQQPIHTCRFWGKILGIKRSYLVAEVEFREGE--EEAEEEEVEEMTEGGEVMEAHGEEEG 340 350 360 370 380 390 410 420 430 440 450 pF1KE6 EEDE------LPKSFYKAPQAIPKEESRTGANKYVYFVCNEPGRPWVKLPPVIPAQIVIA :::: .::: .: : .:::::::.:::::.:::::::: ::..:: : ::::: : CCDS12 EEDEEKAVDIVPKSVWKPPPVIPKEESRSGANKYLYFVCNEPGLPWTRLPHVTPAQIVNA 400 410 420 430 440 450 460 470 480 490 500 510 pF1KE6 RKIKKFFTGRLDAPIISYPPFPGNESNYLRAQIARISAGTHVSPLGFYQFGEEEGEEEEE ::::::::: ::.:..:::::::::.::::::::::::.:.:::::::::.::::.:::: CCDS12 RKIKKFFTGYLDTPVVSYPPFPGNEANYLRAQIARISAATQVSPLGFYQFSEEEGDEEEE 460 470 480 490 500 510 520 530 540 550 560 570 pF1KE6 AEGGRNSFEENPDFEGIQVIDLVESLSNWVHHVQHILSQRFRIYPPGQHGYPQISFHNML . .::.:.::::::::: :..::.:..:::::.:::: : CCDS12 GGAGRDSYEENPDFEGIPVLELVDSMANWVHHTQHILPQGRCTWVNPLQKTEEEEDLGEE 520 530 540 550 560 570 580 590 600 pF1KE6 LQSFNPTFGLEHMPSPMAKSLKIST CCDS12 EEKADEGPEEVEQEVGPPLLTPLSEDAEIMHLAPWTTRLSCSLCPQYSVAVVRSNLWPGA 580 590 600 610 620 630 600 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 16:14:41 2016 done: Tue Nov 8 16:14:42 2016 Total Scan time: 4.130 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]