FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5728, 926 aa 1>>>pF1KE5728 926 - 926 aa - 926 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7680+/-0.000947; mu= 8.8276+/- 0.058 mean_var=261.6886+/-53.646, 0's: 0 Z-trim(114.0): 13 B-trim: 49 in 1/54 Lambda= 0.079283 statistics sampled from 14593 (14603) to 14593 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.748), E-opt: 0.2 (0.449), width: 16 Scan time: 4.120 The best scores are: opt bits E(32554) CCDS10026.1 OTUD7A gene_id:161725|Hs108|chr15 ( 926) 6231 726.6 5.4e-209 CCDS72903.1 OTUD7B gene_id:56957|Hs108|chr1 ( 843) 2156 260.4 1e-68 CCDS5187.1 TNFAIP3 gene_id:7128|Hs108|chr6 ( 790) 555 77.3 1.3e-13 >>CCDS10026.1 OTUD7A gene_id:161725|Hs108|chr15 (926 aa) initn: 6231 init1: 6231 opt: 6231 Z-score: 3865.6 bits: 726.6 E(32554): 5.4e-209 Smith-Waterman score: 6231; 100.0% identity (100.0% similar) in 926 aa overlap (1-926:1-926) 10 20 30 40 50 60 pF1KE5 MVSSVLPNPTSAECWAALLHDPMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MVSSVLPNPTSAECWAALLHDPMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 SDYEQLRQVHTANLPHVFNEGRGPKQPEREPQPGHKVERPCLQRQDDIAQEKRLSRGISH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 SDYEQLRQVHTANLPHVFNEGRGPKQPEREPQPGHKVERPCLQRQDDIAQEKRLSRGISH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ASSAIVSLARSHVASECNNEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDLIEQATMVAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ASSAIVSLARSHVASECNNEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDLIEQATMVAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 EQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRKALYTMMRTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 EQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRKALYTMMRTG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 AEREALKRRWRWQQTQQNKEEEWEREWTELLKLASSEPRTHFSKNGGTGGGVDNSEDPVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AEREALKRRWRWQQTQQNKEEEWEREWTELLKLASSEPRTHFSKNGGTGGGVDNSEDPVY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 ESLEEFHVFVLAHILRRPIVVVADTMLRDSGGEAFAPIPFGGIYLPLEVPPNRCHCSPLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ESLEEFHVFVLAHILRRPIVVVADTMLRDSGGEAFAPIPFGGIYLPLEVPPNRCHCSPLV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 LAYDQAHFSALVSMEQRDQQREQAVIPLTDSEHKLLPLHFAVDPGKDWEWGKDDNDNARL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LAYDQAHFSALVSMEQRDQQREQAVIPLTDSEHKLLPLHFAVDPGKDWEWGKDDNDNARL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 AHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTASAGEDVQSLADSLDSDRDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTASAGEDVQSLADSLDSDRDS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE5 VCSNSNSNNGKNGKDKEKEKQRKEKDKTRADSVANKLGSFSKTLGIKLKKNMGGLGGLVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 VCSNSNSNNGKNGKDKEKEKQRKEKDKTRADSVANKLGSFSKTLGIKLKKNMGGLGGLVH 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE5 GKMGRANSANGKNGDSAERGKEKKAKSRKGSKEESGASASTSPSEKTTPSPTDKAAGASP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 GKMGRANSANGKNGDSAERGKEKKAKSRKGSKEESGASASTSPSEKTTPSPTDKAAGASP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KE5 AEKGGGPRGDAWKYSTDVKLSLNILRAAMQGERKFIFAGLLLTSHRHQFHEEMIGYYLTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AEKGGGPRGDAWKYSTDVKLSLNILRAAMQGERKFIFAGLLLTSHRHQFHEEMIGYYLTS 610 620 630 640 650 660 670 680 690 700 710 720 pF1KE5 AQERFSAEQEQRRRDAATAAAAAAAAAAATAKRPPRRPETEGVPVPERASPGPPTQLVLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AQERFSAEQEQRRRDAATAAAAAAAAAAATAKRPPRRPETEGVPVPERASPGPPTQLVLK 670 680 690 700 710 720 730 740 750 760 770 780 pF1KE5 LKERPSPGPAAGRAARAAAGGTASPGGGARRASASGPVPGRSPPAPARQSVIHVQASGAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 LKERPSPGPAAGRAARAAAGGTASPGGGARRASASGPVPGRSPPAPARQSVIHVQASGAR 730 740 750 760 770 780 790 800 810 820 830 840 pF1KE5 DEACAPAVGALRPCATYPQQNRSLSSQSYSPARAAALRTVNTVESLARAVPGALPGAAGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 DEACAPAVGALRPCATYPQQNRSLSSQSYSPARAAALRTVNTVESLARAVPGALPGAAGT 790 800 810 820 830 840 850 860 870 880 890 900 pF1KE5 AGAAEHKSQTYTNGFGALRDGLEFADADAPTARSNGECGRGGPGPVQRRCQRENCAFYGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AGAAEHKSQTYTNGFGALRDGLEFADADAPTARSNGECGRGGPGPVQRRCQRENCAFYGR 850 860 870 880 890 900 910 920 pF1KE5 AETEHYCSYCYREELRRRREARGARP :::::::::::::::::::::::::: CCDS10 AETEHYCSYCYREELRRRREARGARP 910 920 >>CCDS72903.1 OTUD7B gene_id:56957|Hs108|chr1 (843 aa) initn: 2011 init1: 1246 opt: 2156 Z-score: 1347.1 bits: 260.4 E(32554): 1e-68 Smith-Waterman score: 3065; 57.6% identity (74.4% similar) in 922 aa overlap (23-923:1-836) 10 20 30 40 50 60 pF1KE5 MVSSVLPNPTSAECWAALLHDPMTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDLTAAL :::::::::::::::::::::::::::::::::..::: CCDS72 MTLDMDAVLSDFVRSTGAEPGLARDLLEGKNWDVNAAL 10 20 30 70 80 90 100 110 pF1KE5 SDYEQLRQVHTANLPHVFNEGRG-PKQPE-----REPQPGHKVERPCLQRQDDIAQEKRL ::.:::::::..::: :.:: : . :: ::: . :: :::::::.::::: CCDS72 SDFEQLRQVHAGNLPPSFSEGSGGSRTPEKGFSDREPT---RPPRPILQRQDDIVQEKRL 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE5 SRGISHASSAIVSLARSHVASECN---NEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDL :::::::::.:::::::::.:. . ... :::::: .::::::.::.::::::::::: CCDS72 SRGISHASSSIVSLARSHVSSNGGGGGSNEHPLEMPICAFQLPDLTVYNEDFRSFIERDL 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE5 IEQATMVALEQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRK :::. .:::::::::::: .: . .::::::::::::::::::::::::::::::.::: CCDS72 IEQSMLVALEQAGRLNWWVSVDPTSQRLLPLATTGDGNCLLHAASLGMWGFHDRDLMLRK 160 170 180 190 200 210 240 250 260 270 280 pF1KE5 ALYTMMRTGAEREALKRRWRWQQTQQNKE-------EEWEREWTELLKLASSEPRTHFSK :::..:. :.:.::::::::::::::::: .::..::.::.:::::::: :.. CCDS72 ALYALMEKGVEKEALKRRWRWQQTQQNKESGLVYTEDEWQKEWNELIKLASSEPRMHLGT 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE5 NGGTGGGVDNSEDPVYESLEEFHVFVLAHILRRPIVVVADTMLRDSGGEAFAPIPFGGIY ::.. :::..::.::::::::::::::::.:::::::::::::::::::::::::::::: CCDS72 NGANCGGVESSEEPVYESLEEFHVFVLAHVLRRPIVVVADTMLRDSGGEAFAPIPFGGIY 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE5 LPLEVPPNRCHCSPLVLAYDQAHFSALVSMEQRDQQREQAVIPLTDSEHKLLPLHFAVDP :::::: ..:: ::::::::::::::::::::... .:::::::::::.::::::::::: CCDS72 LPLEVPASQCHRSPLVLAYDQAHFSALVSMEQKENTKEQAVIPLTDSEYKLLPLHFAVDP 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE5 GKDWEWGKDDNDNARLAHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTASAG :: :::::::.::.::: .:::::.::.:::::::: :: . :...:::::::::::::: CCDS72 GKGWEWGKDDSDNVRLASVILSLEVKLHLLHSYMNVKWIPLSSDAQAPLAQPESPTASAG 400 410 420 430 440 450 470 480 490 500 510 520 pF1KE5 EDVQSLADSLDSDRDSVCSNSNSNNGKNGKDKEKEKQRKEKDKTRADSVANKLGSFSKTL .. .: .: :::..:: :.:.::.: :. ::: :. .:::: ::::::::::::.::: CCDS72 DEPRSTPESGDSDKESVGSSSTSNEG--GRRKEKSKRDREKDKKRADSVANKLGSFGKTL 460 470 480 490 500 510 530 540 550 560 570 580 pF1KE5 GIKLKKNMGGLGGLVHGKMGRANSA-NGKNG-DSAERGKEKKAKSRKGSKEESGASASTS : ::::::::: .: : .... .:..: .. :. :... :: ::.::: .:. . CCDS72 GSKLKKNMGGLMHSKGSKPGGVGTGLGGSSGTETLEKKKKNSLKSWKGGKEE---AAGDG 520 530 540 550 560 570 590 600 610 620 630 640 pF1KE5 P-SEKTTPSPTDKAAGASPAEKGGGPRGDAWKYSTDVKLSLNILRAAMQGERKFIFAGLL : ::: : ...: .::. ::: .: ::.:::.::::: ::::.: : CCDS72 PVSEK----PPAESVG-----NGGS------KYSQEVMQSLSILRTAMQGEGKFIFVGTL 580 590 600 610 650 660 670 680 690 700 pF1KE5 LTSHRHQFHEEMIGYYLTSAQERFSAEQEQRRRDAATAAAAAAAAAAATAKRPPRRPETE .::::..:::: ::..:.::: :::.:.. :.: CCDS72 KMGHRHQYQEEMIQRYLSDAEERFLAEQKQKE-----------------AERKIMNGGIG 620 630 640 650 710 720 730 740 750 760 pF1KE5 GVPVPERASPGPPTQLVLKLKERPSPGPAAGRAARAAAGGTASPGGGARRASASGPVPGR : : : . .: : .. .:.:. :: : .:: : .:. :: CCDS72 GGPPPAK-KPEPDAR-----EEQPT-GPPA--ESRAMAFSTGYPGD-------------F 660 670 680 690 770 780 790 800 810 820 pF1KE5 SPPAPARQSVIHVQASGARDEACAPAVGALRPCATYPQQNRSLSSQSYSPARAAALR-TV . : :. .: : : :. : .: ::.: : ::.:.: :.: . .. CCDS72 TIPRPSGGGV-HCQEP-RRQLAGGPCVGGLPPYATFPRQC--------PPGRPYPHQDSI 700 710 720 730 740 830 840 850 860 870 880 pF1KE5 NTVESLARAVPGALPGAAGTAGAAEHKSQTYTNGFGALRDGLEFADADAPTARSNGECGR ..: ... : :: . ...:.::. :. : : . ..: : CCDS72 PSLEPGSHSKDGLHRGA--LLPPPYRVADSYSNGY---REPPE------PDGWAGGL--R 750 760 770 780 790 890 900 910 920 pF1KE5 GGPGPVQRRCQRENCAFYGRAETEHYCSYCYREELRRR-REARGARP : : :.: .:.. ::.:::. ::...:: ::::::::: :: : CCDS72 GLP-PTQTKCKQPNCSFYGHPETNNFCSCCYREELRRREREPDGELLVHRF 800 810 820 830 840 >>CCDS5187.1 TNFAIP3 gene_id:7128|Hs108|chr6 (790 aa) initn: 531 init1: 162 opt: 555 Z-score: 357.7 bits: 77.3 E(32554): 1.3e-13 Smith-Waterman score: 593; 37.9% identity (65.4% similar) in 272 aa overlap (146-404:43-294) 120 130 140 150 160 170 pF1KE5 RGISHASSAIVSLARSHVASECNNEQFPLEMPIYTFQLPDLSVYSEDFRSFIERDLIEQA : ::... . .:: .:.. ::.. CCDS51 SNMRKAVKIRERTPEDIFKPTNGIIHHFKTMHRYTLEMFRTCQFCPQFREIIHKALIDRN 20 30 40 50 60 70 180 190 200 210 220 230 pF1KE5 TMVALEQAGRLNWWSTVCTSCKRLLPLATTGDGNCLLHAASLGMWGFHDRDLVLRKALYT ...::. .::: : ..:. : :.::::::.::.: ::: .: ::::::::.. CCDS51 IQATLESQKKLNW----CREVRKLVALKTNGDGNCLMHATSQYMWGVQDTDLVLRKALFS 80 90 100 110 120 240 250 260 270 280 pF1KE5 MMRTGAEREALKRRWRWQQTQQNKEEE---------WEREWTELLKLASSEPRTHFSKNG .. :. .: ::. .. .... : :. :: .:.:.::.. : ....: CCDS51 TLKETDTRN-FKFRWQLESLKSQEFVETGLCYDTRNWNDEWDNLIKMASTD--TPMARSG 130 140 150 160 170 180 290 300 310 320 330 340 pF1KE5 GTGGGVDNSEDPVYESLEEFHVFVLAHILRRPIVVVADTMLRD-SGGEAFAPIPFGGIYL :.::::.:.::: .::::::.:..: :::. .: :::. ::::: CCDS51 LQ-----------YNSLEEIHIFVLCNILRRPIIVISDKMLRSLESGSNFAPLKVGGIYL 190 200 210 220 230 350 360 370 380 390 400 pF1KE5 PLEVPPNRCHCSPLVLAYDQAHFSALVSMEQRDQQREQAVIPLTDSEH---KLLPLHFAV ::. : ..:. :.::.::. :: ::.. .:. : ..::.. .. . : .:: . CCDS51 PLHWPAQECYRYPIVLGYDSHHFVPLVTL--KDSGPEIRAVPLVNRDRGRFEDLKVHFLT 240 250 260 270 280 290 410 420 430 440 450 460 pF1KE5 DPGKDWEWGKDDNDNARLAHLILSLEAKLNLLHSYMNVTWIRIPSETRAPLAQPESPTAS :: CCDS51 DPENEMKEKLLKEYLMVIEIPVQGWDHGTTHLINAAKLDEANLPKEINLVDDYFELVQHE 300 310 320 330 340 350 926 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 06:08:06 2016 done: Tue Nov 8 06:08:06 2016 Total Scan time: 4.120 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]