FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1142, 420 aa 1>>>pF1KE1142 420 - 420 aa - 420 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8154+/-0.000852; mu= 11.4593+/- 0.051 mean_var=121.4867+/-24.147, 0's: 0 Z-trim(110.7): 33 B-trim: 0 in 0/51 Lambda= 0.116362 statistics sampled from 11804 (11831) to 11804 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.363), width: 16 Scan time: 2.890 The best scores are: opt bits E(32554) CCDS47007.1 TADA2B gene_id:93624|Hs108|chr4 ( 420) 2832 486.3 2.3e-137 CCDS11319.1 TADA2A gene_id:6871|Hs108|chr17 ( 443) 512 96.9 4.2e-20 CCDS45656.1 TADA2A gene_id:6871|Hs108|chr17 ( 305) 434 83.6 2.7e-16 >>CCDS47007.1 TADA2B gene_id:93624|Hs108|chr4 (420 aa) initn: 2832 init1: 2832 opt: 2832 Z-score: 2579.5 bits: 486.3 E(32554): 2.3e-137 Smith-Waterman score: 2832; 100.0% identity (100.0% similar) in 420 aa overlap (1-420:1-420) 10 20 30 40 50 60 pF1KE1 MAELGKKYCVYCLAEVSPLRFRCTECQDIELCPECFSAGAEIGHHRRYHGYQLVDGGRFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MAELGKKYCVYCLAEVSPLRFRCTECQDIELCPECFSAGAEIGHHRRYHGYQLVDGGRFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 LWGPEAEGGWTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHYVSMYIHGNLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LWGPEAEGGWTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHYVSMYIHGNLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 KACIPDTIPNRVTDHTCPSGGPLSPSLTTPLPPLDISVAEQQQLGYMPLRDDYEIEYDQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KACIPDTIPNRVTDHTCPSGGPLSPSLTTPLPPLDISVAEQQQLGYMPLRDDYEIEYDQD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 AETLISGLSVNYDDDDVEIELKRAHVDMYVRKLKERQRRKNIARDYNLVPAFLGKDKKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 AETLISGLSVNYDDDDVEIELKRAHVDMYVRKLKERQRRKNIARDYNLVPAFLGKDKKEK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 EKALKRKITKEEKELRLKLRPLYQFMSCKEFDDLFENMHKEKMLRAKIRELQRYRRNGIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EKALKRKITKEEKELRLKLRPLYQFMSCKEFDDLFENMHKEKMLRAKIRELQRYRRNGIT 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 KMEESAEYEAARHKREKRKENKNLAGSKRGKEDGKDSEFAAIENLPGFELLSDREKVLCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KMEESAEYEAARHKREKRKENKNLAGSKRGKEDGKDSEFAAIENLPGFELLSDREKVLCS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE1 SLNLSPARYVTVKTIIIKDHLQKRQGIPSKSRLPSYLDKVLKKRILNFLTESGWISRDAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SLNLSPARYVTVKTIIIKDHLQKRQGIPSKSRLPSYLDKVLKKRILNFLTESGWISRDAS 370 380 390 400 410 420 >>CCDS11319.1 TADA2A gene_id:6871|Hs108|chr17 (443 aa) initn: 571 init1: 189 opt: 512 Z-score: 474.3 bits: 96.9 E(32554): 4.2e-20 Smith-Waterman score: 557; 27.6% identity (56.6% similar) in 445 aa overlap (6-417:14-442) 10 20 30 40 50 pF1KE1 MAELGKKYCVYCLAEVSPLRFRCTEC--QDIELCPECFSAGAEIGHHRRYHG : : : . . ..:.:: . :: .::. : : .:. : CCDS11 MDRLGSFSNDPSDKPPCRGCSSYLMEPYIKCAECGPPPFFLCLQCFTRGFEYKKHQSDHT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 YQLVDGGRFTLWGPEAEGGWTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHY :... . : . : .::..::. ::.:. . :::::.:.: .. ..: .: .:: CCDS11 YEIMTSD-FPVLDP----SWTAQEEMALLEAVMDCGFGNWQDVANQM-CTKTKEECEKHY 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VSMYIHGNLGKACIPDTIPNRVTDHTCPSGGPLSPSLTTPLPPLDISVAEQQQLGYMPLR .. .:.. : . . . . . .: .. :. . : : .: :. ... :::: : CCDS11 MKHFINNPLFASTLLN-LKQAEEAKTADTAIPFHSTDDPPRPTFD-SLLSRDMAGYMPAR 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 DDYEIEYDQDAETLISGLSVNYDDDDVEIELKRAHVDMYVRKLKERQRRKNIARDYNLVP :. :.:. :: . .. ::.:. :: : ::.: .::::::::.: ::..:. CCDS11 ADFIEEFDNYAEWDLRDIDFVEDDSDILHALKMAVVDIYHSRLKERQRRKKIIRDHGLI- 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 AFLGKDKKEKEKALKRKITKEEKELRLKLRPLYQFMSCKEFDDLFENMHKEKMLRAKIRE . .: . ..:. :: ..: .: . .... : : ..:. : :: .:.. CCDS11 ------NLRKFQLMERRYPKEVQDLYETMRRFARIVGPVEHDKFIESHALEFELRREIKR 240 250 260 270 280 300 310 320 330 pF1KE1 LQRYRRNGITKMEESAEYEAARHKREKRKENKNLAGSK--------------RGKED--- ::.:: :::.. . :. .. ::... .... . : . : CCDS11 LQEYRTAGITNFCSARTYDHLKKTREEERLKRTMLSEVLQYIQDSSACQQWLRRQADIDS 290 300 310 320 330 340 340 350 360 370 380 pF1KE1 ------------GKDSEFA-AIENLPGFELLSDREKVLCSSLNLSPARYVTVKTIIIKDH :. : . .::: : :...:: ::. . : :. :. :. .. .. CCDS11 GLSPSIPMASNSGRRSAPPLNLTGLPGTEKLNEKEKELCQMVRLVPGAYLEYKSALL-NE 350 360 370 380 390 400 390 400 410 420 pF1KE1 LQKRQGIP-SKSRLPSYLDKVLKKRILNFLTESGWISRDAS .:. :. ...: .: ..: .:: . :.:.. CCDS11 CNKQGGLRLAQARALIKIDVNKTRKIYDFLIREGYITKG 410 420 430 440 >>CCDS45656.1 TADA2A gene_id:6871|Hs108|chr17 (305 aa) initn: 399 init1: 187 opt: 434 Z-score: 405.9 bits: 83.6 E(32554): 2.7e-16 Smith-Waterman score: 434; 30.3% identity (59.9% similar) in 274 aa overlap (6-277:14-272) 10 20 30 40 50 pF1KE1 MAELGKKYCVYCLAEVSPLRFRCTEC--QDIELCPECFSAGAEIGHHRRYHG : : : . . ..:.:: . :: .::. : : .:. : CCDS45 MDRLGSFSNDPSDKPPCRGCSSYLMEPYIKCAECGPPPFFLCLQCFTRGFEYKKHQSDHT 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 YQLVDGGRFTLWGPEAEGGWTSREEQLLLDAIEQFGFGNWEDMAAHVGASRTPQEVMEHY :... . : . : .::..::. ::.:. . :::::.:.: .. ..: .: .:: CCDS45 YEIMTSD-FPVLDP----SWTAQEEMALLEAVMDCGFGNWQDVANQM-CTKTKEECEKHY 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VSMYIHGNLGKACIPDTIPNRVTDHTCPSGGPLSPSLTTPLPPLDISVAEQQQLGYMPLR .. .:.. : . . . . . .: .. :. . : : .: :. ... :::: : CCDS45 MKHFINNPLFASTLLN-LKQAEEAKTADTAIPFHSTDDPPRPTFD-SLLSRDMAGYMPAR 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 DDYEIEYDQDAETLISGLSVNYDDDDVEIELKRAHVDMYVRKLKERQRRKNIARDYNLVP :. :.:. :: . .. ::.:. :: : ::.: .::::::::.: ::..:. CCDS45 ADFIEEFDNYAEWDLRDIDFVEDDSDILHALKMAVVDIYHSRLKERQRRKKIIRDHGLI- 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 AFLGKDKKEKEKALKRKITKEEKELRLKLRPLYQFMSCKEFDDLFENMHKEKMLRAKIRE . .: . ..:. :: ..: .: . .... : : ..:. CCDS45 ------NLRKFQLMERRYPKEVQDLYETMRRFARIVGPVEHDKFIESHACRWFLSLEQYL 240 250 260 270 280 300 310 320 330 340 350 pF1KE1 LQRYRRNGITKMEESAEYEAARHKREKRKENKNLAGSKRGKEDGKDSEFAAIENLPGFEL CCDS45 CVYIYINRRDNGVFYVKFYK 290 300 420 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:29:19 2016 done: Sun Nov 6 10:29:20 2016 Total Scan time: 2.890 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]