FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4548, 647 aa 1>>>pF1KE4548 647 - 647 aa - 647 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.7873+/-0.00105; mu= -3.1476+/- 0.064 mean_var=330.1819+/-65.630, 0's: 0 Z-trim(114.1): 14 B-trim: 7 in 1/52 Lambda= 0.070583 statistics sampled from 14675 (14681) to 14675 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.451), width: 16 Scan time: 4.520 The best scores are: opt bits E(32554) CCDS8354.1 DLAT gene_id:1737|Hs108|chr11 ( 647) 4303 452.0 1.1e-126 CCDS44569.1 PDHX gene_id:8050|Hs108|chr11 ( 486) 781 93.3 8.4e-19 CCDS7896.1 PDHX gene_id:8050|Hs108|chr11 ( 501) 781 93.3 8.6e-19 >>CCDS8354.1 DLAT gene_id:1737|Hs108|chr11 (647 aa) initn: 4303 init1: 4303 opt: 4303 Z-score: 2387.6 bits: 452.0 E(32554): 1.1e-126 Smith-Waterman score: 4303; 100.0% identity (100.0% similar) in 647 aa overlap (1-647:1-647) 10 20 30 40 50 60 pF1KE4 MWRVCARRAQNVAPWAGLEARWTALQEVPGTPRVTSRSGPAPARRNSVTTGYGGVRALCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 MWRVCARRAQNVAPWAGLEARWTALQEVPGTPRVTSRSGPAPARRNSVTTGYGGVRALCG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 WTPSSGATPRNRLLLQLLGSPGRRYYSLPPHQKVPLPSLSPTMQAGTIARWEKKEGDKIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 WTPSSGATPRNRLLLQLLGSPGRRYYSLPPHQKVPLPSLSPTMQAGTIARWEKKEGDKIN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 EGDLIAEVETDKATVGFESLEECYMAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 EGDLIAEVETDKATVGFESLEECYMAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 LDSSAAPTPQAAPAPTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 LDSSAAPTPQAAPAPTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 KVGEKLSEGDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 KVGEKLSEGDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 SAFADYRPTEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCPATPAGPKGRVFVSPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 SAFADYRPTEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCPATPAGPKGRVFVSPL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 AKKLAVEKGIDLTQVKGTGPDGRITKKDIDSFVPSKVAPAPAAVVPPTGPGMAPVPTGVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 AKKLAVEKGIDLTQVKGTGPDGRITKKDIDSFVPSKVAPAPAAVVPPTGPGMAPVPTGVF 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 TDIPISNIRRVIAQRLMQSKQTIPHYYLSIDVNMGEVLLVRKELNKILEGRSKISVNDFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 TDIPISNIRRVIAQRLMQSKQTIPHYYLSIDVNMGEVLLVRKELNKILEGRSKISVNDFI 430 440 450 460 470 480 490 500 510 520 530 540 pF1KE4 IKASALACLKVPEANSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIAND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 IKASALACLKVPEANSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIAND 490 500 510 520 530 540 550 560 570 580 590 600 pF1KE4 VVSLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 VVSLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPA 550 560 570 580 590 600 610 620 630 640 pF1KE4 DNEKGFDVASMMSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL ::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 DNEKGFDVASMMSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL 610 620 630 640 >>CCDS44569.1 PDHX gene_id:8050|Hs108|chr11 (486 aa) initn: 1014 init1: 456 opt: 781 Z-score: 451.0 bits: 93.3 E(32554): 8.4e-19 Smith-Waterman score: 1100; 40.9% identity (66.7% similar) in 484 aa overlap (197-645:19-485) 170 180 190 200 210 220 pF1KE4 VGKPEDIEAFKNYTLDSSAAPTPQAAPAPTPAATASPPTP-SAQAPGSSYPPHMQVLLPA :.. .::. :. :: ...:.:. CCDS44 MQSGGAEGSPGAGRTGRGPGSGKAPPAEISSGAPDFPGGDPIKILMPS 10 20 30 40 230 240 250 260 270 280 pF1KE4 LSPTMTMGTVQRWEKKVGEKLSEGDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVP ::::: :.. .: :: :: .: :: : :::::::.. ......: ::::.: ::.... CCDS44 LSPTMEEGNIVKWLKKEGEAVSAGDALCEIETDKAVVTLDASDDGILAKIVVEEGSKNIR 50 60 70 80 90 100 290 300 310 320 330 340 pF1KE4 LGTPLCIIVEKEADISAFADYRPTEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCP ::. . .::: :.. :.. .:. :. : ::::. : :.: .: :. : CCDS44 LGSLIGLIVE-EGE-----DWKHVEI----PKDVGP-PPPVSK-PSEPRP-SPEPQISIP 110 120 130 140 150 350 360 370 380 390 pF1KE4 ATPAGPKG--RVFVSPLAKKLAVEKGIDLTQVKGTGPDGRITKKDIDSFVP--------- . : : .:: :... ....: .: .::: : .::.: ..: CCDS44 VKKEHIPGTLRFRLSPAARNILEKHSLDASQGTATGPRGIFTKEDALKLVQLKQTGKITE 160 170 180 190 200 210 400 410 420 430 pF1KE4 SKVAPAPAA-------------------VVPPTG-PGMAPVPTGVFTDIPISNIRRVIAQ :. .:::.: :.::.. ::. : .:.::.:: ::::::::. CCDS44 SRPTPAPTATPTAPSPLQATAGPSYPRPVIPPVSTPGQ-PNAVGTFTEIPASNIRRVIAK 220 230 240 250 260 270 440 450 460 470 480 490 pF1KE4 RLMQSKQTIPHYYLSIDVNMGEVLLVRKELNKILEGRSKISVNDFIIKASALACLKVPEA :: .::.:.:: : . : ..: :: ::..: : :.:::::::::.:.. ..:.. CCDS44 RLTESKSTVPHAYATADCDLGAVLKVRQDLVK---DDIKVSVNDFIIKAAAVTLKQMPDV 280 290 300 310 320 330 500 510 520 530 540 550 pF1KE4 NSSWMDTVIRQNHVVDVSVAVSTPAGLITPIVFNAHIKGVETIANDVVSLATKAREGKLQ : :: .: .:.::::.: ::.:::. .: ::.. ::..: .:. :::.::: CCDS44 NVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAAKGIQEIADSVKALSKKARDGKLL 340 350 360 370 380 390 560 570 580 590 600 610 pF1KE4 PHEFQGGTFTISNLGMFGIKNFSAIINPPQACILAIGASEDKLVPADNEKG---FDVASM :.:.:::.:.::::::::: .:.:.::::::::::.: . : ...:.: .. .. CCDS44 PEEYQGGSFSISNLGMFGIDEFTAVINPPQACILAVGRFRPVLKLTEDEEGNAKLQQRQL 400 410 420 430 440 450 620 630 640 pF1KE4 MSVTLSCDHRVVDGAVGAQWLAEFRKYLEKPITMLL ..::.: : :::: .....: :. ::.:: . CCDS44 ITVTMSSDSRVVDDELATRFLKSFKANLENPIRLA 460 470 480 >>CCDS7896.1 PDHX gene_id:8050|Hs108|chr11 (501 aa) initn: 1052 init1: 456 opt: 781 Z-score: 450.8 bits: 93.3 E(32554): 8.6e-19 Smith-Waterman score: 1078; 41.6% identity (67.7% similar) in 461 aa overlap (219-645:57-500) 190 200 210 220 230 240 pF1KE4 PQAAPAPTPAATASPPTPSAQAPGSSYPPHMQVLLPALSPTMTMGTVQRWEKKVGEKLSE ...:.:.::::: :.. .: :: :: .: CCDS78 GLVKGALGWSVSRGANWRWFHSTQWLRGDPIKILMPSLSPTMEEGNIVKWLKKEGEAVSA 30 40 50 60 70 80 250 260 270 280 290 300 pF1KE4 GDLLAEIETDKATIGFEVQEEGYLAKILVPEGTRDVPLGTPLCIIVEKEADISAFADYRP :: : :::::::.. ......: ::::.: ::.... ::. . .::: :.. :.. CCDS78 GDALCEIETDKAVVTLDASDDGILAKIVVEEGSKNIRLGSLIGLIVE-EGE-----DWKH 90 100 110 120 130 140 310 320 330 340 350 360 pF1KE4 TEVTDLKPQVPPPTPPPVAAVPPTPQPLAPTPSAPCPATPAGPKG--RVFVSPLAKKLAV .:. :. : ::::. : :.: .: :. :. : : .:: :... CCDS78 VEI----PKDVGP-PPPVSK-PSEPRP-SPEPQISIPVKKEHIPGTLRFRLSPAARNILE 150 160 170 180 190 370 380 390 400 pF1KE4 EKGIDLTQVKGTGPDGRITKKDIDSFVP---------SKVAPAPAA-------------- ....: .: .::: : .::.: ..: :. .:::.: CCDS78 KHSLDASQGTATGPRGIFTKEDALKLVQLKQTGKITESRPTPAPTATPTAPSPLQATAGP 200 210 220 230 240 250 410 420 430 440 450 pF1KE4 -----VVPPTG-PGMAPVPTGVFTDIPISNIRRVIAQRLMQSKQTIPHYYLSIDVNMGEV :.::.. ::. : .:.::.:: ::::::::.:: .::.:.:: : . : ..: : CCDS78 SYPRPVIPPVSTPGQ-PNAVGTFTEIPASNIRRVIAKRLTESKSTVPHAYATADCDLGAV 260 270 280 290 300 310 460 470 480 490 500 510 pF1KE4 LLVRKELNKILEGRSKISVNDFIIKASALACLKVPEANSSWMDTVIRQNHVVDVSVAVST : ::..: : :.:::::::::.:.. ..:..: :: .: .:.::::.: CCDS78 LKVRQDLVK---DDIKVSVNDFIIKAAAVTLKQMPDVNVSWDGEGPKQLPFIDISVAVAT 320 330 340 350 360 520 530 540 550 560 570 pF1KE4 PAGLITPIVFNAHIKGVETIANDVVSLATKAREGKLQPHEFQGGTFTISNLGMFGIKNFS ::.:::. .: ::.. ::..: .:. :::.::: :.:.:::.:.::::::::: .:. CCDS78 DKGLLTPIIKDAAAKGIQEIADSVKALSKKARDGKLLPEEYQGGSFSISNLGMFGIDEFT 370 380 390 400 410 420 580 590 600 610 620 630 pF1KE4 AIINPPQACILAIGASEDKLVPADNEKG---FDVASMMSVTLSCDHRVVDGAVGAQWLAE :.::::::::::.: . : ...:.: .. ....::.: : :::: .....: CCDS78 AVINPPQACILAVGRFRPVLKLTEDEEGNAKLQQRQLITVTMSSDSRVVDDELATRFLKS 430 440 450 460 470 480 640 pF1KE4 FRKYLEKPITMLL :. ::.:: . CCDS78 FKANLENPIRLA 490 500 647 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:03:03 2016 done: Sun Nov 6 00:03:04 2016 Total Scan time: 4.520 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]