FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4437, 453 aa 1>>>pF1KE4437 453 - 453 aa - 453 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3172+/-0.000899; mu= 17.4975+/- 0.054 mean_var=68.0395+/-13.314, 0's: 0 Z-trim(106.1): 43 B-trim: 124 in 1/50 Lambda= 0.155487 statistics sampled from 8774 (8815) to 8774 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.642), E-opt: 0.2 (0.271), width: 16 Scan time: 2.570 The best scores are: opt bits E(32554) CCDS11212.1 ALDH3A1 gene_id:218|Hs108|chr17 ( 453) 2987 679.2 2.3e-195 CCDS82090.1 ALDH3A1 gene_id:218|Hs108|chr17 ( 380) 2529 576.4 1.7e-164 CCDS11210.1 ALDH3A2 gene_id:224|Hs108|chr17 ( 485) 2113 483.2 2.6e-136 CCDS32589.1 ALDH3A2 gene_id:224|Hs108|chr17 ( 508) 2113 483.2 2.7e-136 CCDS73335.1 ALDH3B1 gene_id:221|Hs108|chr11 ( 468) 1670 383.8 2.1e-106 CCDS73336.1 ALDH3B1 gene_id:221|Hs108|chr11 ( 431) 1453 335.1 8.7e-92 CCDS31622.1 ALDH3B2 gene_id:222|Hs108|chr11 ( 385) 1346 311.0 1.3e-84 CCDS76443.1 ALDH3B1 gene_id:221|Hs108|chr11 ( 351) 683 162.3 7.2e-40 CCDS10389.1 ALDH1A3 gene_id:220|Hs108|chr15 ( 512) 586 140.6 3.5e-33 CCDS6644.1 ALDH1A1 gene_id:216|Hs108|chr9 ( 501) 571 137.3 3.5e-32 CCDS9155.1 ALDH2 gene_id:217|Hs108|chr12 ( 517) 557 134.1 3.2e-31 CCDS6615.1 ALDH1B1 gene_id:219|Hs108|chr9 ( 517) 552 133.0 7e-31 CCDS55968.1 ALDH1A2 gene_id:8854|Hs108|chr15 ( 497) 547 131.9 1.5e-30 CCDS10163.1 ALDH1A2 gene_id:8854|Hs108|chr15 ( 518) 547 131.9 1.5e-30 CCDS45266.1 ALDH1A2 gene_id:8854|Hs108|chr15 ( 422) 516 124.9 1.6e-28 CCDS55885.1 ALDH2 gene_id:217|Hs108|chr12 ( 470) 512 124.0 3.2e-28 CCDS31891.1 ALDH1L2 gene_id:160428|Hs108|chr12 ( 923) 503 122.2 2.3e-27 CCDS4555.1 ALDH5A1 gene_id:7915|Hs108|chr6 ( 535) 484 117.8 2.8e-26 CCDS58850.1 ALDH1L1 gene_id:10840|Hs108|chr3 ( 801) 447 109.6 1.2e-23 CCDS3034.1 ALDH1L1 gene_id:10840|Hs108|chr3 ( 902) 447 109.6 1.4e-23 CCDS58851.1 ALDH1L1 gene_id:10840|Hs108|chr3 ( 912) 447 109.6 1.4e-23 CCDS1250.2 ALDH9A1 gene_id:223|Hs108|chr1 ( 518) 441 108.1 2.2e-23 CCDS76794.1 ALDH1A3 gene_id:220|Hs108|chr15 ( 405) 432 106.0 7.2e-23 CCDS10164.1 ALDH1A2 gene_id:8854|Hs108|chr15 ( 480) 376 93.5 5e-19 CCDS4137.2 ALDH7A1 gene_id:501|Hs108|chr5 ( 539) 371 92.4 1.2e-18 CCDS4556.1 ALDH5A1 gene_id:7915|Hs108|chr6 ( 548) 350 87.7 3.2e-17 >>CCDS11212.1 ALDH3A1 gene_id:218|Hs108|chr17 (453 aa) initn: 2987 init1: 2987 opt: 2987 Z-score: 3620.8 bits: 679.2 E(32554): 2.3e-195 Smith-Waterman score: 2987; 99.8% identity (100.0% similar) in 453 aa overlap (1-453:1-453) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER :::::::::::::.:::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TIQPMVGAIAAGNSVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE 370 380 390 400 410 420 430 440 450 pF1KE4 TFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH ::::::::::::::::::::::::::::::::: CCDS11 TFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH 430 440 450 >>CCDS82090.1 ALDH3A1 gene_id:218|Hs108|chr17 (380 aa) initn: 2529 init1: 2529 opt: 2529 Z-score: 3066.7 bits: 576.4 E(32554): 1.7e-164 Smith-Waterman score: 2529; 99.7% identity (100.0% similar) in 380 aa overlap (74-453:1-380) 50 60 70 80 90 100 pF1KE4 LVGALAADLHKNEWNAYYEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSE :::::::::::::::::::::::::::::: CCDS82 MIQKLPEWAADEPVEKTPQTQQDELYIHSE 10 20 30 110 120 130 140 150 160 pF1KE4 PLGVVLVIGTWNYPFNLTIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLY ::::::::::::::::::::::::::::::.::::::::::::::::::::::::::::: CCDS82 PLGVVLVIGTWNYPFNLTIQPMVGAIAAGNSVVLKPSELSENMASLLATIIPQYLDKDLY 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE4 PVINGGVPETTELLKERFDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PVINGGVPETTELLKERFDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNC 100 110 120 130 140 150 230 240 250 260 270 280 pF1KE4 DLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 DLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYG 160 170 180 190 200 210 290 300 310 320 330 340 pF1KE4 RIISARHFQRVMGLIEGQKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 RIISARHFQRVMGLIEGQKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIV 220 230 240 250 260 270 350 360 370 380 390 400 pF1KE4 CVRSLEEAIQFINQREKPLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 CVRSLEEAIQFINQREKPLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFG 280 290 300 310 320 330 410 420 430 440 450 pF1KE4 GVGNSGMGSYHGKKSFETFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GVGNSGMGSYHGKKSFETFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH 340 350 360 370 380 >>CCDS11210.1 ALDH3A2 gene_id:224|Hs108|chr17 (485 aa) initn: 2092 init1: 2092 opt: 2113 Z-score: 2560.8 bits: 483.2 E(32554): 2.6e-136 Smith-Waterman score: 2113; 67.9% identity (90.9% similar) in 439 aa overlap (8-446:5-443) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :.:.: :: :::.:::.::.::::::.:..::.:.... :.:::: :.:.:.: CCDS11 MELEVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVY 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL .::. :: ::..:...::::.. .::.:. :. :: ::. .::::::.::.::::: : CCDS11 SQEVITVLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER ::::..:::::::::..:::::::: :..:: ..:::::.::: :::::: :::::::.: CCDS11 TIQPLIGAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQR 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS ::::.:::.:.::::.: ::::::::::::::::::::.::.::::..::::.:::.:: CCDS11 FDHIFYTGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNC 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG ::::.:::::::. :.::::: :.:...::::::. :.: :: :::. :::.:...:.:: CCDS11 GQTCIAPDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEG 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK ::.:.:: : ::::::::.::::::.. :::::::::.:::: :....:::.:::.::: CCDS11 QKIAFGGETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREK 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE :::::.:: : :.::.:: :::::::..::::.:.::.:.::::::.::::.::::.::. CCDS11 PLALYVFSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVGSSGMGAYHGKHSFD 360 370 380 390 400 410 430 440 450 pF1KE4 TFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH ::::.: ::.. : . . :.::::. CCDS11 TFSHQRPCLLKSLKREGANKLRYPPNSQSKVDWGKFFLLKRFNKEKLGLLLLTFLGIVAA 420 430 440 450 460 470 >>CCDS32589.1 ALDH3A2 gene_id:224|Hs108|chr17 (508 aa) initn: 2092 init1: 2092 opt: 2113 Z-score: 2560.5 bits: 483.2 E(32554): 2.7e-136 Smith-Waterman score: 2113; 67.9% identity (90.9% similar) in 439 aa overlap (8-446:5-443) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :.:.: :: :::.:::.::.::::::.:..::.:.... :.:::: :.:.:.: CCDS32 MELEVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVY 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL .::. :: ::..:...::::.. .::.:. :. :: ::. .::::::.::.::::: : CCDS32 SQEVITVLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER ::::..:::::::::..:::::::: :..:: ..:::::.::: :::::: :::::::.: CCDS32 TIQPLIGAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQR 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS ::::.:::.:.::::.: ::::::::::::::::::::.::.::::..::::.:::.:: CCDS32 FDHIFYTGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNC 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG ::::.:::::::. :.::::: :.:...::::::. :.: :: :::. :::.:...:.:: CCDS32 GQTCIAPDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEG 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK ::.:.:: : ::::::::.::::::.. :::::::::.:::: :....:::.:::.::: CCDS32 QKIAFGGETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREK 300 310 320 330 340 350 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE :::::.:: : :.::.:: :::::::..::::.:.::.:.::::::.::::.::::.::. CCDS32 PLALYVFSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVGSSGMGAYHGKHSFD 360 370 380 390 400 410 430 440 450 pF1KE4 TFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH ::::.: ::.. : . . :.::::. CCDS32 TFSHQRPCLLKSLKREGANKLRYPPNSQSKVDWGKFFLLKRFNKEKLGLLLLTFLGIVAA 420 430 440 450 460 470 >>CCDS73335.1 ALDH3B1 gene_id:221|Hs108|chr11 (468 aa) initn: 1689 init1: 1639 opt: 1670 Z-score: 2023.9 bits: 383.8 E(32554): 2.1e-106 Smith-Waterman score: 1670; 53.7% identity (80.5% similar) in 451 aa overlap (1-450:1-451) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :. ......: : :: .::::: .:: ::..: :..::..: : ::: ::::. ... CCDS73 MDPLGDTLRRLREAFHAGRTRPAEFRAAQLQGLGRFLQENKQLLHDALAQDLHKSAFESE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL ::. :. ...: : :: : :. :: : .:..::.:.::.:. ::::.:: CCDS73 VSEVAISQGEVTLALRNLRAWMKDERVPKNLATQLDSAFIRKEPFGLVLIIAPWNYPLNL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER :. :.:::.:::: :::::::.:.:. ..:: ..:::.:.. . :. :: :: .::..: CCDS73 TLVPLVGALAAGNCVVLKPSEISKNVEKILAEVLPQYVDQSCFAVVLGGPQETGQLLEHR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS ::.:..::: ::::.::::::::::::::::::.::::: ::: ... :.:: ...:. CCDS73 FDYIFFTGSPRVGKIVMTAAAKHLTPVTLELGGKNPCYVDDNCDPQTVANRVAWFRYFNA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG :::::::::.::.: .:.... :.... .:::.: ..: . ::::. ..:::. .:. CCDS73 GQTCVAPDYVLCSPEMQERLLPALQSTITRFYGDDPQSSPNLGRIINQKQFQRLRALLGC 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK .:: :: .: . ::::::.:.::. . ::::::::::.:::: :.::.:::.:::.::: CCDS73 GRVAIGGQSDESDRYIAPTVLVDVQEMEPVMQEEIFGPILPIVNVQSLDEAIEFINRREK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE ::::: ::....:.:.....::::: .:: ..:.:: :::::::: :::: :::: ::. CCDS73 PLALYAFSNSSQVVKRVLTQTSSGGFCGNDGFMHMTLASLPFGGVGASGMGRYHGKFSFD 370 380 390 400 410 420 430 440 450 pF1KE4 TFSHRRSCLVRPLMNDEGLKVRYPP-SPAKMTQH ::::.:.::.: .. .:::: :: .. CCDS73 TFSHHRACLLRSPGMEKLNALRYPPQSPRRLRMLLVAMEAQGCSCTLL 430 440 450 460 >>CCDS73336.1 ALDH3B1 gene_id:221|Hs108|chr11 (431 aa) initn: 1634 init1: 1447 opt: 1453 Z-score: 1761.4 bits: 335.1 E(32554): 8.7e-92 Smith-Waterman score: 1544; 51.7% identity (76.3% similar) in 451 aa overlap (1-450:1-414) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :. ......: : :: .::::: .:: ::..: :..::..: : ::: :::: CCDS73 MDPLGDTLRRLREAFHAGRTRPAEFRAAQLQGLGRFLQENKQLLHDALAQDLHKA----- 10 20 30 40 50 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL :: : .:..::.:.::.:. ::::.:: CCDS73 --------------------------------TQLDSAFIRKEPFGLVLIIAPWNYPLNL 60 70 80 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER :. :.:::.:::: :::::::.:.:. ..:: ..:::.:.. . :. :: :: .::..: CCDS73 TLVPLVGALAAGNCVVLKPSEISKNVEKILAEVLPQYVDQSCFAVVLGGPQETGQLLEHR 90 100 110 120 130 140 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS ::.:..::: ::::.::::::::::::::::::.::::: ::: ... :.:: ...:. CCDS73 FDYIFFTGSPRVGKIVMTAAAKHLTPVTLELGGKNPCYVDDNCDPQTVANRVAWFRYFNA 150 160 170 180 190 200 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG :::::::::.::.: .:.... :.... .:::.: ..: . ::::. ..:::. .:. CCDS73 GQTCVAPDYVLCSPEMQERLLPALQSTITRFYGDDPQSSPNLGRIINQKQFQRLRALLGC 210 220 230 240 250 260 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK .:: :: .: . ::::::.:.::. . ::::::::::.:::: :.::.:::.:::.::: CCDS73 GRVAIGGQSDESDRYIAPTVLVDVQEMEPVMQEEIFGPILPIVNVQSLDEAIEFINRREK 270 280 290 300 310 320 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE ::::: ::....:.:.....::::: .:: ..:.:: :::::::: :::: :::: ::. CCDS73 PLALYAFSNSSQVVKRVLTQTSSGGFCGNDGFMHMTLASLPFGGVGASGMGRYHGKFSFD 330 340 350 360 370 380 430 440 450 pF1KE4 TFSHRRSCLVRPLMNDEGLKVRYPP-SPAKMTQH ::::.:.::.: .. .:::: :: .. CCDS73 TFSHHRACLLRSPGMEKLNALRYPPQSPRRLRMLLVAMEAQGCSCTLL 390 400 410 420 430 >>CCDS31622.1 ALDH3B2 gene_id:222|Hs108|chr11 (385 aa) initn: 1346 init1: 1346 opt: 1346 Z-score: 1632.4 bits: 311.0 E(32554): 1.3e-84 Smith-Waterman score: 1346; 52.5% identity (82.9% similar) in 362 aa overlap (84-445:3-364) 60 70 80 90 100 110 pF1KE4 KNEWNAYYEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGT ::: . . : ..: .::.:.::.:. CCDS31 MKDEPRSTNLFMKLDSVFIWKEPFGLVLIIAP 10 20 30 120 130 140 150 160 170 pF1KE4 WNYPFNLTIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPET ::::.:::. .:::.:::. :::::::.:.. ..:: ..:::::.. . :. :: :: CCDS31 WNYPLNLTLVLLVGALAAGSCVVLKPSEISQGTEKVLAEVLPQYLDQSCFAVVLGGPQET 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE4 TELLKERFDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIA .::....:.:..::: ::::.::::.:::::::::::::.::::: ::: ... :.: CCDS31 GQLLEHKLDYIFFTGSPRVGKIVMTAATKHLTPVTLELGGKNPCYVDDNCDPQTVANRVA 100 110 120 130 140 150 240 250 260 270 280 290 pF1KE4 WGKFMNSGQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQR : ..:.:::::::::.::.: .:.... :.... .:::.: ..: . :.::. ..::: CCDS31 WFCYFNAGQTCVAPDYVLCSPEMQERLLPALQSTITRFYGDDPQSSPNLGHIINQKQFQR 160 170 180 190 200 210 300 310 320 330 340 350 pF1KE4 VMGLIEGQKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQ . .:. ..:: :: .. . ::::::.:.::. ::::::::::.:::: :.:..:::. CCDS31 LRALLGCSRVAIGGQSNESDRYIAPTVLVDVQETEPVMQEEIFGPILPIVNVQSVDEAIK 220 230 240 250 260 270 360 370 380 390 400 410 pF1KE4 FINQREKPLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSY :::..::::::: ::....:...:. .::::. ..:. ...:.: :.::::::.:::: : CCDS31 FINRQEKPLALYAFSNSSQVVNQMLERTSSGSFGGNEGFTYISLLSVPFGGVGHSGMGRY 280 290 300 310 320 330 420 430 440 450 pF1KE4 HGKKSFETFSHRRSCLVRPLMNDEGLKVRYPPSPAKMTQH ::: .:.::::.:.::. : .. ...::: CCDS31 HGKFTFDTFSHHRTCLLAPSGLEKLKEIHYPPYTDWNQQLLRWGMGSQSCTLL 340 350 360 370 380 >>CCDS76443.1 ALDH3B1 gene_id:221|Hs108|chr11 (351 aa) initn: 1179 init1: 660 opt: 683 Z-score: 829.2 bits: 162.3 E(32554): 7.2e-40 Smith-Waterman score: 937; 40.0% identity (59.2% similar) in 453 aa overlap (1-450:1-334) 10 20 30 40 50 60 pF1KE4 MSKISEAVKRARAAFSSGRTRPLQFRIQQLEALQRLIQEQEQELVGALAADLHKNEWNAY :. ......: : :: .::::: .:: ::..: :..::..: : ::: ::::. ... CCDS76 MDPLGDTLRRLREAFHAGRTRPAEFRAAQLQGLGRFLQENKQLLHDALAQDLHKSAFESE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 YEEVVYVLEEIEYMIQKLPEWAADEPVEKTPQTQQDELYIHSEPLGVVLVIGTWNYPFNL ::. :. ...: : :: : :. :: : .:..::.:.::.:. ::::.:: CCDS76 VSEVAISQGEVTLALRNLRAWMKDERVPKNLATQLDSAFIRKEPFGLVLIIAPWNYPLNL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 TIQPMVGAIAAGNAVVLKPSELSENMASLLATIIPQYLDKDLYPVINGGVPETTELLKER :. :.:::.:::: :::::::.:.:. ..:: ..:::.:.. CCDS76 TLVPLVGALAAGNCVVLKPSEISKNVEKILAEVLPQYVDQS------------------- 130 140 150 160 190 200 210 220 230 240 pF1KE4 FDHILYTGSTGVGKIIMTAAAKHLTPVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNS :: CCDS76 ----------------------------------SP------------------------ 250 260 270 280 290 300 pF1KE4 GQTCVAPDYILCDPSIQNQIVEKLKKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIEG . ::::. ..:::. .:. CCDS76 ----------------------------------------NLGRIINQKQFQRLRALLGC 170 180 310 320 330 340 350 360 pF1KE4 QKVAYGGTGDAATRYIAPTILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREK .:: :: .: . ::::::.:.::. . ::::::::::.:::: :.::.:::.:::.::: CCDS76 GRVAIGGQSDESDRYIAPTVLVDVQEMEPVMQEEIFGPILPIVNVQSLDEAIEFINRREK 190 200 210 220 230 240 370 380 390 400 410 420 pF1KE4 PLALYMFSSNDKVIKKMIAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFE ::::: ::....:.:.....::::: .:: ..:.:: :::::::: :::: :::: ::. CCDS76 PLALYAFSNSSQVVKRVLTQTSSGGFCGNDGFMHMTLASLPFGGVGASGMGRYHGKFSFD 250 260 270 280 290 300 430 440 450 pF1KE4 TFSHRRSCLVR-PLMNDEGLK-VRYPP-SPAKMTQH ::::.:.::.: : : : :. .:::: :: .. CCDS76 TFSHHRACLLRSPGM--EKLNALRYPPQSPRRLRMLLVAMEAQGCSCTLL 310 320 330 340 350 >>CCDS10389.1 ALDH1A3 gene_id:220|Hs108|chr15 (512 aa) initn: 491 init1: 159 opt: 586 Z-score: 709.2 bits: 140.6 E(32554): 3.5e-33 Smith-Waterman score: 586; 28.2% identity (59.5% similar) in 444 aa overlap (4-431:71-506) 10 20 30 pF1KE4 MSKISEAVKRARAAFSSG---RTRPLQFRIQQL ...::. :..::. : : : . : CCDS10 HESKSGKKFATCNPSTREQICEVEEGDKPDVDKAVEAAQVAFQRGSPWRRLDALSRGRLL 50 60 70 80 90 100 40 50 60 70 80 90 pF1KE4 EALQRLIQEQEQELVGALAADLHKNEWNAYYEEVVYVLEEIEYMIQKLPEWAADEPVEKT . : :..... :.. . : : .:.. .. .. ..:. :: :. :: CCDS10 HQLADLVERDRATLAALETMDTGKPFLHAFFIDLEGCIRTLRYFAG----WA-DKIQGKT 110 120 130 140 150 100 110 120 130 140 pF1KE4 PQTQQDEL-YIHSEPLGVVLVIGTWNYPFNLTIQPMVGAIAAGNAVVLKPSELSENMASL :... . . . ::.:: .: ::.:. . . .. :. ::..::::.: . : CCDS10 IPTDDNVVCFTRHEPIGVCGAITPWNFPLLMLVWKLAPALCCGNTMVLKPAEQTPLTALY 160 170 180 190 200 210 150 160 170 180 190 200 pF1KE4 LATIIPQY-LDKDLYPVINGGVPETTELLKE--RFDHILYTGSTGVGKIIMTAAAK-HLT :...: . . . .. : : . .. ....: .:::: :::.. ::.. .: CCDS10 LGSLIKEAGFPPGVVNIVPGFGPTVGAAISSHPQINKIAFTGSTEVGKLVKEAASRSNLK 220 230 240 250 260 270 210 220 230 240 250 260 pF1KE4 PVTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEK-L ::::::::.:: : . :::.: . : :.:.:: :.: . .. . .. ...:.. . CCDS10 RVTLELGGKNPCIVCADADLDLAVECAHQGVFFNQGQCCTAASRVFVEEQVYSEFVRRSV 280 290 300 310 320 330 270 280 290 300 310 pF1KE4 KKSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIE-----GQKVAYGGTG-DAATRYIAP . . :. :. . . : :. ..:.... ::: : :. ::.. . .: : CCDS10 EYAKKRPVGDPFDVKTEQGPQIDQKQFDKILELIESGKKEGAKLECGGSAMEDKGLFIKP 340 350 360 370 380 390 320 330 340 350 360 370 pF1KE4 TILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREKPLALYMFSSN-DKVIKKM :....: . . .::::::: ::. .:.::.:. :. . :. .:..: ::..: . CCDS10 TVFSEVTDNMRIAKEEIFGPVQPILKFKSIEEVIKRANSTDYGLTAAVFTKNLDKALK-L 400 410 420 430 440 450 380 390 400 410 420 430 pF1KE4 IAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFETFSHRRSCLVRPLMNDE . :: : : . . :::: :: : :. .. ... .. .. CCDS10 ASALESGTVWIN--CYNALYAQAPFGGFKMSGNGRELGEYALAEYTEVKTVTIKLGDKNP 460 470 480 490 500 510 440 450 pF1KE4 GLKVRYPPSPAKMTQH >>CCDS6644.1 ALDH1A1 gene_id:216|Hs108|chr9 (501 aa) initn: 490 init1: 164 opt: 571 Z-score: 691.1 bits: 137.3 E(32554): 3.5e-32 Smith-Waterman score: 571; 29.5% identity (57.9% similar) in 444 aa overlap (4-431:60-495) 10 20 30 pF1KE4 MSKISEAVKRARAAFSSG---RTRPLQFRIQQL ...::: :: ::. : :: . : . : CCDS66 HDSVSGKKFPVFNPATEEELCQVEEGDKEDVDKAVKAARQAFQIGSPWRTMDASERGRLL 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE4 EALQRLIQEQEQELVGALAADLHKNEWNAYYEEVVYVLEEIEYMIQKLPEWAADEPVEKT : ::.... :. . . : ::: .... .. ..: :: . CCDS66 YKLADLIERDRLLLATMESMNGGKLYSNAYLNDLAGCIKTLRYCAG----WADKIQGRTI 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE4 PQTQQDELYIHSEPLGVVLVIGTWNYPFNLTIQPMVGAIAAGNAVVLKPSELSENMASLL : . : . ::.:: : ::.:. . : . :.. ::.::.::.: . : . CCDS66 PIDGNFFTYTRHEPIGVCGQIIPWNFPLVMLIWKIGPALSCGNTVVVKPAEQTPLTALHV 150 160 170 180 190 200 160 170 180 190 200 pF1KE4 ATIIPQY-LDKDLYPVINGGVPETTELLKERFD--HILYTGSTGVGKIIMTAAAK-HLTP :..: . . . .. : : . .. ..: .. .:::: :::.: ::.: .: CCDS66 ASLIKEAGFPPGVVNIVPGYGPTAGAAISSHMDIDKVAFTGSTEVGKLIKEAAGKSNLKR 210 220 230 240 250 260 210 220 230 240 250 260 pF1KE4 VTLELGGKSPCYVDKNCDLDVACRRIAWGKFMNSGQTCVAPDYILCDPSIQNQIVEK-LK ::::::::::: : . ::: : . : :...:: :.: . :. . :: ...:.. .. CCDS66 VTLELGGKSPCIVLADADLDNAVEFAHHGVFYHQGQCCIAASRIFVEESIYDEFVRRSVE 270 280 290 300 310 320 270 280 290 300 310 pF1KE4 KSLKEFYGEDAKKSRDYGRIISARHFQRVMGLIE-----GQKVAYGGTGDAATR--YIAP .. : . :. . : :. ....... ::: : :. :: : ... .. : CCDS66 RAKKYILGNPLTPGVTQGPQIDKEQYDKILDLIESGKKEGAKLECGG-GPWGNKGYFVQP 330 340 350 360 370 380 320 330 340 350 360 370 pF1KE4 TILTDVDPQSPVMQEEIFGPVLPIVCVRSLEEAIQFINQREKPLALYMFSSN-DKVIKKM :....: . . .::::::: :. .::...:. :. :. .:... ::.: . CCDS66 TVFSNVTDEMRIAKEEIFGPVQQIMKFKSLDDVIKRANNTFYGLSAGVFTKDIDKAIT-I 390 400 410 420 430 440 380 390 400 410 420 430 pF1KE4 IAETSSGGVAANDVIVHITLHSLPFGGVGNSGMGSYHGKKSFETFSHRRSCLVRPLMNDE . ..: : .: .. . :::: :: : :. .:. ... .. :. CCDS66 SSALQAGTVWVN--CYGVVSAQCPFGGFKMSGNGRELGEYGFHEYTEVKTVTVKISQKNS 450 460 470 480 490 500 440 450 pF1KE4 GLKVRYPPSPAKMTQH 453 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 17:40:28 2016 done: Mon Nov 7 17:40:29 2016 Total Scan time: 2.570 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]