FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4445, 461 aa 1>>>pF1KE4445 461 - 461 aa - 461 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9258+/-0.000747; mu= 19.8784+/- 0.045 mean_var=66.1627+/-12.880, 0's: 0 Z-trim(108.8): 11 B-trim: 7 in 1/50 Lambda= 0.157677 statistics sampled from 10467 (10473) to 10467 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.322), width: 16 Scan time: 2.960 The best scores are: opt bits E(32554) CCDS244.2 FUCA1 gene_id:2517|Hs108|chr1 ( 466) 3321 764.2 0 CCDS5200.1 FUCA2 gene_id:2519|Hs108|chr6 ( 467) 1860 431.9 6.8e-121 >>CCDS244.2 FUCA1 gene_id:2517|Hs108|chr1 (466 aa) initn: 3321 init1: 3321 opt: 3321 Z-score: 4080.0 bits: 764.2 E(32554): 0 Smith-Waterman score: 3321; 100.0% identity (100.0% similar) in 461 aa overlap (1-461:6-466) 10 20 30 40 50 pF1KE4 MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE4 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS24 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTK 370 380 390 400 410 420 420 430 440 450 460 pF1KE4 ITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK :::::::::::::::::::::::::::::::::::::::::::::: CCDS24 ITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 430 440 450 460 >>CCDS5200.1 FUCA2 gene_id:2519|Hs108|chr6 (467 aa) initn: 1074 init1: 929 opt: 1860 Z-score: 2283.9 bits: 431.9 E(32554): 6.8e-121 Smith-Waterman score: 1860; 54.9% identity (77.7% similar) in 461 aa overlap (4-460:8-466) 10 20 30 40 50 pF1KE4 MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGVF : : : ::::::.: :. :. : : :::.: ::::::.::::.: CCDS52 MRPQELPRLAFPLLLLLLLLLPPPPC--PAHSATRFDPTWESLDARQLPAWFDQAKFGIF 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 IHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEWA :::::::::..:::::::.:: : :.: .::.:::::.:.: :::: :::.::. ..:: CCDS52 IHWGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 DLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKR-NIRYG :.:::.::::.:::.::::::: : : :::::. : ::.::.: :: .:.:.: ..:.: CCDS52 DIFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW ::.::.::::::.: :....:. ..: .::.::::.:::.:.:...::::. :: :: CCDS52 LYYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYW 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID :::.::.::::.:::. ::.::::: . :.:::.:.: :...: : :::: : .:: CCDS52 NSTGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTID 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE4 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK :.::::::. ..:: :....::.::: ::: :.::::: :: : .:.::: .:. CCDS52 KLSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQMGS 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA--VYAIFLHWPENGVLNLESP-ITTS ::..:::::: .. :: : . : .:::::: . ::::::.:: .: : : : . CCDS52 WLKVNGEAIYETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILG 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 TTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK .:.. .:: :.: . ..:... :::: .: ...:.. ::.: CCDS52 ATEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI 420 430 440 450 460 461 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:05:43 2016 done: Sun Nov 6 01:05:44 2016 Total Scan time: 2.960 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]