FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6892, 361 aa 1>>>pF1KE6892 361 - 361 aa - 361 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4200+/-0.000639; mu= 16.8197+/- 0.039 mean_var=77.5532+/-15.742, 0's: 0 Z-trim(112.1): 13 B-trim: 0 in 0/49 Lambda= 0.145638 statistics sampled from 12936 (12945) to 12936 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.398), width: 16 Scan time: 2.340 The best scores are: opt bits E(32554) CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 ( 361) 2566 548.1 4.4e-156 CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 ( 359) 2160 462.8 2.1e-130 CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 ( 374) 2051 439.9 1.7e-123 CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 ( 342) 927 203.7 2e-52 CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 ( 359) 873 192.3 5.3e-49 CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 ( 530) 859 189.5 5.5e-48 >>CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 (361 aa) initn: 2566 init1: 2566 opt: 2566 Z-score: 2915.6 bits: 548.1 E(32554): 4.4e-156 Smith-Waterman score: 2566; 99.4% identity (99.4% similar) in 361 aa overlap (1-361:1-361) 10 20 30 40 50 60 pF1KE6 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDTTPTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDTTPTR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PTLLILLWTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADTVIVHHWDIMSNPKSR ::::::: :::::::::::::::::::::::::::::::::::: ::::::::::::::: CCDS12 PTLLILLRTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADMVIVHHWDIMSNPKSR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEPWSGQPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEPWSGQPA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 HPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETLSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 HPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETLSR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 YKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 YKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 DLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQESRYQTVRSIAAWF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 DLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQESRYQTVRSIAAWF 310 320 330 340 350 360 pF1KE6 T : CCDS12 T >>CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 (359 aa) initn: 2114 init1: 1882 opt: 2160 Z-score: 2454.6 bits: 462.8 E(32554): 2.1e-130 Smith-Waterman score: 2160; 84.3% identity (91.7% similar) in 363 aa overlap (1-361:1-359) 10 20 30 40 50 pF1KE6 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDTT--P ::::: ::::: :: ::..::::::.:::::::::::.:: : :.:: :.: : CCDS12 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPT---VYPNGSRFPDSTGTP 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 TRPTLLILLWTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADTVIVHHWDIMSNPK .. :::::::::. :.:: ::::::::::::.::::::::::::.::::: ..: ::. CCDS12 AHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 SRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEPWSGQ ..:: ::: ::::::::..: : .: .:.:.: ::::::::::::::::::::::::::: CCDS12 AQLPRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 PAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETL :::::::::::::::::::::: :.:::::::::::::::::::::::::::.::::::: CCDS12 PAHPPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQESRYQTVRSIAA ::::::::::::::::::::::::::::::::::::: :::::::::.:::::: :.::: CCDS12 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQT-RGIAA 300 310 320 330 340 350 360 pF1KE6 WFT ::: CCDS12 WFT >>CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 (374 aa) initn: 2315 init1: 2024 opt: 2051 Z-score: 2330.6 bits: 439.9 E(32554): 1.7e-123 Smith-Waterman score: 2324; 88.2% identity (92.5% similar) in 374 aa overlap (1-361:1-374) 10 20 30 40 pF1KE6 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPR-----------APS ::::: ::::: ::::::.::::::::::::::::::::::::::: ::. CCDS12 MDPLGPAKPQWLWRRCLAGLLFQLLVAVCFFSYLRVSRDDATGSPRPGLMAVEPVTGAPN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 GSSRQDT--TPTRPTLLILLWTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADTVI :: ::. ::..::::::::::::. :::: ::::::::.:::.:::: .::::::.:: CCDS12 GSRCQDSMATPAHPTLLILLWTWPFNTPVALPRCSEMVPGAADCNITADSSVYPQADAVI 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 VHHWDIMSNPKSRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFT ::::::: ::.. ::: ::::::::::..: : ::.:::::: :::::::::::::::: CCDS12 VHHWDIMYNPSANLPPPTRPQGQRWIWFSMESPSNCRHLEALDGYFNLTMSYRSDSDIFT 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHK 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE6 PLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE6 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQE :::::::::::::::::::::::::::::::::.:::::::::::::: ::::::::::: CCDS12 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFHWRETLRPRSFSWALAFCKACWKLQQE 310 320 330 340 350 360 350 360 pF1KE6 SRYQTVRSIAAWFT :::::::::::::: CCDS12 SRYQTVRSIAAWFT 370 >>CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 (342 aa) initn: 629 init1: 336 opt: 927 Z-score: 1054.8 bits: 203.7 E(32554): 2e-52 Smith-Waterman score: 927; 44.8% identity (69.8% similar) in 315 aa overlap (50-360:34-340) 20 30 40 50 60 70 pF1KE6 LLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDTTPTRPTLLILLWTWPF-HIPVAL ::. . : .::. ::.: ::: : : CCDS70 AGHGPTRRLRGLGVLAGVALLAALWLLWLLGSAPRGTPAPQPTITILVWHWPFTDQPPEL 10 20 30 40 50 60 80 90 100 110 120 130 pF1KE6 SRCSEMVPGTADCHITADRKVYPQADTVIVHHWDIMSNPKSRLPPSPRPQGQRWIWFNLE . : : ::..:.:.. .::.:. :: .... .:.:: . ::.:: :.: ..: CCDS70 PSDTCTRYGIARCHLSANRSLLASADAVVFHHRELQTR-RSHLPLAQRPRGQPWVWASME 70 80 90 100 110 120 140 150 160 170 180 190 pF1KE6 PPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEP-WSGQPAHPPLNLSAKTELVAWAV : . . : : :: ..::: :::::.::: ::: :. :. ::: ::....::.: CCDS70 SPSHTHGLSHLRGIFNWVLSYRRDSDIFVPYGRLEPHWG--PS-PPL--PAKSRVAAWVV 130 140 150 160 170 200 210 220 230 240 250 pF1KE6 SNWKPDSARVRYYQSLQAHLKVDVYGRSH-KPLPKGTMMETLSRYKFYLAFENSLHPDYI ::.. . :.: :..: ::.:::.::.. .:: . .. :...:.:::.:::: : ::: CCDS70 SNFQERQLRARLYRQLAPHLRVDVFGRANGRPLCASCLVPTVAQYRFYLSFENSQHRDYI 180 190 200 210 220 230 260 270 280 290 300 310 pF1KE6 TEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARY :::.::::: : .::::::: :..:: :.: :::.::::: : ..:: .: ... .:: CCDS70 TEKFWRNALVAGTVPVVLGPPRATYEAFVPADAFVHVDDFGSARELAAFLTGMNE--SRY 240 250 260 270 280 290 320 330 340 350 360 pF1KE6 LSYFRWRETLRPRSFS-WALDFCKACWKLQQESRYQTVRSIAAWFT .: ::. :: : :. : :: : . . : :. ... .:: CCDS70 QRFFAWRDRLRVRLFTDWRERFCAICDRYPHLPRSQVYEDLEGWFQA 300 310 320 330 340 >>CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 (359 aa) initn: 769 init1: 355 opt: 873 Z-score: 993.2 bits: 192.3 E(32554): 5.3e-49 Smith-Waterman score: 873; 44.3% identity (72.0% similar) in 300 aa overlap (65-360:66-357) 40 50 60 70 80 90 pF1KE6 RVSRDDATGSPRAPSGSSRQDTTPTRPTLLILLWTWPFHIPVALSRCSEMVPGTADCHIT ::.:.::: :. :. : . ::.: CCDS50 WIFSPMESASSVLKMKNFFSTKTDYFNETTILVWVWPFGQTFDLTSCQAMF-NIQGCHLT 40 50 60 70 80 90 100 110 120 130 140 150 pF1KE6 ADRKVYPQADTVIVHHWDIMSNPKSRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFN .::..: .. .:..:: :: : . :: . :: :.:::.::: : . . .... :: CCDS50 TDRSLYNKSHAVLIHHRDI-SWDLTNLPQQARPPFQKWIWMNLESPTHTPQKSGIEHLFN 100 110 120 130 140 150 160 170 180 190 200 210 pF1KE6 LTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQ ::..:: :::: .:::.: : .: ... .: .:: :.::::.:. :::.::. :. CCDS50 LTLTYRRDSDIQVPYGFLTV-STNPF--VFEVPSKEKLVCWVVSNWNPEHARVKYYNELS 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE6 AHLKVDVYGRSH-KPLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVV ... .::.. . . ... :.: ::::.::::.: :::::::. ::. : .:::: CCDS50 KSIEIHTYGQAFGEYVNDKNLIPTISTCKFYLSFENSIHKDYITEKLY-NAFLAGSVPVV 220 230 240 250 260 280 290 300 310 320 330 pF1KE6 LGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLR---PRS ::::: ::: ..: :.::::.:..::..::.::.:.::.. ::::: ::. . :: CCDS50 LGPSRENYENYIPADSFIHVEDYNSPSELAKYLKEVDKNNKLYLSYFNWRKDFTVNLPR- 270 280 290 300 310 320 340 350 360 pF1KE6 FSWALDFCKACWKLQQESRYQTVRSIAAWFT : : : :: .......:..: .. :: CCDS50 F-WESHACLACDHVKRHQEYKSVGNLEKWFWN 330 340 350 >>CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 (530 aa) initn: 829 init1: 401 opt: 859 Z-score: 974.9 bits: 189.5 E(32554): 5.5e-48 Smith-Waterman score: 914; 44.1% identity (65.6% similar) in 349 aa overlap (57-360:184-528) 30 40 50 60 70 80 pF1KE6 AVCFFSYLRVSRDDATGSPRAPSGSSRQDTTPTRPTLLILLWTWPF----HIPVALSRCS ::.:: . .::: :: : : CCDS83 CVLAAAGLTCTALITYACWGQLPPLPWASPTPSRP-VGVLLWWEPFGGRDSAPRPPPDC- 160 170 180 190 200 210 90 100 110 120 pF1KE6 EMVPGTADCHITADRKVYPQADTVIVHHWDIMSNPKSRLPP------------------- .. . . :.. .:: : .:..:. :: :....: . :: CCDS83 RLRFNISGCRLLTDRASYGEAQAVLFHHRDLVKGPPDWPPPWGIQAHTAEEVDLRVLDYE 220 230 240 250 260 270 130 140 150 160 170 pF1KE6 ------------SPRPQGQRWIWFNLEPPPNCQHLEAL-DRYFNLTMSYRSDSDIFTPYG :::: ::::.:.:.: : . :..: . :: :.:::.:::.:.::: CCDS83 EAAAAAEALATSSPRPPGQRWVWMNFESPSHSPGLRSLASNLFNWTLSYRADSDVFVPYG 280 290 300 310 320 330 180 190 200 210 220 pF1KE6 WLEPWSGQPAHPPLNL----SAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSH .: : : .:. :: .: : : ::::.::.: .::::::..:. :. :::.::. CCDS83 YLYPRS-HPGDPPSGLAPPLSRKQGLVAWVVSHWDERQARVRYYHQLSQHVTVDVFGRGG 340 350 360 370 380 390 230 240 250 260 270 280 pF1KE6 --KPLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERF .:.:. ...:..::::::::::: : :::::::::::: : ::::::::.:.::::: CCDS83 PGQPVPEIGLLHTVARYKFYLAFENSQHLDYITEKLWRNALLAGAVPVVLGPDRANYERF 400 410 420 430 440 450 290 300 310 320 330 340 pF1KE6 LPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRET--LRPRSFSWALDFCKACW .: :::::::: : ..:: :: ::.. : : ::.::.. .. :: : .:..: CCDS83 VPRGAFIHVDDFPSASSLASYLLFLDRNPAVYRRYFHWRRSYAVHITSF-WDEPWCRVCQ 460 470 480 490 500 350 360 pF1KE6 KLQQES-RYQTVRSIAAWFT .:. . : ...:..:.:: CCDS83 AVQRAGDRPKSIRNLASWFER 510 520 530 361 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:18:11 2016 done: Tue Nov 8 03:18:12 2016 Total Scan time: 2.340 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]