FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2614, 530 aa 1>>>pF1KE2614 530 - 530 aa - 530 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5042+/-0.000898; mu= 17.6030+/- 0.054 mean_var=69.6520+/-14.192, 0's: 0 Z-trim(106.2): 23 B-trim: 0 in 0/51 Lambda= 0.153677 statistics sampled from 8843 (8854) to 8843 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.629), E-opt: 0.2 (0.272), width: 16 Scan time: 3.200 The best scores are: opt bits E(32554) CCDS5340.1 SLC29A4 gene_id:222962|Hs108|chr7 ( 530) 3540 794.1 0 CCDS75561.1 SLC29A4 gene_id:222962|Hs108|chr7 ( 516) 2355 531.4 9.8e-151 CCDS8137.1 SLC29A2 gene_id:3177|Hs108|chr11 ( 456) 272 69.5 9.3e-12 CCDS4908.1 SLC29A1 gene_id:2030|Hs108|chr6 ( 456) 263 67.5 3.7e-11 >>CCDS5340.1 SLC29A4 gene_id:222962|Hs108|chr7 (530 aa) initn: 3540 init1: 3540 opt: 3540 Z-score: 4239.4 bits: 794.1 E(32554): 0 Smith-Waterman score: 3540; 100.0% identity (100.0% similar) in 530 aa overlap (1-530:1-530) 10 20 30 40 50 60 pF1KE2 MGSVGSQRLEEPSVAGTPDPGVVMSFTFDSHQLEEAAEAAQGQGLRARGVPAFTDTTLDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MGSVGSQRLEEPSVAGTPDPGVVMSFTFDSHQLEEAAEAAQGQGLRARGVPAFTDTTLDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PVPDDRYHAIYFAMLLAGVGFLLPYNSFITDVDYLHHKYPGTSIVFDMSLTYILVALAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 PVPDDRYHAIYFAMLLAGVGFLLPYNSFITDVDYLHHKYPGTSIVFDMSLTYILVALAAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LLNNVLVERLTLHTRITAGYLLALGPLLFISICDVWLQLFSRDQAYAINLAAVGTVAFGC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LLNNVLVERLTLHTRITAGYLLALGPLLFISICDVWLQLFSRDQAYAINLAAVGTVAFGC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 TVQQSSFYGYTGMLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFLVSVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TVQQSSFYGYTGMLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFLVSVA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 LELLCFLLHLLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LELLCFLLHLLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAPA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE2 LAPNESPKDSPAHEVTGSGGAYMRFDVPRPRVQRSWPTFRALLLHRYVVARVIWADMLSI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 LAPNESPKDSPAHEVTGSGGAYMRFDVPRPRVQRSWPTFRALLLHRYVVARVIWADMLSI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE2 AVTYFITLCLFPGLESEIRHCILGEWLPILIMAVFNLSDFVGKILAALPVDWRGTHLLAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 AVTYFITLCLFPGLESEIRHCILGEWLPILIMAVFNLSDFVGKILAALPVDWRGTHLLAC 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE2 SCLRVVFIPLFILCVYPSGMPALRHPAWPCIFSLLMGISNGYFGSVPMILAAGKVSPKQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 SCLRVVFIPLFILCVYPSGMPALRHPAWPCIFSLLMGISNGYFGSVPMILAAGKVSPKQR 430 440 450 460 470 480 490 500 510 520 530 pF1KE2 ELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSILAGL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSILAGL 490 500 510 520 530 >>CCDS75561.1 SLC29A4 gene_id:222962|Hs108|chr7 (516 aa) initn: 3234 init1: 2355 opt: 2355 Z-score: 2819.7 bits: 531.4 E(32554): 9.8e-151 Smith-Waterman score: 3206; 92.6% identity (94.2% similar) in 530 aa overlap (1-530:1-516) 10 20 30 40 50 60 pF1KE2 MGSVGSQRLEEPSVAGTPDPGVVMSFTFDSHQLEEAAEAAQGQGLRARGVPAFTDTTLDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MGSVGSQRLEEPSVAGTPDPGVVMSFTFDSHQLEEAAEAAQGQGLRARGVPAFTDTTLDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PVPDDRYHAIYFAMLLAGVGFLLPYNSFITDVDYLHHKYPGTSIVFDMSLTYILVALAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 PVPDDRYHAIYFAMLLAGVGFLLPYNSFITDVDYLHHKYPGTSIVFDMSLTYILVALAAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LLNNVLVERLTLHTRITAGYLLALGPLLFISICDVWLQLFSRDQAYAINLAAVGTVAFGC ::::::::::::::::::. . : .: .: . : : . CCDS75 LLNNVLVERLTLHTRITAAS----------ATCGCSSSLGTRPTPSTWPLWA----PWPS 130 140 150 160 190 200 210 220 230 240 pF1KE2 TVQQSSFYGYTGMLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFLVSVA ..:::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 AAQQSSFYGYTGMLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFLVSVA 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE2 LELLCFLLHLLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LELLCFLLHLLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAPA 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE2 LAPNESPKDSPAHEVTGSGGAYMRFDVPRPRVQRSWPTFRALLLHRYVVARVIWADMLSI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LAPNESPKDSPAHEVTGSGGAYMRFDVPRPRVQRSWPTFRALLLHRYVVARVIWADMLSI 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE2 AVTYFITLCLFPGLESEIRHCILGEWLPILIMAVFNLSDFVGKILAALPVDWRGTHLLAC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 AVTYFITLCLFPGLESEIRHCILGEWLPILIMAVFNLSDFVGKILAALPVDWRGTHLLAC 350 360 370 380 390 400 430 440 450 460 470 480 pF1KE2 SCLRVVFIPLFILCVYPSGMPALRHPAWPCIFSLLMGISNGYFGSVPMILAAGKVSPKQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 SCLRVVFIPLFILCVYPSGMPALRHPAWPCIFSLLMGISNGYFGSVPMILAAGKVSPKQR 410 420 430 440 450 460 490 500 510 520 530 pF1KE2 ELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSILAGL :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSILAGL 470 480 490 500 510 >>CCDS8137.1 SLC29A2 gene_id:3177|Hs108|chr11 (456 aa) initn: 349 init1: 128 opt: 272 Z-score: 324.6 bits: 69.5 E(32554): 9.3e-12 Smith-Waterman score: 447; 26.1% identity (54.4% similar) in 476 aa overlap (63-503:7-450) 40 50 60 70 80 90 pF1KE2 LEEAAEAAQGQGLRARGVPAFTDTTLDEPVPDDRYHAIYFAMLLAGVGFLLPYNSFITDV : : :: . ..... :.: :::.: ::: . CCDS81 MARGDAPRDSYHLVGISFFILGLGTLLPWNFFITAI 10 20 30 100 110 120 130 pF1KE2 DYLH-----------------HKYPGTSIVFD--MSLTYILVALAAVLLNNVLVERLTLH :.. : : .. :. ..: : : .:::. : . . CCDS81 PYFQARLAGAGNSTARILSTNHTGPEDAFNFNNWVTLLSQLPLLLFTLLNSFLYQCVPET 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE2 TRITAGYLLALGPLLFISICDVWLQL-FSRDQAYAINLAAVGTVAFGCTVQQSSFYGYTG .:: : :::. ::.... . ... .: ..:..:.: . .: :.:..: : CCDS81 VRIL-GSLLAI--LLLFALTAALVKVDMSPGPFFSITMASVCFINSFSAVLQGSLFGQLG 100 110 120 130 140 150 200 210 220 230 240 pF1KE2 MLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFL---VSVALELLCFLLH .:. :. ..:.. ::.. .:. .:. : ..:.: .:. :.. . ..:.: CCDS81 TMPSTYSTLFLSGQGLAGIFAALAMLLSMASGVDAETSALGYFITPCVGILMSIVCYL-- 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE2 LLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAPALAPNESPKD . : : . : .... : ..: : : .:. CCDS81 ---------------------SLPHLKFARYYLANKSSQAQAQELETKAELLQSDENGIP 220 230 240 250 310 320 330 340 350 360 pF1KE2 SPAHEVTGSGGAYMRFDVPR-PRVQRSWPTFRALLLHRYVVARVIWADMLSIAVTYFITL : ..:. . . .:. . :. . . : . ..: . :: : ..... .:: CCDS81 SSPQKVALT----LDLDLEKEPESEPDEPQ-KPGKPSVFTVFQKIWLTALCLVLVFTVTL 260 270 280 290 300 370 380 390 400 410 420 pF1KE2 CLFPGLESEIRHCIL-GEWL----PILIMAVFNLSDFVGKILAALPVDW--RGTHLLAC- .::.. . . :.: :: . .::. :..:. :.. . : . ..:: CCDS81 SVFPAITAMVTSSTSPGKWSQFFNPICCFLLFNIMDWLGRSLTSYFL-WPDEDSRLLPLL 310 320 330 340 350 360 430 440 450 460 470 pF1KE2 SCLRVVFIPLFILCVYP--SGMPALR-HPAWPCIFSLLMGISNGYFGSVPMILAAGKVSP ::: .:.:::.:: : : .: : . :. : ::...::::. :. : :: .: : CCDS81 VCLRFLFVPLFMLCHVPQRSRLPILFPQDAYFITFMLLFAVSNGYLVSLTMCLAPRQVLP 370 380 390 400 410 420 480 490 500 510 520 530 pF1KE2 KQRELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSILAGL ..::.:: :: ::. :..... CCDS81 HEREVAGALMTFFLALGLSCGASLSFLFKALL 430 440 450 >>CCDS4908.1 SLC29A1 gene_id:2030|Hs108|chr6 (456 aa) initn: 340 init1: 114 opt: 263 Z-score: 313.8 bits: 67.5 E(32554): 3.7e-11 Smith-Waterman score: 382; 24.9% identity (54.3% similar) in 486 aa overlap (63-503:7-450) 40 50 60 70 80 90 pF1KE2 LEEAAEAAQGQGLRARGVPAFTDTTLDEPVPDDRYHAIYFAMLLAGVGFLLPYNSFITDV :.:::.:... ... :.: :::.: :.: . CCDS49 MTTSHQPQDRYKAVWLIFFMLGLGTLLPWNFFMTAT 10 20 30 100 110 120 pF1KE2 DYLHHKY----------------------PGT--------SIVFD--MSLTYILVALAAV .:. .. :.. : .:. :.: .: : . CCDS49 QYFTNRLDMSQNVSLVTAELSKDAQASAAPAAPLPERNSLSAIFNNVMTLCAMLPLLLFT 40 50 60 70 80 90 130 140 150 160 170 pF1KE2 LLNNVLVERLTLHTRITAGYLLALGPLLFISICDVWLQLFSRDQAYAINLAAVGTV-AFG ::. : .:. .:: : :.:. ...:. : .:: . ..:.. . . .:: CCDS49 YLNSFLHQRIPQSVRIL-GSLVAILLVFLITAILVKVQL-DALPFFVITMIKIVLINSFG 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE2 CTVQQSSFYGYTGMLPKRYTQGVMTGESTAGVMISLSRILTKLLLPDERASTLIFFLVSV .. :.:..: .:.:: :: .:.:.. :: . :.. : . . :.. .:... CCDS49 -AILQGSLFGLAGLLPASYTAPIMSGQGLAGFFASVAMICAIASGSELSESAFGYFITAC 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE2 ALELLCFLLHLLVRRSRFVLFYTTRPRDSHRGRPGLGRGYGYRVHHDVVAGDVHFEHPAP :. .: .. .: . : .: .: .. :: ... :... :.: CCDS49 AVIILTIICYLGLPRLEFYRYYQQLKLEG----PGE-----QETKLDLISKG---EEP-- 220 230 240 250 300 310 320 330 340 350 pF1KE2 ALAPNESPKDSPAHEVTGSGGAYMRFDVPRPRVQRSWPTFRALLLHRYVVARVIWADMLS .. :. . :..: .: . ...:.: . :.: .: CCDS49 -----RAGKEESGVSVSNS----------QPTNESH--SIKAILKNISVLA-------FS 260 270 280 290 360 370 380 390 400 410 pF1KE2 IAVTYFITLCLFPGLESEIRHCILGE--W----LPILIMAVFNLSDFVGKILAALPVDWR . . ::. .::.. :.. : : : .:. . .::. :..:. :.:. . : CCDS49 VCFIFTITIGMFPAVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDWLGRSLTAVFM-WP 300 310 320 330 340 350 420 430 440 450 460 pF1KE2 G--THLLACSCL-RVVFIPLFILC-VYPSGM--PALRHPAWPCIFSLLMGISNGYFGSVP : .. : : :.::.::..:: . : . ...: :: .: ...::::..:. CCDS49 GKDSRWLPSLVLARLVFVPLLLLCNIKPRRYLTVVFEHDAWFIFFMAAFAFSNGYLASLC 360 370 380 390 400 410 470 480 490 500 510 520 pF1KE2 MILAAGKVSPKQRELAGNTMTVSYMSGLTLGSAVAYCTYSLTRDAHGSCLHASTANGSIL : .. ::.: . : :: :. ::.::.. .. CCDS49 MCFGPKKVKPAEAETAGAIMAFFLCLGLALGAVFSFLFRAIV 420 430 440 450 530 pF1KE2 AGL 530 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 17:08:04 2016 done: Tue Nov 8 17:08:04 2016 Total Scan time: 3.200 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]