FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4467, 489 aa 1>>>pF1KE4467 489 - 489 aa - 489 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8176+/-0.000851; mu= 20.7861+/- 0.051 mean_var=62.9507+/-12.792, 0's: 0 Z-trim(106.4): 23 B-trim: 0 in 0/51 Lambda= 0.161649 statistics sampled from 8963 (8980) to 8963 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.276), width: 16 Scan time: 3.360 The best scores are: opt bits E(32554) CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 489) 3213 758.1 4.9e-219 CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 591) 2816 665.5 4.2e-191 CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 551) 2382 564.3 1.2e-160 CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 ( 496) 679 167.1 3.9e-41 CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 497) 641 158.2 1.8e-38 CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 296) 508 127.1 2.6e-29 >>CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 (489 aa) initn: 3213 init1: 3213 opt: 3213 Z-score: 4045.8 bits: 758.1 E(32554): 4.9e-219 Smith-Waterman score: 3213; 99.8% identity (100.0% similar) in 489 aa overlap (1-489:1-489) 10 20 30 40 50 60 pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT ::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::: CCDS56 MVPSSPAVEKQVPVEPGPDPELRSWRHLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP 430 440 450 460 470 480 pF1KE4 VCPSEVCPS ::::::::: CCDS56 VCPSEVCPS >>CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 (591 aa) initn: 2845 init1: 2816 opt: 2816 Z-score: 3544.3 bits: 665.5 E(32554): 4.2e-191 Smith-Waterman score: 2816; 99.8% identity (100.0% similar) in 431 aa overlap (1-431:1-431) 10 20 30 40 50 60 pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT ::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::: CCDS13 MVPSSPAVEKQVPVEPGPDPELRSWRHLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP ::::::::::: CCDS13 VRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLRSAAEEKA 430 440 450 460 470 480 >>CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 (551 aa) initn: 2411 init1: 2382 opt: 2382 Z-score: 2997.7 bits: 564.3 E(32554): 1.2e-160 Smith-Waterman score: 2382; 95.1% identity (96.7% similar) in 391 aa overlap (41-431:1-391) 20 30 40 50 60 70 pF1KE4 QVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFTREQVTNEITP .:: . .: : . . .::::::: CCDS56 MRPQPAEPAPGGRGNEACSIHSEVTNEITP 10 20 30 80 90 100 110 120 130 pF1KE4 VLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQLMELFYSVTM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQLMELFYSVTM 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE4 AARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRVSFSTLNYISL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRVSFSTLNYISL 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE4 AFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGHALRVACGDSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGHALRVACGDSV 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE4 LARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAAD 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE4 AASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFV 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE4 LFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRK 340 350 360 370 380 390 440 450 460 470 480 pF1KE4 QNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS : CCDS56 QFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLRSAAEEKAAQALSVQDKG 400 410 420 430 440 450 >>CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 (496 aa) initn: 1169 init1: 676 opt: 679 Z-score: 852.0 bits: 167.1 E(32554): 3.9e-41 Smith-Waterman score: 1156; 43.1% identity (72.9% similar) in 432 aa overlap (24-431:11-438) 10 20 30 40 50 60 pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT :: . ::..::....::.: :. ::: :::::.: CCDS24 MDCYRTSLSSSWIYPTVILCLFGFFSMMRPSEPFLIPYLSGPDKNLT 10 20 30 40 70 80 90 100 110 120 pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ ..:::: :: .::::..:.:::.::::.:: ::..:::.::. .:::::.:..: :: CCDS24 SAEITNEIFPVWTYSYLVLLLPVFVLTDYVRYKPVIILQGISFIITWLLLLFGQGVKTMQ 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV ..:.::... ::..:: .::.:.: : .::::.:: :...: . ..:::.::::... . CCDS24 VVEFFYGMVTAAEVAYYAYIYSVVSPEHYQRVSGYCRSVTLAAYTAGSVLAQLLVSLANM 110 120 130 140 150 160 190 200 210 220 230 pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASE---LERMNPG--PG :. :: :::: .. . ...::: ::.:.::. . . :.: ::. . : :: CCDS24 SYFYLNVISLASVSVAFLFSLFLPMPKKSMFFHAKPSREIKKSSSVNPVLEETHEGEAPG 170 180 190 200 210 220 240 250 260 270 pF1KE4 --------------GKLGHALRVACGDS-----VLARMLRELGDSLRRPQLRLWSLWWVF :::... . : :... ...: . .: :::::.: CCDS24 CEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWSLWWAF 230 240 250 260 270 280 280 290 300 310 320 330 pF1KE4 NSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSK .::. :. ::.:::. :. .:. .::::..: .:. ::...::.:.::. : .. CCDS24 ATAGFNQVLNYVQILWDYKAPSQDSS-IYNGAVEAIATFGGAVAAFAVGYVKVNWDLLGE 290 300 310 320 330 340 340 350 360 370 380 390 pF1KE4 LLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELC : .. ....:: .::. .: ..:: :::....:..::..:. ::.:::: .:. : CCDS24 LALVVFSVVNAGSLFLMHYT---ANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVERY 350 360 370 380 390 400 400 410 420 430 440 450 pF1KE4 ALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSS :::::.:::.: ...::.: :: : :::.::: : CCDS24 ALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMRSMYITYS 410 420 430 440 450 460 460 470 480 pF1KE4 EGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS CCDS24 TKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL 470 480 490 >>CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 (497 aa) initn: 1118 init1: 638 opt: 641 Z-score: 804.0 bits: 158.2 E(32554): 1.8e-38 Smith-Waterman score: 1075; 42.3% identity (71.8% similar) in 433 aa overlap (25-431:30-455) 10 20 30 40 50 pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGP : . :: :::.:..::.: :.::::::: CCDS12 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 DKNFTREQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHS :::.:...: ::: :: .::::..: :::: :::::: ::.::::::.. .:..:: ... CCDS12 DKNLTEREVFNEIYPVWTYSYLVLLFPVFLATDYLRYKPVVLLQGLSLIVTWFMLLYAQG 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 VAHMQLMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLV . .:..:.::... :..::: :::.:.: . ::.:..: :.:.:.: ..:::::.:: CCDS12 LLAIQFLEFFYGIATATEIAYYSYIYSVVDLGMYQKVTSYCRSATLVGFTVGSVLGQILV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 TVGRVSFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPG-- .:. :. .:: :::. .. . ..: :: :..::::.. . :. .. .. .: : CCDS12 SVAGWSLFSLNVISLTCVSVAFAVAWFLPMPQKSLFFHHIP-STCQR-VNGIKVQNGGIV 190 200 210 220 230 240 250 260 pF1KE4 ---------PG-----GKLGHALR---VACGDSVLARML--RELGDSL-----RRPQLRL :: .:. .. : . :.: . : ... :: : CCDS12 TDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWNDFLMCYSSRPLL-C 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE4 WSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKI ::.::.... ::. :: :.. ::..: :. .: .:::...:.::::::.. ::.:..:: CCDS12 WSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKI 300 310 320 330 340 350 330 340 350 360 370 380 pF1KE4 RWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIAS :. :... .. . :. :... . ..::.:::..:.:: :..:. :::::::. CCDS12 SWSTWGEMTLSLFSLLIAAAVYIMDTV---GNIWVCYASYVVFRIIYMLLITIATFQIAA 360 370 380 390 400 410 390 400 410 420 430 440 pF1KE4 SLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRL .:: : :::::::::.: ..:..:.:: :. :::: . : CCDS12 NLSMERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLAS 420 430 440 450 460 470 450 460 470 480 pF1KE4 AADTLSSEGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS CCDS12 GAVSVMKKCRKLEDPQSSSQVTTS 480 490 >>CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 (296 aa) initn: 638 init1: 260 opt: 508 Z-score: 639.6 bits: 127.1 E(32554): 2.6e-29 Smith-Waterman score: 508; 45.2% identity (76.8% similar) in 168 aa overlap (264-431:92-254) 240 250 260 270 280 290 pF1KE4 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN :: : ::.::.... ::. :: :.. ::. CCDS81 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE 70 80 90 100 110 120 300 310 320 330 340 350 pF1KE4 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL .: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :... CCDS81 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM 130 140 150 160 170 360 370 380 390 400 410 pF1KE4 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI . ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:. CCDS81 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL 180 190 200 210 220 230 420 430 440 450 460 470 pF1KE4 ITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTL .:.:: :. :::: . : CCDS81 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS 240 250 260 270 280 290 489 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:30:20 2016 done: Sun Nov 6 00:30:21 2016 Total Scan time: 3.360 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]