FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6287, 296 aa 1>>>pF1KE6287 296 - 296 aa - 296 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8770+/-0.000941; mu= 17.7316+/- 0.057 mean_var=62.1588+/-13.044, 0's: 0 Z-trim(104.6): 21 B-trim: 0 in 0/45 Lambda= 0.162676 statistics sampled from 7962 (7973) to 7962 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.604), E-opt: 0.2 (0.245), width: 16 Scan time: 1.920 The best scores are: opt bits E(32554) CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 296) 1933 462.3 2e-130 CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 497) 1479 355.9 3.6e-98 CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 ( 496) 767 188.8 7.1e-48 CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 551) 578 144.4 1.8e-34 CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 591) 578 144.5 1.9e-34 CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 489) 508 128.0 1.4e-29 >>CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 (296 aa) initn: 1933 init1: 1933 opt: 1933 Z-score: 2455.1 bits: 462.3 E(32554): 2e-130 Smith-Waterman score: 1933; 100.0% identity (100.0% similar) in 296 aa overlap (1-296:1-296) 10 20 30 40 50 60 pF1KE6 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 KVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 KVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIMD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTLLTLI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 TVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTLLTLI 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 VVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 VVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS 250 260 270 280 290 >>CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 (497 aa) initn: 1479 init1: 1479 opt: 1479 Z-score: 1876.0 bits: 355.9 E(32554): 3.6e-98 Smith-Waterman score: 1479; 99.1% identity (99.6% similar) in 231 aa overlap (66-296:267-497) 40 50 60 70 80 90 pF1KE6 LLCAYGFFASLRPSEPFLTPYLLGPDKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL :. ::::::::::::::::::::::::::: CCDS12 IVTDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWNDFLMCYSSRPLL 240 250 260 270 280 290 100 110 120 130 140 150 pF1KE6 CWSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 CWSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKI 300 310 320 330 340 350 160 170 180 190 200 210 pF1KE6 SWSTWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SWSTWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLS 360 370 380 390 400 410 220 230 240 250 260 270 pF1KE6 MERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAV 420 430 440 450 460 470 280 290 pF1KE6 SVMKKCRKLEDPQSSSQVTTS ::::::::::::::::::::: CCDS12 SVMKKCRKLEDPQSSSQVTTS 480 490 >-- initn: 494 init1: 459 opt: 486 Z-score: 616.5 bits: 122.8 E(32554): 5.1e-28 Smith-Waterman score: 486; 44.8% identity (64.2% similar) in 232 aa overlap (1-223:1-217) 10 20 30 40 50 60 pF1KE6 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE :::::::: . . .:. :: :: . :. : . : :: ::: CCDS12 DKNLTEREVFNE-----IYPVWT-----YSYLVLL-FPVFLATDYLRYKPVV-LLQGL-- 70 80 90 100 130 140 150 160 170 pF1KE6 KVMPSRYAAIYNGGVEAVSTL---LGAVAVFAVGYIKISWSTWGE-MTLSLFSLLIAAAV ... . . .: :. :.. : : ... ..: . .:. : .. : .:.. CCDS12 SLIVTWFMLLYAQGLLAIQFLEFFYGIATATEIAYYSYIYSVVDLGMYQKVTSYCRSATL 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE6 Y---IMDTVGNIWVCYASYVVFR--IIYMLLITIATFQIAANLSMERYALVFGVNTFIAL . ...:.: : :.. .: .: . ...: : .: : : . .: : CCDS12 VGFTVGSVLGQILVSVAGWSLFSLNVISLTCVSVA-FAVAWFLPMPQKSLFFHHIPSTCQ 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 ALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSS CCDS12 RVNGIKVQNGGIVTDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWND 230 240 250 260 270 280 >>CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 (496 aa) initn: 918 init1: 747 opt: 767 Z-score: 972.9 bits: 188.8 E(32554): 7.1e-48 Smith-Waterman score: 767; 51.6% identity (81.9% similar) in 221 aa overlap (71-289:253-472) 50 60 70 80 90 pF1KE6 GFFASLRPSEPFLTPYLLGPDKNLTEREEPKPDRLLV-LKVLW-NDFLMCYSSRPLLCWS ::. . : . : : .:. ::::. :. :: CCDS24 GEAPGCEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWS 230 240 250 260 270 280 100 110 120 130 140 150 pF1KE6 VWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWS .:::..: :. ::.::.: ::. ::. ..::::.:::..:. ::::.:::::.:..:. CCDS24 LWWAFATAGFNQVLNYVQILWDYKAPSQDSSIYNGAVEAIATFGGAVAAFAVGYVKVNWD 290 300 310 320 330 340 160 170 180 190 200 210 pF1KE6 TWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMER ::..: .::.. :.....: ..:::.:::.:..:. ::::::::.::::.::..:: CCDS24 LLGELALVVFSVVNAGSLFLMHYTANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVER 350 360 370 380 390 400 220 230 240 250 260 270 pF1KE6 YALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVM ::::::.::::::..::..:.:::: ::.: .. :::.:.::::.:: .:: .. . CCDS24 YALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMR-SMYIT 410 420 430 440 450 460 280 290 pF1KE6 KKCRKLEDPQSSSQVTTS . .. .: :: CCDS24 YSTKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL 470 480 490 >>CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 (551 aa) initn: 541 init1: 307 opt: 578 Z-score: 732.6 bits: 144.4 E(32554): 1.8e-34 Smith-Waterman score: 578; 40.6% identity (76.7% similar) in 202 aa overlap (92-288:224-425) 70 80 90 100 110 120 pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE :: : ::.::.... ::. :: :.. ::. CCDS56 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN 200 210 220 230 240 250 130 140 150 160 170 pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM .: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :... CCDS56 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL 260 270 280 290 300 310 180 190 200 210 220 230 pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL . ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:. CCDS56 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI 320 330 340 350 360 370 240 250 260 270 280 290 pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS .:.:: :. :::: . :: .:. :: ....... .. .. ...:.. . :. CCDS56 ITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLR 380 390 400 410 420 430 CCDS56 SAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSLGAVGPASLEQRQSDPYLAQAPAPQA 440 450 460 470 480 490 >>CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 (591 aa) initn: 685 init1: 307 opt: 578 Z-score: 732.1 bits: 144.5 E(32554): 1.9e-34 Smith-Waterman score: 578; 40.6% identity (76.7% similar) in 202 aa overlap (92-288:264-465) 70 80 90 100 110 120 pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE :: : ::.::.... ::. :: :.. ::. CCDS13 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN 240 250 260 270 280 290 130 140 150 160 170 pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM .: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :... CCDS13 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL 300 310 320 330 340 350 180 190 200 210 220 230 pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL . ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:. CCDS13 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI 360 370 380 390 400 410 240 250 260 270 280 290 pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS .:.:: :. :::: . :: .:. :: ....... .. .. ...:.. . :. CCDS13 ITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLR 420 430 440 450 460 470 CCDS13 SAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSLGAVGPASLEQRQSDPYLAQAPAPQA 480 490 500 510 520 530 >>CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 (489 aa) initn: 638 init1: 260 opt: 508 Z-score: 644.5 bits: 128.0 E(32554): 1.4e-29 Smith-Waterman score: 508; 45.2% identity (76.8% similar) in 168 aa overlap (92-254:264-431) 70 80 90 100 110 120 pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE :: : ::.::.... ::. :: :.. ::. CCDS56 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN 240 250 260 270 280 290 130 140 150 160 170 pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM .: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :... CCDS56 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL 300 310 320 330 340 350 180 190 200 210 220 230 pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL . ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:. CCDS56 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI 360 370 380 390 400 410 240 250 260 270 280 290 pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS .:.:: :. :::: . : CCDS56 ITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTL 420 430 440 450 460 470 296 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:48:11 2016 done: Tue Nov 8 11:48:12 2016 Total Scan time: 1.920 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]