FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6287, 296 aa
1>>>pF1KE6287 296 - 296 aa - 296 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8770+/-0.000941; mu= 17.7316+/- 0.057
mean_var=62.1588+/-13.044, 0's: 0 Z-trim(104.6): 21 B-trim: 0 in 0/45
Lambda= 0.162676
statistics sampled from 7962 (7973) to 7962 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.604), E-opt: 0.2 (0.245), width: 16
Scan time: 1.920
The best scores are: opt bits E(32554)
CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 296) 1933 462.3 2e-130
CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 497) 1479 355.9 3.6e-98
CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 ( 496) 767 188.8 7.1e-48
CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 551) 578 144.4 1.8e-34
CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 591) 578 144.5 1.9e-34
CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 489) 508 128.0 1.4e-29
>>CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 (296 aa)
initn: 1933 init1: 1933 opt: 1933 Z-score: 2455.1 bits: 462.3 E(32554): 2e-130
Smith-Waterman score: 1933; 100.0% identity (100.0% similar) in 296 aa overlap (1-296:1-296)
10 20 30 40 50 60
pF1KE6 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 KVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 KVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIMD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 TVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTLLTLI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 TVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTLLTLI
190 200 210 220 230 240
250 260 270 280 290
pF1KE6 VVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 VVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
250 260 270 280 290
>>CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 (497 aa)
initn: 1479 init1: 1479 opt: 1479 Z-score: 1876.0 bits: 355.9 E(32554): 3.6e-98
Smith-Waterman score: 1479; 99.1% identity (99.6% similar) in 231 aa overlap (66-296:267-497)
40 50 60 70 80 90
pF1KE6 LLCAYGFFASLRPSEPFLTPYLLGPDKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL
:. :::::::::::::::::::::::::::
CCDS12 IVTDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWNDFLMCYSSRPLL
240 250 260 270 280 290
100 110 120 130 140 150
pF1KE6 CWSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 CWSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKI
300 310 320 330 340 350
160 170 180 190 200 210
pF1KE6 SWSTWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SWSTWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLS
360 370 380 390 400 410
220 230 240 250 260 270
pF1KE6 MERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAV
420 430 440 450 460 470
280 290
pF1KE6 SVMKKCRKLEDPQSSSQVTTS
:::::::::::::::::::::
CCDS12 SVMKKCRKLEDPQSSSQVTTS
480 490
>--
initn: 494 init1: 459 opt: 486 Z-score: 616.5 bits: 122.8 E(32554): 5.1e-28
Smith-Waterman score: 486; 44.8% identity (64.2% similar) in 232 aa overlap (1-223:1-217)
10 20 30 40 50 60
pF1KE6 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 DKNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLLCWSVWWALSTCGYFQVVNYTQGLWE
:::::::: . . .:. :: :: . :. : . : :: :::
CCDS12 DKNLTEREVFNE-----IYPVWT-----YSYLVLL-FPVFLATDYLRYKPVV-LLQGL--
70 80 90 100
130 140 150 160 170
pF1KE6 KVMPSRYAAIYNGGVEAVSTL---LGAVAVFAVGYIKISWSTWGE-MTLSLFSLLIAAAV
... . . .: :. :.. : : ... ..: . .:. : .. : .:..
CCDS12 SLIVTWFMLLYAQGLLAIQFLEFFYGIATATEIAYYSYIYSVVDLGMYQKVTSYCRSATL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE6 Y---IMDTVGNIWVCYASYVVFR--IIYMLLITIATFQIAANLSMERYALVFGVNTFIAL
. ...:.: : :.. .: .: . ...: : .: : : . .: :
CCDS12 VGFTVGSVLGQILVSVAGWSLFSLNVISLTCVSVA-FAVAWFLPMPQKSLFFHHIPSTCQ
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE6 ALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSS
CCDS12 RVNGIKVQNGGIVTDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWND
230 240 250 260 270 280
>>CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 (496 aa)
initn: 918 init1: 747 opt: 767 Z-score: 972.9 bits: 188.8 E(32554): 7.1e-48
Smith-Waterman score: 767; 51.6% identity (81.9% similar) in 221 aa overlap (71-289:253-472)
50 60 70 80 90
pF1KE6 GFFASLRPSEPFLTPYLLGPDKNLTEREEPKPDRLLV-LKVLW-NDFLMCYSSRPLLCWS
::. . : . : : .:. ::::. :. ::
CCDS24 GEAPGCEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWS
230 240 250 260 270 280
100 110 120 130 140 150
pF1KE6 VWWALSTCGYFQVVNYTQGLWEKVMPSRYAAIYNGGVEAVSTLLGAVAVFAVGYIKISWS
.:::..: :. ::.::.: ::. ::. ..::::.:::..:. ::::.:::::.:..:.
CCDS24 LWWAFATAGFNQVLNYVQILWDYKAPSQDSSIYNGAVEAIATFGGAVAAFAVGYVKVNWD
290 300 310 320 330 340
160 170 180 190 200 210
pF1KE6 TWGEMTLSLFSLLIAAAVYIMDTVGNIWVCYASYVVFRIIYMLLITIATFQIAANLSMER
::..: .::.. :.....: ..:::.:::.:..:. ::::::::.::::.::..::
CCDS24 LLGELALVVFSVVNAGSLFLMHYTANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVER
350 360 370 380 390 400
220 230 240 250 260 270
pF1KE6 YALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVM
::::::.::::::..::..:.:::: ::.: .. :::.:.::::.:: .:: .. .
CCDS24 YALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMR-SMYIT
410 420 430 440 450 460
280 290
pF1KE6 KKCRKLEDPQSSSQVTTS
. .. .: ::
CCDS24 YSTKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL
470 480 490
>>CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 (551 aa)
initn: 541 init1: 307 opt: 578 Z-score: 732.6 bits: 144.4 E(32554): 1.8e-34
Smith-Waterman score: 578; 40.6% identity (76.7% similar) in 202 aa overlap (92-288:224-425)
70 80 90 100 110 120
pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE
:: : ::.::.... ::. :: :.. ::.
CCDS56 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN
200 210 220 230 240 250
130 140 150 160 170
pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM
.: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :...
CCDS56 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL
260 270 280 290 300 310
180 190 200 210 220 230
pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL
. ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:.
CCDS56 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI
320 330 340 350 360 370
240 250 260 270 280 290
pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
.:.:: :. :::: . :: .:. :: ....... .. .. ...:.. . :.
CCDS56 ITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLR
380 390 400 410 420 430
CCDS56 SAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSLGAVGPASLEQRQSDPYLAQAPAPQA
440 450 460 470 480 490
>>CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 (591 aa)
initn: 685 init1: 307 opt: 578 Z-score: 732.1 bits: 144.5 E(32554): 1.9e-34
Smith-Waterman score: 578; 40.6% identity (76.7% similar) in 202 aa overlap (92-288:264-465)
70 80 90 100 110 120
pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE
:: : ::.::.... ::. :: :.. ::.
CCDS13 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN
240 250 260 270 280 290
130 140 150 160 170
pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM
.: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :...
CCDS13 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL
300 310 320 330 340 350
180 190 200 210 220 230
pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL
. ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:.
CCDS13 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI
360 370 380 390 400 410
240 250 260 270 280 290
pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
.:.:: :. :::: . :: .:. :: ....... .. .. ...:.. . :.
CCDS13 ITFIVSDVRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLR
420 430 440 450 460 470
CCDS13 SAAEEKAAQALSVQDKGLGGLQPAQSPPLSPEDSLGAVGPASLEQRQSDPYLAQAPAPQA
480 490 500 510 520 530
>>CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 (489 aa)
initn: 638 init1: 260 opt: 508 Z-score: 644.5 bits: 128.0 E(32554): 1.4e-29
Smith-Waterman score: 508; 45.2% identity (76.8% similar) in 168 aa overlap (92-254:264-431)
70 80 90 100 110 120
pF1KE6 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE
:: : ::.::.... ::. :: :.. ::.
CCDS56 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN
240 250 260 270 280 290
130 140 150 160 170
pF1KE6 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM
.: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :...
CCDS56 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL
300 310 320 330 340 350
180 190 200 210 220 230
pF1KE6 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL
. ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:.
CCDS56 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI
360 370 380 390 400 410
240 250 260 270 280 290
pF1KE6 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
.:.:: :. :::: . :
CCDS56 ITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTL
420 430 440 450 460 470
296 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 11:48:11 2016 done: Tue Nov 8 11:48:12 2016
Total Scan time: 1.920 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]