FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4467, 489 aa
1>>>pF1KE4467 489 - 489 aa - 489 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8176+/-0.000851; mu= 20.7861+/- 0.051
mean_var=62.9507+/-12.792, 0's: 0 Z-trim(106.4): 23 B-trim: 0 in 0/51
Lambda= 0.161649
statistics sampled from 8963 (8980) to 8963 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.276), width: 16
Scan time: 3.360
The best scores are: opt bits E(32554)
CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 489) 3213 758.1 4.9e-219
CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 591) 2816 665.5 4.2e-191
CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 ( 551) 2382 564.3 1.2e-160
CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 ( 496) 679 167.1 3.9e-41
CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 497) 641 158.2 1.8e-38
CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 ( 296) 508 127.1 2.6e-29
>>CCDS56218.1 SLC19A1 gene_id:6573|Hs108|chr21 (489 aa)
initn: 3213 init1: 3213 opt: 3213 Z-score: 4045.8 bits: 758.1 E(32554): 4.9e-219
Smith-Waterman score: 3213; 99.8% identity (100.0% similar) in 489 aa overlap (1-489:1-489)
10 20 30 40 50 60
pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT
::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::::
CCDS56 MVPSSPAVEKQVPVEPGPDPELRSWRHLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP
430 440 450 460 470 480
pF1KE4 VCPSEVCPS
:::::::::
CCDS56 VCPSEVCPS
>>CCDS13725.1 SLC19A1 gene_id:6573|Hs108|chr21 (591 aa)
initn: 2845 init1: 2816 opt: 2816 Z-score: 3544.3 bits: 665.5 E(32554): 4.2e-191
Smith-Waterman score: 2816; 99.8% identity (100.0% similar) in 431 aa overlap (1-431:1-431)
10 20 30 40 50 60
pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT
::::::::::::::::::::::::::.:::::::::::::::::::::::::::::::::
CCDS13 MVPSSPAVEKQVPVEPGPDPELRSWRHLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGH
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE4 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 ALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE4 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPS
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE4 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSD
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE4 VRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGP
:::::::::::
CCDS13 VRGLGLPVRKQFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLRSAAEEKA
430 440 450 460 470 480
>>CCDS56217.1 SLC19A1 gene_id:6573|Hs108|chr21 (551 aa)
initn: 2411 init1: 2382 opt: 2382 Z-score: 2997.7 bits: 564.3 E(32554): 1.2e-160
Smith-Waterman score: 2382; 95.1% identity (96.7% similar) in 391 aa overlap (41-431:1-391)
20 30 40 50 60 70
pF1KE4 QVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFTREQVTNEITP
.:: . .: : . . .:::::::
CCDS56 MRPQPAEPAPGGRGNEACSIHSEVTNEITP
10 20 30
80 90 100 110 120 130
pF1KE4 VLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQLMELFYSVTM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQLMELFYSVTM
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE4 AARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRVSFSTLNYISL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRVSFSTLNYISL
100 110 120 130 140 150
200 210 220 230 240 250
pF1KE4 AFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGHALRVACGDSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPGPGGKLGHALRVACGDSV
160 170 180 190 200 210
260 270 280 290 300 310
pF1KE4 LARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAAD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAAD
220 230 240 250 260 270
320 330 340 350 360 370
pF1KE4 AASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFV
280 290 300 310 320 330
380 390 400 410 420 430
pF1KE4 LFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRK
340 350 360 370 380 390
440 450 460 470 480
pF1KE4 QNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS
:
CCDS56 QFQLYSVYFLILSIIYFLGAMLDGLRHCQRGHHPRQPPAQGLRSAAEEKAAQALSVQDKG
400 410 420 430 440 450
>>CCDS2468.1 SLC19A3 gene_id:80704|Hs108|chr2 (496 aa)
initn: 1169 init1: 676 opt: 679 Z-score: 852.0 bits: 167.1 E(32554): 3.9e-41
Smith-Waterman score: 1156; 43.1% identity (72.9% similar) in 432 aa overlap (24-431:11-438)
10 20 30 40 50 60
pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGPDKNFT
:: . ::..::....::.: :. ::: :::::.:
CCDS24 MDCYRTSLSSSWIYPTVILCLFGFFSMMRPSEPFLIPYLSGPDKNLT
10 20 30 40
70 80 90 100 110 120
pF1KE4 REQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHSVAHMQ
..:::: :: .::::..:.:::.::::.:: ::..:::.::. .:::::.:..: ::
CCDS24 SAEITNEIFPVWTYSYLVLLLPVFVLTDYVRYKPVIILQGISFIITWLLLLFGQGVKTMQ
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE4 LMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLVTVGRV
..:.::... ::..:: .::.:.: : .::::.:: :...: . ..:::.::::... .
CCDS24 VVEFFYGMVTAAEVAYYAYIYSVVSPEHYQRVSGYCRSVTLAAYTAGSVLAQLLVSLANM
110 120 130 140 150 160
190 200 210 220 230
pF1KE4 SFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASE---LERMNPG--PG
:. :: :::: .. . ...::: ::.:.::. . . :.: ::. . : ::
CCDS24 SYFYLNVISLASVSVAFLFSLFLPMPKKSMFFHAKPSREIKKSSSVNPVLEETHEGEAPG
170 180 190 200 210 220
240 250 260 270
pF1KE4 --------------GKLGHALRVACGDS-----VLARMLRELGDSLRRPQLRLWSLWWVF
:::... . : :... ...: . .: :::::.:
CCDS24 CEEQKPTSEILSTSGKLNKGQLNSLKPSNVTVDVFVQWFQDLKECYSSKRLFYWSLWWAF
230 240 250 260 270 280
280 290 300 310 320 330
pF1KE4 NSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSK
.::. :. ::.:::. :. .:. .::::..: .:. ::...::.:.::. : ..
CCDS24 ATAGFNQVLNYVQILWDYKAPSQDSS-IYNGAVEAIATFGGAVAAFAVGYVKVNWDLLGE
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE4 LLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELC
: .. ....:: .::. .: ..:: :::....:..::..:. ::.:::: .:. :
CCDS24 LALVVFSVVNAGSLFLMHYT---ANIWACYAGYLIFKSSYMLLITIAVFQIAVNLNVERY
350 360 370 380 390 400
400 410 420 430 440 450
pF1KE4 ALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSS
:::::.:::.: ...::.: :: : :::.::: :
CCDS24 ALVFGINTFIALVIQTIMTVIVVDQRGLNLPVSIQFLVYGSYFAVIAGIFLMRSMYITYS
410 420 430 440 450 460
460 470 480
pF1KE4 EGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS
CCDS24 TKSQKDVQSPAPSENPDVSHPEEESNIIMSTKL
470 480 490
>>CCDS1280.1 SLC19A2 gene_id:10560|Hs108|chr1 (497 aa)
initn: 1118 init1: 638 opt: 641 Z-score: 804.0 bits: 158.2 E(32554): 1.8e-38
Smith-Waterman score: 1075; 42.3% identity (71.8% similar) in 433 aa overlap (25-431:30-455)
10 20 30 40 50
pF1KE4 MVPSSPAVEKQVPVEPGPDPELRSWRRLVCYLCFYGFMAQIRPGESFITPYLLGP
: . :: :::.:..::.: :.:::::::
CCDS12 MDVPGPVSRRAAAAAATVLLRTARVRRECWFLPTALLCAYGFFASLRPSEPFLTPYLLGP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE4 DKNFTREQVTNEITPVLSYSYLAVLVPVFLLTDYLRYTPVLLLQGLSFVSVWLLLLLGHS
:::.:...: ::: :: .::::..: :::: :::::: ::.::::::.. .:..:: ...
CCDS12 DKNLTEREVFNEIYPVWTYSYLVLLFPVFLATDYLRYKPVVLLQGLSLIVTWFMLLYAQG
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE4 VAHMQLMELFYSVTMAARIAYSSYIFSLVRPARYQRVAGYSRAAVLLGVFTSSVLGQLLV
. .:..:.::... :..::: :::.:.: . ::.:..: :.:.:.: ..:::::.::
CCDS12 LLAIQFLEFFYGIATATEIAYYSYIYSVVDLGMYQKVTSYCRSATLVGFTVGSVLGQILV
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE4 TVGRVSFSTLNYISLAFLTFSVVLALFLKRPKRSLFFNRDDRGRCETSASELERMNPG--
.:. :. .:: :::. .. . ..: :: :..::::.. . :. .. .. .: :
CCDS12 SVAGWSLFSLNVISLTCVSVAFAVAWFLPMPQKSLFFHHIP-STCQR-VNGIKVQNGGIV
190 200 210 220 230
240 250 260
pF1KE4 ---------PG-----GKLGHALR---VACGDSVLARML--RELGDSL-----RRPQLRL
:: .:. .. : . :.: . : ... :: :
CCDS12 TDTPASNHLPGWEDIESKIPLNMEEPPVEEPEPKPDRLLVLKVLWNDFLMCYSSRPLL-C
240 250 260 270 280 290
270 280 290 300 310 320
pF1KE4 WSLWWVFNSAGYYLVVYYVHILWNEVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKI
::.::.... ::. :: :.. ::..: :. .: .:::...:.::::::.. ::.:..::
CCDS12 WSVWWALSTCGYFQVVNYTQGLWEKVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKI
300 310 320 330 340 350
330 340 350 360 370 380
pF1KE4 RWARWSKLLIAGVTATQAGLVFLLAHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIAS
:. :... .. . :. :... . ..::.:::..:.:: :..:. :::::::.
CCDS12 SWSTWGEMTLSLFSLLIAAAVYIMDTV---GNIWVCYASYVVFRIIYMLLITIATFQIAA
360 370 380 390 400 410
390 400 410 420 430 440
pF1KE4 SLSKELCALVFGVNTFFATIVKTIITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRL
.:: : :::::::::.: ..:..:.:: :. :::: . :
CCDS12 NLSMERYALVFGVNTFIALALQTLLTLIVVDASGLGLEITTQFLIYASYFALIAVVFLAS
420 430 440 450 460 470
450 460 470 480
pF1KE4 AADTLSSEGSSGSGPRSWFLSPTLRAALHGPVCPSEVCPS
CCDS12 GAVSVMKKCRKLEDPQSSSQVTTS
480 490
>>CCDS81398.1 SLC19A2 gene_id:10560|Hs108|chr1 (296 aa)
initn: 638 init1: 260 opt: 508 Z-score: 639.6 bits: 127.1 E(32554): 2.6e-29
Smith-Waterman score: 508; 45.2% identity (76.8% similar) in 168 aa overlap (264-431:92-254)
240 250 260 270 280 290
pF1KE4 PGGKLGHALRVACGDSVLARMLRELGDSLRRPQLRLWSLWWVFNSAGYYLVVYYVHILWN
:: : ::.::.... ::. :: :.. ::.
CCDS81 KNLTEREEPKPDRLLVLKVLWNDFLMCYSSRPLL-CWSVWWALSTCGYFQVVNYTQGLWE
70 80 90 100 110 120
300 310 320 330 340 350
pF1KE4 EVDPTTNSARVYNGAADAASTLLGAITSFAAGFVKIRWARWSKLLIAGVTATQAGLVFLL
.: :. .: .:::...:.::::::.. ::.:..:: :. :... .. . :. :...
CCDS81 KVMPSRYAA-IYNGGVEAVSTLLGAVAVFAVGYIKISWSTWGEMTLSLFSLLIAAAVYIM
130 140 150 160 170
360 370 380 390 400 410
pF1KE4 AHTRHPSSIWLCYAAFVLFRGSYQFLVPIATFQIASSLSKELCALVFGVNTFFATIVKTI
. ..::.:::..:.:: :..:. :::::::..:: : :::::::::.: ..:.
CCDS81 DTV---GNIWVCYASYVVFRIIYMLLITIATFQIAANLSMERYALVFGVNTFIALALQTL
180 190 200 210 220 230
420 430 440 450 460 470
pF1KE4 ITFIVSDVRGLGLPVRKQNEELHVASLSLWKSHLRLAADTLSSEGSSGSGPRSWFLSPTL
.:.:: :. :::: . :
CCDS81 LTLIVVDASGLGLEITTQFLIYASYFALIAVVFLASGAVSVMKKCRKLEDPQSSSQVTTS
240 250 260 270 280 290
489 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 00:30:20 2016 done: Sun Nov 6 00:30:21 2016
Total Scan time: 3.360 Total Display time: 0.050
Function used was FASTA [36.3.4 Apr, 2011]