FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3313, 553 aa
1>>>pF1KB3313 553 - 553 aa - 553 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9843+/-0.00104; mu= 12.2504+/- 0.062
mean_var=154.8290+/-30.913, 0's: 0 Z-trim(108.9): 33 B-trim: 250 in 1/52
Lambda= 0.103074
statistics sampled from 10470 (10496) to 10470 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.683), E-opt: 0.2 (0.322), width: 16
Scan time: 3.520
The best scores are: opt bits E(32554)
CCDS10524.1 GLYR1 gene_id:84656|Hs108|chr16 ( 553) 3668 557.8 1.3e-158
CCDS81945.1 GLYR1 gene_id:84656|Hs108|chr16 ( 547) 3600 547.6 1.4e-155
CCDS5414.1 HIBADH gene_id:11112|Hs108|chr7 ( 336) 381 68.8 1.2e-11
>>CCDS10524.1 GLYR1 gene_id:84656|Hs108|chr16 (553 aa)
initn: 3668 init1: 3668 opt: 3668 Z-score: 2961.4 bits: 557.8 E(32554): 1.3e-158
Smith-Waterman score: 3668; 100.0% identity (100.0% similar) in 553 aa overlap (1-553:1-553)
10 20 30 40 50 60
pF1KB3 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS
490 500 510 520 530 540
550
pF1KB3 DNDMSAVYRAYIH
:::::::::::::
CCDS10 DNDMSAVYRAYIH
550
>>CCDS81945.1 GLYR1 gene_id:84656|Hs108|chr16 (547 aa)
initn: 2028 init1: 2028 opt: 3600 Z-score: 2906.8 bits: 547.6 E(32554): 1.4e-155
Smith-Waterman score: 3600; 98.9% identity (98.9% similar) in 553 aa overlap (1-553:1-547)
10 20 30 40 50 60
pF1KB3 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MAAVSLRLGDLVWGKLGRYPPWPGKIVNPPKDLKKPRGKKCFFVKFFGTEDHAWIKVEQL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 KPYHAHKEEMIKINKGKRFQQAVDAVEEFLRRAKGKDQTSSHNSSDDKNRRNSSEERSRP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 NSGDEKRKLSLSEGKVKKNMGEGKKRVSSGSSERGSKSPLKRAQEQSPRKRGRPPKDEKD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 LTIPESSTVKGMMAGPMAAFKWQPTASEPVKDADPHFHHFLLSQTEKPAVCYQAITKKLK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 ICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGIVSNLLKMGHTVTVWNRTA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 EKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM
:: ::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 EK------EGARLGRTPAEVVSTCDITFACVSDPKAAKDLVLGPSGVLQGIRPGKCYVDM
310 320 330 340 350
370 380 390 400 410 420
pF1KB3 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 STVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVILAAGDRGLYEDCSSCFQAM
360 370 380 390 400 410
430 440 450 460 470 480
pF1KB3 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 GKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTGQSQQTLLDILNQGQLASI
420 430 440 450 460 470
490 500 510 520 530 540
pF1KB3 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 FLDQKCQNILQGNFKPDFYLKYIQKDLRLAIALGDAVNHPTPMAAAANEVYKRAKALDQS
480 490 500 510 520 530
550
pF1KB3 DNDMSAVYRAYIH
:::::::::::::
CCDS81 DNDMSAVYRAYIH
540
>>CCDS5414.1 HIBADH gene_id:11112|Hs108|chr7 (336 aa)
initn: 324 init1: 324 opt: 381 Z-score: 322.5 bits: 68.8 E(32554): 1.2e-11
Smith-Waterman score: 381; 25.3% identity (56.9% similar) in 304 aa overlap (253-549:25-328)
230 240 250 260 270 280
pF1KB3 SQTEKPAVCYQAITKKLKICEEETGSTSIQAADSTAVNGSITPTDKKIGFLGLGLMGSGI
:.. .:: . . . .::.::: ::. .
CCDS54 MAASLRLLGAASGLRYWSRRLRPAAGSFAAVCSRSVASKTPVGFIGLGNMGNPM
10 20 30 40 50
290 300 310 320 330 340
pF1KB3 VSNLLKMGHTVTVWNRTAEKCDLFIQEGARLGRTPAEVVSTCDITFACVSDPKAAKDLVL
..::.: :. . ... . : : . : .. .::.:. : .. . : .
CCDS54 AKNLMKHGYPLIIYDVFPDACKEFQDAGEQVVSSPADVAEKADRIITMLPTSINAIEAYS
60 70 80 90 100 110
350 360 370 380 390 400
pF1KB3 GPSGVLQGIRPGKCYVDMSTVDADTVTELAQVIVSRGGRFLEAPVSGNQQLSNDGMLVIL
: .:.:. .. :. .: ::.: . :::. . . :. :..:::::. . .: :...
CCDS54 GANGILKKVKKGSLLIDSSTIDPAVSKELAKEVEKMGAVFMDAPVSGGVGAARSGNLTFM
120 130 140 150 160 170
410 420 430 440 450 460
pF1KB3 AAGDRGLYEDCSSCFQAMGKTSFFLGEVGNAAKMMLIVNMVQGSFMATIAEGLTLAQVTG
..: . . . . ::.. . : ::.. . ::. . : ::...:. :
CCDS54 VGGVEDEFAAAQELLGCMGSNVVYCGAVGTGQAAKICNNMLLAISMIGTAEAMNLGIRLG
180 190 200 210 220 230
470 480 490 500 510
pF1KB3 QSQQTLLDILNQ--GQLASIFLDQKCQNILQG-----NFKPDFYLKYIQKDLRLAIALGD
. . : :::. :. : . ....: :.. : . ::: :: .
CCDS54 LDPKLLAKILNMSSGRCWSSDTYNPVPGVMDGVPSANNYQGGFGTTLMAKDLGLAQDSAT
240 250 260 270 280 290
520 530 540 550
pF1KB3 AVNHPTPMAAAANEVYKRAKALDQSDNDMSAVYRAYIH
... : ... :...:. : : .:.:.:..
CCDS54 STKSPILLGSLAHQIYRMMCAKGYSKKDFSSVFQFLREEETF
300 310 320 330
553 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 04:54:48 2016 done: Sat Nov 5 04:54:49 2016
Total Scan time: 3.520 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]