FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA1983, 406 aa
1>>>pF1KA1983 406 - 406 aa - 406 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.5293+/-0.00044; mu= -2.9356+/- 0.028
mean_var=533.0750+/-108.245, 0's: 0 Z-trim(124.4): 406 B-trim: 0 in 0/60
Lambda= 0.055549
statistics sampled from 45459 (45920) to 45459 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.538), width: 16
Scan time: 8.300
The best scores are: opt bits E(85289)
NP_597716 (OMIM: 235510,612753) collagen and calci ( 406) 2959 251.4 3e-66
XP_016881047 (OMIM: 235510,612753) PREDICTED: coll ( 366) 2419 208.0 3e-53
XP_016881045 (OMIM: 235510,612753) PREDICTED: coll ( 435) 1877 164.7 4e-40
XP_016881046 (OMIM: 235510,612753) PREDICTED: coll ( 435) 1877 164.7 4e-40
NP_000486 (OMIM: 301050,303630) collagen alpha-5(I (1685) 344 42.6 0.0085
XP_016884749 (OMIM: 301050,303630) PREDICTED: coll (1690) 344 42.6 0.0085
>>NP_597716 (OMIM: 235510,612753) collagen and calcium-b (406 aa)
initn: 2959 init1: 2959 opt: 2959 Z-score: 1310.3 bits: 251.4 E(85289): 3e-66
Smith-Waterman score: 2959; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406)
10 20 30 40 50 60
pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA1 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_597 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR
310 320 330 340 350 360
370 380 390 400
pF1KA1 THSSAEEFPLPQEFPSYPEAMDLGSGDDHPRRTETRDLRAPRDFYP
::::::::::::::::::::::::::::::::::::::::::::::
NP_597 THSSAEEFPLPQEFPSYPEAMDLGSGDDHPRRTETRDLRAPRDFYP
370 380 390 400
>>XP_016881047 (OMIM: 235510,612753) PREDICTED: collagen (366 aa)
initn: 2650 init1: 2419 opt: 2419 Z-score: 1076.9 bits: 208.0 E(85289): 3e-53
Smith-Waterman score: 2419; 100.0% identity (100.0% similar) in 329 aa overlap (1-329:1-329)
10 20 30 40 50 60
pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA1 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR
:::::::::::::::::::::::::::::
XP_016 RRGPVGPPGAPGRDGSKGERGAPGPRGSPVSSTLCPASPGERSQGCSSDEPIGTPWFFRL
310 320 330 340 350 360
>>XP_016881045 (OMIM: 235510,612753) PREDICTED: collagen (435 aa)
initn: 1877 init1: 1877 opt: 1877 Z-score: 841.4 bits: 164.7 E(85289): 4e-40
Smith-Waterman score: 1877; 99.2% identity (99.2% similar) in 262 aa overlap (1-262:1-262)
10 20 30 40 50 60
pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
::::::::::::::::::: :
XP_016 SNTYLPGPPGLPGGQGPPGECGRGHRRVITNRKSLPKAHICWGLDNVLQLPSRRNKAHQD
250 260 270 280 290 300
>>XP_016881046 (OMIM: 235510,612753) PREDICTED: collagen (435 aa)
initn: 1877 init1: 1877 opt: 1877 Z-score: 841.4 bits: 164.7 E(85289): 4e-40
Smith-Waterman score: 1877; 99.2% identity (99.2% similar) in 262 aa overlap (1-262:1-262)
10 20 30 40 50 60
pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG
::::::::::::::::::: :
XP_016 SNTYLPGPPGLPGGQGPPGECGRGHRRVITNRKSLPKAHICWGLDNVLQLPSRRNKAHQD
250 260 270 280 290 300
>>NP_000486 (OMIM: 301050,303630) collagen alpha-5(IV) c (1685 aa)
initn: 806 init1: 333 opt: 344 Z-score: 171.2 bits: 42.6 E(85289): 0.0085
Smith-Waterman score: 349; 51.9% identity (57.5% similar) in 106 aa overlap (246-333:1192-1297)
220 230 240 250 260
pF1KA1 KQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLPG--GQ----------GPPGSPGP
:::::::: :: : :: :::
NP_000 PPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGP
1170 1180 1190 1200 1210 1220
270 280 290 300 310
pF1KA1 KGSPGFPGMPGPPGQPGPRGSMGPM------GPSPDLSHIKQGRRGPVGPPGAPGRDGSK
:: ::: :.:: : ::: :: :: .:.:. . : :: :::: :: : :
NP_000 KGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGLPGPEGPPGLPGNGGIK
1230 1240 1250 1260 1270 1280
320 330 340 350 360 370
pF1KA1 GERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSY
::.: :: : :: ::
NP_000 GEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVP
1290 1300 1310 1320 1330 1340
>>XP_016884749 (OMIM: 301050,303630) PREDICTED: collagen (1690 aa)
initn: 806 init1: 333 opt: 344 Z-score: 171.2 bits: 42.6 E(85289): 0.0085
Smith-Waterman score: 349; 51.9% identity (57.5% similar) in 106 aa overlap (246-333:1197-1302)
220 230 240 250 260
pF1KA1 KQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLPG--GQ----------GPPGSPGP
:::::::: :: : :: :::
XP_016 PPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGP
1170 1180 1190 1200 1210 1220
270 280 290 300 310
pF1KA1 KGSPGFPGMPGPPGQPGPRGSMGPM------GPSPDLSHIKQGRRGPVGPPGAPGRDGSK
:: ::: :.:: : ::: :: :: .:.:. . : :: :::: :: : :
XP_016 KGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGLPGPEGPPGLPGNGGIK
1230 1240 1250 1260 1270 1280
320 330 340 350 360 370
pF1KA1 GERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSY
::.: :: : :: ::
XP_016 GEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVP
1290 1300 1310 1320 1330 1340
406 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 22:21:20 2016 done: Wed Nov 2 22:21:21 2016
Total Scan time: 8.300 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]