FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0963, 193 aa
1>>>pF1KB0963 193 - 193 aa - 193 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4479+/-0.000364; mu= 15.6010+/- 0.023
mean_var=196.1994+/-46.524, 0's: 0 Z-trim(118.8): 259 B-trim: 1901 in 1/48
Lambda= 0.091564
statistics sampled from 31734 (32100) to 31734 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.72), E-opt: 0.2 (0.376), width: 16
Scan time: 5.850
The best scores are: opt bits E(85289)
NP_001312 (OMIM: 601871) cysteine and glycine-rich ( 193) 1417 198.8 4.7e-51
NP_001287894 (OMIM: 601871) cysteine and glycine-r ( 193) 1417 198.8 4.7e-51
NP_001180500 (OMIM: 123876) cysteine and glycine-r ( 193) 1159 164.7 8.5e-41
NP_004069 (OMIM: 123876) cysteine and glycine-rich ( 193) 1159 164.7 8.5e-41
NP_001180501 (OMIM: 123876) cysteine and glycine-r ( 193) 1159 164.7 8.5e-41
NP_001180499 (OMIM: 123876) cysteine and glycine-r ( 187) 1101 157.0 1.7e-38
NP_003467 (OMIM: 600824,607482,612124) cysteine an ( 194) 977 140.6 1.5e-33
NP_001302 (OMIM: 123875) cysteine-rich protein 1 [ ( 77) 275 47.2 7.6e-06
NP_001303 (OMIM: 601183) cysteine-rich protein 2 i ( 208) 262 46.2 4.1e-05
NP_001257766 (OMIM: 601183) cysteine-rich protein ( 282) 262 46.5 4.8e-05
NP_001257770 (OMIM: 601183) cysteine-rich protein ( 87) 255 44.7 5.1e-05
>>NP_001312 (OMIM: 601871) cysteine and glycine-rich pro (193 aa)
initn: 1417 init1: 1417 opt: 1417 Z-score: 1037.6 bits: 198.8 E(85289): 4.7e-51
Smith-Waterman score: 1417; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GYGQGAGALVHAQ
:::::::::::::
NP_001 GYGQGAGALVHAQ
190
>>NP_001287894 (OMIM: 601871) cysteine and glycine-rich (193 aa)
initn: 1417 init1: 1417 opt: 1417 Z-score: 1037.6 bits: 198.8 E(85289): 4.7e-51
Smith-Waterman score: 1417; 100.0% identity (100.0% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GYGQGAGALVHAQ
:::::::::::::
NP_001 GYGQGAGALVHAQ
190
>>NP_001180500 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41
Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::...::::::.::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GYGQGAGALVHAQ
:.:::::::::..
NP_001 GFGQGAGALVHSE
190
>>NP_004069 (OMIM: 123876) cysteine and glycine-rich pro (193 aa)
initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41
Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_004 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_004 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::...::::::.::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_004 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GYGQGAGALVHAQ
:.:::::::::..
NP_004 GFGQGAGALVHSE
190
>>NP_001180501 (OMIM: 123876) cysteine and glycine-rich (193 aa)
initn: 1511 init1: 1159 opt: 1159 Z-score: 853.4 bits: 164.7 E(85289): 8.5e-41
Smith-Waterman score: 1159; 79.3% identity (91.7% similar) in 193 aa overlap (1-193:1-193)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::...::::::.::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_001 RCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170 180
190
pF1KB0 GYGQGAGALVHAQ
:.:::::::::..
NP_001 GFGQGAGALVHSE
190
>>NP_001180499 (OMIM: 123876) cysteine and glycine-rich (187 aa)
initn: 1139 init1: 757 opt: 1101 Z-score: 812.2 bits: 157.0 E(85289): 1.7e-38
Smith-Waterman score: 1101; 76.7% identity (88.6% similar) in 193 aa overlap (1-193:1-187)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
:: ::::.:::.: .::: ::::::.: :::. :::::::.:::::::::.: :::::::
NP_001 MPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKSCFLCMVCKKNLDSTTVAVHGEEIYCKS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCS
::::::::::::::::::::. :.:: :::: : . ::::::::.:::::: ::.:.:
NP_001 CYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHEEAPGHRPTTNPNASKFAQKIGGSERCP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 RCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKGF
::...:::::: ::: :::::::::.::::::..:.::::::::::::::::::
NP_001 RCSQAVYAAEK------SWHKACFRCAKCGKGLESTTLADKDGEIYCKGCYAKNFGPKGF
130 140 150 160 170
190
pF1KB0 GYGQGAGALVHAQ
:.:::::::::..
NP_001 GFGQGAGALVHSE
180
>>NP_003467 (OMIM: 600824,607482,612124) cysteine and gl (194 aa)
initn: 989 init1: 523 opt: 977 Z-score: 723.5 bits: 140.6 E(85289): 1.5e-33
Smith-Waterman score: 977; 69.0% identity (85.3% similar) in 184 aa overlap (1-183:1-184)
10 20 30 40 50 60
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYCKS
:: :::: ::::: .:::::::.::.:::::. :: ::.::: ::::::: :. :::::
NP_003 MPNWGGGAKCGACEKTVYHAEEIQCNGRSFHKTCFHCMACRKALDSTTVAAHESEIYCKV
10 20 30 40 50 60
70 80 90 100 110
pF1KB0 CYGKKYGPKGYGYGQGAGTLNMDRGERLGIK-PESVQPHRPTTNPNTSKFAQKYGGAEKC
:::..::::: ::::::: :. : ::.::.. .: .: : .:. : :::. :.: .:::
NP_003 CYGRRYGPKGIGYGQGAGCLSTDTGEHLGLQFQQSPKPARSVTTSNPSKFTAKFGESEKC
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB0 SRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKNFGPKG
::: :::::::..:.::::::.::::: ::::::::..:.:.::.::: :::::::: :
NP_003 PRCGKSVYAAEKVMGGGKPWHKTCFRCAICGKSLESTNVTDKDGELYCKVCYAKNFGPTG
130 140 150 160 170 180
180 190
pF1KB0 FGYGQGAGALVHAQ
.:.:
NP_003 IGFGGLTQQVEKKE
190
>>NP_001302 (OMIM: 123875) cysteine-rich protein 1 [Homo (77 aa)
initn: 274 init1: 201 opt: 275 Z-score: 225.9 bits: 47.2 E(85289): 7.6e-06
Smith-Waterman score: 275; 50.7% identity (68.0% similar) in 75 aa overlap (118-191:3-74)
90 100 110 120 130 140
pF1KB0 LGIKPESVQPHRPTTNPNTSKFAQKYGGAEKCSRCGDSVYAAEKIIGAGKPWHKNCFRCA
:: .:. :: ::.. . :: ::. :..:
NP_001 MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCE
10 20 30
150 160 170 180 190
pF1KB0 KCGKSLESTTLTEKEGEIYCKG-CYAKNFGPKGFGYGQGAGALVHAQ
::::.: : .:.::. ::. ::: ::::::: : :: :
NP_001 KCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRG---GAESHTFK
40 50 60 70
>>NP_001303 (OMIM: 601183) cysteine-rich protein 2 isofo (208 aa)
initn: 282 init1: 195 opt: 262 Z-score: 212.8 bits: 46.2 E(85289): 4.1e-05
Smith-Waterman score: 437; 35.9% identity (59.1% similar) in 198 aa overlap (8-191:3-198)
10 20 30 40 50
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDEEIYC-K
.:: : .::: ::.:. :...:. :. : : :.: : :: . .: :
NP_001 MASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFCHK
10 20 30 40 50
60 70 80 90 100
pF1KB0 SCYGKKYGPKGYGYGQGAGTLNMDRGERLGIK---PESVQPHR--------PTTNPNTSK
::. .:::: . : :::. ... : . : : : : .:. ..
NP_001 PCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKASGPPKGPSRAS
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB0 FAQKYGGAEK-CSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYC
. . : . : ::. .:: :::. . :: ::. :.:: .:::.: .:..:. ::
NP_001 SVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPGGHAEHDGQPYC
120 130 140 150 160 170
170 180 190
pF1KB0 -KGCYAKNFGPKGFGYGQGAGALVHAQ
: ::. ::::: . : ..:. ..
NP_001 HKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP
180 190 200
>>NP_001257766 (OMIM: 601183) cysteine-rich protein 2 is (282 aa)
initn: 240 init1: 195 opt: 262 Z-score: 211.6 bits: 46.5 E(85289): 4.8e-05
Smith-Waterman score: 388; 34.9% identity (58.6% similar) in 186 aa overlap (20-191:89-272)
10 20 30 40
pF1KB0 MPVWGGGNKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTV
::.:. :...:. :. : : :.:
NP_001 EPSQDHHESQEHRGPLVGSQTCLVHQAEGTAEKVSSLGKDWHKFCLKCERCSKTLTPGGH
60 70 80 90 100 110
50 60 70 80 90
pF1KB0 AIHDEEIYC-KSCYGKKYGPKGYGYGQGAGTLNMDRGERLGIK---PESVQPHR------
: :: . .: : ::. .:::: . : :::. ... : . : : :
NP_001 AEHDGKPFCHKPCYATLFGPKGVNIG-GAGSYIYEKPLAEGPQVTGPIEVPAARAEERKA
120 130 140 150 160 170
100 110 120 130 140 150
pF1KB0 --PTTNPNTSKFAQKYGGAEK-CSRCGDSVYAAEKIIGAGKPWHKNCFRCAKCGKSLEST
: .:. .. . . : . : ::. .:: :::. . :: ::. :.:: .:::.:
NP_001 SGPPKGPSRASSVTTFTGEPNTCPRCSKKVYFAEKVTSLGKDWHRPCLRCERCGKTLTPG
180 190 200 210 220 230
160 170 180 190
pF1KB0 TLTEKEGEIYC-KGCYAKNFGPKGFGYGQGAGALVHAQ
.:..:. :: : ::. ::::: . : ..:. ..
NP_001 GHAEHDGQPYCHKPCYGILFGPKGVNTG-AVGSYIYDRDPEGKVQP
240 250 260 270 280
193 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 17:31:06 2016 done: Sat Nov 5 17:31:07 2016
Total Scan time: 5.850 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]