FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1743, 245 aa
1>>>pF1KE1743 245 - 245 aa - 245 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6551+/-0.000926; mu= 6.1639+/- 0.056
mean_var=209.0342+/-41.909, 0's: 0 Z-trim(114.2): 131 B-trim: 0 in 0/53
Lambda= 0.088709
statistics sampled from 14617 (14754) to 14617 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.453), width: 16
Scan time: 2.040
The best scores are: opt bits E(32554)
CCDS226.1 C1QA gene_id:712|Hs108|chr1 ( 245) 1718 231.5 3.9e-61
CCDS228.1 C1QB gene_id:713|Hs108|chr1 ( 253) 601 88.6 4.3e-18
CCDS227.1 C1QC gene_id:714|Hs108|chr1 ( 245) 539 80.7 1e-15
CCDS3414.1 C1QTNF7 gene_id:114905|Hs108|chr4 ( 289) 450 69.3 3.1e-12
CCDS47025.1 C1QTNF7 gene_id:114905|Hs108|chr4 ( 296) 450 69.3 3.1e-12
CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 ( 238) 420 65.4 3.9e-11
CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 ( 243) 416 64.9 5.6e-11
CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 ( 244) 415 64.8 6.1e-11
>>CCDS226.1 C1QA gene_id:712|Hs108|chr1 (245 aa)
initn: 1718 init1: 1718 opt: 1718 Z-score: 1211.0 bits: 231.5 E(32554): 3.9e-61
Smith-Waterman score: 1718; 100.0% identity (100.0% similar) in 245 aa overlap (1-245:1-245)
10 20 30 40 50 60
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 RRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 RRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSR
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 GQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 GQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFL
190 200 210 220 230 240
pF1KE1 IFPSA
:::::
CCDS22 IFPSA
>>CCDS228.1 C1QB gene_id:713|Hs108|chr1 (253 aa)
initn: 342 init1: 202 opt: 601 Z-score: 438.3 bits: 88.6 E(32554): 4.3e-18
Smith-Waterman score: 601; 41.3% identity (67.1% similar) in 252 aa overlap (1-243:3-249)
10 20 30 40 50
pF1KE1 MEGPRGWL--VLCVLAISLASMVTEDL-CRAPD---GKKGEAGRPGRRGRPGLKGEQG
:. : : . .. .: ..: .. .: : .: : : : :: :.:: : .:
CCDS22 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 EPGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ
: : ::. .: .: :..:.:: ::::::: :: :: :. : :: : :: :. :
CCDS22 EKGLPGL-AGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKAT
70 80 90 100 110
120 130 140 150 160 170
pF1KE1 PRPAFSAIRR-NPPMGGNVVI-FDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWE
. :::: : : :. . .: :: ::::... :. .::.:.: ::: ::::... :. .
CCDS22 QKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGN
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE1 ICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGS
.:.... . : .... . ::: . . :::..:::::.:.::..:... :.. . :
CCDS22 LCVNLMRG-RERAQKVVTFCDYAYN-TFQVTTGGMVLKLEQGENVFLQATDKNSLL--GM
180 190 200 210 220 230
240
pF1KE1 E-ADSVFSGFLIFPSA
: :.:.:::::.::
CCDS22 EGANSIFSGFLLFPDMEA
240 250
>>CCDS227.1 C1QC gene_id:714|Hs108|chr1 (245 aa)
initn: 318 init1: 169 opt: 539 Z-score: 395.6 bits: 80.7 E(32554): 1e-15
Smith-Waterman score: 539; 38.8% identity (63.2% similar) in 242 aa overlap (8-243:15-244)
10 20 30 40 50
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGE
:.: .: . : .... : . : : : ::. : :: : .::
CCDS22 MDVGPSSLPHLGLKLLLLLLLLPLRGQANTG-CYGIPGMPGLPGAPGKDGYDGLPGPKGE
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 PGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGN---IK
:: :.: ::.: ::..:::: :.::: :: :: : :.:: : : ::. :
CCDS22 PGIPAI-PGIRGPKGQKGEPGLPGHPGK---NGPMGPPGMPGVPGPMGIPGEPGEEGRYK
60 70 80 90 100 110
120 130 140 150 160
pF1KE1 DQPRPAFSAIRRN--PPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQ
.. . .:.. :.. :: .... :..:.:: . :.. .:.:.: ::: :::....
CCDS22 QKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHT
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 WEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHI-Y
..:. . :. . . :: :.: :: :::..:.:: :..::. . .
CCDS22 ANLCVLLYRSGV----KVVTFCGHTSK-TNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGI
180 190 200 210 220 230
230 240
pF1KE1 QGSEADSVFSGFLIFPSA
::: ::::::::.::
CCDS22 QGS--DSVFSGFLLFPD
240
>>CCDS3414.1 C1QTNF7 gene_id:114905|Hs108|chr4 (289 aa)
initn: 318 init1: 150 opt: 450 Z-score: 333.2 bits: 69.3 E(32554): 3.1e-12
Smith-Waterman score: 450; 38.3% identity (60.4% similar) in 227 aa overlap (31-242:53-274)
10 20 30 40 50 60
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR
: .:. : ::: :: : :::.:: :. :.:
CCDS34 LKGENYSPRYICSIPGLPGPPGPPGANGSPGPHGRIGLPGRDGRDGRKGEKGEKGTAGLR
30 40 50 60 70 80
70 80 90 100 110
pF1KE1 -----TGIQGLKGDQGEPG---PSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ
:. : :::::: : : : :. : :: :: : .: : .: : :: .
CCDS34 GKTGPLGLAGEKGDQGETGKKGPIGPEGEKGEVGPIGPPGPKGDRGEQGDPGLPGVCRCG
90 100 110 120 130 140
120 130 140 150 160
pF1KE1 P---RPAFSA-IRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQV-LS
. :::. : . : .::. :. :. : :. .:.:.:. :: :::.... :.
CCDS34 SIVLKSAFSVGITTSYPEERLPIIFNKVLFNEGEHYNPATGKFICAFPGIYYFSYDITLA
150 160 170 180 190 200
170 180 190 200 210 220
pF1KE1 QWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEK--DPKKGH
. .. ...: . :: : . :. : : .:.::. :. :: :.::.: ..:
CCDS34 NKHLAIGLVHN--GQYR--IKTFDA-NTGNHDVASGSTVIYLQPEDEVWLEIFFTDQNGL
210 220 230 240 250
230 240
pF1KE1 IYQGSEADSVFSGFLIFPSA
. . . :::.:::::..
CCDS34 FSDPGWADSLFSGFLLYVDTDYLDSISEDDEL
260 270 280
>>CCDS47025.1 C1QTNF7 gene_id:114905|Hs108|chr4 (296 aa)
initn: 318 init1: 150 opt: 450 Z-score: 333.0 bits: 69.3 E(32554): 3.1e-12
Smith-Waterman score: 450; 38.3% identity (60.4% similar) in 227 aa overlap (31-242:60-281)
10 20 30 40 50 60
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR
: .:. : ::: :: : :::.:: :. :.:
CCDS47 LKGENYSPRYICSIPGLPGPPGPPGANGSPGPHGRIGLPGRDGRDGRKGEKGEKGTAGLR
30 40 50 60 70 80
70 80 90 100 110
pF1KE1 -----TGIQGLKGDQGEPG---PSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ
:. : :::::: : : : :. : :: :: : .: : .: : :: .
CCDS47 GKTGPLGLAGEKGDQGETGKKGPIGPEGEKGEVGPIGPPGPKGDRGEQGDPGLPGVCRCG
90 100 110 120 130 140
120 130 140 150 160
pF1KE1 P---RPAFSA-IRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQV-LS
. :::. : . : .::. :. :. : :. .:.:.:. :: :::.... :.
CCDS47 SIVLKSAFSVGITTSYPEERLPIIFNKVLFNEGEHYNPATGKFICAFPGIYYFSYDITLA
150 160 170 180 190 200
170 180 190 200 210 220
pF1KE1 QWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEK--DPKKGH
. .. ...: . :: : . :. : : .:.::. :. :: :.::.: ..:
CCDS47 NKHLAIGLVHN--GQYR--IKTFDA-NTGNHDVASGSTVIYLQPEDEVWLEIFFTDQNGL
210 220 230 240 250 260
230 240
pF1KE1 IYQGSEADSVFSGFLIFPSA
. . . :::.:::::..
CCDS47 FSDPGWADSLFSGFLLYVDTDYLDSISEDDEL
270 280 290
>>CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 (238 aa)
initn: 297 init1: 167 opt: 420 Z-score: 313.4 bits: 65.4 E(32554): 3.9e-11
Smith-Waterman score: 421; 37.2% identity (58.4% similar) in 226 aa overlap (26-243:28-237)
10 20 30 40 50
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRA---PDGKKGEAGRPGRRGRPGLKGEQGEPG
:: : : .: :: : :. . ::
CCDS31 MVLLLLVAIPLLVHSSRGPAHYEMLGRCRMVCDPHGPRG----PGPDGAPA-SVPPFPPG
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 APGI--RTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQP
: : : : ::.: : ::: : ::. : ::: :: : :: :. . : . :
CCDS31 AKGEVGRRGKAGLRGPPGPPGPRGPPGEPGRPGPPGPPG----PGPGGVAPAAGYV---P
60 70 80 90 100
120 130 140 150 160 170
pF1KE1 RPAFSAIRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICL
: :: : : : : .:. :: :.:: . :. ::.:.: .:: :.:...:: .
CCDS31 RIAFYAGLRRPHEGYEVLRFDDVVTNVGNAYEAASGKFTCPMPGVYFFAYHVLMRGGDGT
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 SIVSS--SRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSE
:. .. . :::: : :. .. .. .:....:.:. ::.:... : :... :.
CCDS31 SMWADLMKNGQVRASAIAQDADQN--YDYASNSVILHLDVGDEVFIKLD--GGKVHGGNT
170 180 190 200 210 220
240
pF1KE1 AD-SVFSGFLIFPSA
:.::::.:.:
CCDS31 NKYSTFSGFIIYPD
230
>>CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 (243 aa)
initn: 279 init1: 210 opt: 416 Z-score: 310.6 bits: 64.9 E(32554): 5.6e-11
Smith-Waterman score: 416; 35.4% identity (56.0% similar) in 243 aa overlap (5-242:2-233)
10 20 30 40 50 60
pF1KE1 MEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR
: ::: .:... .: .: . :. :. : :: :: .: :: :: :
CCDS84 MRPLLVLLLLGLAAGSPPLDD-NKIPSLCPGHPGLPGT---PGHHGSQGLPG----R
10 20 30 40
70 80 90 100 110 120
pF1KE1 TGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI
: .: : : :: .:. :. : ::: : : :: : : : :. . :: ::::
CCDS84 DGRDGRDGAPGAPGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAK
50 60 70 80 90 100
130 140 150 160 170
pF1KE1 R---RNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVS
: : :: . . :: :..:.. :. .:.:.: ::: :::. .. . .. :..
CCDS84 RSESRVPPPSDAPLPFDRVLVNEQGHYDAVTGKFTCQVPGVYYFAVHA-TVYRASLQFDL
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 SSRGQ-VRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKG-HIYQGSEADSV
. :. . . : : .::: ...:. :::::. :: . ..::.
CCDS84 VKNGESIASFFQFFGGWPKP--ASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDST
170 180 190 200 210 220
240
pF1KE1 FSGFLIFPSA
:::::..
CCDS84 FSGFLVYSDWHSSPVFA
230 240
>>CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 (244 aa)
initn: 404 init1: 180 opt: 415 Z-score: 309.8 bits: 64.8 E(32554): 6.1e-11
Smith-Waterman score: 416; 34.3% identity (62.9% similar) in 213 aa overlap (34-242:42-240)
10 20 30 40 50 60
pF1KE1 PRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGI
: :.::. : :: :..: ::
CCDS32 ALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAPGRDGRDGTPGE-------
20 30 40 50 60
70 80 90 100 110 120
pF1KE1 QGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRN
.: ::: : ::.:. :..: :: :: ::.:::.: :: ::. : :::. ..
CCDS32 KGEKGDPGLIGPKGDIGETGVPGAEGP---RGFPGIQGRKGEPGEGAYVYRSAFSVGLET
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 PPMGGNVVI-FDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQ
:. : : .. ::.. :.. .:.: :..:: :::.... . .. :. : .
CCDS32 YVTIPNMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHI----TVYMKDVKVSLFK
130 140 150 160 170
190 200 210 220 230
pF1KE1 VRRSLGFC-DTTNKGLFQVVSGGMVLQLQQGDQVWVE--KDPKKGHIYQGSEADSVFSGF
... : : ... . .::...:.:. :::::.. . ... .: .. ::.:.::
CCDS32 KDKAMLFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYADNDNDSTFTGF
180 190 200 210 220 230
240
pF1KE1 LIFPSA
:..
CCDS32 LLYHDTN
240
245 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 19:27:53 2016 done: Sun Nov 6 19:27:53 2016
Total Scan time: 2.040 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]