FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1436, 253 aa
1>>>pF1KE1436 253 - 253 aa - 253 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.0708+/-0.000981; mu= 5.8010+/- 0.060
mean_var=255.7788+/-52.251, 0's: 0 Z-trim(114.5): 136 B-trim: 133 in 1/52
Lambda= 0.080194
statistics sampled from 14916 (15056) to 14916 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.77), E-opt: 0.2 (0.462), width: 16
Scan time: 2.320
The best scores are: opt bits E(32554)
CCDS228.1 C1QB gene_id:713|Hs108|chr1 ( 253) 1761 215.8 2.3e-56
CCDS227.1 C1QC gene_id:714|Hs108|chr1 ( 245) 816 106.5 1.8e-23
CCDS226.1 C1QA gene_id:712|Hs108|chr1 ( 245) 601 81.6 5.5e-16
CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 ( 243) 474 66.9 1.5e-11
CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 ( 244) 469 66.3 2.2e-11
CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 ( 238) 467 66.1 2.5e-11
>>CCDS228.1 C1QB gene_id:713|Hs108|chr1 (253 aa)
initn: 1761 init1: 1761 opt: 1761 Z-score: 1125.6 bits: 215.8 E(32554): 2.3e-56
Smith-Waterman score: 1761; 100.0% identity (100.0% similar) in 253 aa overlap (1-253:1-253)
10 20 30 40 50 60
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 EKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 EKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 KIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 KIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGNL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 CVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATDKNSLLGMEGANS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 CVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATDKNSLLGMEGANS
190 200 210 220 230 240
250
pF1KE1 IFSGFLLFPDMEA
:::::::::::::
CCDS22 IFSGFLLFPDMEA
250
>>CCDS227.1 C1QC gene_id:714|Hs108|chr1 (245 aa)
initn: 593 init1: 593 opt: 816 Z-score: 534.8 bits: 106.5 E(32554): 1.8e-23
Smith-Waterman score: 816; 50.0% identity (74.0% similar) in 246 aa overlap (5-250:8-245)
10 20 30 40 50
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPG
.: .. .:.::::: : .::. .: : :::.::.::.:: :: : ::
CCDS22 MDVGPSSLPHLGLKLLLLLLLLPL--RGQANTGCYG---IPGMPGLPGAPGKDGYDGLPG
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 IKGEKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYK
::: :.:.. : .: :.::.::.::.::: :: :: : : :: : :: :: : ::
CCDS22 PKGEPGIPAIPGIRGPKGQKGEPGLPGHPGKNGPMGPPGMPGVPGPMGIPGEPGEEGRYK
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 ATQKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSR
. .:..:: . : .. :::. :.:: ...:. .:::::::::::::.::::
CCDS22 QKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHT
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE1 GNLCVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATDKNSLLGMEG
.:::: :.:. . :::::: .. .: ::..::..:.:. ::.:.: ..: ...:..:
CCDS22 ANLCVLLYRS---GVKVVTFCGHTSKTNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGIQG
180 190 200 210 220 230
240 250
pF1KE1 ANSIFSGFLLFPDMEA
..:.:::::::::
CCDS22 SDSVFSGFLLFPD
240
>>CCDS226.1 C1QA gene_id:712|Hs108|chr1 (245 aa)
initn: 366 init1: 202 opt: 601 Z-score: 400.4 bits: 81.6 E(32554): 5.5e-16
Smith-Waterman score: 601; 41.3% identity (67.1% similar) in 252 aa overlap (3-249:1-243)
10 20 30 40 50 60
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG
:. : : . .. .: ..: .. .: : .: : : : :: :.:: : .:
CCDS22 MEGPRGWL--VLCVLAISLASMVTEDL-CRAPD---GKKGEAGRPGRRGRPGLKGEQG
10 20 30 40 50
70 80 90 100 110
pF1KE1 EKGLPGL-AGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKAT
: : ::. .: .: :..:.:: ::::::: :: :: :. : :: : :: :. :
CCDS22 EPGAPGIRTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQ
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE1 QKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGN
. :::: : : :. . .: :: ::::... :. .::.:.: ::: ::::... :. .
CCDS22 PRPAFSAIRR-NPPMGGNVVI-FDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWE
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE1 LCVNLMRG-RERAQKVVTFCDYAYN-TFQVTTGGMVLKLEQGENVFLQATDKNSLL--GM
.:.... . : .... . ::: . . :::..:::::.:.::..:... :.. . :
CCDS22 ICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGS
180 190 200 210 220 230
240 250
pF1KE1 EGANSIFSGFLLFPDMEA
: :.:.:::::.::
CCDS22 E-ADSVFSGFLIFPSA
240
>>CCDS8420.1 C1QTNF5 gene_id:114902|Hs108|chr11 (243 aa)
initn: 859 init1: 268 opt: 474 Z-score: 321.0 bits: 66.9 E(32554): 1.5e-11
Smith-Waterman score: 495; 37.2% identity (59.1% similar) in 242 aa overlap (13-250:4-235)
10 20 30 40 50 60
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG
:..:::::: : . : :: ::.::::: :. : :: :
CCDS84 MRPLLVLLLLGLAAGSPPLDDNKIPSLCPGHPGLPGTPGHHGSQGLPGRDG
10 20 30 40 50
70 80 90 100 110 120
pF1KE1 EKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQ
. : : : :: :: : ::.:: : ::.: :: : : : .:. ..
CCDS84 RDGRDGAPGAPGEKGEGGRPGLPGPRGDPGPRGEAGPAG---------PTGPAGECSVPP
60 70 80 90 100
130 140 150 160 170
pF1KE1 KIAFSATRTIN-VPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASS-RG
. :::: :. . :: : . ::.:..: ...:. .:::::.:::.:::. ::. :.
CCDS84 RSAFSAKRSESRVPPPSDAPLPFDRVLVNEQGHYDAVTGKFTCQVPGVYYFAVHATVYRA
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE1 NLCVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQAT--DKNSLLGME
.: .:... : . : .. .:: ...:: ..:..:. : .. .
CCDS84 SLQFDLVKNGESIASFFQFFG-GWPKPASLSGGAMVRLEPEDQVWVQVGVGDYIGIYASI
170 180 190 200 210 220
240 250
pF1KE1 GANSIFSGFLLFPDMEA
..: :::::.. :
CCDS84 KTDSTFSGFLVYSDWHSSPVFA
230 240
>>CCDS3284.1 ADIPOQ gene_id:9370|Hs108|chr3 (244 aa)
initn: 366 init1: 232 opt: 469 Z-score: 317.9 bits: 66.3 E(32554): 2.2e-11
Smith-Waterman score: 480; 38.1% identity (64.6% similar) in 226 aa overlap (30-250:35-242)
10 20 30 40 50
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPA-IPGIPGIPGTPGPDGQPGTPGI
.::: : ::: :: :.:: ::. ::::
CCDS32 GAVLLLLALPGHDQETTTQGPGVLLPLPKGACTGWMAGIPGHPGHNGAPGRDGRDGTPGE
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE1 KGEKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKA
::::: ::: : .:..:: : :: : :.: :: : : :: : :
CCDS32 KGEKGDPGLIGPKGDIGETGVPGAEG------------PRGFPGIQGRKGEPGE-GAYVY
70 80 90 100 110
120 130 140 150 160 170
pF1KE1 TQKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSR-
. .. . ...: .. ::: ... :..:.:. .::: :..::::::.:: .
CCDS32 RSAFSVGLETYVTIP---NMPIRFTKIFYNQQNHYDGSTGKFHCNIPGLYYFAYHITVYM
120 130 140 150 160
180 190 200 210 220 230
pF1KE1 GNLCVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQAT---DKNSLLG
.. :.:.. ...:. . :. .: :. . ..:...:.:: :..:.::. ..:.: .
CCDS32 KDVKVSLFK-KDKAM-LFTYDQYQENNVDQASGSVLLHLEVGDQVWLQVYGEGERNGLYA
170 180 190 200 210 220
240 250
pF1KE1 MEGANSIFSGFLLFPDMEA
. .: :.::::. :
CCDS32 DNDNDSTFTGFLLYHDTN
230 240
>>CCDS31793.1 C1QL4 gene_id:338761|Hs108|chr12 (238 aa)
initn: 317 init1: 317 opt: 467 Z-score: 316.8 bits: 66.1 E(32554): 2.5e-11
Smith-Waterman score: 467; 36.3% identity (59.0% similar) in 251 aa overlap (12-250:2-238)
10 20 30 40 50
pF1KE1 MMMKIPWGSIPVLMLLLLLGLIDISQ---AQLSCTGPPAIPGIPGIPGTPGPDGQPGT-P
::.::. . :. :. :. : . : : ::::: :.. :
CCDS31 MVLLLLVAIPLLVHSSRGPAHYEMLGRCRMVCDPHGPRGPGPDGAPASVP
10 20 30 40 50
60 70 80 90 100 110
pF1KE1 GIKGEKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDY
: : .:: :..: :. : :: ::.:: : : :: :: ::: : .:
CCDS31 --------PFPPGAKGEVGRRGKAGLRGPPGPPGPRGPPGEPGRPGPPGPPGP-GPGGVA
60 70 80 90 100
120 130 140 150 160 170
pF1KE1 KAT---QKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYH
:. .::: : . : . ...::: :.::..: :: :::::: .::.:.:.::
CCDS31 PAAGYVPRIAFYA--GLRRPHEGYEVLRFDDVVTNVGNAYEAASGKFTCPMPGVYFFAYH
110 120 130 140 150
180 190 200 210 220
pF1KE1 ASSRG----NLCVNLMR-GRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATD
. :: .. ..::. :. ::. .. : :. ......:.:. :..::..
CCDS31 VLMRGGDGTSMWADLMKNGQVRASAIAQDADQNYD---YASNSVILHLDVGDEVFIKLDG
160 170 180 190 200 210
230 240 250
pF1KE1 KNSLLGMEGANSIFSGFLLFPDMEA
. : . : ::::...::
CCDS31 GKVHGGNTNKYSTFSGFIIYPD
220 230
253 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 23:08:46 2016 done: Sun Nov 6 23:08:46 2016
Total Scan time: 2.320 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]