FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6604, 243 aa
1>>>pF1KE6604 243 - 243 aa - 243 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6230+/-0.000793; mu= 13.8779+/- 0.048
mean_var=92.8118+/-17.834, 0's: 0 Z-trim(110.4): 34 B-trim: 45 in 1/50
Lambda= 0.133129
statistics sampled from 11536 (11566) to 11536 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.735), E-opt: 0.2 (0.355), width: 16
Scan time: 1.520
The best scores are: opt bits E(32554)
CCDS4928.1 CRISP2 gene_id:7180|Hs108|chr6 ( 243) 1722 340.3 6.9e-94
CCDS4929.2 CRISP3 gene_id:10321|Hs108|chr6 ( 258) 1284 256.2 1.5e-68
CCDS55019.1 CRISP3 gene_id:10321|Hs108|chr6 ( 268) 1284 256.2 1.6e-68
CCDS4931.1 CRISP1 gene_id:167|Hs108|chr6 ( 249) 759 155.4 3.4e-38
CCDS4932.1 CRISP1 gene_id:167|Hs108|chr6 ( 178) 506 106.6 1.1e-23
CCDS9011.1 GLIPR1 gene_id:11010|Hs108|chr12 ( 266) 415 89.3 2.7e-18
CCDS9009.1 GLIPR1L1 gene_id:256710|Hs108|chr12 ( 233) 395 85.4 3.5e-17
CCDS76578.1 GLIPR1L1 gene_id:256710|Hs108|chr12 ( 242) 389 84.3 8.1e-17
CCDS34440.1 PI16 gene_id:221476|Hs108|chr6 ( 463) 380 82.8 4.4e-16
CCDS58258.1 GLIPR1L2 gene_id:144321|Hs108|chr12 ( 344) 300 67.3 1.5e-11
CCDS9010.1 GLIPR1L2 gene_id:144321|Hs108|chr12 ( 253) 291 65.5 3.9e-11
CCDS32484.1 CLEC18B gene_id:497190|Hs108|chr16 ( 455) 292 65.9 5.3e-11
CCDS10886.1 CLEC18A gene_id:348174|Hs108|chr16 ( 446) 291 65.7 5.9e-11
CCDS32473.1 CLEC18C gene_id:283971|Hs108|chr16 ( 446) 290 65.5 6.8e-11
>>CCDS4928.1 CRISP2 gene_id:7180|Hs108|chr6 (243 aa)
initn: 1722 init1: 1722 opt: 1722 Z-score: 1799.0 bits: 340.3 E(32554): 6.9e-94
Smith-Waterman score: 1722; 100.0% identity (100.0% similar) in 243 aa overlap (1-243:1-243)
10 20 30 40 50 60
pF1KE6 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 MEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPTSWSSAIQSWYDEILD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPTSWSSAIQSWYDEILD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 FVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGNNMNRKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 FVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGNNMNRKN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE6 TPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 TPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCEN
190 200 210 220 230 240
pF1KE6 KIY
:::
CCDS49 KIY
>>CCDS4929.2 CRISP3 gene_id:10321|Hs108|chr6 (258 aa)
initn: 1267 init1: 1220 opt: 1284 Z-score: 1344.0 bits: 256.2 E(32554): 1.5e-68
Smith-Waterman score: 1284; 71.4% identity (87.8% similar) in 245 aa overlap (1-243:14-258)
10 20 30 40
pF1KE6 MALLPVL-FLVTVLLPSLPA-EGKDPAFTALLTTQLQVQREIVNKHN
:.:.::: :::. ::::.:: : :::::::::::: :::::::::::
CCDS49 MKQILHPALETTAMTLFPVLLFLVAGLLPSFPANEDKDPAFTALLTTQTQVQREIVNKHN
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE6 ELRKAVSPPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPT
:::.:::::: :::::::..:...:::.:::.:. .::.:.:: :: .::::::::: .
CCDS49 ELRRAVSPPARNMLKMEWNKEAAANAQKWANQCNYRHSNPKDRMTSLKCGENLYMSSASS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE6 SWSSAIQSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYY
:::.:::::.:: :: .:::::.::::::::::.::::.: :::: :::::: :::::
CCDS49 SWSQAIQSWFDEYNDFDFGVGPKTPNAVVGHYTQVVWYSSYLVGCGNAYCPNQKVLKYYY
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE6 VCQYCPAGNNMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEH
::::::::: :: .::.::.:::.:::.:: :::::.:.:.:: ::: ::: : :.:
CCDS49 VCQYCPAGNWANRLYVPYEQGAPCASCPDNCDDGLCTNGCKYEDLYSNCKSLKLTLTCKH
190 200 210 220 230 240
230 240
pF1KE6 ELLKEKCKATCLCENKIY
.:....:::.: : :.::
CCDS49 QLVRDSCKASCNCSNSIY
250
>>CCDS55019.1 CRISP3 gene_id:10321|Hs108|chr6 (268 aa)
initn: 1267 init1: 1220 opt: 1284 Z-score: 1343.8 bits: 256.2 E(32554): 1.6e-68
Smith-Waterman score: 1284; 71.4% identity (87.8% similar) in 245 aa overlap (1-243:24-268)
10 20 30
pF1KE6 MALLPVL-FLVTVLLPSLPA-EGKDPAFTALLTTQLQ
:.:.::: :::. ::::.:: : :::::::::::: :
CCDS55 MKQILHPALETTDPCSTGFVFPAMTLFPVLLFLVAGLLPSFPANEDKDPAFTALLTTQTQ
10 20 30 40 50 60
40 50 60 70 80 90
pF1KE6 VQREIVNKHNELRKAVSPPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCG
:::::::::::::.:::::: :::::::..:...:::.:::.:. .::.:.:: :: .::
CCDS55 VQREIVNKHNELRRAVSPPARNMLKMEWNKEAAANAQKWANQCNYRHSNPKDRMTSLKCG
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE6 ENLYMSSDPTSWSSAIQSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYC
::::::: .:::.:::::.:: :: .:::::.::::::::::.::::.: :::: :::
CCDS55 ENLYMSSASSSWSQAIQSWFDEYNDFDFGVGPKTPNAVVGHYTQVVWYSSYLVGCGNAYC
130 140 150 160 170 180
160 170 180 190 200 210
pF1KE6 PNQDSLKYYYVCQYCPAGNNMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCD
::: :::::::::::::: :: .::.::.:::.:::.:: :::::.:.:.:: :::
CCDS55 PNQKVLKYYYVCQYCPAGNWANRLYVPYEQGAPCASCPDNCDDGLCTNGCKYEDLYSNCK
190 200 210 220 230 240
220 230 240
pF1KE6 SLKNTAGCEHELLKEKCKATCLCENKIY
::: : :.:.:....:::.: : :.::
CCDS55 SLKLTLTCKHQLVRDSCKASCNCSNSIY
250 260
>>CCDS4931.1 CRISP1 gene_id:167|Hs108|chr6 (249 aa)
initn: 723 init1: 421 opt: 759 Z-score: 799.3 bits: 155.4 E(32554): 3.4e-38
Smith-Waterman score: 759; 45.2% identity (68.5% similar) in 248 aa overlap (1-242:1-248)
10 20 30 40 50
pF1KE6 MALLPVLFLVTV--LLPSLPAEGKDP--AFTALLTTQLQVQREIVNKHNELRKAVSPPAS
: . .::::.. ::: : . :. :. :.: .::.:::: :: ::. : ::::
CCDS49 MEIKHLLFLVAAACLLPMLSMKKKSARDQFNKLVTDLPNVQEEIVNIHNALRRRVVPPAS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 NMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKT-STRCGENLYMSSDPTSWSSAIQSWY
::::: ::.:.. ::. ... : . .:.: .:. .: ::::..:.: :.::::.: ::
CCDS49 NMLKMSWSEEAAQNARIFSKYCDMTESNPLERRLPNTFCGENMHMTSYPVSWSSVIGVWY
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE6 DEILDFVYGVGPKSPNAVV-GHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGN
.: .: .: . . .. ::::.:: ..: .::.:: : .: : .: :::.:: ::
CCDS49 SESTSFKHGEWTTTDDDITTDHYTQIVWATSYLIGCAIASCRQQGSPRYLYVCHYCHEGN
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE6 NMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA
. . :: ::. :.:: .::..:. :::: : : : .:: . ::.: :::
CCDS49 DPETKNEPYKTGVPCEACPSNCEDKLCTNPCIYYDEYFDCDIQVHYLGCNHSTTILFCKA
190 200 210 220 230 240
240
pF1KE6 TCLCENKIY
::::...:
CCDS49 TCLCDTEIK
>>CCDS4932.1 CRISP1 gene_id:167|Hs108|chr6 (178 aa)
initn: 470 init1: 199 opt: 506 Z-score: 538.6 bits: 106.6 E(32554): 1.1e-23
Smith-Waterman score: 506; 45.5% identity (70.5% similar) in 176 aa overlap (1-170:1-176)
10 20 30 40 50
pF1KE6 MALLPVLFLVTV--LLPSLPAEGKDP--AFTALLTTQLQVQREIVNKHNELRKAVSPPAS
: . .::::.. ::: : . :. :. :.: .::.:::: :: ::. : ::::
CCDS49 MEIKHLLFLVAAACLLPMLSMKKKSARDQFNKLVTDLPNVQEEIVNIHNALRRRVVPPAS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE6 NMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKT-STRCGENLYMSSDPTSWSSAIQSWY
::::: ::.:.. ::. ... : . .:.: .:. .: ::::..:.: :.::::.: ::
CCDS49 NMLKMSWSEEAAQNARIFSKYCDMTESNPLERRLPNTFCGENMHMTSYPVSWSSVIGVWY
70 80 90 100 110 120
120 130 140 150 160 170
pF1KE6 DEILDFVYGVGPKSPNAVV-GHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGN
.: .: .: . . .. ::::.:: ..: .::.:: : .: : .: :::.::
CCDS49 SESTSFKHGEWTTTDDDITTDHYTQIVWATSYLIGCAIASCRQQGSPRYLYVCHYCHD
130 140 150 160 170
180 190 200 210 220 230
pF1KE6 NMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA
>>CCDS9011.1 GLIPR1 gene_id:11010|Hs108|chr12 (266 aa)
initn: 309 init1: 129 opt: 415 Z-score: 441.8 bits: 89.3 E(32554): 2.7e-18
Smith-Waterman score: 415; 37.5% identity (63.6% similar) in 184 aa overlap (38-208:35-213)
10 20 30 40 50 60
pF1KE6 FLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREV
.. : ::..:. :.: ::.:: : :. .
CCDS90 LATIAWMVSFVSNYSHTANILPDIENEDFIKDCVRIHNKFRSEVKPTASDMLYMTWDPAL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE6 TTNAQRWANKCTLQHSD---PEDR--KTSTRCGENLYMSSDPT-SWSSAIQSWYDEILDF
. :. ::..: ..:. : . . : :::.. .: : : :::: .::::: :
CCDS90 AQIAKAWASNCQFSHNTRLKPPHKLHPNFTSLGENIWTGSVPIFSVSSAITNWYDEIQD-
70 80 90 100 110 120
130 140 150 160 170
pF1KE6 VYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQ---DSLKY--YYVCQYCPAGNNM
: . . : :::::.:: ..:.:::.. .::. :.:. ...:.: :.::
CCDS90 -YDFKTRICKKVCGHYTQVVWADSYKVGCAVQFCPKVSGFDALSNGAHFICNYGPGGN--
130 140 150 160 170 180
180 190 200 210 220 230
pF1KE6 NRKNTPYQQGTPCAGCP--DDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA
. ::..:. :..:: : : .::.: . :
CCDS90 -YPTWPYKRGATCSACPNNDKCLDNLCVNRQRDQVKRYYSVVYPGWPIYPRNRYTSLFLI
190 200 210 220 230
240
pF1KE6 TCLCENKIY
CCDS90 VNSVILILSVIITILVQHKYPNLVLLD
240 250 260
>>CCDS9009.1 GLIPR1L1 gene_id:256710|Hs108|chr12 (233 aa)
initn: 359 init1: 145 opt: 395 Z-score: 421.9 bits: 85.4 E(32554): 3.5e-17
Smith-Waterman score: 395; 37.9% identity (61.5% similar) in 174 aa overlap (41-203:38-205)
20 30 40 50 60 70
pF1KE6 TVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVTTN
.. ::: : :.:::..: : :.. ..
CCDS90 SCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKGLAKM
10 20 30 40 50 60
80 90 100 110 120
pF1KE6 AQRWANKCTLQHSDPEDRKTSTRC-------GENLYMSSDPT-SWSSAIQSWYDEILDFV
:. :::.: ..:.: :. : .: :::..... . . :: .::.: .:
CCDS90 AKAWANQCKFEHNDCLDK--SYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNET-QF-
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY-YYVCQYCPAGNNMNRKNT
: : . : :::::::: ... :::..:.::: . . .::.: :::: :
CCDS90 YDFDSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANMP--
130 140 150 160 170 180
190 200 210 220 230
pF1KE6 PYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCE
:: .: :. : . : :.:: :
CCDS90 PYVRGESCSLCSKEEKCVKNLCKNPFLKPTGRAPQQTAFNPFSLGFLLLRIF
190 200 210 220 230
>>CCDS76578.1 GLIPR1L1 gene_id:256710|Hs108|chr12 (242 aa)
initn: 353 init1: 145 opt: 389 Z-score: 415.4 bits: 84.3 E(32554): 8.1e-17
Smith-Waterman score: 389; 37.8% identity (61.6% similar) in 172 aa overlap (41-201:38-203)
20 30 40 50 60 70
pF1KE6 TVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVTTN
.. ::: : :.:::..: : :.. ..
CCDS76 SCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKGLAKM
10 20 30 40 50 60
80 90 100 110 120
pF1KE6 AQRWANKCTLQHSDPEDRKTSTRC-------GENLYMSSDPT-SWSSAIQSWYDEILDFV
:. :::.: ..:.: :. : .: :::..... . . :: .::.: .:
CCDS76 AKAWANQCKFEHNDCLDK--SYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNET-QF-
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY-YYVCQYCPAGNNMNRKNT
: : . : :::::::: ... :::..:.::: . . .::.: :::: :
CCDS76 YDFDSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANM--P
130 140 150 160 170 180
190 200 210 220 230
pF1KE6 PYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCE
:: .: :. : . : :.::
CCDS76 PYVRGESCSLCSKEEKCVKNLCRTPQLIIPNQNPFLKPTGRAPQQTAFNPFSLGFLLLRI
190 200 210 220 230 240
>>CCDS34440.1 PI16 gene_id:221476|Hs108|chr6 (463 aa)
initn: 301 init1: 136 opt: 380 Z-score: 402.3 bits: 82.8 E(32554): 4.4e-16
Smith-Waterman score: 382; 33.2% identity (60.6% similar) in 208 aa overlap (1-201:9-197)
10 20 30 40 50
pF1KE6 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVS
: :::.:.:... . : :.: . .: .:. :: : ::
CCDS34 MHGSCSFLMLLLPLLLLLVA------TTGPVGALTD------EEKRLMVELHNLYRAQVS
10 20 30 40
60 70 80 90 100 110
pF1KE6 PPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDP-TSWSSAI
: ::.::.:.:..:... :. .: .:. :. . :. ::::. .: . :.
CCDS34 PTASDMLHMRWDEELAAFAKAYARQCVWGHNKERGRR-----GENLFAITDEGMDVPLAM
50 60 70 80 90 100
120 130 140 150 160
pF1KE6 QSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY----YYVC
. :. : . ... ::. . :::::.:: .: ..::: .: . .... ::
CCDS34 EEWHHEREHYNLSAATCSPGQMCGHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVC
110 120 130 140 150 160
170 180 190 200 210 220
pF1KE6 QYCPAGNNMNRKNTPYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEH
.: : :: ... :::.::::. ::. : ..::
CCDS34 NYEPPGNVKGKR--PYQEGTPCSQCPSGYHCKNSLCEPIGSPEDAQDLPYLVTEAPSFRA
170 180 190 200 210 220
230 240
pF1KE6 ELLKEKCKATCLCENKIY
CCDS34 TEASDSRKMGTPSSLATGIPAFLVTEVSGSLATKALPAVETQAPTSLATKDPPSMATEAP
230 240 250 260 270 280
>>CCDS58258.1 GLIPR1L2 gene_id:144321|Hs108|chr12 (344 aa)
initn: 218 init1: 129 opt: 300 Z-score: 321.0 bits: 67.3 E(32554): 1.5e-11
Smith-Waterman score: 300; 31.7% identity (58.3% similar) in 180 aa overlap (39-208:56-230)
10 20 30 40 50 60
pF1KE6 LVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVT
: :: ::::: : : .::. : :. ..
CCDS58 LRLCELWLLLLGSSLNARFLPDEEDVDFINEYVNLHNELRGDVIPRGSNLRFMTWDVALS
30 40 50 60 70 80
70 80 90 100 110 120
pF1KE6 TNAQRWANKCTLQHS----DPEDRKTSTR-CGENLYMSSDPT-SWSSAIQSWYDEILDFV
.:. :..:: . :. : . . . :::.... . . : ::.::. : .
CCDS58 RTARAWGKKCLFTHNIYLQDVQMVHPKFYGIGENMWVGPENEFTASIAIRSWHAEKKMYN
90 100 110 120 130 140
130 140 150 160 170 180
pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY--YYVCQYCPAGNNMNRKN
. : : . ..: :::: .:.:::... : . . . ..:.: : :....:.
CCDS58 FENGSCSGD--CSNYIQLVWDHSYKVGCAVTPCSKIGHIIHAAIFICNYAP-GGTLTRR-
150 160 170 180 190 200
190 200 210 220 230
pF1KE6 TPYQQGTPCAGCP--DDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLC
::. : :. : : : ::.:. . :
CCDS58 -PYEPGIFCTRCGRRDKCTDFLCSNADRDQATYYRFWYPKWEMPRPVVCDPLCTFILLLR
210 220 230 240 250 260
243 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 14:41:46 2016 done: Tue Nov 8 14:41:46 2016
Total Scan time: 1.520 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]