FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6604, 243 aa 1>>>pF1KE6604 243 - 243 aa - 243 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6230+/-0.000793; mu= 13.8779+/- 0.048 mean_var=92.8118+/-17.834, 0's: 0 Z-trim(110.4): 34 B-trim: 45 in 1/50 Lambda= 0.133129 statistics sampled from 11536 (11566) to 11536 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.735), E-opt: 0.2 (0.355), width: 16 Scan time: 1.520 The best scores are: opt bits E(32554) CCDS4928.1 CRISP2 gene_id:7180|Hs108|chr6 ( 243) 1722 340.3 6.9e-94 CCDS4929.2 CRISP3 gene_id:10321|Hs108|chr6 ( 258) 1284 256.2 1.5e-68 CCDS55019.1 CRISP3 gene_id:10321|Hs108|chr6 ( 268) 1284 256.2 1.6e-68 CCDS4931.1 CRISP1 gene_id:167|Hs108|chr6 ( 249) 759 155.4 3.4e-38 CCDS4932.1 CRISP1 gene_id:167|Hs108|chr6 ( 178) 506 106.6 1.1e-23 CCDS9011.1 GLIPR1 gene_id:11010|Hs108|chr12 ( 266) 415 89.3 2.7e-18 CCDS9009.1 GLIPR1L1 gene_id:256710|Hs108|chr12 ( 233) 395 85.4 3.5e-17 CCDS76578.1 GLIPR1L1 gene_id:256710|Hs108|chr12 ( 242) 389 84.3 8.1e-17 CCDS34440.1 PI16 gene_id:221476|Hs108|chr6 ( 463) 380 82.8 4.4e-16 CCDS58258.1 GLIPR1L2 gene_id:144321|Hs108|chr12 ( 344) 300 67.3 1.5e-11 CCDS9010.1 GLIPR1L2 gene_id:144321|Hs108|chr12 ( 253) 291 65.5 3.9e-11 CCDS32484.1 CLEC18B gene_id:497190|Hs108|chr16 ( 455) 292 65.9 5.3e-11 CCDS10886.1 CLEC18A gene_id:348174|Hs108|chr16 ( 446) 291 65.7 5.9e-11 CCDS32473.1 CLEC18C gene_id:283971|Hs108|chr16 ( 446) 290 65.5 6.8e-11 >>CCDS4928.1 CRISP2 gene_id:7180|Hs108|chr6 (243 aa) initn: 1722 init1: 1722 opt: 1722 Z-score: 1799.0 bits: 340.3 E(32554): 6.9e-94 Smith-Waterman score: 1722; 100.0% identity (100.0% similar) in 243 aa overlap (1-243:1-243) 10 20 30 40 50 60 pF1KE6 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 MEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPTSWSSAIQSWYDEILD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 MEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPTSWSSAIQSWYDEILD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 FVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGNNMNRKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 FVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGNNMNRKN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 TPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS49 TPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCEN 190 200 210 220 230 240 pF1KE6 KIY ::: CCDS49 KIY >>CCDS4929.2 CRISP3 gene_id:10321|Hs108|chr6 (258 aa) initn: 1267 init1: 1220 opt: 1284 Z-score: 1344.0 bits: 256.2 E(32554): 1.5e-68 Smith-Waterman score: 1284; 71.4% identity (87.8% similar) in 245 aa overlap (1-243:14-258) 10 20 30 40 pF1KE6 MALLPVL-FLVTVLLPSLPA-EGKDPAFTALLTTQLQVQREIVNKHN :.:.::: :::. ::::.:: : :::::::::::: ::::::::::: CCDS49 MKQILHPALETTAMTLFPVLLFLVAGLLPSFPANEDKDPAFTALLTTQTQVQREIVNKHN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 ELRKAVSPPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDPT :::.:::::: :::::::..:...:::.:::.:. .::.:.:: :: .::::::::: . CCDS49 ELRRAVSPPARNMLKMEWNKEAAANAQKWANQCNYRHSNPKDRMTSLKCGENLYMSSASS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 SWSSAIQSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKYYY :::.:::::.:: :: .:::::.::::::::::.::::.: :::: :::::: ::::: CCDS49 SWSQAIQSWFDEYNDFDFGVGPKTPNAVVGHYTQVVWYSSYLVGCGNAYCPNQKVLKYYY 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 VCQYCPAGNNMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEH ::::::::: :: .::.::.:::.:::.:: :::::.:.:.:: ::: ::: : :.: CCDS49 VCQYCPAGNWANRLYVPYEQGAPCASCPDNCDDGLCTNGCKYEDLYSNCKSLKLTLTCKH 190 200 210 220 230 240 230 240 pF1KE6 ELLKEKCKATCLCENKIY .:....:::.: : :.:: CCDS49 QLVRDSCKASCNCSNSIY 250 >>CCDS55019.1 CRISP3 gene_id:10321|Hs108|chr6 (268 aa) initn: 1267 init1: 1220 opt: 1284 Z-score: 1343.8 bits: 256.2 E(32554): 1.6e-68 Smith-Waterman score: 1284; 71.4% identity (87.8% similar) in 245 aa overlap (1-243:24-268) 10 20 30 pF1KE6 MALLPVL-FLVTVLLPSLPA-EGKDPAFTALLTTQLQ :.:.::: :::. ::::.:: : :::::::::::: : CCDS55 MKQILHPALETTDPCSTGFVFPAMTLFPVLLFLVAGLLPSFPANEDKDPAFTALLTTQTQ 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 VQREIVNKHNELRKAVSPPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCG :::::::::::::.:::::: :::::::..:...:::.:::.:. .::.:.:: :: .:: CCDS55 VQREIVNKHNELRRAVSPPARNMLKMEWNKEAAANAQKWANQCNYRHSNPKDRMTSLKCG 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 ENLYMSSDPTSWSSAIQSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYC ::::::: .:::.:::::.:: :: .:::::.::::::::::.::::.: :::: ::: CCDS55 ENLYMSSASSSWSQAIQSWFDEYNDFDFGVGPKTPNAVVGHYTQVVWYSSYLVGCGNAYC 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 PNQDSLKYYYVCQYCPAGNNMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCD ::: :::::::::::::: :: .::.::.:::.:::.:: :::::.:.:.:: ::: CCDS55 PNQKVLKYYYVCQYCPAGNWANRLYVPYEQGAPCASCPDNCDDGLCTNGCKYEDLYSNCK 190 200 210 220 230 240 220 230 240 pF1KE6 SLKNTAGCEHELLKEKCKATCLCENKIY ::: : :.:.:....:::.: : :.:: CCDS55 SLKLTLTCKHQLVRDSCKASCNCSNSIY 250 260 >>CCDS4931.1 CRISP1 gene_id:167|Hs108|chr6 (249 aa) initn: 723 init1: 421 opt: 759 Z-score: 799.3 bits: 155.4 E(32554): 3.4e-38 Smith-Waterman score: 759; 45.2% identity (68.5% similar) in 248 aa overlap (1-242:1-248) 10 20 30 40 50 pF1KE6 MALLPVLFLVTV--LLPSLPAEGKDP--AFTALLTTQLQVQREIVNKHNELRKAVSPPAS : . .::::.. ::: : . :. :. :.: .::.:::: :: ::. : :::: CCDS49 MEIKHLLFLVAAACLLPMLSMKKKSARDQFNKLVTDLPNVQEEIVNIHNALRRRVVPPAS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 NMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKT-STRCGENLYMSSDPTSWSSAIQSWY ::::: ::.:.. ::. ... : . .:.: .:. .: ::::..:.: :.::::.: :: CCDS49 NMLKMSWSEEAAQNARIFSKYCDMTESNPLERRLPNTFCGENMHMTSYPVSWSSVIGVWY 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 DEILDFVYGVGPKSPNAVV-GHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGN .: .: .: . . .. ::::.:: ..: .::.:: : .: : .: :::.:: :: CCDS49 SESTSFKHGEWTTTDDDITTDHYTQIVWATSYLIGCAIASCRQQGSPRYLYVCHYCHEGN 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 NMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA . . :: ::. :.:: .::..:. :::: : : : .:: . ::.: ::: CCDS49 DPETKNEPYKTGVPCEACPSNCEDKLCTNPCIYYDEYFDCDIQVHYLGCNHSTTILFCKA 190 200 210 220 230 240 240 pF1KE6 TCLCENKIY ::::...: CCDS49 TCLCDTEIK >>CCDS4932.1 CRISP1 gene_id:167|Hs108|chr6 (178 aa) initn: 470 init1: 199 opt: 506 Z-score: 538.6 bits: 106.6 E(32554): 1.1e-23 Smith-Waterman score: 506; 45.5% identity (70.5% similar) in 176 aa overlap (1-170:1-176) 10 20 30 40 50 pF1KE6 MALLPVLFLVTV--LLPSLPAEGKDP--AFTALLTTQLQVQREIVNKHNELRKAVSPPAS : . .::::.. ::: : . :. :. :.: .::.:::: :: ::. : :::: CCDS49 MEIKHLLFLVAAACLLPMLSMKKKSARDQFNKLVTDLPNVQEEIVNIHNALRRRVVPPAS 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE6 NMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKT-STRCGENLYMSSDPTSWSSAIQSWY ::::: ::.:.. ::. ... : . .:.: .:. .: ::::..:.: :.::::.: :: CCDS49 NMLKMSWSEEAAQNARIFSKYCDMTESNPLERRLPNTFCGENMHMTSYPVSWSSVIGVWY 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 DEILDFVYGVGPKSPNAVV-GHYTQLVWYSTYQVGCGIAYCPNQDSLKYYYVCQYCPAGN .: .: .: . . .. ::::.:: ..: .::.:: : .: : .: :::.:: CCDS49 SESTSFKHGEWTTTDDDITTDHYTQIVWATSYLIGCAIASCRQQGSPRYLYVCHYCHD 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 NMNRKNTPYQQGTPCAGCPDDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA >>CCDS9011.1 GLIPR1 gene_id:11010|Hs108|chr12 (266 aa) initn: 309 init1: 129 opt: 415 Z-score: 441.8 bits: 89.3 E(32554): 2.7e-18 Smith-Waterman score: 415; 37.5% identity (63.6% similar) in 184 aa overlap (38-208:35-213) 10 20 30 40 50 60 pF1KE6 FLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREV .. : ::..:. :.: ::.:: : :. . CCDS90 LATIAWMVSFVSNYSHTANILPDIENEDFIKDCVRIHNKFRSEVKPTASDMLYMTWDPAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 TTNAQRWANKCTLQHSD---PEDR--KTSTRCGENLYMSSDPT-SWSSAIQSWYDEILDF . :. ::..: ..:. : . . : :::.. .: : : :::: .::::: : CCDS90 AQIAKAWASNCQFSHNTRLKPPHKLHPNFTSLGENIWTGSVPIFSVSSAITNWYDEIQD- 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 VYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQ---DSLKY--YYVCQYCPAGNNM : . . : :::::.:: ..:.:::.. .::. :.:. ...:.: :.:: CCDS90 -YDFKTRICKKVCGHYTQVVWADSYKVGCAVQFCPKVSGFDALSNGAHFICNYGPGGN-- 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE6 NRKNTPYQQGTPCAGCP--DDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKA . ::..:. :..:: : : .::.: . : CCDS90 -YPTWPYKRGATCSACPNNDKCLDNLCVNRQRDQVKRYYSVVYPGWPIYPRNRYTSLFLI 190 200 210 220 230 240 pF1KE6 TCLCENKIY CCDS90 VNSVILILSVIITILVQHKYPNLVLLD 240 250 260 >>CCDS9009.1 GLIPR1L1 gene_id:256710|Hs108|chr12 (233 aa) initn: 359 init1: 145 opt: 395 Z-score: 421.9 bits: 85.4 E(32554): 3.5e-17 Smith-Waterman score: 395; 37.9% identity (61.5% similar) in 174 aa overlap (41-203:38-205) 20 30 40 50 60 70 pF1KE6 TVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVTTN .. ::: : :.:::..: : :.. .. CCDS90 SCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKGLAKM 10 20 30 40 50 60 80 90 100 110 120 pF1KE6 AQRWANKCTLQHSDPEDRKTSTRC-------GENLYMSSDPT-SWSSAIQSWYDEILDFV :. :::.: ..:.: :. : .: :::..... . . :: .::.: .: CCDS90 AKAWANQCKFEHNDCLDK--SYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNET-QF- 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY-YYVCQYCPAGNNMNRKNT : : . : :::::::: ... :::..:.::: . . .::.: :::: : CCDS90 YDFDSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANMP-- 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 PYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCE :: .: :. : . : :.:: : CCDS90 PYVRGESCSLCSKEEKCVKNLCKNPFLKPTGRAPQQTAFNPFSLGFLLLRIF 190 200 210 220 230 >>CCDS76578.1 GLIPR1L1 gene_id:256710|Hs108|chr12 (242 aa) initn: 353 init1: 145 opt: 389 Z-score: 415.4 bits: 84.3 E(32554): 8.1e-17 Smith-Waterman score: 389; 37.8% identity (61.6% similar) in 172 aa overlap (41-201:38-203) 20 30 40 50 60 70 pF1KE6 TVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVTTN .. ::: : :.:::..: : :.. .. CCDS76 SCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKGLAKM 10 20 30 40 50 60 80 90 100 110 120 pF1KE6 AQRWANKCTLQHSDPEDRKTSTRC-------GENLYMSSDPT-SWSSAIQSWYDEILDFV :. :::.: ..:.: :. : .: :::..... . . :: .::.: .: CCDS76 AKAWANQCKFEHNDCLDK--SYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNET-QF- 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY-YYVCQYCPAGNNMNRKNT : : . : :::::::: ... :::..:.::: . . .::.: :::: : CCDS76 YDFDSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANM--P 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 PYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCE :: .: :. : . : :.:: CCDS76 PYVRGESCSLCSKEEKCVKNLCRTPQLIIPNQNPFLKPTGRAPQQTAFNPFSLGFLLLRI 190 200 210 220 230 240 >>CCDS34440.1 PI16 gene_id:221476|Hs108|chr6 (463 aa) initn: 301 init1: 136 opt: 380 Z-score: 402.3 bits: 82.8 E(32554): 4.4e-16 Smith-Waterman score: 382; 33.2% identity (60.6% similar) in 208 aa overlap (1-201:9-197) 10 20 30 40 50 pF1KE6 MALLPVLFLVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVS : :::.:.:... . : :.: . .: .:. :: : :: CCDS34 MHGSCSFLMLLLPLLLLLVA------TTGPVGALTD------EEKRLMVELHNLYRAQVS 10 20 30 40 60 70 80 90 100 110 pF1KE6 PPASNMLKMEWSREVTTNAQRWANKCTLQHSDPEDRKTSTRCGENLYMSSDP-TSWSSAI : ::.::.:.:..:... :. .: .:. :. . :. ::::. .: . :. CCDS34 PTASDMLHMRWDEELAAFAKAYARQCVWGHNKERGRR-----GENLFAITDEGMDVPLAM 50 60 70 80 90 100 120 130 140 150 160 pF1KE6 QSWYDEILDFVYGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY----YYVC . :. : . ... ::. . :::::.:: .: ..::: .: . .... :: CCDS34 EEWHHEREHYNLSAATCSPGQMCGHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVC 110 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 QYCPAGNNMNRKNTPYQQGTPCAGCPDD--CDKGLCTNSCQYQDLLSNCDSLKNTAGCEH .: : :: ... :::.::::. ::. : ..:: CCDS34 NYEPPGNVKGKR--PYQEGTPCSQCPSGYHCKNSLCEPIGSPEDAQDLPYLVTEAPSFRA 170 180 190 200 210 220 230 240 pF1KE6 ELLKEKCKATCLCENKIY CCDS34 TEASDSRKMGTPSSLATGIPAFLVTEVSGSLATKALPAVETQAPTSLATKDPPSMATEAP 230 240 250 260 270 280 >>CCDS58258.1 GLIPR1L2 gene_id:144321|Hs108|chr12 (344 aa) initn: 218 init1: 129 opt: 300 Z-score: 321.0 bits: 67.3 E(32554): 1.5e-11 Smith-Waterman score: 300; 31.7% identity (58.3% similar) in 180 aa overlap (39-208:56-230) 10 20 30 40 50 60 pF1KE6 LVTVLLPSLPAEGKDPAFTALLTTQLQVQREIVNKHNELRKAVSPPASNMLKMEWSREVT : :: ::::: : : .::. : :. .. CCDS58 LRLCELWLLLLGSSLNARFLPDEEDVDFINEYVNLHNELRGDVIPRGSNLRFMTWDVALS 30 40 50 60 70 80 70 80 90 100 110 120 pF1KE6 TNAQRWANKCTLQHS----DPEDRKTSTR-CGENLYMSSDPT-SWSSAIQSWYDEILDFV .:. :..:: . :. : . . . :::.... . . : ::.::. : . CCDS58 RTARAWGKKCLFTHNIYLQDVQMVHPKFYGIGENMWVGPENEFTASIAIRSWHAEKKMYN 90 100 110 120 130 140 130 140 150 160 170 180 pF1KE6 YGVGPKSPNAVVGHYTQLVWYSTYQVGCGIAYCPNQDSLKY--YYVCQYCPAGNNMNRKN . : : . ..: :::: .:.:::... : . . . ..:.: : :....:. CCDS58 FENGSCSGD--CSNYIQLVWDHSYKVGCAVTPCSKIGHIIHAAIFICNYAP-GGTLTRR- 150 160 170 180 190 200 190 200 210 220 230 pF1KE6 TPYQQGTPCAGCP--DDCDKGLCTNSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLC ::. : :. : : : ::.:. . : CCDS58 -PYEPGIFCTRCGRRDKCTDFLCSNADRDQATYYRFWYPKWEMPRPVVCDPLCTFILLLR 210 220 230 240 250 260 243 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:41:46 2016 done: Tue Nov 8 14:41:46 2016 Total Scan time: 1.520 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]