FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1741, 244 aa 1>>>pF1KE1741 244 - 244 aa - 244 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2574+/-0.000613; mu= 15.0460+/- 0.037 mean_var=64.0877+/-12.844, 0's: 0 Z-trim(111.1): 36 B-trim: 0 in 0/54 Lambda= 0.160209 statistics sampled from 12069 (12105) to 12069 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.754), E-opt: 0.2 (0.372), width: 16 Scan time: 2.130 The best scores are: opt bits E(32554) CCDS5225.1 ULBP3 gene_id:79465|Hs108|chr6 ( 244) 1700 400.9 4.1e-112 CCDS5223.1 ULBP1 gene_id:80329|Hs108|chr6 ( 244) 900 216.0 1.9e-56 CCDS43514.1 RAET1G gene_id:353091|Hs108|chr6 ( 334) 848 204.0 1e-52 CCDS5224.1 RAET1L gene_id:154064|Hs108|chr6 ( 246) 823 198.2 4.3e-51 CCDS5222.1 ULBP2 gene_id:80328|Hs108|chr6 ( 246) 816 196.6 1.3e-50 CCDS5221.1 RAET1E gene_id:135250|Hs108|chr6 ( 263) 528 130.0 1.5e-30 CCDS59042.1 RAET1E gene_id:135250|Hs108|chr6 ( 209) 511 126.0 1.9e-29 CCDS59043.1 RAET1E gene_id:135250|Hs108|chr6 ( 212) 511 126.0 1.9e-29 CCDS59044.1 RAET1E gene_id:135250|Hs108|chr6 ( 227) 416 104.1 8.3e-23 CCDS1342.1 MR1 gene_id:3140|Hs108|chr1 ( 341) 282 73.2 2.5e-13 CCDS53440.1 MR1 gene_id:3140|Hs108|chr1 ( 249) 267 69.7 2.1e-12 CCDS53441.1 MR1 gene_id:3140|Hs108|chr1 ( 214) 263 68.7 3.5e-12 CCDS34373.1 A gene_id:3105|Hs108|chr6 ( 365) 251 66.1 3.7e-11 >>CCDS5225.1 ULBP3 gene_id:79465|Hs108|chr6 (244 aa) initn: 1700 init1: 1700 opt: 1700 Z-score: 2126.4 bits: 400.9 E(32554): 4.1e-112 Smith-Waterman score: 1700; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS52 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS52 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS52 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS52 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFI 190 200 210 220 230 240 pF1KE1 LPGI :::: CCDS52 LPGI >>CCDS5223.1 ULBP1 gene_id:80329|Hs108|chr6 (244 aa) initn: 921 init1: 828 opt: 900 Z-score: 1127.0 bits: 216.0 E(32554): 1.9e-56 Smith-Waterman score: 900; 54.9% identity (76.6% similar) in 244 aa overlap (1-243:1-243) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK ::::::::.: : .: .:: :: .: .:.: : :.: : : ::::::. ::.. CCDS52 MAAAASPAFLLCLPLL-HLLSGWSRAGWVDTHCLCYDFIITPKSRPEPQWCEVQGLVDER 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL :: ::: . :. ... : ... .: .: .: : ::.: . :. .: : ..:.. : :: CCDS52 PFLHYDCVNHKAKAFASLGKKVNVTKTWEEQTETLRDVVDFLKGQLLDIQVENLIPIEPL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT :::.::::: :: :. :::::: :.:.:::::::::::::..: ::..: :::::. .: CCDS52 TLQARMSCEHEAHGHGRGSWQFLFNGQKFLLFDSNNRKWTALHPGAKKMTEKWEKNRDVT 120 130 140 150 160 170 190 200 210 220 230 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLII-LCF ::. .:. ::: ::..:::. .. :.:: ::..::: .::::.::::::::.::: ::: CCDS52 MFFQKISLGDCKMWLEEFLMYWEQMLDPTKPPSLAPGTTQPKAMATTLSPWSLLIIFLCF 180 190 200 210 220 230 240 pF1KE1 ILPGI :: : CCDS52 ILAGR 240 >>CCDS43514.1 RAET1G gene_id:353091|Hs108|chr6 (334 aa) initn: 708 init1: 708 opt: 848 Z-score: 1060.0 bits: 204.0 E(32554): 1e-52 Smith-Waterman score: 848; 54.7% identity (74.5% similar) in 243 aa overlap (1-241:1-241) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK ::::::::.: :: .: :: .: :: :: ::: :..:.: : : .:: ::.:::.: CCDS43 MAAAASPAFLLRLPLL-LLLSSWCRTGLADPHSLCYDITVIPKFRPGPRWCAVQGQVDEK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL .:: ::::: : .. : ..: .: :: : .:::: . : .: : .::.. :. :: CCDS43 TFLHYDCGSKTVTPVSPLGKKLNVTTAWKAQNPVLREVVDILTEQLLDIQLENYIPKEPL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT :::.::::: .:.:. ::::.::::. ::::::.:: ::.:: :::.::::::.:. .: CCDS43 TLQARMSCEQKAEGHGSGSWQLSFDGQIFLLFDSENRMWTTVHPGARKMKEKWENDKDMT 120 130 140 150 160 170 190 200 210 220 230 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTA--PPTMAPGLAQPKAIATTLSPWSFLIILC :...:: :: .::.:::: . :::.: ::::. : :::.: :::: :.:.: CCDS43 MSFHYISMGDCTGWLEDFLMGMDSTLEPSAGAPPTMSSGTAQPRATATTLI-LCCLLIMC 180 190 200 210 220 230 240 pF1KE1 FILPGI ... CCDS43 LLICSRHSLTQSHGHHPQSLQPPPHPPLLHPTWLLRRVLWSDSYQIAKRPLSGGHVTRVT 240 250 260 270 280 290 >>CCDS5224.1 RAET1L gene_id:154064|Hs108|chr6 (246 aa) initn: 753 init1: 707 opt: 823 Z-score: 1030.8 bits: 198.2 E(32554): 4.3e-51 Smith-Waterman score: 823; 53.8% identity (74.1% similar) in 247 aa overlap (1-244:1-246) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK ::::: ::.: : .: .::: :: . : : ::: :..:.: : : .:: ::.:::.: CCDS52 MAAAAIPALLLCLPLL-FLLFGWSRARRDDPHSLCYDITVIPKFRPGPRWCAVQGQVDEK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL .:: ::::. : .. : ..: .: :: : .:::: . : .: : .::..::. :: CCDS52 TFLHYDCGNKTVTPVSPLGKKLNVTMAWKAQNPVLREVVDILTEQLLDIQLENYTPKEPL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT :::.::::: .:.:. ::::::.::. ::::::..: ::.:: :::.::::::.:. .. CCDS52 TLQARMSCEQKAEGHSSGSWQFSIDGQTFLLFDSEKRMWTTVHPGARKMKEKWENDKDVA 120 130 140 150 160 170 190 200 210 220 230 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTA--PPTMAPGLAQPKAIATTLSPWSFLIIL- :...:: :: .::.:::: . :::.: : .:. : .: .: :::: .:::: CCDS52 MSFHYISMGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILP 180 190 200 210 220 230 240 pF1KE1 CFILPGI ::::::: CCDS52 CFILPGI 240 >>CCDS5222.1 ULBP2 gene_id:80328|Hs108|chr6 (246 aa) initn: 754 init1: 708 opt: 816 Z-score: 1022.1 bits: 196.6 E(32554): 1.3e-50 Smith-Waterman score: 816; 54.3% identity (73.3% similar) in 247 aa overlap (1-244:1-246) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK :::::. :: : .: :: :: .:::: ::: :..:.: : : .:: ::.:::.: CCDS52 MAAAAATKILLCLPLL-LLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEK 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL .:: ::::. : .. : ..: .: :: : .:::: . : .: : .::..::. :: CCDS52 TFLHYDCGNKTVTPVSPLGKKLNVTTAWKAQNPVLREVVDILTEQLRDIQLENYTPKEPL 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT :::.::::: .:.:. :::::::::. ::::::..: ::.:: :::.::::::.:. .. CCDS52 TLQARMSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDKVVA 120 130 140 150 160 170 190 200 210 220 230 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTA--PPTMAPGLAQPKAIATTLSPWSFLIIL- :.. :: :: .::.:::: . :::.: : .:. : .: .: :::: .:::: CCDS52 MSFHYFSMGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILP 180 190 200 210 220 230 240 pF1KE1 CFILPGI ::::::: CCDS52 CFILPGI 240 >>CCDS5221.1 RAET1E gene_id:135250|Hs108|chr6 (263 aa) initn: 524 init1: 318 opt: 528 Z-score: 661.9 bits: 130.0 E(32554): 1.5e-30 Smith-Waterman score: 528; 41.0% identity (64.8% similar) in 210 aa overlap (32-241:31-235) 10 20 30 40 50 60 pF1KE1 AAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQKN ::: .:::: : : :: :::.: .... CCDS52 MRRISLTSSPVRLLLFLLLLLIALEIMVGGHSLCFNFTIKSLSRPGQPWCEAQVFLNKNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPLT ::.:. .. : .: : ...:::..::. . : :::. ::. : : . .. : : : CCDS52 FLQYNSDNNMVKPLGLLGKKVYATSTWGELTQTLGEVGRDLRMLLCDIK-PQIKTSDPST 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 LQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLTT :::.: :. ::. .::::. .:.: ::::. : :::.. : ..:: :.:: :: CCDS52 LQVEMFCQREAERCTGASWQFATNGEKSLLFDAMNMTWTVINHEASKIKETWKKDRGLEK 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 FFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFIL .:. .: :: :::.:: : . :::. :. : . ... : ..:. ::: CCDS52 YFRKLSKGDCDHWLREFLGHWEAMPEPTVSPVNASDIHW----SSSSLPDRWIILGAFIL 180 190 200 210 220 230 pF1KE1 PGI CCDS52 LVLMGIVLICVWWQNGEWQAGLWPLRTS 240 250 260 >>CCDS59042.1 RAET1E gene_id:135250|Hs108|chr6 (209 aa) initn: 512 init1: 306 opt: 511 Z-score: 642.1 bits: 126.0 E(32554): 1.9e-29 Smith-Waterman score: 511; 44.4% identity (67.4% similar) in 178 aa overlap (32-209:31-207) 10 20 30 40 50 60 pF1KE1 AAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQKN ::: .:::: : : :: :::.: .... CCDS59 MRRISLTSSPVRLLLFLLLLLIALEIMVGGHSLCFNFTIKSLSRPGQPWCEAQVFLNKNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPLT ::.:. .. : .: : ...:::..::. . : :::. ::. : : . .. : : : CCDS59 FLQYNSDNNMVKPLGLLGKKVYATSTWGELTQTLGEVGRDLRMLLCDIK-PQIKTSDPST 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 LQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLTT :::.: :. ::. .::::. .:.: ::::. : :::.. : ..:: :.:: :: CCDS59 LQVEMFCQREAERCTGASWQFATNGEKSLLFDAMNMTWTVINHEASKIKETWKKDRGLEK 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 FFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFIL .:. .: :: :::.:: : . ::: CCDS59 YFRKLSKGDCDHWLREFLGHWEAMPEPTGN 180 190 200 >>CCDS59043.1 RAET1E gene_id:135250|Hs108|chr6 (212 aa) initn: 512 init1: 306 opt: 511 Z-score: 642.1 bits: 126.0 E(32554): 1.9e-29 Smith-Waterman score: 511; 44.4% identity (67.4% similar) in 178 aa overlap (32-209:31-207) 10 20 30 40 50 60 pF1KE1 AAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQKN ::: .:::: : : :: :::.: .... CCDS59 MRRISLTSSPVRLLLFLLLLLIALEIMVGGHSLCFNFTIKSLSRPGQPWCEAQVFLNKNL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 FLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPLT ::.:. .. : .: : ...:::..::. . : :::. ::. : : . .. : : : CCDS59 FLQYNSDNNMVKPLGLLGKKVYATSTWGELTQTLGEVGRDLRMLLCDIK-PQIKTSDPST 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 LQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLTT :::.: :. ::. .::::. .:.: ::::. : :::.. : ..:: :.:: :: CCDS59 LQVEMFCQREAERCTGASWQFATNGEKSLLFDAMNMTWTVINHEASKIKETWKKDRGLEK 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 FFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFIL .:. .: :: :::.:: : . ::: CCDS59 YFRKLSKGDCDHWLREFLGHWEAMPEPTGRRST 180 190 200 210 >>CCDS59044.1 RAET1E gene_id:135250|Hs108|chr6 (227 aa) initn: 388 init1: 318 opt: 416 Z-score: 522.9 bits: 104.1 E(32554): 8.3e-23 Smith-Waterman score: 416; 39.5% identity (63.8% similar) in 177 aa overlap (65-241:28-199) 40 50 60 70 80 90 pF1KE1 WYNFTIIHLPRHGQQWCEVQSQVDQKNFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEM :. .. : .: : ...:::..::. . CCDS59 MRRISLTSSPVRLLLFLLLLLIALEIMYNSDNNMVKPLGLLGKKVYATSTWGELTQT 10 20 30 40 50 100 110 120 130 140 150 pF1KE1 LREVGQRLRLELADTELEDFTPSGPLTLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDS : :::. ::. : : . .. : : ::::.: :. ::. .::::. .:.: ::::. CCDS59 LGEVGRDLRMLLCDIK-PQIKTSDPSTLQVEMFCQREAERCTGASWQFATNGEKSLLFDA 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE1 NNRKWTVVHAGARRMKEKWEKDSGLTTFFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTM : :::.. : ..:: :.:: :: .:. .: :: :::.:: : . :::. :. CCDS59 MNMTWTVINHEASKIKETWKKDRGLEKYFRKLSKGDCDHWLREFLGHWEAMPEPTVSPVN 120 130 140 150 160 170 220 230 240 pF1KE1 APGLAQPKAIATTLSPWSFLIILCFILPGI : . ...: : ..:. ::: CCDS59 ASDIHWS---SSSL-PDRWIILGAFILLVLMGIVLICVWWQNGEWQAGLWPLRTS 180 190 200 210 220 >>CCDS1342.1 MR1 gene_id:3140|Hs108|chr1 (341 aa) initn: 283 init1: 216 opt: 282 Z-score: 352.9 bits: 73.2 E(32554): 2.5e-13 Smith-Waterman score: 282; 26.8% identity (52.5% similar) in 198 aa overlap (15-212:8-204) 10 20 30 40 50 60 pF1KE1 MAAAASPAILPRLAILPYLLFDWSGTGRADAHSLWYNFTIIHLPRHGQQWCEVQSQVDQK .:: .. . . .::: : . : :: . ::.. CCDS13 MGELMAFLLPLIIVLMVKHSDSRTHSLRYFRLGVSDPIHGVPEFISVGYVDSH 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 NFLSYDCGSDKVLSMGHLEEQLYATDAWGKQLEMLREVGQRLRLELADTELEDFTPSGPL . .:: . . . . : : : . ..:: : ...:: . . .. :: CCDS13 PITTYDSVTRQKEPRAPWMAENLAPDHWERYTQLLRGWQQMFKVELKRLQ-RHYNHSGSH 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 TLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNRKWTVVHAGARRMKEKWEKDSGLT : : ..:: :: : :...::. ::.:.... .: .: :. .:. :: .. CCDS13 TYQRMIGCELLEDGSTTGFLQYAYDGQDFLIFNKDTLSWLAVDNVAHTIKQAWEANQHEL 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 TFFKMVSMRDCKSWLRDFLMHRKKRLEPTAPPTMAPGLAQPKAIATTLSPWSFLIILCFI . : ..: .::. :: . : :. : :: CCDS13 LYQKNWLEEECIAWLKRFLEYGKDTLQRTEPPLVRVNRKETFPGVTALFCKAHGFYPPEI 180 190 200 210 220 230 pF1KE1 LPGI CCDS13 YMTWMKNGEEIVQEIDYGDILPSGDGTYQAWASIELDPQSSNLYSCHVEHCGVHMVLQVP 240 250 260 270 280 290 244 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 16:29:16 2016 done: Sun Nov 6 16:29:16 2016 Total Scan time: 2.130 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]