FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0259, 250 aa 1>>>pF1KE0259 250 - 250 aa - 250 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1626+/-0.000859; mu= 11.6365+/- 0.051 mean_var=135.3937+/-28.104, 0's: 0 Z-trim(111.2): 138 B-trim: 229 in 1/51 Lambda= 0.110224 statistics sampled from 11988 (12165) to 11988 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.374), width: 16 Scan time: 2.280 The best scores are: opt bits E(32554) CCDS34665.1 TRIM73 gene_id:375593|Hs108|chr7 ( 250) 1739 287.5 6e-78 CCDS5545.1 TRIM74 gene_id:378108|Hs108|chr7 ( 250) 1729 285.9 1.8e-77 CCDS34654.1 TRIM50 gene_id:135892|Hs108|chr7 ( 487) 1578 262.2 4.8e-70 CCDS32437.1 TRIM72 gene_id:493829|Hs108|chr16 ( 477) 387 72.8 4.8e-13 CCDS4654.1 TRIM27 gene_id:5987|Hs108|chr6 ( 513) 360 68.5 1e-11 CCDS34378.1 TRIM39 gene_id:56658|Hs108|chr6 ( 488) 347 66.4 4e-11 CCDS78121.1 RPP21 gene_id:202658|Hs108|chr6 ( 503) 347 66.4 4.1e-11 CCDS34377.1 TRIM39 gene_id:56658|Hs108|chr6 ( 518) 347 66.4 4.2e-11 CCDS31392.1 TRIM5 gene_id:85363|Hs108|chr11 ( 326) 344 65.7 4.3e-11 CCDS31394.1 TRIM5 gene_id:85363|Hs108|chr11 ( 347) 344 65.8 4.5e-11 CCDS31393.1 TRIM5 gene_id:85363|Hs108|chr11 ( 493) 344 65.9 5.7e-11 >>CCDS34665.1 TRIM73 gene_id:375593|Hs108|chr7 (250 aa) initn: 1739 init1: 1739 opt: 1739 Z-score: 1512.9 bits: 287.5 E(32554): 6e-78 Smith-Waterman score: 1739; 100.0% identity (100.0% similar) in 250 aa overlap (1-250:1-250) 10 20 30 40 50 60 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF 190 200 210 220 230 240 250 pF1KE0 IWKFHSMASR :::::::::: CCDS34 IWKFHSMASR 250 >>CCDS5545.1 TRIM74 gene_id:378108|Hs108|chr7 (250 aa) initn: 1729 init1: 1729 opt: 1729 Z-score: 1504.4 bits: 285.9 E(32554): 1.8e-77 Smith-Waterman score: 1729; 99.6% identity (99.6% similar) in 250 aa overlap (1-250:1-250) 10 20 30 40 50 60 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MAWQVSLLELEDWLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF 190 200 210 220 230 240 250 pF1KE0 IWKFHSMASR :::::::::: CCDS55 IWKFHSMASR 250 >>CCDS34654.1 TRIM50 gene_id:135892|Hs108|chr7 (487 aa) initn: 1578 init1: 1578 opt: 1578 Z-score: 1371.1 bits: 262.2 E(32554): 4.8e-70 Smith-Waterman score: 1578; 93.2% identity (96.8% similar) in 250 aa overlap (1-250:1-250) 10 20 30 40 50 60 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV ::::::: ::::::::::::::::: :::::::::::::::::: :::...:::.: :.: CCDS34 MAWQVSLPELEDRLQCPICLEVFKEPLMLQCGHSYCKGCLVSLSCHLDAELRCPVCRQAV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 DGSSSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 DGSSSLPNVSLARVIEALRLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSHQHH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRREFQ :::::::: ::::::::::.::::::::::::::::::.::::::::::::::::::::: CCDS34 PVTPVSTVYSRMKEELAALISELKQEQKKVDELIAKLVNNRTRIVNESDVFSWVIRREFQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 ELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHEF ::.: :::::::::::::::::::::::::::::::::::::::::::::::::::::.: CCDS34 ELHHLVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFGNEDHHKF 190 200 210 220 230 240 250 pF1KE0 IWKFHSMASR : :::::::: CCDS34 IRKFHSMASRAEMPQARPLEGAFSPISFKPGLHQADIKLTVWKRLFRKVLPAPEPLKLDP 250 260 270 280 290 300 >>CCDS32437.1 TRIM72 gene_id:493829|Hs108|chr16 (477 aa) initn: 331 init1: 210 opt: 387 Z-score: 347.6 bits: 72.8 E(32554): 4.8e-13 Smith-Waterman score: 387; 29.1% identity (58.6% similar) in 251 aa overlap (10-250:8-247) 10 20 30 40 50 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCL--VSLSYHLDTKVRCPMCWQ :...:.::.::..: . .::::.:..:: :. : : :: : CCDS32 MSAAPGLLHQELSCPLCLQLFDAPVTAECGHSFCRACLGRVAGEPAADGTVLCPCCQA 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 VVDGSSSLPNVSLAWVIEAL-RLPGDPEPKVCVHHRNPLSLFCEKDQELICGLCGLLGSH . .. :..:: ..:.: ..: : .: .:::..::.:. :.::.:. :::: CCDS32 PTRPQALSTNLQLARLVEGLAQVPQGH----CEEHLDPLSIYCEQDRALVCGVCASLGSH 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 QHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSWVIRR . : . :.. . .:.: .: .:.. . .. .: : .. ... . : .. . CCDS32 RGHRLLPAAEAHARLKTQLPQQKLQLQEACMRKEKSVAVLEHQLVEVEETVRQFRGAVGE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE0 EFQELR-------HPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLE .. ..: .:.: : : : ..::. ::: : : : ::: CCDS32 QLGKMRVFLAALEGSLDREAERVRGEAGVALRRELGSLNSYLEQ-------LRQMEKVLE 180 190 200 210 220 240 250 pF1KE0 QFGNEDHHEFIWKFHSMASR . ... . ::. :. ..:: CCDS32 EVADKPQTEFLMKYCLVTSRLQKILAESPPPARLDIQLPIISDDFKFQVWRKMFRALMPA 230 240 250 260 270 280 >>CCDS4654.1 TRIM27 gene_id:5987|Hs108|chr6 (513 aa) initn: 382 init1: 187 opt: 360 Z-score: 324.0 bits: 68.5 E(32554): 1e-11 Smith-Waterman score: 360; 28.2% identity (56.5% similar) in 248 aa overlap (10-250:10-257) 10 20 30 40 50 60 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHLDTKVRCPMCWQVV :... ::.::. : : .::.:::. : .::. .:.: ::.: .. CCDS46 MASGSVAECLQQETTCPVCLQYFAEPMMLDCGHNICCACLARCWGTAETNVSCPQCRETF 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 DGSSSLPNVSLAWV---IEALRL--PGDP--EPKVCVHHRNPLSLFCEKDQELICGLCGL :: :: : .. :: :. : : :: .::.::.:.::.:: :: .: CCDS46 PQRHMRPNRHLANVTQLVKQLRTERPSGPGGEMGVCEKHREPLKLYCEEDQMPICVVCDR 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 LGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSW :. : : :. . .::.. ...::. . . :. . :..... ... CCDS46 SREHRGHSVLPLEEAVEGFKEQIQNQLDHLKRVKDLKKRRRAQGEQARAELLSLTQMERE 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE0 VIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERLAQAECVLEQFG : ::..: : . :.. : : . .. :.. . : . . .:.. ::. CCDS46 KIVWEFEQLYHSLKEHEYRLLARLEELDLAIYNSINGAITQFSCNISHLSSLIAQLEEKQ 190 200 210 220 230 240 240 250 pF1KE0 NEDHHEFIWKFHSMASR .. .:.. . . :: CCDS46 QQPTRELLQDIGDTLSRAERIRIPEPWITPPDLQEKIHIFAQKCLFLTESLKQFTEKMQS 250 260 270 280 290 300 >>CCDS34378.1 TRIM39 gene_id:56658|Hs108|chr6 (488 aa) initn: 356 init1: 198 opt: 347 Z-score: 313.1 bits: 66.4 E(32554): 4e-11 Smith-Waterman score: 347; 28.9% identity (62.1% similar) in 190 aa overlap (6-190:19-208) 10 20 30 40 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHL .: .:. . .: .::: .:: ....:::..::.:.. : CCDS34 MAETSLLEAGASAASTAAALENLQVEASCSVCLEYLKEPVIIECGHNFCKACITRWWEDL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 DTKVRCPMCWQVVDGSSSLPNVSLAWVIEALRLPGDPEPKV-----CVHHRNPLSLFCEK . ::.: .. : :: .:. ..: . . :. : .:.. ::::: . CCDS34 ERDFPCPVCRKTSRYRSLRPNRQLGSMVEIAKQLQAVKRKIRDESLCPQHHEALSLFCYE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE0 DQELICGLCGLLGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRT ::: .: .:.. .:. : :.:.. . ...::.: . :.:. ... . .. :. CCDS34 DQEAVCLICAISHTHRAHTVVPLDDATQEYKEKLQKCLEPLEQKLQEITRCKSSEEKKPG 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE0 RIVNESDVFSWVIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERL .. . : :::.::.. .:::. CCDS34 ELKRLVESRRQQILREFEELHRRLDEEQQVLLSRLEEEEQDILQRLRENAAHLGDKRRDL 190 200 210 220 230 240 >>CCDS78121.1 RPP21 gene_id:202658|Hs108|chr6 (503 aa) initn: 356 init1: 198 opt: 347 Z-score: 312.9 bits: 66.4 E(32554): 4.1e-11 Smith-Waterman score: 347; 28.9% identity (62.1% similar) in 190 aa overlap (6-190:19-208) 10 20 30 40 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHL .: .:. . .: .::: .:: ....:::..::.:.. : CCDS78 MAETSLLEAGASAASTAAALENLQVEASCSVCLEYLKEPVIIECGHNFCKACITRWWEDL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 DTKVRCPMCWQVVDGSSSLPNVSLAWVIEALRLPGDPEPKV-----CVHHRNPLSLFCEK . ::.: .. : :: .:. ..: . . :. : .:.. ::::: . CCDS78 ERDFPCPVCRKTSRYRSLRPNRQLGSMVEIAKQLQAVKRKIRDESLCPQHHEALSLFCYE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE0 DQELICGLCGLLGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRT ::: .: .:.. .:. : :.:.. . ...::.: . :.:. ... . .. :. CCDS78 DQEAVCLICAISHTHRAHTVVPLDDATQEYKEKLQKCLEPLEQKLQEITRCKSSEEKKPG 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE0 RIVNESDVFSWVIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERL .. . : :::.::.. .:::. CCDS78 ELKRLVESRRQQILREFEELHRRLDEEQQVLLSRLEEEEQDILQRLRENAAHLGDKRRDL 190 200 210 220 230 240 >>CCDS34377.1 TRIM39 gene_id:56658|Hs108|chr6 (518 aa) initn: 356 init1: 198 opt: 347 Z-score: 312.8 bits: 66.4 E(32554): 4.2e-11 Smith-Waterman score: 347; 28.9% identity (62.1% similar) in 190 aa overlap (6-190:19-208) 10 20 30 40 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVSLSYHL .: .:. . .: .::: .:: ....:::..::.:.. : CCDS34 MAETSLLEAGASAASTAAALENLQVEASCSVCLEYLKEPVIIECGHNFCKACITRWWEDL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 DTKVRCPMCWQVVDGSSSLPNVSLAWVIEALRLPGDPEPKV-----CVHHRNPLSLFCEK . ::.: .. : :: .:. ..: . . :. : .:.. ::::: . CCDS34 ERDFPCPVCRKTSRYRSLRPNRQLGSMVEIAKQLQAVKRKIRDESLCPQHHEALSLFCYE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE0 DQELICGLCGLLGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRT ::: .: .:.. .:. : :.:.. . ...::.: . :.:. ... . .. :. CCDS34 DQEAVCLICAISHTHRAHTVVPLDDATQEYKEKLQKCLEPLEQKLQEITRCKSSEEKKPG 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE0 RIVNESDVFSWVIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQLEQAQGTRERL .. . : :::.::.. .:::. CCDS34 ELKRLVESRRQQILREFEELHRRLDEEQQVLLSRLEEEEQDILQRLRENAAHLGDKRRDL 190 200 210 220 230 240 >>CCDS31392.1 TRIM5 gene_id:85363|Hs108|chr11 (326 aa) initn: 341 init1: 163 opt: 344 Z-score: 312.7 bits: 65.7 E(32554): 4.3e-11 Smith-Waterman score: 344; 28.5% identity (62.1% similar) in 235 aa overlap (7-230:6-240) 10 20 30 40 50 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVS--LSYHLDT-KVRCPMCW :....... ::::::.. . : :.::::.:..::.. . :: . ::.: CCDS31 MASGILVNVKEEVTCPICLELLTQPLSLDCGHSFCQACLTANHKKSMLDKGESSCPVCR 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 QVVDGSSSLPNVSLAWVIEALR-LPGDPE-PKV--CVHHRNPLSLFCEKDQELICGLCGL . . :: .: ..: :: . .:: :: :..: . : :::..: ..:: :: CCDS31 ISYQPENIRPNRHVANIVEKLREVKLSPEGQKVDHCARHGEKLLLFCQEDGKVICWLCER 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 LGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSW :. : . . : ... .: : . :.:.:....:: : . .... .. . . CCDS31 SQEHRGHHTFLTEEVAREYQVKLQAALEMLRQKQQEAEELEADIREEKASWKTQIQYDKT 120 130 140 150 160 170 180 190 200 210 220 pF1KE0 VIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQ----LEQAQGTRERLAQAECVL . .:..:: .: :.. :... . . .. :: . ..:.:. :: ... : : CCDS31 NVLADFEQLRDILDWEESNELQNLEKEEEDILKSLTNSETEMVQQTQSLRELISDLEHRL 180 190 200 210 220 230 230 240 250 pF1KE0 EQFGNEDHHEFIWKFHSMASR . CCDS31 QGSVMELLQGVDGVIKRTENVTLKKPETFPKNQRRVFRAPDLKGMLEVFRELTDVRRYWG 240 250 260 270 280 290 >>CCDS31394.1 TRIM5 gene_id:85363|Hs108|chr11 (347 aa) initn: 341 init1: 163 opt: 344 Z-score: 312.3 bits: 65.8 E(32554): 4.5e-11 Smith-Waterman score: 344; 28.5% identity (62.1% similar) in 235 aa overlap (7-230:6-240) 10 20 30 40 50 pF1KE0 MAWQVSLLELEDRLQCPICLEVFKESLMLQCGHSYCKGCLVS--LSYHLDT-KVRCPMCW :....... ::::::.. . : :.::::.:..::.. . :: . ::.: CCDS31 MASGILVNVKEEVTCPICLELLTQPLSLDCGHSFCQACLTANHKKSMLDKGESSCPVCR 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 QVVDGSSSLPNVSLAWVIEALR-LPGDPE-PKV--CVHHRNPLSLFCEKDQELICGLCGL . . :: .: ..: :: . .:: :: :..: . : :::..: ..:: :: CCDS31 ISYQPENIRPNRHVANIVEKLREVKLSPEGQKVDHCARHGEKLLLFCQEDGKVICWLCER 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 LGSHQHHPVTPVSTVCSRMKEELAALFSELKQEQKKVDELIAKLVKNRTRIVNESDVFSW :. : . . : ... .: : . :.:.:....:: : . .... .. . . CCDS31 SQEHRGHHTFLTEEVAREYQVKLQAALEMLRQKQQEAEELEADIREEKASWKTQIQYDKT 120 130 140 150 160 170 180 190 200 210 220 pF1KE0 VIRREFQELRHPVDEEKARCLEGIGGHTRGLVASLDMQ----LEQAQGTRERLAQAECVL . .:..:: .: :.. :... . . .. :: . ..:.:. :: ... : : CCDS31 NVLADFEQLRDILDWEESNELQNLEKEEEDILKSLTNSETEMVQQTQSLRELISDLEHRL 180 190 200 210 220 230 230 240 250 pF1KE0 EQFGNEDHHEFIWKFHSMASR . CCDS31 QGSVMELLQGVDGVIKRTENVTLKKPETFPKNQRRVFRAPDLKGMLEVFRELTDVRRYWG 240 250 260 270 280 290 250 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 18:44:15 2016 done: Thu Nov 3 18:44:16 2016 Total Scan time: 2.280 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]