FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2234, 263 aa 1>>>pF1KE2234 263 - 263 aa - 263 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4313+/-0.000787; mu= 14.4260+/- 0.047 mean_var=78.2714+/-15.890, 0's: 0 Z-trim(109.5): 65 B-trim: 2 in 1/50 Lambda= 0.144968 statistics sampled from 10830 (10901) to 10830 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.717), E-opt: 0.2 (0.335), width: 16 Scan time: 2.000 The best scores are: opt bits E(32554) CCDS4760.1 DMB gene_id:3109|Hs108|chr6 ( 263) 1824 390.6 5.8e-109 CCDS4754.1 DOB gene_id:3112|Hs108|chr6 ( 273) 286 69.0 4e-12 CCDS4765.1 DPB1 gene_id:3115|Hs108|chr6 ( 258) 281 67.9 8e-12 CCDS59006.1 DQB1 gene_id:3119|Hs108|chr6 ( 269) 276 66.9 1.7e-11 CCDS78128.1 DQB2 gene_id:3120|Hs108|chr6 ( 264) 275 66.7 1.9e-11 CCDS47409.1 DRB1 gene_id:3123|Hs108|chr6 ( 266) 273 66.2 2.6e-11 CCDS43451.1 DQB1 gene_id:3119|Hs108|chr6 ( 261) 266 64.8 7.1e-11 >>CCDS4760.1 DMB gene_id:3109|Hs108|chr6 (263 aa) initn: 1824 init1: 1824 opt: 1824 Z-score: 2069.7 bits: 390.6 E(32554): 5.8e-109 Smith-Waterman score: 1824; 99.6% identity (99.6% similar) in 263 aa overlap (1-263:1-263) 10 20 30 40 50 60 pF1KE2 MITFLPLLLGLSLGCTGAGGFVAHVESTCLLDDAGTPKDFTYCISFNKDLLTCWDPEENK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MITFLPLLLGLSLGCTGAGGFVAHVESTCLLDDAGTPKDFTYCISFNKDLLTCWDPEENK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 MAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQPFWGSLTNRTRPPSVQVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQPFWGSLTNRTRPPSVQVA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 KTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHSSAHKTAQPNGDWTYQTLSHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHSSAHKTAQPNGDWTYQTLSHL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 ALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPMQTLKVSVSAVTLGLGLIIFSLGVISW :::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::: CCDS47 ALTPSYGDTYTCVVEHTGAPEPILRDWTPGLSPMQTLKVSVSAVTLGLGLIIFSLGVISW 190 200 210 220 230 240 250 260 pF1KE2 RRAGHSSYTPLPGSNYSEGWHIS ::::::::::::::::::::::: CCDS47 RRAGHSSYTPLPGSNYSEGWHIS 250 260 >>CCDS4754.1 DOB gene_id:3112|Hs108|chr6 (273 aa) initn: 246 init1: 143 opt: 286 Z-score: 331.1 bits: 69.0 E(32554): 4e-12 Smith-Waterman score: 286; 26.5% identity (59.1% similar) in 264 aa overlap (1-257:13-264) 10 20 30 40 pF1KE2 MITFLPLLLGLSLGCTGAGGFVAHVESTCLLDDAGTPK-DFTYCISFN .... : ... : . :: .... : . . :: : .:. . :: CCDS47 MGSGWVPWVVALLVNLTRLDSSMTQGTDSPEDFVIQAKADCYFTN-GTEKVQFVVRFIFN 10 20 30 40 50 50 60 70 80 90 100 pF1KE2 KDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQPFWGS . . .: . . : .:..:.. ... :.. :..: :..... :. . . CCDS47 LEEYVRFDSDVG-----MFVALTKLGQPDAEQWNSRLDLLERSRQAVDGVCRHNYRLGAP 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 LT-NRTRPPSVQV-AKTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHSSAHKT .: .: : : : . ::. .. .: : : ::::... : : ::. . .. . CCDS47 FTVGRKVQPEVTVYPERTPL-LHQHNLLHCSVTGFYPGDIKIKWFLNGQ---EERAGVMS 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 AQP--NGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPMQTLKVSVSA . : :::::.::. : .:: : .:::.:.: . :. .: . : .. :. . CCDS47 TGPIRNGDWTFQTVVMLEMTPELGHVYTCLVDHSSLLSPVSVEWR-AQSEYSWRKMLSGI 180 190 200 210 220 230 240 250 260 pF1KE2 VTLGLGLIIFSLGVISWRRAGHSSY--TPLPGSNYSEGWHIS ... ::::.. .:.. :: ...: : . :.. : CCDS47 AAFLLGLIFLLVGIVIQLRA-QKGYVRTQMSGNEVSRAVLLPQSC 230 240 250 260 270 >>CCDS4765.1 DPB1 gene_id:3115|Hs108|chr6 (258 aa) initn: 238 init1: 136 opt: 281 Z-score: 325.8 bits: 67.9 E(32554): 8e-12 Smith-Waterman score: 281; 30.9% identity (56.4% similar) in 181 aa overlap (65-243:73-250) 40 50 60 70 80 90 pF1KE2 GTPKDFTYCISFNKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLN-QKDTLMQRLRNG :: ... :. ... : ::: : .. CCDS47 ECYAFNGTQRFLERYIYNREEFARFDSDVGEFRAVTELGRPAAEYWNSQKDILEEKRAVP 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE2 LQNCATHTQPFWGSLTNRTR-PPSVQVAKTTPFNTREPVMLACYVWGFYPAEVTITWRKN . : :. . : .: . : : :.:. . .. .:.:.: :::. . . : : CCDS47 DRMC-RHNYELGGPMTLQRRVQPRVNVSPSKKGPLQHHNLLVCHVTDFYPGSIQVRWFLN 110 120 130 140 150 160 160 170 180 190 200 210 pF1KE2 GKLVMPHSSAHKTAQPNGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLS :. . . . :::::.: : : .::. ::.::: ::: . :. .: . : CCDS47 GQEETAGVVSTNLIR-NGDWTFQILVMLEMTPQQGDVYTCQVEHTSLDSPVTVEWK-AQS 170 180 190 200 210 220 230 240 250 260 pF1KE2 PMQTLKVSVSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS :. ..: . ::::: ..:.. ::. CCDS47 DSARSKTLTGAGGFVLGLIICGVGIFMHRRSKKVQRGSA 220 230 240 250 >>CCDS59006.1 DQB1 gene_id:3119|Hs108|chr6 (269 aa) initn: 228 init1: 156 opt: 276 Z-score: 319.8 bits: 66.9 E(32554): 1.7e-11 Smith-Waterman score: 276; 29.7% identity (58.2% similar) in 182 aa overlap (77-252:90-264) 50 60 70 80 90 100 pF1KE2 NKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQP--F ... :.. ... : :.. :. : CCDS59 TRYIYNREEYARFDSDVGVYRAVTPQGRPDAEYWNSQKEVLEGTRAELDTVCRHNYEVAF 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 WGSLTNRTRPP-SVQVAKTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHSSAH : : :..: ... ..: .: .. .:.: : :::... . : .: . . .: CCDS59 RGILQRRVEPTVTISPSRTEALNHHN--LLVCSVTDFYPGQIKVRWFRNDQ----EETAG 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 KTAQP---NGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPMQTLKVS .. : :::::.: : : .::. ::.::: ::: . :: .: :. :. CCDS59 VVSTPLIRNGDWTFQILVMLEMTPQRGDVYTCHVEHPSLQSPITVEWRAQSESAQS-KML 180 190 200 210 220 230 230 240 250 260 pF1KE2 VSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS .. . ::::...::.: .:. .. : : CCDS59 SGVGGFVLGLIFLGLGLIIRQRSQKGPQGPPPAGLLH 240 250 260 >>CCDS78128.1 DQB2 gene_id:3120|Hs108|chr6 (264 aa) initn: 253 init1: 155 opt: 275 Z-score: 318.8 bits: 66.7 E(32554): 1.9e-11 Smith-Waterman score: 275; 28.7% identity (52.1% similar) in 188 aa overlap (65-252:74-259) 40 50 60 70 80 90 pF1KE2 GTPKDFTYCISFNKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGL :: ... :. . . : :: : :. CCDS78 YFTNGTERVRGVARYIYNREEYGRFDSDVGEFQAVTELGRSIEDWNNYKDFLEQERAAVD 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE2 QNCATHTQPFWGSLTNRTRPPSVQVAKTTPFNTREPVMLACYVWGFYPAEVTITWRKNGK . : . . . .: :.: .. . . .:.: : ::::.. . : .: . CCDS78 KVCRHNYEAELRTTLQRQVEPTVTISPSRTEALNHHNLLVCSVTDFYPAQIKVRWFRNDQ 110 120 130 140 150 160 160 170 180 190 200 210 pF1KE2 LVMPHSSAHKTAQPNGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPM . . . :::::.: : : .::. :: ::: ::: . :: .: CCDS78 EETAGVVSTSLIR-NGDWTFQILVMLEITPQRGDIYTCQVEHPSLQSPITVEWRAQSESA 170 180 190 200 210 220 220 230 240 250 260 pF1KE2 QTLKVSVSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS :. :. . . ::::...::.: .:. .. : : CCDS78 QS-KMLSGIGGFVLGLIFLGLGLIIRHRGQKGPRGPPPAGLLH 230 240 250 260 >>CCDS47409.1 DRB1 gene_id:3123|Hs108|chr6 (266 aa) initn: 215 init1: 125 opt: 273 Z-score: 316.5 bits: 66.2 E(32554): 2.6e-11 Smith-Waterman score: 273; 29.5% identity (56.8% similar) in 190 aa overlap (65-250:75-261) 40 50 60 70 80 90 pF1KE2 GTPKDFTYCISFNKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGL :: ... :. ... :.. .... : .. CCDS47 HFFNGTERVRFLDRYFYNQEESVRFDSDVGEFRAVTELGRPDAEYWNSQKDILEQARAAV 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE2 QNCATHTQPFWGSLTNRTR-PPSVQV--AKTTPFNTREPVMLACYVWGFYPAEVTITWRK .. :. :.: . : :.: : .:: :.. .. .:.: : ::::. . . : CCDS47 DTYCRHNYGVVESFTVQRRVQPKVTVYPSKTQPLQHHN--LLVCSVSGFYPGSIEVRWFL 110 120 130 140 150 160 160 170 180 190 200 210 pF1KE2 NGKLVMPHSSAHKTAQPNGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGL ::. . : :::::.::: : .: :..::: ::: .. :. .: CCDS47 NGQEEKAGMVSTGLIQ-NGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPLTVEWRARS 170 180 190 200 210 220 220 230 240 250 260 pF1KE2 SPMQTLKVS-VSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS :. .: :.. .::: .. .: . . :::. : CCDS47 ESAQSKMLSGVGGFVLGLLFLGAGLFIYFRNQKGHSGLQPTGFLS 230 240 250 260 >>CCDS43451.1 DQB1 gene_id:3119|Hs108|chr6 (261 aa) initn: 228 init1: 156 opt: 266 Z-score: 308.7 bits: 64.8 E(32554): 7.1e-11 Smith-Waterman score: 266; 30.1% identity (59.0% similar) in 173 aa overlap (77-243:90-255) 50 60 70 80 90 100 pF1KE2 NKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHLNQKDTLMQRLRNGLQNCATHTQP--F ... :.. ... : :.. :. : CCDS43 TRYIYNREEYARFDSDVGVYRAVTPQGRPDAEYWNSQKEVLEGTRAELDTVCRHNYEVAF 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE2 WGSLTNRTRPP-SVQVAKTTPFNTREPVMLACYVWGFYPAEVTITWRKNGKLVMPHSSAH : : :..: ... ..: .: .. .:.: : :::... . : .: . . .: CCDS43 RGILQRRVEPTVTISPSRTEALNHHN--LLVCSVTDFYPGQIKVRWFRNDQ----EETAG 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE2 KTAQP---NGDWTYQTLSHLALTPSYGDTYTCVVEHIGAPEPILRDWTPGLSPMQTLKVS .. : :::::.: : : .::. ::.::: ::: . :: .: :. :. CCDS43 VVSTPLIRNGDWTFQILVMLEMTPQRGDVYTCHVEHPSLQSPITVEWRAQSESAQS-KML 180 190 200 210 220 230 230 240 250 260 pF1KE2 VSAVTLGLGLIIFSLGVISWRRAGHSSYTPLPGSNYSEGWHIS .. . ::::...::.: .:. CCDS43 SGVGGFVLGLIFLGLGLIIRQRSQKGLLH 240 250 260 263 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 00:27:13 2016 done: Mon Nov 7 00:27:13 2016 Total Scan time: 2.000 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]