FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3035, 621 aa 1>>>pF1KB3035 621 - 621 aa - 621 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.0824+/-0.000388; mu= -1.7103+/- 0.024 mean_var=272.3768+/-58.276, 0's: 0 Z-trim(121.5): 25 B-trim: 1391 in 2/58 Lambda= 0.077712 statistics sampled from 38181 (38219) to 38181 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.448), width: 16 Scan time: 12.740 The best scores are: opt bits E(85289) NP_006523 (OMIM: 600284) RNA polymerase II elongat ( 621) 4178 481.9 2.9e-135 XP_016882824 (OMIM: 600284) PREDICTED: RNA polymer ( 488) 3280 381.1 4.8e-105 XP_016882825 (OMIM: 600284) PREDICTED: RNA polymer ( 370) 2536 297.6 5e-80 XP_016882826 (OMIM: 600284) PREDICTED: RNA polymer ( 337) 2266 267.3 6e-71 XP_011526632 (OMIM: 600284) PREDICTED: RNA polymer ( 357) 1920 228.6 3e-59 XP_016864728 (OMIM: 601874) PREDICTED: RNA polymer ( 589) 1298 159.0 4.3e-38 NP_036213 (OMIM: 601874) RNA polymerase II elongat ( 640) 1298 159.0 4.6e-38 XP_006714638 (OMIM: 601874) PREDICTED: RNA polymer ( 585) 1102 137.0 1.8e-31 XP_016864732 (OMIM: 601874) PREDICTED: RNA polymer ( 455) 635 84.6 8.4e-16 XP_016864729 (OMIM: 601874) PREDICTED: RNA polymer ( 508) 635 84.6 9.2e-16 XP_016864730 (OMIM: 601874) PREDICTED: RNA polymer ( 508) 635 84.6 9.2e-16 XP_016864731 (OMIM: 601874) PREDICTED: RNA polymer ( 508) 635 84.6 9.2e-16 NP_079441 (OMIM: 609885) RNA polymerase II elongat ( 397) 394 57.5 1e-07 NP_001231663 (OMIM: 610153,610572) MARVEL domain-c ( 546) 286 45.5 0.00058 XP_005248504 (OMIM: 610153,610572) PREDICTED: MARV ( 546) 286 45.5 0.00058 XP_005248503 (OMIM: 610153,610572) PREDICTED: MARV ( 558) 286 45.5 0.00059 NP_001033692 (OMIM: 610153,610572) MARVEL domain-c ( 558) 286 45.5 0.00059 XP_005248502 (OMIM: 610153,610572) PREDICTED: MARV ( 558) 286 45.5 0.00059 NP_001192184 (OMIM: 251290,602876) occludin isofor ( 271) 247 40.9 0.007 >>NP_006523 (OMIM: 600284) RNA polymerase II elongation (621 aa) initn: 4178 init1: 4178 opt: 4178 Z-score: 2549.5 bits: 481.9 E(85289): 2.9e-135 Smith-Waterman score: 4178; 100.0% identity (100.0% similar) in 621 aa overlap (1-621:1-621) 10 20 30 40 50 60 pF1KB3 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIRFQGSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIRFQGSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 GHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQDKITV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 GHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQDKITV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 CATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPSRKRAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 CATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPSRKRAT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 PINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLTQADKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 PINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLTQADKD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 ALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQPQSTGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 ALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQPQSTGS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 LLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 LLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 GREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVSNDLGHSGRDCEHGEAAAPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 GREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVSNDLGHSGRDCEHGEAAAPA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 PTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERAAEDKPRAQLPDCAPATHAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 PTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERAAEDKPRAQLPDCAPATHAT 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 PGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 PGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLH 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB3 ARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_006 ARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHS 550 560 570 580 590 600 610 620 pF1KB3 KLAHIKRLIAEYDQRQLQAWP ::::::::::::::::::::: NP_006 KLAHIKRLIAEYDQRQLQAWP 610 620 >>XP_016882824 (OMIM: 600284) PREDICTED: RNA polymerase (488 aa) initn: 3280 init1: 3280 opt: 3280 Z-score: 2006.9 bits: 381.1 E(85289): 4.8e-105 Smith-Waterman score: 3280; 100.0% identity (100.0% similar) in 488 aa overlap (134-621:1-488) 110 120 130 140 150 160 pF1KB3 GEVHLDCLGSIQDKITVCATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFR :::::::::::::::::::::::::::::: XP_016 MAQAEEETRSRSAIVIKAGGRYLGKKVQFR 10 20 30 170 180 190 200 210 220 pF1KB3 KPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKA 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB3 ELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 ELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQL 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB3 LKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISH 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB3 FTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVSND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 FTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVSND 220 230 240 250 260 270 410 420 430 440 450 460 pF1KB3 LGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERAAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERAAE 280 290 300 310 320 330 470 480 490 500 510 520 pF1KB3 DKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 DKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQ 340 350 360 370 380 390 530 540 550 560 570 580 pF1KB3 SYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKK 400 410 420 430 440 450 590 600 610 620 pF1KB3 TNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP :::::::::::::::::::::::::::::::::::::: XP_016 TNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP 460 470 480 >>XP_016882825 (OMIM: 600284) PREDICTED: RNA polymerase (370 aa) initn: 2536 init1: 2536 opt: 2536 Z-score: 1557.8 bits: 297.6 E(85289): 5e-80 Smith-Waterman score: 2536; 100.0% identity (100.0% similar) in 370 aa overlap (252-621:1-370) 230 240 250 260 270 280 pF1KB3 KAELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQ :::::::::::::::::::::::::::::: XP_016 MSAKDGTCTLQDCMYKDVQKDWPGYSEGDQ 10 20 30 290 300 310 320 330 340 pF1KB3 QLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRI 40 50 60 70 80 90 350 360 370 380 390 400 pF1KB3 SHFTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SHFTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLSSSTHLPPRLEPPRAHDPLADVS 100 110 120 130 140 150 410 420 430 440 450 460 pF1KB3 NDLGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 NDLGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERA 160 170 180 190 200 210 470 480 490 500 510 520 pF1KB3 AEDKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AEDKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQ 220 230 240 250 260 270 530 540 550 560 570 580 pF1KB3 RQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKI 280 290 300 310 320 330 590 600 610 620 pF1KB3 KKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP :::::::::::::::::::::::::::::::::::::::: XP_016 KKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP 340 350 360 370 >>XP_016882826 (OMIM: 600284) PREDICTED: RNA polymerase (337 aa) initn: 2266 init1: 2266 opt: 2266 Z-score: 1394.8 bits: 267.3 E(85289): 6e-71 Smith-Waterman score: 2266; 100.0% identity (100.0% similar) in 332 aa overlap (290-621:6-337) 260 270 280 290 300 310 pF1KB3 TLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSAS :::::::::::::::::::::::::::::: XP_016 MASSSRKLCQPQSTGSLLGDPAASSPPGERGRSAS 10 20 30 320 330 340 350 360 370 pF1KB3 PPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPNGREALLPTPGPPASTDTLS 40 50 60 70 80 90 380 390 400 410 420 430 pF1KB3 SSTHLPPRLEPPRAHDPLADVSNDLGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SSTHLPPRLEPPRAHDPLADVSNDLGHSGRDCEHGEAAAPAPTVRLGLPLLTDCAQPSRP 100 110 120 130 140 150 440 450 460 470 480 490 pF1KB3 HGSPSRSKPKKKSKKHKDKERAAEDKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HGSPSRSKPKKKSKKHKDKERAAEDKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSV 160 170 180 190 200 210 500 510 520 530 540 550 pF1KB3 PTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQ 220 230 240 250 260 270 560 570 580 590 600 610 pF1KB3 LSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 LSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQA 280 290 300 310 320 330 620 pF1KB3 WP :: XP_016 WP >>XP_011526632 (OMIM: 600284) PREDICTED: RNA polymerase (357 aa) initn: 1946 init1: 1920 opt: 1920 Z-score: 1184.8 bits: 228.6 E(85289): 3e-59 Smith-Waterman score: 1920; 99.7% identity (100.0% similar) in 291 aa overlap (1-291:1-291) 10 20 30 40 50 60 pF1KB3 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIRFQGSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIRFQGSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 GHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQDKITV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQDKITV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 CATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPSRKRAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 CATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPSRKRAT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 PINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLTQADKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLTQADKD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 ALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQPQSTGS ::::::::::::::::::::::::::::::::::::::::::::::::::. XP_011 ALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVREHSSWSTSLR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 LLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPN XP_011 TGLQEDCWPHLHAQTMVLLSRRLITSRLVESKMLSLHVEAASPSQAGGHSAALQILL 310 320 330 340 350 >>XP_016864728 (OMIM: 601874) PREDICTED: RNA polymerase (589 aa) initn: 1253 init1: 688 opt: 1298 Z-score: 804.8 bits: 159.0 E(85289): 4.3e-38 Smith-Waterman score: 1649; 47.0% identity (72.1% similar) in 591 aa overlap (4-568:9-587) 10 20 30 40 50 pF1KB3 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIR :.:.. :::::::... ....:.:::::..:.::.:.:..... . .::::. XP_016 MAAGGTGGLREEQRYGLSCGRLGQ-DNITVLHVKLTETAIRALETYQSHKNLIPFRPSIQ 10 20 30 40 50 60 70 80 90 100 110 pF1KB3 FQGSQGHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQ ::: .: ..::. : :...:.:::::.:.:::::::::::: :: : .:.::: :: XP_016 FQGLHGLVKIPKNDPLNEVHNFNFYLSNVGKDNPQGSFDCIQQTFSSSGASQLNCLGFIQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB3 DKITVCATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPS ::::::::.:::: .:. :.:::::.:.::. ::: :: :.::.::.:: ...:.:: XP_016 DKITVCATNDSYQMTRERMTQAEEESRNRSTKVIKPGGPYVGKRVQIRKAPQAVSDTVPE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB3 RKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLT :::.::.: :..:::. .: : .::::.::::.:::::. :.: ::: ::::::.. XP_016 RKRSTPMNPANTIRKTHSS-----STISQRPYRDRVIHLLALKAYKKPELLARLQKDGVN 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB3 QADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQP : ::..: ..::::::...:: . ::.: ..:..:.::::::: :.. :. :: ::: XP_016 QKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQRDWPGYSEIDRRSLESVLSRKLNPS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KB3 QSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGK :... : . :: ..: ::::: .::::: ::: ::::.:.:. :..::. XP_016 QNAA---GTSRSESPVCSSRDAVSSPQKRLLDSEFIDPLMNKKARISHLTNRVPPTLNGH 300 310 320 330 340 350 360 370 380 390 400 pF1KB3 LGVPNGREALLPTPGPPAS----TDTLSSSTHLPPRLEPPRAHD--------PLADVSND :. :..... : :::. : ::.:: .::. . : . ..: XP_016 LN-PTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPIS-HPPQIVNSNSNSPSTPEGRGTQD 360 370 380 390 400 410 420 430 440 450 pF1KB3 L---GHSGRDC--EHGEAAAPAPTVRLGLP---LLTDCAQPSRPHGSPSRSKPKKKSKKH : . : : : . . : :: .: : .: . . : :..: ::::::: XP_016 LPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLKCPKPMEENHSMSHKKSKKKSKKH 410 420 430 440 450 460 460 470 480 490 500 pF1KB3 KDKERAAE------DKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDY :.:.. . .. . .: .. . ..: .. :.. :..: : :. : ::: XP_016 KEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNSSGGVKEDCTASMEP-SAIELPDY 470 480 490 500 510 520 510 520 530 540 550 560 pF1KB3 LLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYET :.:: :: : ::::.::.::::::.::: ::::.: ..::: .:::: ..:: ::.::. XP_016 LIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETVARRFIKLDAQRKRLSPGSKEYQV 530 540 550 560 570 580 570 580 590 600 610 620 pF1KB3 TRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP XP_016 V >>NP_036213 (OMIM: 601874) RNA polymerase II elongation (640 aa) initn: 1487 init1: 688 opt: 1298 Z-score: 804.3 bits: 159.0 E(85289): 4.6e-38 Smith-Waterman score: 1883; 47.9% identity (73.7% similar) in 643 aa overlap (4-620:9-639) 10 20 30 40 50 pF1KB3 MAALKEDRSYGLSCGRVSDGSKVSVFHVKLTDSALRAFESYRARQDSVSLRPSIR :.:.. :::::::... ....:.:::::..:.::.:.:..... . .::::. NP_036 MAAGGTGGLREEQRYGLSCGRLGQ-DNITVLHVKLTETAIRALETYQSHKNLIPFRPSIQ 10 20 30 40 50 60 70 80 90 100 110 pF1KB3 FQGSQGHISIPQPDCPAEARTFSFYLSNIGRDNPQGSFDCIQQYVSSHGEVHLDCLGSIQ ::: .: ..::. : :...:.:::::.:.:::::::::::: :: : .:.::: :: NP_036 FQGLHGLVKIPKNDPLNEVHNFNFYLSNVGKDNPQGSFDCIQQTFSSSGASQLNCLGFIQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB3 DKITVCATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKVQFRKPAPGATDAVPS ::::::::.:::: .:. :.:::::.:.::. ::: :: :.::.::.:: ...:.:: NP_036 DKITVCATNDSYQMTRERMTQAEEESRNRSTKVIKPGGPYVGKRVQIRKAPQAVSDTVPE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB3 RKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPYRKAELLLRLQKDGLT :::.::.: :..:::. .: : .::::.::::.:::::. :.: ::: ::::::.. NP_036 RKRSTPMNPANTIRKTHSS-----STISQRPYRDRVIHLLALKAYKKPELLARLQKDGVN 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB3 QADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGDQQLLKRVLVRKLCQP : ::..: ..::::::...:: . ::.: ..:..:.::::::: :.. :. :: ::: NP_036 QKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQRDWPGYSEIDRRSLESVLSRKLNPS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KB3 QSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPRISHFTQRAQPAVNGK :... : . :: ..: ::::: .::::: ::: ::::.:.:. :..::. NP_036 QNAA---GTSRSESPVCSSRDAVSSPQKRLLDSEFIDPLMNKKARISHLTNRVPPTLNGH 300 310 320 330 340 350 360 370 380 390 400 pF1KB3 LGVPNGREALLPTPGPPAS----TDTLSSSTHLPPRLEPPRAHD--------PLADVSND :. :..... : :::. : ::.:: .::. . : . ..: NP_036 LN-PTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPIS-HPPQIVNSNSNSPSTPEGRGTQD 360 370 380 390 400 410 420 430 440 450 pF1KB3 L---GHSGRDC--EHGEAAAPAPTVRLGLP---LLTDCAQPSRPHGSPSRSKPKKKSKKH : . : : : . . : :: .: : .: . . : :..: ::::::: NP_036 LPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLKCPKPMEENHSMSHKKSKKKSKKH 410 420 430 440 450 460 460 470 480 490 500 pF1KB3 KDKERAAE------DKPRAQLPDCAPATHATPGAPADTPGLNGTCSVSSVPTSTSETPDY :.:.. . .. . .: .. . ..: .. :.. :..: :.. : ::: NP_036 KEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNSSGGVKEDCTASMEPSAI-ELPDY 470 480 490 500 510 520 510 520 530 540 550 560 pF1KB3 LLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLDAQLRQLSQGSEEYET :.:: :: : ::::.::.::::::.::: ::::.: ..::: .:::: ..:: ::.::.. NP_036 LIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETVARRFIKLDAQRKRLSPGSKEYQN 530 540 550 560 570 580 570 580 590 600 610 620 pF1KB3 TRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQRQLQAWP .. ..::::.:::... :: .::.::::::.:::::::::.:.::.: ..: NP_036 VHEEVLQEYQKIKQSSPNYHEEKYRCEYLHNKLAHIKRLIGEFDQQQAESWS 590 600 610 620 630 640 >>XP_006714638 (OMIM: 601874) PREDICTED: RNA polymerase (585 aa) initn: 1293 init1: 551 opt: 1102 Z-score: 686.1 bits: 137.0 E(85289): 1.8e-31 Smith-Waterman score: 1687; 47.9% identity (72.1% similar) in 591 aa overlap (56-620:5-584) 30 40 50 60 70 80 pF1KB3 FHVKLTDSALRAFESYRARQDSVSLRPSIRFQGSQGHISIPQPDCPAEARTFSFYLSNIG :. :.. ..::. : :...:.:::::.: XP_006 MDLFFSWSNALVKIPKNDPLNEVHNFNFYLSNVG 10 20 30 90 100 110 120 130 140 pF1KB3 RDNPQGSFDCIQQYVSSHGEVHLDCLGSIQDKITVCATDDSYQKARQSMAQAEEETRSRS .:::::::::::: :: : .:.::: ::::::::::.:::: .:. :.:::::.:.:: XP_006 KDNPQGSFDCIQQTFSSSGASQLNCLGFIQDKITVCATNDSYQMTRERMTQAEEESRNRS 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB3 AIVIKAGGRYLGKKVQFRKPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQR . ::: :: :.::.::.:: ...:.:: :::.::.: :..:::. .: : .::: XP_006 TKVIKPGGPYVGKRVQIRKAPQAVSDTVPERKRSTPMNPANTIRKTHSS-----STISQR 100 110 120 130 140 210 220 230 240 250 260 pF1KB3 PFRDRVLHLLALRPYRKAELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCM :.::::.:::::. :.: ::: ::::::..: ::..: ..::::::...:: . ::.: . XP_006 PYRDRVIHLLALKAYKKPELLARLQKDGVNQKDKNSLGAILQQVANLNSKDLSYTLKDYV 150 160 170 180 190 200 270 280 290 300 310 320 pF1KB3 YKDVQKDWPGYSEGDQQLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRL .:..:.::::::: :.. :. :: ::: :... : . :: ..: ::::: XP_006 FKELQRDWPGYSEIDRRSLESVLSRKLNPSQNAA---GTSRSESPVCSSRDAVSSPQKRL 210 220 230 240 250 260 330 340 350 360 370 380 pF1KB3 QPPDFIDPLANKKPRISHFTQRAQPAVNGKLGVPNGREALLPTPGPPAS----TDTLSSS .::::: ::: ::::.:.:. :..::.:. :..... : :::. : : XP_006 LDSEFIDPLMNKKARISHLTNRVPPTLNGHLN-PTSEKSAAGLPLPPAAAAIPTPPPLPS 270 280 290 300 310 320 390 400 410 420 pF1KB3 THLPPRLEPPRAHD--------PLADVSNDL---GHSGRDC--EHGEAAAPAPTVRLGLP :.:: .::. . : . ..:: . : : : . . : :: XP_006 TYLPIS-HPPQIVNSNSNSPSTPEGRGTQDLPVDSFSQNDSIYEDQQDKYTSRTSLETLP 330 340 350 360 370 380 430 440 450 460 470 pF1KB3 ---LLTDCAQPSRPHGSPSRSKPKKKSKKHKDKERAAE------DKPRAQLPDCAPATHA .: : .: . . : :..: ::::::::.:.. . .. . .: .. XP_006 PGSVLLKCPKPMEENHSMSHKKSKKKSKKHKEKDQIKKHDIETIEEKEEDLKREEEIAKL 390 400 410 420 430 440 480 490 500 510 520 530 pF1KB3 TPGAPADTPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDL . ..: .. :.. :..: : :. : ::::.:: :: : ::::.::.::::::.::: : XP_006 NNSSPNSSGGVKEDCTASMEP-SAIELPDYLIKYIAIVSYEQRQNYKDDFNAEYDEYRAL 450 460 470 480 490 500 540 550 560 570 580 590 pF1KB3 HARIERITRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLH :::.: ..::: .:::: ..:: ::.::.... ..::::.:::... :: .::.:::::: XP_006 HARMETVARRFIKLDAQRKRLSPGSKEYQNVHEEVLQEYQKIKQSSPNYHEEKYRCEYLH 510 520 530 540 550 560 600 610 620 pF1KB3 SKLAHIKRLIAEYDQRQLQAWP .:::::::::.:.::.: ..: XP_006 NKLAHIKRLIGEFDQQQAESWS 570 580 >>XP_016864732 (OMIM: 601874) PREDICTED: RNA polymerase (455 aa) initn: 1133 init1: 522 opt: 635 Z-score: 404.7 bits: 84.6 E(85289): 8.4e-16 Smith-Waterman score: 1172; 44.2% identity (69.4% similar) in 464 aa overlap (182-620:1-454) 160 170 180 190 200 210 pF1KB3 GGRYLGKKVQFRKPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRV .: :..:::. .: : .::::.:::: XP_016 MNPANTIRKTHSS-----STISQRPYRDRV 10 20 220 230 240 250 260 270 pF1KB3 LHLLALRPYRKAELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQK .:::::. :.: ::: ::::::..: ::..: ..::::::...:: . ::.: ..:..:. XP_016 IHLLALKAYKKPELLARLQKDGVNQKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQR 30 40 50 60 70 80 280 290 300 310 320 330 pF1KB3 DWPGYSEGDQQLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFI ::::::: :.. :. :: ::: :... : . :: ..: ::::: .:: XP_016 DWPGYSEIDRRSLESVLSRKLNPSQNAA---GTSRSESPVCSSRDAVSSPQKRLLDSEFI 90 100 110 120 130 140 340 350 360 370 380 pF1KB3 DPLANKKPRISHFTQRAQPAVNGKLGVPNGREALLPTPGPPAS----TDTLSSSTHLP-- ::: ::: ::::.:.:. :..::.:. :..... : :::. : ::.:: XP_016 DPLMNKKARISHLTNRVPPTLNGHLN-PTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPIS 150 160 170 180 190 200 390 400 410 420 430 pF1KB3 --PRLEPPRAHDPLADV---SNDL---GHSGRDC--EHGEAAAPAPTVRLGLP---LLTD :.. ...: . ..:: . : : : . . : :: .: XP_016 HPPQIVNSNSNSPSTPEGRGTQDLPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLK 210 220 230 240 250 260 440 450 460 470 480 pF1KB3 CAQPSRPHGSPSRSKPKKKSKKHKDKERAAE------DKPRAQLPDCAPATHATPGAPAD : .: . . : :..: ::::::::.:.. . .. . .: .. . ..: . XP_016 CPKPMEENHSMSHKKSKKKSKKHKEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNS 270 280 290 300 310 320 490 500 510 520 530 540 pF1KB3 TPGLNGTCSVSSVPTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERI . :.. :..: :.. : ::::.:: :: : ::::.::.::::::.::: ::::.: . XP_016 SGGVKEDCTASMEPSAI-ELPDYLIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETV 330 340 350 360 370 380 550 560 570 580 590 600 pF1KB3 TRRFTQLDAQLRQLSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIK .::: .:::: ..:: ::.::.... ..::::.:::... :: .::.::::::.:::::: XP_016 ARRFIKLDAQRKRLSPGSKEYQNVHEEVLQEYQKIKQSSPNYHEEKYRCEYLHNKLAHIK 390 400 410 420 430 440 610 620 pF1KB3 RLIAEYDQRQLQAWP :::.:.::.: ..: XP_016 RLIGEFDQQQAESWS 450 >>XP_016864729 (OMIM: 601874) PREDICTED: RNA polymerase (508 aa) initn: 1133 init1: 522 opt: 635 Z-score: 404.0 bits: 84.6 E(85289): 9.2e-16 Smith-Waterman score: 1372; 45.9% identity (70.5% similar) in 516 aa overlap (131-620:3-507) 110 120 130 140 150 160 pF1KB3 SSHGEVHLDCLGSIQDKITVCATDDSYQKARQSMAQAEEETRSRSAIVIKAGGRYLGKKV :. :.:::::.:.::. ::: :: :.::.: XP_016 MTRERMTQAEEESRNRSTKVIKPGGPYVGKRV 10 20 30 170 180 190 200 210 220 pF1KB3 QFRKPAPGATDAVPSRKRATPINLASAIRKSGASAVSGGSGVSQRPFRDRVLHLLALRPY :.:: ...:.:: :::.::.: :..:::. .: : .::::.::::.:::::. : XP_016 QIRKAPQAVSDTVPERKRSTPMNPANTIRKTHSS-----STISQRPYRDRVIHLLALKAY 40 50 60 70 80 230 240 250 260 270 280 pF1KB3 RKAELLLRLQKDGLTQADKDALDGLLQQVANMSAKDGTCTLQDCMYKDVQKDWPGYSEGD .: ::: ::::::..: ::..: ..::::::...:: . ::.: ..:..:.::::::: : XP_016 KKPELLARLQKDGVNQKDKNSLGAILQQVANLNSKDLSYTLKDYVFKELQRDWPGYSEID 90 100 110 120 130 140 290 300 310 320 330 340 pF1KB3 QQLLKRVLVRKLCQPQSTGSLLGDPAASSPPGERGRSASPPQKRLQPPDFIDPLANKKPR .. :. :: ::: :... : . :: ..: ::::: .::::: ::: : XP_016 RRSLESVLSRKLNPSQNAA---GTSRSESPVCSSRDAVSSPQKRLLDSEFIDPLMNKKAR 150 160 170 180 190 200 350 360 370 380 390 pF1KB3 ISHFTQRAQPAVNGKLGVPNGREALLPTPGPPAS----TDTLSSSTHLPPRLEPPRAHD- :::.:.:. :..::.:. :..... : :::. : ::.:: .::. . XP_016 ISHLTNRVPPTLNGHLN-PTSEKSAAGLPLPPAAAAIPTPPPLPSTYLPIS-HPPQIVNS 210 220 230 240 250 260 400 410 420 430 440 pF1KB3 -------PLADVSNDL---GHSGRDC--EHGEAAAPAPTVRLGLP---LLTDCAQPSRPH : . ..:: . : : : . . : :: .: : .: . . XP_016 NSNSPSTPEGRGTQDLPVDSFSQNDSIYEDQQDKYTSRTSLETLPPGSVLLKCPKPMEEN 270 280 290 300 310 320 450 460 470 480 490 pF1KB3 GSPSRSKPKKKSKKHKDKERAAE------DKPRAQLPDCAPATHATPGAPADTPGLNGTC : :..: ::::::::.:.. . .. . .: .. . ..: .. :.. : XP_016 HSMSHKKSKKKSKKHKEKDQIKKHDIETIEEKEEDLKREEEIAKLNNSSPNSSGGVKEDC 330 340 350 360 370 380 500 510 520 530 540 550 pF1KB3 SVSSVPTSTSETPDYLLKYAAISSSEQRQSYKNDFNAEYSEYRDLHARIERITRRFTQLD ..: : :. : ::::.:: :: : ::::.::.::::::.::: ::::.: ..::: .:: XP_016 TASMEP-SAIELPDYLIKYIAIVSYEQRQNYKDDFNAEYDEYRALHARMETVARRFIKLD 390 400 410 420 430 440 560 570 580 590 600 610 pF1KB3 AQLRQLSQGSEEYETTRGQILQEYRKIKKTNTNYSQEKHRCEYLHSKLAHIKRLIAEYDQ :: ..:: ::.::.... ..::::.:::... :: .::.::::::.:::::::::.:.:: XP_016 AQRKRLSPGSKEYQNVHEEVLQEYQKIKQSSPNYHEEKYRCEYLHNKLAHIKRLIGEFDQ 450 460 470 480 490 500 620 pF1KB3 RQLQAWP .: ..: XP_016 QQAESWS 621 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:16:35 2016 done: Tue Nov 8 03:16:36 2016 Total Scan time: 12.740 Total Display time: 0.100 Function used was FASTA [36.3.4 Apr, 2011]