FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3917, 1720 aa 1>>>pF1KB3917 1720 - 1720 aa - 1720 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.4235+/-0.00156; mu= 4.4467+/- 0.093 mean_var=237.8483+/-48.864, 0's: 0 Z-trim(105.7): 34 B-trim: 154 in 2/49 Lambda= 0.083162 statistics sampled from 8562 (8570) to 8562 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.6), E-opt: 0.2 (0.263), width: 16 Scan time: 4.170 The best scores are: opt bits E(32554) CCDS42706.1 POLR1A gene_id:25885|Hs108|chr2 (1720) 11455 1389.6 0 CCDS7354.1 POLR3A gene_id:11128|Hs108|chr10 (1390) 485 73.4 6.2e-12 >>CCDS42706.1 POLR1A gene_id:25885|Hs108|chr2 (1720 aa) initn: 11455 init1: 11455 opt: 11455 Z-score: 7439.5 bits: 1389.6 E(32554): 0 Smith-Waterman score: 11455; 99.9% identity (99.9% similar) in 1720 aa overlap (1-1720:1-1720) 10 20 30 40 50 60 pF1KB3 MLISKNMPWRRLQGISFGMYSAEELKKLSVKSITNPRYLDSLGNPSANGLYDLALGPADS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MLISKNMPWRRLQGISFGMYSAEELKKLSVKSITNPRYLDSLGNPSANGLYDLALGPADS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 KEVCSTCVQDFSNCSGHLGHIELPLTVYNPLLFDKLYLLLRGSCLNCHMLTCPRAVIHLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KEVCSTCVQDFSNCSGHLGHIELPLTVYNPLLFDKLYLLLRGSCLNCHMLTCPRAVIHLL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 LCQLRVLEVGALQAVYELERILNRFLEENADPSASEIREELEQYTTEIVQNNLLGSQGAH ::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::: CCDS42 LCQLRVLEVGALQAVYELERILNRFLEENPDPSASEIREELEQYTTEIVQNNLLGSQGAH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 VKNVCESKSKLIALFWKAHMNAKRCPHCKTGRSVVRKEHNSKLTITFPAMVHRTAGQKDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 VKNVCESKSKLIALFWKAHMNAKRCPHCKTGRSVVRKEHNSKLTITFPAMVHRTAGQKDS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 EPLGIEEAQIGKRGYLTPTSAREHLSALWKNEGFFLNYLFSGMDDDGMESRFNPSVFFLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EPLGIEEAQIGKRGYLTPTSAREHLSALWKNEGFFLNYLFSGMDDDGMESRFNPSVFFLD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 FLVVPPSRYRPVSRLGDQMFTNGQTVNLQAVMKDVVLIRKLLALMAQEQKLPEEVATPTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FLVVPPSRYRPVSRLGDQMFTNGQTVNLQAVMKDVVLIRKLLALMAQEQKLPEEVATPTT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 DEEKDSLIAIDRSFLSTLPGQSLIDKLYNIWIRLQSHVNIVFDSEMDKLMMDKYPGIRQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DEEKDSLIAIDRSFLSTLPGQSLIDKLYNIWIRLQSHVNIVFDSEMDKLMMDKYPGIRQI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 LEKKEGLFRKHMMGKRVDYAARSVICPDMYINTNEIGIPMVFATKLTYPQPVTPWNVQEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LEKKEGLFRKHMMGKRVDYAARSVICPDMYINTNEIGIPMVFATKLTYPQPVTPWNVQEL 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 RQAVINGPNVHPGASMVINEDGSRTALSAVDMTQREAVAKQLLTPATGAPKPQGTKIVCR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RQAVINGPNVHPGASMVINEDGSRTALSAVDMTQREAVAKQLLTPATGAPKPQGTKIVCR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB3 HVKNGDILLLNRQPTLHRPSIQAHRARILPEEKVLRLHYANCKAYNADFDGDEMNAHFPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 HVKNGDILLLNRQPTLHRPSIQAHRARILPEEKVLRLHYANCKAYNADFDGDEMNAHFPQ 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB3 SELGRAEAYVLACTDQQYLVPKDGQPLAGLIQDHMVSGASMTTRGCFFTREHYMELVYRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SELGRAEAYVLACTDQQYLVPKDGQPLAGLIQDHMVSGASMTTRGCFFTREHYMELVYRG 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB3 LTDKVGRVKLLSPSILKPFPLWTGKQVVSTLLINIIPEDHIPLNLSGKAKITGKAWVKET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LTDKVGRVKLLSPSILKPFPLWTGKQVVSTLLINIIPEDHIPLNLSGKAKITGKAWVKET 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB3 PRSVPGFNPDSMCESQVIIREGELLCGVLDKAHYGSSAYGLVHCCYEIYGGETSGKVLTC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PRSVPGFNPDSMCESQVIIREGELLCGVLDKAHYGSSAYGLVHCCYEIYGGETSGKVLTC 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB3 LARLFTAYLQLYRGFTLGVEDILVKPKADVKRQRIIEESTHCGPQAVRAALNLPEAASYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LARLFTAYLQLYRGFTLGVEDILVKPKADVKRQRIIEESTHCGPQAVRAALNLPEAASYD 790 800 810 820 830 840 850 860 870 880 890 900 pF1KB3 EVRGKWQDAHLGKDQRDFNMIDLKFKEEVNHYSNEINKACMPFGLHRQFPENSLQMMVQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EVRGKWQDAHLGKDQRDFNMIDLKFKEEVNHYSNEINKACMPFGLHRQFPENSLQMMVQS 850 860 870 880 890 900 910 920 930 940 950 960 pF1KB3 GAKGSTVNTMQISCLLGQIELEGRRPPLMASGKSLPCFEPYEFTPRAGGFVTGRFLTGIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GAKGSTVNTMQISCLLGQIELEGRRPPLMASGKSLPCFEPYEFTPRAGGFVTGRFLTGIK 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KB3 PPEFFFHCMAGREGLVDTAVKTSRSGYLQRCIIKHLEGLVVQYDLTVRDSDGSVVQFLYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PPEFFFHCMAGREGLVDTAVKTSRSGYLQRCIIKHLEGLVVQYDLTVRDSDGSVVQFLYG 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KB3 EDGLDIPKTQFLQPKQFPFLASNYEVIMKSQHLHEVLSRADPKKALHHFRAIKKWQSKHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 EDGLDIPKTQFLQPKQFPFLASNYEVIMKSQHLHEVLSRADPKKALHHFRAIKKWQSKHP 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KB3 NTLLRRGAFLSYSQKIQEAVKALKLESENRNGRSPGTQEMLRMWYELDEESRRKYQKKAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 NTLLRRGAFLSYSQKIQEAVKALKLESENRNGRSPGTQEMLRMWYELDEESRRKYQKKAA 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KB3 ACPDPSLSVWRPDIYFASVSETFETKVDDYSQEWAAQTEKSYEKSELSLDRLRTLLQLKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 ACPDPSLSVWRPDIYFASVSETFETKVDDYSQEWAAQTEKSYEKSELSLDRLRTLLQLKW 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KB3 QRSLCEPGEAVGLLAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILMVASANIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QRSLCEPGEAVGLLAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILMVASANIK 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KB3 TPMMSVPVLNTKKALKRVKSLKKQLTRVCLGEVLQKIDVQESFCMEEKQNKFQVYQLRFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 TPMMSVPVLNTKKALKRVKSLKKQLTRVCLGEVLQKIDVQESFCMEEKQNKFQVYQLRFQ 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KB3 FLPHAYYQQEKCLRPEDILRFMETRFFKLLMESIKKKNNKASAFRNVNTRRATQRDLDNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 FLPHAYYQQEKCLRPEDILRFMETRFFKLLMESIKKKNNKASAFRNVNTRRATQRDLDNA 1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 pF1KB3 GELGRSRGEQEGDEEEEGHIVDAEAEEGDADASDAKRKEKQEEEVDYESEEEEEREGEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GELGRSRGEQEGDEEEEGHIVDAEAEEGDADASDAKRKEKQEEEVDYESEEEEEREGEEN 1390 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 1500 pF1KB3 DDEDMQEERNPHREGARKTQEQDEEVGLGTEEDPSLPALLTQPRKPTHSQEPQGPEAMER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 DDEDMQEERNPHREGARKTQEQDEEVGLGTEEDPSLPALLTQPRKPTHSQEPQGPEAMER 1450 1460 1470 1480 1490 1500 1510 1520 1530 1540 1550 1560 pF1KB3 RVQAVREIHPFIDDYQYDTEESLWCQVTVKLPLMKINFDMSSLVVSLAHGAVIYATKGIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RVQAVREIHPFIDDYQYDTEESLWCQVTVKLPLMKINFDMSSLVVSLAHGAVIYATKGIT 1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 1610 1620 pF1KB3 RCLLNETTNNKNEKELVLNTEGINLPELFKYAEVLDLRRLYSNDIHAIANTYGIEAALRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RCLLNETTNNKNEKELVLNTEGINLPELFKYAEVLDLRRLYSNDIHAIANTYGIEAALRV 1570 1580 1590 1600 1610 1620 1630 1640 1650 1660 1670 1680 pF1KB3 IEKEIKDVFAVYGIAVDPRHLSLVADYMCFEGVYKPLNRFGIRSNSSPLQQMTFETSFQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 IEKEIKDVFAVYGIAVDPRHLSLVADYMCFEGVYKPLNRFGIRSNSSPLQQMTFETSFQF 1630 1640 1650 1660 1670 1680 1690 1700 1710 1720 pF1KB3 LKQATMLGSHDELRSPSACLVVGKVVRGGTGLFELKQPLR :::::::::::::::::::::::::::::::::::::::: CCDS42 LKQATMLGSHDELRSPSACLVVGKVVRGGTGLFELKQPLR 1690 1700 1710 1720 >>CCDS7354.1 POLR3A gene_id:11128|Hs108|chr10 (1390 aa) initn: 1492 init1: 459 opt: 485 Z-score: 327.8 bits: 73.4 E(32554): 6.2e-12 Smith-Waterman score: 1498; 29.3% identity (55.0% similar) in 1311 aa overlap (10-1298:13-1118) 10 20 30 40 50 pF1KB3 MLISKNMPWRRLQGISFGMYSAEELKKLS-VKSITNPRY-LDSLGNPSANGLYDLAL .... : ::: : ::... . .. ... : :. : :. : . CCDS73 MVKEQFRETDVAKKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNQHAPLLYGVLDHRM 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB3 GPADSKEVCSTCVQDFSNCSGHLGHIELPLTVYNPLLFDKLYLLLRGSCLNCHMLTCPRA : ... . : :: .....: :: :.:.: : .. : . .:. : .: : CCDS73 GTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIGILQMICKTC----C--- 70 80 90 100 110 120 130 140 150 160 170 pF1KB3 VIHLLLCQLRVLEVGALQAVYELERILNRFLEENADPSASEIREE-LEQYTTEIVQNNLL :..: : : .. .::. :. . .... :.. .. .. CCDS73 --HIMLSQ-------------EEKK---QFLDYLKRPGLTYLQKRGLKKKISDKCRK--- 120 130 140 150 180 190 200 210 220 230 pF1KB3 GSQGAHVKNVCESKSKLIALFWKAHMNAKRCPHCKTGRSVVRKEHNSKLTITFPAMVHRT ::.:. . . . : . . ::...:: .. : :. : CCDS73 -------KNICHHCGAFNGTVKKCGLLKIIHEKYKTNKKVVDPIVSNFLQSFETAIEH-- 160 170 180 190 200 240 250 260 270 280 290 pF1KB3 AGQKDSEPLGIEEAQIGKRGYLTPTSAREHLSALWKNEGFFLNYLFSGMDDDGMESRF-- .:. ::: .:. :.:.:. : :: ::. . . . . CCDS73 --NKEVEPL------LGR--------AQENLNPL-----VVLN-LFKRIPAEDVPLLLMN 210 220 230 240 300 310 320 330 340 pF1KB3 ----NPSVFFLDFLVVPPSRYRPVSRLGDQMFTNGQTVNLQAVMKDVVLIRKLLALMAQE .:: ..: :.::: :: : ..: . .. . .: . ..... .. .. CCDS73 PEAGKPSDLILTRLLVPPLCIRP-SVVSD-LKSGTNEDDLTMKLTEIIFLNDVI----KK 250 260 270 280 290 350 360 370 380 390 400 pF1KB3 QKLPEEVATPTTDEEKDSLIAIDRSFLSTLPGQSLIDKLYNIWIRLQSHVNIVFDSEMDK ... . : ..: : : :: . . ..::.. CCDS73 HRI---------SGAKTQMIMED-------------------WDFLQLQCALYINSELSG 300 310 320 410 420 430 440 450 460 pF1KB3 LMMDKYP-----GIRQILEKKEGLFRKHMMGKRVDYAARSVICPDMYINTNEIGIPMVFA . .. : :. : :. :.: :: .. :::::...:.:: :: . .:...:. : CCDS73 IPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVA 330 340 350 360 370 380 470 480 490 500 510 520 pF1KB3 TKLTYPQPVTPWNVQELRQAVINGPNVHPGASMVINEDGSRTALSAVDMTQREAVAKQLL ::.:. :. :.. ::. : :::.:::::... .. . . . . .:: .:..: CCDS73 KILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQQRHTQMKRF--LKYGNREKMAQEL- 390 400 410 420 430 440 530 540 550 560 570 580 pF1KB3 TPATGAPKPQGTKIVCRHVKNGDILLLNRQPTLHRPSIQAHRARILPEEKVLRLHYANCK . :: ::. .::..:.::::.::. ::.:: ::. :. ...:.. : CCDS73 ---------KYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPH-RTFRFNECVCT 450 460 470 480 490 590 600 610 620 630 640 pF1KB3 AYNADFDGDEMNAHFPQSELGRAEAYVLACTDQQYLVPKDGQPLAGLIQDHMVSGASMTT ::::::::::: :.::.: ..::: :: : . ..:..:.:: . ::: .... .: CCDS73 PYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTL 500 510 520 530 540 550 650 660 670 680 690 700 pF1KB3 RGCFFTREHYMELVYRGLT--DKVGRVKLLSPSILKPFPLWTGKQVVSTLLINIIPEDHI . :: : . ... :. :. .:.: :.:::: ::::::. :..: : : CCDS73 KDTFFDRAKACQIIASILVGKDEKIKVRLPPPTILKPVTLWTGKQIFSVILR---PSDDN 560 570 580 590 600 610 710 720 730 740 750 pF1KB3 PLNLSGKAKITGKAWVKETPRSVPGFNPDSMC--ESQVIIREGELLCGVLDKAHYGS-SA :. . ..: :: . : . : .: .: : :...::. : .::. :: : CCDS73 PVRANLRTK--GKQYC--------GKGED-LCANDSYVTIQNSELMSGSMDKGTLGSGSK 620 630 640 650 660 760 770 780 790 800 810 pF1KB3 YGLVHCCYEIYGGETSGKVLTCLARLFTAYLQLYRGFTLGVEDILVKPKADVKRQRIIEE .. . . .: . .. ... :::: .::. :::..:. : : : . . . : CCDS73 NNIFYILLRDWGQNEAADAMSRLARLAPVYLS-NRGFSIGIGD--VTPGQGLLKAKY--E 670 680 690 700 710 820 830 840 850 860 870 pF1KB3 STHCGPQAVRAALNLPEAASYDEVRGKWQDAHLGKDQRDFNMIDLKFKEEVNHYSNEINK . : . . :: . :: :. .. .. . :: :.. .. .. CCDS73 LLNAGYKKCDEYI---EALN----TGKLQQQPGCTAEETLEALILK---ELSVIRDHAGS 720 730 740 750 760 880 890 900 910 920 930 pF1KB3 ACMPFGLHRQFPE-NSLQMMVQSGAKGSTVNTMQISCLLGQIELEGRRPPLMASGKSLPC ::. :.. . :: :. :.::: .: :. .:: . : : : ..::: CCDS73 ACL-----RELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPH 770 780 790 800 810 820 940 950 960 970 980 990 pF1KB3 FEPYEFTPRAGGFVTGRFLTGIKPPEFFFHCMAGREGLVDTAVKTSRSGYLQRCIIKHLE :: . : : :::.. : .:. : ::::: ::::::::::::::...::.:: ..: :: CCDS73 FEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLE 830 840 850 860 870 880 1000 1010 1020 1030 1040 1050 pF1KB3 GLVVQYDLTVRDSDGSVVQFLYGEDGLDIPKTQFL-QPKQFPFLASNYEVIMKSQHLHEV : :::::::.: :...::.:: :::: . .: .: . .: .... CCDS73 DLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKDEPLEFKRVLDNIKAVFPCP----- 890 900 910 920 930 1060 1070 1080 1090 1100 1110 pF1KB3 LSRADPKKALHHFRAIKKWQSKHPNTLLRRGAFLSYSQK-IQEAVKALKLESENRNGRSP ..: :: . . : .: ..... :: ... .:: : .: CCDS73 ---SEP--ALSKNELILTTES-----IMKKSEFLCCQDSFLQEIKKFIK----------- 940 950 960 970 1120 1130 1140 1150 1160 1170 pF1KB3 GTQEMLRMWYELDEESRRKYQKKAAACPDPSLSVWRPDIYFASVSETFETKVDDYSQEWA :..: .. ..: :: . . .: . .: ..: . CCDS73 GVSEKIK-------KTRDKYGINDNGTTEPRV------LY----------QLDRITP--- 980 990 1000 1180 1190 1200 1210 1220 1230 pF1KB3 AQTEKSYEKSELSLDRLRTLLQLKWQRSLCEPGEAVGLLAAQSIGEPSTQMTLNTFHFAG .:.:: : . : :..:. ::: ::: : :::::::.:::::.:::::: CCDS73 TQVEKFLETCR---D--------KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAG 1010 1020 1030 1040 1050 1240 1250 1260 1270 1280 1290 pF1KB3 RGEMNVTLGIPRLREILMVASANIKTPMMSVPVLNTKKALKRVKSLKKQLTRVCLGEVLQ . ::.:::.::..::. :: :.::.... :. .. .: .. .. :::. . CCDS73 VASMNITLGVPRIKEIIN-ASKAISTPIITAQ-LDKDDDADYARLVKGRIEKTLLGEISE 1060 1070 1080 1090 1100 1110 1300 1310 1320 1330 1340 1350 pF1KB3 KIDVQESFCMEEKQNKFQVYQLRFQFLPHAYYQQEKCLRPEDILRFMETRFFKLLMESIK :. CCDS73 YIEEVFLPDDCFILVKLSLERIRLLRLEVNAETVRYSICTSKLRVKPGDVAVHGEAVVCV 1120 1130 1140 1150 1160 1170 1720 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:01:11 2016 done: Tue Nov 8 11:01:12 2016 Total Scan time: 4.170 Total Display time: 0.150 Function used was FASTA [36.3.4 Apr, 2011]