GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:39:02 Sequence gi568815595r:127981253_128223724 : 242472 bp : 44.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2845 3026 182 2 2 99 45 218 0.997 16.37 1.02 PlyA + 4037 4042 6 1.05 2.12 PlyA - 5239 5234 6 -0.45 2.11 Term - 7143 6956 188 1 2 59 53 74 0.798 -1.35 2.10 Intr - 7685 7605 81 0 0 79 100 44 0.922 4.31 2.09 Intr - 8774 8682 93 0 0 44 111 60 0.902 3.84 2.08 Intr - 10601 10431 171 0 0 87 58 66 0.897 3.41 2.07 Intr - 11737 11661 77 0 2 38 105 67 0.963 2.56 2.06 Intr - 13528 13331 198 0 0 81 105 15 0.475 0.97 2.05 Intr - 16962 16861 102 2 0 126 32 81 0.707 5.59 2.04 Intr - 23277 23204 74 0 2 35 96 55 0.216 -0.80 2.03 Intr - 41776 41660 117 1 0 87 62 49 0.256 2.86 2.02 Intr - 52550 52374 177 2 0 79 42 123 0.217 6.92 2.01 Init - 54718 54689 30 1 0 62 80 34 0.478 -0.21 2.00 Prom - 64274 64235 40 -2.66 3.00 Prom + 67823 67862 40 -4.06 3.01 Init + 70091 70094 4 1 1 62 72 0 0.216 -4.04 3.02 Intr + 70454 70658 205 0 1 95 111 78 0.351 9.06 3.03 Intr + 71509 71650 142 1 1 83 115 46 0.397 7.16 3.04 Intr + 74264 74329 66 1 0 98 115 23 0.938 5.10 3.05 Intr + 74421 74499 79 2 1 99 75 2 0.965 -0.88 3.06 Intr + 75457 75588 132 2 0 79 47 105 0.955 6.12 3.07 Intr + 78850 78959 110 2 2 58 91 113 0.984 8.60 3.08 Intr + 79256 79409 154 1 1 57 94 121 0.989 9.25 3.09 Intr + 83625 83785 161 1 2 99 89 249 0.999 25.81 3.10 Intr + 85702 85899 198 0 0 60 123 345 0.995 34.75 3.11 Intr + 86169 86406 238 2 1 95 35 87 0.388 1.29 3.12 Intr + 86693 86807 115 0 1 83 117 157 0.714 17.61 3.13 Term + 88224 88410 187 0 1 97 52 257 0.902 19.96 3.14 PlyA + 90407 90412 6 1.05 4.18 PlyA - 90469 90464 6 1.05 4.17 Term - 95937 95672 266 1 2 71 38 197 0.680 8.67 4.16 Intr - 97877 97780 98 1 2 49 94 70 0.375 3.35 4.15 Intr - 100157 100002 156 1 0 115 44 264 0.446 23.83 4.14 Intr - 101322 101231 92 0 2 67 83 124 0.996 8.59 4.13 Intr - 106556 106454 103 1 1 63 99 123 0.856 11.08 4.12 Intr - 116246 116048 199 0 1 90 91 368 0.998 35.71 4.11 Intr - 117693 117630 64 0 1 80 91 60 0.942 3.79 4.10 Intr - 119492 119343 150 2 0 96 84 112 0.993 11.96 4.09 Intr - 120396 120307 90 0 0 91 90 71 0.995 7.79 4.08 Intr - 123672 123521 152 1 2 97 87 97 0.718 10.38 4.07 Intr - 131768 131636 133 2 1 93 111 105 0.990 13.52 4.06 Intr - 133501 133304 198 1 0 97 42 65 0.763 2.35 4.05 Intr - 142489 142158 332 2 2 52 68 272 0.252 16.95 4.04 Intr - 142782 142639 144 1 0 35 52 115 0.238 2.85 4.03 Intr - 151594 151384 211 1 1 85 61 34 0.019 -1.11 4.02 Intr - 163756 163626 131 2 2 74 64 53 0.322 1.91 4.01 Init - 168876 168723 154 0 1 49 59 99 0.389 3.25 4.00 Prom - 169330 169291 40 -6.66 5.00 Prom + 170026 170065 40 -4.16 5.01 Init + 172256 172571 316 1 1 92 89 616 0.157 59.60 5.02 Intr + 189450 189521 72 1 0 138 65 -3 0.103 1.78 5.03 Intr + 211015 211066 52 2 1 103 69 26 0.149 0.17 5.04 Term + 214184 214409 226 2 1 62 33 156 0.474 3.75 5.05 PlyA + 214884 214889 6 1.05 6.03 PlyA - 215092 215087 6 1.05 6.02 Term - 217457 217206 252 2 0 95 49 107 0.697 3.14 6.01 Intr - 233656 233606 51 1 0 122 87 24 0.196 4.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 172256 172744 489 1 0 92 53 643 0.834 57.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_1|60_aa XRHEVISKEILELDPWENQWNVVAINVLMHDSYDVCLVARMNPRDLIPPPSDLVEEGNEH >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_1|183_bp natcgccatgaggttatctccaaagaaatattggaactggacccatgggaaaaccagtgg aatgttgtagccatcaacgtcctcatgcatgacagctatgatgtctgcctagtagccagg atgaatccccgagacctcatccccccgccttcagatttggtggaagaaggcaatgagcac taa >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_2|435_aa MPPGKGFLRWEFSSGGQNQITNMNSGTISEPIQGLNKLAIKSCITGLKIEDQVENQRKRK DIRKLMTFKCWDYRHEHLAPVSDLTCKDFKAAIVNMVKELKDTMKQRKDPLSPHNNTRRQ LYVTDEEVEAQRGSSVAGEQKLTCTDVSVASITSIDSVECKHSKPMGELVSLGLPKTVLV GSTVSWSMSSFSLVYYLILLEHILKEPLKNAWMVDVAVLKHGCTFIASSVIERAFTSSGS SEISHKEPPVGYCTLVVRVGLVPSATDTAQPTTAQLQSLNASFSAWSVALGILHPAPHIL PITSYVLHHPQDILGLSPSASKWLEATEHNGSLPGLSLVLLSTHRKRRWLQFQGAGRLAK QKIAHLFLNITFKANTDILYPRLHPTHTVVLIPLLRFAQAPPNLAPNMRLAGLPGCSTLL SGSGLWMMFLEDFLA >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_2|1308_bp atgcccccgggcaaaggctttttgcgctgggagttcagctcaggtgggcaaaaccagata actaacatgaacagtggaacaatcagtgaaccaatacaagggttaaataagctagcaatt aaaagctgtatcactggtctaaagatagaagatcaagtagaaaatcagcgcaagaggaaa gatatacgaaaactaatgaccttcaagtgctgggattacaggcatgagcacctggcccca gtgtcagatttaacttgcaaagatttcaaagcagccattgtaaatatggtcaaagaacta aaggacaccatgaagcaaaggaaggaccctttgagtcctcacaacaacactagaaggcag ctctatgtcacagatgaggaagttgaggcacagagaggatcctcagtggctggtgagcag aagctgacatgcacggatgtgtcggttgcctctattacgagtattgactcagttgagtgc aaacacagcaagcccatgggtgagttggtcagcctgggacttcccaagactgtcctagta ggatccactgtttcctggtccatgtcttccttttccttggtttactaccttattctgctt gagcacatcctcaaggagcctcttaagaatgcatggatggtagatgtggcagttttgaaa catggctgcacatttattgcctcttctgtcattgagagggctttcacttcatctggaagt tctgaaatttcccataaggagcctcctgtgggctactgcacacttgttgtgagagtgggc ctcgtcccaagtgcaactgacacagctcagcccacgacagcacagttgcagagcctgaac gccagcttctcagcctggagcgtggccttgggcatcctccaccctgctcctcacatcctg cctataacatcctacgttttgcaccatcctcaagatatcttgggattaagcccaagtgcc tccaagtggcttgaagccactgagcacaacgggagcctcccaggattgtctctggtgctc ctgagcacccacagaaaacggcggtggctgcagttccagggggcaggccgacttgctaag cagaaaatagcccatctcttcctcaacatcacttttaaagcaaacacagacattctctat cctcggctgcacccaacccacactgttgttctgatcccactgctcaggttcgcccaggct ccccccaacttggcccccaacatgcgtcttgctggactgccaggctgctccaccctgctt agtggctctgggctgtggatgatgttcttagaggacttccttgcctga >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_3|596_aa MESIKARHKGTLPGGLPEFFCEADYFPSCPGPSPAKSLQKLIPFVLRWKGTHTHDKPPGL LNDSDSEGARRRPWVARKVTSLCPNSSCFVSPIKVKFLEVIKPFCVILPEIQKPERKIQF KEKVLWTAITLFIFLVCCQIPLFGIMSSDSADPFYWMRVILASNRGTLMELGISPIVTSG LIMQLLAGAKIIEVGDTPKDRALFNGAQKLFGMIITIGQSIVYVMTGMYGDPSEMGAGIC LLITIQLFVAGLIVLLLDELLQKGYGLGSGISLFIATNICETIVWKAFSPTTVNTGRGME FEGAIIALFHLLATRTDKVRALREAFYRQNLPNLMNLIATIFVFAVVIYFQGFRVDLPIK SARYRGQYNTYPIKLFYTSNIPIILQSALVSNLYVISQMLSARFSGNLLVSLLGTWSDTS SGGPARAYPVGGLCYYLSPPESFGSVLEDPVHAVVYIVFMLGSCAFFSKTWIEVSGSSAK DVSRSKTFWKGDESVTASNSNFSPVGTLQVAKQLKEQQMVMRGHRETSMVHELNRYIPTA AAFGGLCIGALSVLADFLGAIGSGTGILLAVTIIYQYFEIFVKEQSEVGSMGALLF >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_3|1791_bp atggaatctatcaaggctcggcacaaaggtactctcccaggaggccttcctgaattcttc tgtgaggcagactacttcccgagctgccccggcccttctccggccaagtctctccagaaa ctcatcccattcgttctcagatggaaggggacccacacccacgacaagcctccgggtttg cttaatgactcagacagcgagggtgctcgaaggcgtccctgggtagcgcggaaggttact tctctgtgtcccaactcttcctgttttgtttctcccatcaaagtcaaatttctggaagtc atcaagcccttctgtgtcatcctgccggaaattcagaagccagagaggaagattcagttt aaggagaaagtgctgtggaccgctatcaccctctttatcttcttagtgtgctgccagatt cccctgtttgggatcatgtcttcagattcagctgaccctttctattggatgagagtgatt ctagcctctaacagaggcacattgatggagctagggatctctcctattgtcacgtctggc cttataatgcaactcttggctggcgccaagataattgaagttggtgacaccccaaaagac cgagctctcttcaacggagcccaaaagttatttggcatgatcattactatcggccagtct atcgtgtatgtgatgaccgggatgtatggggacccttctgaaatgggtgctggaatttgc ctgctaatcaccattcagctctttgttgctggcttaattgtcctacttttggatgaactc ctgcaaaaaggatatggccttggctctggtatttctctcttcattgcaactaacatctgt gaaaccatcgtatggaaggcattcagccccactactgtcaacactggccgaggaatggaa tttgaaggtgctatcatcgcacttttccatctgctggccacacgcacagacaaggtccga gcccttcgggaggcgttctaccgccagaatcttcccaacctcatgaatctcatcgccacc atctttgtctttgcagtggtcatctatttccagggcttccgagtggacctgccaatcaag tcggcccgctaccgtggccagtacaacacctatcccatcaagctcttctatacgtccaac atccccatcatcctgcagtctgccctggtgtccaacctttatgtcatctcccaaatgctc tcagctcgcttcagtggcaacttgctggtcagcctgctgggcacctggtcggacacgtct tctgggggcccagcacgtgcttatccagttggtggcctttgctattacctgtcccctcca gaatcttttggctccgtgttagaagacccggtccatgcagttgtatacatagtgttcatg ctgggctcctgtgcattcttctccaaaacgtggattgaggtctcaggttcctctgccaaa gatgtaagtagaagcaaaactttctggaagggtgatgaaagtgtgactgcctccaattcc aacttctcccctgtgggcaccctgcaggttgcaaagcagctgaaggagcagcagatggtg atgagaggccaccgagagacctccatggtccatgaactcaaccggtacatccccacagcc gcggcctttggtgggctgtgcatcggggccctctcggtcctggctgacttcctaggcgcc attgggtctggaaccgggatcctgctcgcagtcacaatcatctaccagtactttgagatc ttcgttaaggagcaaagcgaggttggcagcatgggggccctgctcttctga >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_4|890_aa MLIKGEAFELTKMQEKGQAAQNRGKLMKERAAQRKQQAQRQQQPKPCGTWRDRLPPHNDK VAVSSSTKSPNQVSLALMKSCAPAPREKSSALPSQKGFLSVAITTGNVLVHLKPAWHWVS PKARGKYCLATTDVYSRPMHSTQQMMNPAKSGFSSFKTLHSLLSQALDQIRGVVPLCSRP QPRQEAPATGEANPETQTEVGDRSSSSSRDGRSPPGVCKMKIEEVKSTTKTQRIASHSHV KGLGLDESGLAKQAASGLVGQENAREVWPVDQGVGGCKQGCCGERAAEVGGSGRDARARG LPPLFPQGSENGPGARGGAPPWSWRFFSFALRRIPVFKVSALSSQVQMPNEAGNGDEGSG WVHCGVSALSPRCSALLKVTMRLQAKQNQATALALAIAQELGSKVPFCPMVGSEVYSTEI KKTEVLMENFRRAIGLRIKETKEVYEGEVTELTPCETENPMGGYGKTISHVIIGLKTAKG TKQLKLDPSIFESLQKERVEAGDVIYIEANSGAVKRQGRCDTYATEFDLEAEEYVPLPKG DVHKKKEIIQDVTLHDLDVANARPQGGQDILSMMGQLMKPKKTEITDKLRGEINKVVNKY IDQGIAELVPGVLFVDEVHMLDIECFTYLHRALESSIAPIVIFASNRGNCVIRGTEDITS PHGIPLDLLDRVMIIRTMLYTPQEMKQIIKIRAQTEGINISEEALNHLGEIGTKTTLRYS VQLLTPANLLAKINGKDSIEKEHVEEISELFYDAKSSAKILADQQDKYMKSKMEVALTAD LTAAGGDRVFFGCPRYQYPDQPGTVINGQAVRAGRPSARVSATDAAVRPARPALTPGQGP GAINGRRAAGGGWAAAIRWADGRLPIARSVTRRHRHRHAASTQISAVSAA >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_4|2673_bp atgcttattaagggagaggcatttgagctgacaaagatgcaagagaagggccaggctgca cagaaccgagggaaattgatgaaagaacgtgctgcacagagaaaacagcaggctcagaga cagcagcaacctaagccatgtggcacgtggcgagacaggctcccccctcacaatgacaaa gtggctgtcagcagctccacaaagtccccaaaccaagtctccttggccctgatgaagtca tgtgcccctgcaccaagagagaaatccagtgctctgcctagccagaaagggtttctttct gtggccatcacaactgggaatgtgctggttcacctgaagccagcatggcactgggtctca cccaaggctcgtggtaaatactgtctggctaccactgatgtttattcaaggcccatgcac tctactcagcagatgatgaatcctgccaagtctgggttttcctccttcaagacgctgcat tctcttttgtcccaggccctagatcaaatccggggcgtggtcccactgtgctcccgaccc cagcctcggcaggaagcgccggctacgggggaagccaacccggagacacagacggaagtg ggtgaccggagctctagcagcagccgcgatgggcgcagccctcccggcgtctgcaaaatg aagattgaggaggtgaagagcactacgaagacgcagcgcatcgcctcccacagccacgtg aaagggctggggctggacgagagcggcttggccaagcaggcggcctcagggcttgtgggc caggagaacgcgcgagaggtgtggccagtggaccagggagttgggggctgcaagcagggc tgctgcggcgagagagctgctgaagtcggtggctcggggcgggatgcgcgcgccaggggt ctcccgccattatttcctcagggaagtgaaaatgggccaggggctcggggaggggcgccg ccctggagctggagatttttttcctttgctcttagaagaataccagttttcaaggtgtca gccttgagctcacaagtgcagatgcctaatgaggcaggcaacggggatgagggaagtggc tgggtgcactgtggcgtgtctgccttgagccccaggtgctcggccttgctgaaggtcacc atgagactgcaggccaaacagaaccaggctacagctctggctctggctattgctcaggag ctgggtagtaaggtccccttctgcccaatggtggggagtgaagtttactcaactgagatc aagaagacagaggtgctgatggagaacttccgcagggccattgggctgcgaataaaggag accaaggaagtttatgaaggtgaagtcacagagctaactccgtgtgagacagagaatccc atgggaggatatggcaaaaccattagccatgtgatcataggactcaaaacagccaaagga accaaacagttgaaactggaccccagcatttttgaaagtttgcagaaagagcgagtagaa gctggagatgtgatttacattgaagccaacagtggggccgtgaagaggcagggcaggtgt gatacctatgccacagaattcgaccttgaagctgaagagtatgtccccttgccaaaaggg gatgtgcacaaaaagaaagaaatcatccaagatgtgaccttgcatgacttggatgtggct aatgcgcggccccaggggggacaagatatcctgtccatgatgggccagctaatgaagcca aagaagacagaaatcacagacaaacttcgaggggagattaataaggtggtgaacaagtac atcgaccagggcattgctgagctggtcccgggtgtgctgtttgttgatgaggtccacatg ctggacattgagtgcttcacctacctgcaccgcgccctggagtcttctatcgctcccatc gtcatctttgcatccaaccgaggcaactgtgtcatcagaggcactgaggacatcacatcc cctcacggcatccctcttgaccttctggaccgagtgatgataatccggaccatgctgtat actccacaggaaatgaaacagatcattaaaatccgtgcccagacggaaggaatcaacatc agtgaggaggcactgaaccacctgggggagattggcaccaagaccacactgaggtactca gtgcagctgctgaccccggccaacttgcttgctaaaatcaacgggaaggacagcattgag aaagagcatgtcgaagagatcagtgaacttttctatgatgccaagtcctccgccaaaatc ctggctgaccagcaggataagtacatgaagagcaagatggaagtggcgctgacagcagat ctcacagccgcgggtggggacagagtcttcttcggctgcccaaggtaccaatatccagat cagccaggcacggtgattaatggccaggccgtgcgggctgggcggccatcagcgcgcgtc tcggcgacggatgcggctgtcaggccggcccggcccgcgctgacccccgggcaggggccc ggcgcgatcaatgggcggcgcgcagcgggcggcggctgggccgccgcgataagatgggcc gatgggcgcctgccgattgcccgctccgtcacccgccgtcaccgtcaccgtcacgccgcc agcacgcagatcagcgccgtcagcgccgcgtga >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_5|221_aa MAGRRVNVNVGVLGHIDSGKTALARALSTTASTAAFDKQPQSRERGITLDLGFSCFSVPL PARLRSSLPEFQAAPEAEPEPGEPLLQVTLVDCPGHASLIRTIIGESNRISFPGKDFLDK KRYYLVIFVGCHVNLTRLGVRSLLVLRKPGLCLQLSTVFGDQQIKGHSKAKLGATVVYQM HFIWRNVWDLTDTGESRLCEPLLSLVSSLHNGDEGASHSQV >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_5|666_bp atggcagggcggcgggtgaacgtgaacgtgggcgtgctgggccacatcgacagcggcaag acggcgctggcgcgggcgctaagcaccacagcctccaccgccgcctttgacaagcagccg cagagccgcgagcgcggcatcacgctcgatctgggcttctcgtgcttctcggtgccgctg cccgcgcgcctgcggtcgtctttgcccgagttccaggcagcgcccgaggccgagcccgag cccggcgagccactgcttcaggtcacgctggtcgactgccccgggcacgcctccctcatc cggaccatcatcggcgaatcaaatagaatctcttttcctggcaaagatttcctagataaa aagaggtactacttagtcatttttgtaggctgccatgtgaacctgacacggttgggtgtg agaagcctgctggttctcagaaaaccaggcctgtgtctgcagctgagcacagtatttggt gatcagcagataaaaggacactccaaagccaaacttggagccactgtcgtctaccagatg cattttatctggcggaatgtgtgggacctgacagacacaggggaatccaggctctgcgaa cctctgctgagcctcgtctcctctctccataatggagatgaaggcgcctcccattcacaa gtgtag >gi568815595r:127981253_128223724|GENSCAN_predicted_peptide_6|100_aa ENHSLVSVHPEEHQIAAKGASNTLKDHSFCMQGAFFRKECPQCLCKREDGFLMLPVSQAH GPVTGFYIQHSLAGLVQEDTENHMCCRKSALSEMEDVVVF >gi568815595r:127981253_128223724|GENSCAN_predicted_CDS_6|303_bp gaaaatcattcactggtatcagtgcacccagaggaacaccagatcgcagctaaaggtgcc agcaacactcttaaagaccattctttttgtatgcaaggagccttcttcagaaaggaatgt ccccagtgtctctgcaaaagagaggatggtttcctaatgctccctgtcagccaagctcat ggccctgtcactggtttttacatccagcattctctggctggccttgtgcaagaagacact gagaaccacatgtgctgccggaagagtgccctttctgaaatggaagatgttgttgtgttc tga