GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:12:22 Sequence gi568815578f:58289460_58544232 : 254773 bp : 44.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5125 5164 40 -2.26 1.01 Init + 8252 8380 129 1 0 63 68 102 0.690 5.85 1.02 Term + 11584 11760 177 0 0 87 47 114 0.876 4.89 1.03 PlyA + 12592 12597 6 1.05 2.00 Prom + 16228 16267 40 -4.26 2.01 Init + 20518 20553 36 0 0 102 84 39 0.549 3.86 2.02 Intr + 21584 21663 80 1 2 112 90 84 0.513 9.35 2.03 Intr + 50519 50626 108 2 0 67 -10 146 0.000 2.10 2.04 Intr + 54259 54340 82 1 1 85 91 48 0.001 4.44 2.05 Intr + 63814 63885 72 0 0 99 63 21 0.317 0.30 2.06 Intr + 63973 64079 107 0 2 99 115 48 0.608 7.61 2.07 Intr + 99964 100058 95 1 2 73 99 102 0.056 9.41 2.08 Intr + 128752 128904 153 2 0 108 119 94 0.953 14.54 2.09 Intr + 145143 145246 104 1 2 82 86 103 0.891 9.39 2.10 Intr + 151448 151624 177 1 0 44 85 161 0.608 11.52 2.11 Term + 154618 154776 159 0 0 95 44 86 0.702 2.84 2.12 PlyA + 155386 155391 6 1.05 3.16 PlyA - 155647 155642 6 1.05 3.15 Term - 158569 158313 257 2 2 113 44 67 0.558 0.55 3.14 Intr - 168967 168886 82 1 1 54 85 104 0.530 5.91 3.13 Intr - 169293 169161 133 2 1 106 28 70 0.311 3.45 3.12 Intr - 171128 171079 50 2 2 42 83 65 0.297 -1.12 3.11 Intr - 172095 171481 615 0 0 71 7 757 0.350 58.45 3.10 Intr - 178199 177647 553 1 1 108 105 995 0.983 96.06 3.09 Intr - 181288 181150 139 2 1 80 90 27 0.743 1.62 3.08 Intr - 182115 182000 116 2 2 56 -1 115 0.410 -0.51 3.07 Intr - 184839 184717 123 2 0 53 40 131 0.418 4.50 3.06 Intr - 191711 191611 101 2 2 16 75 66 0.060 -2.99 3.05 Intr - 192404 192186 219 2 0 -9 109 133 0.066 4.20 3.04 Intr - 213954 213845 110 1 2 75 81 52 0.534 3.20 3.03 Intr - 223495 223286 210 2 0 71 64 55 0.137 0.28 3.02 Intr - 225051 224935 117 1 0 50 45 82 0.290 0.54 3.01 Init - 225248 225200 49 2 1 73 92 90 0.345 8.01 3.00 Prom - 226787 226748 40 -2.26 4.05 PlyA - 228817 228812 6 1.05 4.04 Term - 231856 231834 23 2 2 156 48 16 0.566 2.87 4.03 Intr - 235244 235148 97 2 1 80 57 59 0.402 1.58 4.02 Intr - 235648 235398 251 2 2 52 101 63 0.428 1.16 4.01 Init - 242314 242299 16 1 1 81 106 -4 0.251 1.09 4.00 Prom - 248615 248576 40 -2.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 43191 43140 52 2 1 124 42 48 0.887 0.70 S.002 Term - 70286 70133 154 1 1 75 54 174 0.965 10.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:58289460_58544232|GENSCAN_predicted_peptide_1|101_aa MYITTHNAMVASTMGHKHRQTGGTQLHLEESEEMADTFPRTQVLPLTTNDFHISSPVLTV LQNPTRGLSGVLNNATRICVIASHVLSSAIHYVRPQGPKEG >gi568815578f:58289460_58544232|GENSCAN_predicted_CDS_1|306_bp atgtacatcaccacccacaatgccatggtggcaagcaccatgggacacaagcacagacaa acaggagggacacagctccacttggaagagagtgaagagatggcagacaccttccctaga acacaggtgctaccactcaccactaatgactttcacatctcctctccagtcctaactgtg ctccagaaccctacaagggggctttctggtgtcctcaacaatgccaccaggatctgtgtc attgcttctcatgtgctgtccagtgccattcactatgttagaccacagggacccaaagaa ggataa >gi568815578f:58289460_58544232|GENSCAN_predicted_peptide_2|390_aa MALRELKVCLLGDTGVGKSSIVWRFVEDSFDPNINPTIGCYHYGKESNPNAEEEVVLTDV IGQDFNLSEINRNALASFMTKTVQYQNELHKFLIWDTAGQERFRALAPMYYRGSAAAIIV YDITKEETFSTLKNWVKELRQHGPPNIVVAIAGNKCDLIDVRGLPAKGAPPLRNMAKVEQ VLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGAS INVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAVHDVEINKIISTTASKTETPIV SKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENKQFKEEDGLRMRKTVQSNSPISALAP TGKEEGLSTRLLALVVLFFIVGVIIGKIAL >gi568815578f:58289460_58544232|GENSCAN_predicted_CDS_2|1173_bp atggcgctgagggagctcaaagtgtgtctgctcggggatacaggtgtaggtaaatcgagt attgtgtggcggtttgtggaagacagttttgatccaaacatcaacccaacaatagggtgt taccactacggaaaagaatccaatcctaatgctgaggaggaagtggttcttacagatgtg ataggccaagattttaacctctcagaaatcaacagaaatgcattggcatcttttatgacc aagactgtccagtaccaaaatgagctacataaattcctaatctgggatacagctggacaa gaacgatttcgtgccttagcaccaatgtactatcgagggtcggctgcagctataatcgtt tatgatatcacaaaagaagagacattttcaacattaaagaattgggtgaaagagcttcga cagcatggcccacctaatattgtagttgccattgcaggaaataaatgtgatcttatcgat gtaaggggtctccccgccaaaggtgctccgccgctaaggaacatggcgaaggtggagcag gtcctgagcctcgagccgcagcacgagctcaaattccgaggtcccttcaccgatgttgtc accaccaacctaaagcttggcaacccgacagaccgaaatgtgtgttttaaggtgaagact acagcaccacgtaggtactgtgtgaggcccaacagcggaatcatcgatgcaggggcctca attaatgtatctgtgatgttacagcctttcgattatgatcccaatgagaaaagtaaacac aagtttatggttcagtctatgtttgctccaactgacacttcagatatggaagcagtacat gatgtagaaataaataaaattatatccacaactgcatcaaagacagaaacaccaatagtg tctaagtctctgagttcttctttggatgacaccgaagttaagaaggttatggaagaatgt aagaggctgcaaggtgaagttcagaggctacgggaggagaacaagcagttcaaggaagaa gatggactgcggatgaggaagacagtgcagagcaacagccccatttcagcattagcccca actgggaaggaagaaggccttagcacccggctcttggctctggtggttttgttctttatc gttggtgtaattattgggaagattgccttgtag >gi568815578f:58289460_58544232|GENSCAN_predicted_peptide_3|957_aa MPAAMLPYACVLVLLGVSPGRHLRASRSFLGPAGGSRVTSQYESRCTLCASRGLASNQTD STAEKPRVPGGLTREGGSLPSIQTQPPERGRVPFSLPLQLPALHRALFSTERFFEKLFSP LATSPDSLKLKSTVWPISFNQILMDSNFLNSKPQTEIKTNLPWKGRSPTWKKVWAPELAL VGSFFRKDWVERGRLGDPQLEAEAGAGGVRVRHWLPIAPTKAQEGPLVEAPKNSSATGHS VRSGHIQNPFYAVMLQAVIIPSPAPPSTSFLISKMGALTEPMSRVVVSIIPVNPLGSAKV LTRCRCEDSRPSQAHLGVTLEDLELVLQRLLPVSIFQSPSALSPIPPPAHTAPAAGEAGG SCLRWEPHCQQPLPDRVPSTAILPPRLNGPWISTGCEVRPGPEFLTRAYTFYPSRLFRAH QFYYEDPFCGEPAHSLLVKGKVRLRRASWVTRGATEADYHLHKVGIVFHSRRALVDVTGR LNQTRAGRDCARRLPPARAWLPGALYELRSARAQGDCLEALGLTMHELSLVRVQRRLQPQ PRASPRLVEELYLGDIHTDPAERRHYRPTGYQRPLQSALHHVQPCPACGLIARSDVHHPP VLPPPLALPLHLGGWWVSSGCEVRPAVLFLTRLFTFHGHSRSWEGYYHHFSDPACRQPTF TVYAAGRYTRGTPSTRVRGGTELVFEVTRAHVTPMDQVTTAMLNFSEPSSCGGAGAWSMG TERDVTATNGCLPLGIRLPHVEYELFKMEQDPLGQSLLFIGQRPTDGSSPDTPEKRPTSY QAPLPGSPVKHLDAAVMFIATPSEVEHFLFNQVSDQMLSPEDVTIRIPKSSPISLVLLSV HCITVPHTETRERITSIYQIQRPSRGPAESVRGQTAGGRESHERPFNEMPSPAGGSSGLY TEKVSGVKCYSGVQSDTGKPKTPISSYMYFSNYNEKWAIPLEKELKMTYNIVFWHGF >gi568815578f:58289460_58544232|GENSCAN_predicted_CDS_3|2874_bp atgcccgcagccatgctcccctacgcttgcgtcctggtgcttttgggagtctcacctggc cggcacctgcgcgcctccaggagcttcctggggcctgcgggagggagtcgcgtcacctcg caatatgagagccgctgcactctctgcgccagtcggggcctggcctcaaatcagacagat tcaacagcagagaagccaagagtgcccgggggcctgacgagagaggggggcagcctgccc tcaattcagacacagccaccagaaagggggagggtgccgttcagcctgccactgcagctc ccagctctacaccgggcactattttccactgaacgcttctttgaaaagctcttttcaccg ctggctacatcaccagatagtttgaagctgaagtctactgtttggcccatcagctttaac caaattctgatggactccaatttccttaacagcaaaccacagactgaaattaagaccaac ttgccatggaaaggaagatctcccacgtggaagaaagtttgggcaccagagctggccttg gtgggctccttcttccgcaaggactgggtggaacgagggaggctgggtgaccctcagctg gaggccgaagctggagccggcggggtgagggttcgccactggctcccaatcgccccaacc aaagctcaagagggcccccttgtggaagctcccaagaacagctcggcaacaggacattca gtgagatctgggcacattcagaaccccttctacgcagtgatgctacaagctgtcatcatc ccctcaccggctcctccaagcaccagtttcctcatcagcaaaatgggtgcattgacagaa cccatgtcacgggtggttgtgtcgattatacctgtcaatcctttgggctctgccaaggta ctgacacgctgccgctgtgaggactcaaggcccagccaagctcatctaggggtcactcta gaggacctggagcttgtgctgcagcgcctgcttcccgtgtccatcttccagtcaccttct gccctgagcccaattcctccccctgcccacactgcaccggcggctggggaggccgggggc agctgcctgcgctgggaaccccactgccagcagcccttgccagatagagtgcccagcact gcgatcctgcctccacgccttaatggaccttggatctccacaggctgcgaggtgcgccca ggaccggagttcctgacccgcgcctacaccttctaccccagccggctctttcgagcccac cagttctactacgaggaccccttctgcggggaacctgcccactcgctgctcgtcaagggc aaagtccgcctgcgccgggcctcctgggtcacccggggagccaccgaggccgactaccac ctgcacaaggtgggcatcgtcttccacagccgccgggccctggtcgacgtcaccgggcgc ctcaaccagacccgcgccggccgggactgcgcgcggcggctgcctccggcccgggcctgg ctgcctggggcgctgtacgagctgcggagcgcccgggctcagggggactgcctggaggcg ctgggcctcaccatgcacgagctcagcctggtccgcgtgcagcgccgcctgcagccgcag ccccgggcgtcgccccggctggtggaggagctgtacctgggggacatccacaccgacccg gcggagaggcggcactaccggcccacgggctaccagcgcccgctgcagagcgcactgcac cacgtgcagccgtgcccagcctgtggcctcattgcccgctccgatgtgcaccacccgccc gtgctgccgccccctctggccctgcccctgcacctgggcggctggtgggtcagctcgggg tgcgaggtgcgcccagcagtcctgttcctcacccggctcttcactttccacgggcacagc cgctcctgggaagggtattaccaccacttctcagacccagcctgccggcagcccaccttc accgtgtatgccgccggccgctacaccaggggcacgccatccaccagggtccgcggcggc accgagctggtgtttgaggtcacacgggcccatgtgacccccatggaccaggtcaccacg gccatgctcaacttctctgagccaagcagctgtgggggtgcgggggcctggtccatgggc actgagcgggatgtcacagccaccaacggctgcctaccgctgggcatccggctcccgcat gtggagtacgagcttttcaagatggaacaagaccccctcgggcaaagcctgctcttcatc ggacaaaggcccaccgatggctcaagtcccgataccccagagaaacgtcccacctcctac caagcacccctgcccgggtccccagtcaagcacttggatgcggcagtgatgttcatcgct acaccttcagaagttgagcactttcttttcaatcaagtctcagaccagatgttgtctcct gaagatgtcaccattcgcatcccaaagagcagccccatttccctggtccttctgtcagtc cactgcatcacagttcctcacactgagacccgtgaacggatcacctccatctaccagatt cagcgaccctcgagaggcccagctgagtcagtgagaggccagacagccgggggcagggaa agccacgagaggcccttcaatgaaatgccatctccagcagggggaagcagtggcctctac accgagaaggtctcaggagttaaatgttattcaggtgttcaaagtgacactgggaagccc aagacccctatctcttcatacatgtatttctcaaactacaatgagaaatgggccattcct ttagagaaggagctgaaaatgacatataacattgtattttggcatggattctga >gi568815578f:58289460_58544232|GENSCAN_predicted_peptide_4|128_aa MNYQMGGERLAEIILVDGGSAALEPGLLTHGILPPPSPHHHESALSSTALCQALVLGALP TSSHLLRTAVQDWFMWYNKCTNIKITYMTNIDIEIHHTLSFSISHAALETQLDPVDLARC RSGSQEAI >gi568815578f:58289460_58544232|GENSCAN_predicted_CDS_4|387_bp atgaattatcaaatgggtggagagaggcttgctgagatcatactggtggatggtggttca gcagcactggagcctggactcctcacccatggcatcctgcctcctccttcccctcatcac catgagtcagcattatccagcacggccctgtgccaggctctggtcttgggggctttaccc acatcatcccatttactcagaacagccgtccaagactggttcatgtggtacaacaaatgt acaaacatcaaaatcacatacatgacgaatatagatattgaaatccaccatacactctct ttttcaataagccacgcagccctggagacacagctggaccctgtggatttggcaagatgc aggagtggctcccaagaggccatttga