GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:02:47 Sequence gi568815595r:128473283_128673729 : 200447 bp : 48.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 1668 1663 6 1.05 1.12 Term - 8036 7737 300 2 0 74 50 286 0.983 18.62 1.11 Intr - 8662 8537 126 1 0 77 82 165 0.999 15.68 1.10 Intr - 10723 10578 146 2 2 112 62 210 0.997 20.80 1.09 Intr - 12735 12445 291 1 0 49 103 335 0.487 28.21 1.08 Intr - 13086 12880 207 1 0 138 27 60 0.523 3.95 1.07 Intr - 13794 13521 274 0 1 118 101 378 0.998 39.21 1.06 Intr - 15586 15493 94 0 1 96 62 66 0.099 4.77 1.05 Intr - 17523 17445 79 1 1 52 81 36 0.124 -2.09 1.04 Intr - 18739 18503 237 2 0 -1 97 142 0.261 3.79 1.03 Intr - 22288 22209 80 0 2 96 66 61 0.079 3.89 1.02 Intr - 31453 31278 176 1 2 93 90 73 0.403 6.74 1.01 Init - 31658 31629 30 2 0 83 44 24 0.085 -2.74 1.00 Prom - 32121 32082 40 -5.56 2.00 Prom + 32407 32446 40 -11.92 2.01 Init + 34144 34351 208 0 1 90 75 143 0.317 12.23 2.02 Intr + 35476 35687 212 2 2 38 -20 222 0.203 5.03 2.03 Intr + 49836 50006 171 2 0 99 69 41 0.595 3.44 2.04 Term + 50081 50221 141 1 0 130 55 41 0.597 2.93 2.05 PlyA + 51693 51698 6 1.05 3.00 Prom + 57774 57813 40 -7.16 3.01 Init + 60836 60994 159 1 0 86 88 64 0.432 6.02 3.02 Term + 64819 65016 198 0 0 23 34 290 0.971 14.50 3.03 PlyA + 66005 66010 6 1.05 4.04 PlyA - 66327 66322 6 1.05 4.03 Term - 67933 67760 174 1 0 76 44 69 0.220 -0.94 4.02 Intr - 73551 73397 155 1 2 31 96 82 0.258 3.09 4.01 Init - 74385 74313 73 0 1 104 17 130 0.316 6.73 4.00 Prom - 75939 75900 40 -5.96 5.15 PlyA - 79854 79849 6 1.05 5.14 Term - 81987 81919 69 0 0 92 52 59 0.747 0.64 5.13 Intr - 82512 82451 62 1 2 110 90 58 0.801 6.65 5.12 Intr - 82767 82631 137 2 2 31 80 77 0.523 1.41 5.11 Intr - 85231 85127 105 0 0 18 91 77 0.241 0.33 5.10 Intr - 90037 89936 102 0 0 74 66 46 0.051 0.29 5.09 Intr - 94959 94811 149 0 2 85 19 92 0.195 1.03 5.08 Intr - 97316 97239 78 2 0 87 56 90 0.507 5.45 5.07 Intr - 100357 100214 144 1 0 79 81 23 0.295 1.18 5.06 Intr - 102570 102533 38 1 2 72 99 26 0.270 0.08 5.05 Intr - 103706 103618 89 1 2 109 47 63 0.337 3.91 5.04 Intr - 110973 110820 154 1 1 87 66 57 0.143 2.53 5.03 Intr - 131000 130977 24 0 0 143 75 13 0.559 3.50 5.02 Intr - 135025 134720 306 2 0 -8 36 218 0.177 3.42 5.01 Init - 139803 139683 121 0 1 79 80 90 0.315 5.68 5.00 Prom - 140514 140475 40 -5.86 6.03 PlyA - 141132 141127 6 1.05 6.02 Term - 144777 144302 476 1 2 81 48 176 0.087 7.95 6.01 Init - 145870 145735 136 1 1 83 100 11 0.116 2.12 6.00 Prom - 146819 146780 40 -6.76 7.11 PlyA - 146914 146909 6 -0.45 7.10 Term - 147311 147129 183 2 0 77 41 425 0.999 34.24 7.09 Intr - 149127 148882 246 0 0 116 89 313 0.999 31.86 7.08 Intr - 152371 152252 120 1 0 112 81 313 0.999 33.69 7.07 Intr - 152730 152592 139 2 1 92 89 203 0.994 21.27 7.06 Intr - 153550 153451 100 2 1 81 89 152 0.999 13.77 7.05 Intr - 156837 156669 169 0 1 65 52 169 0.847 10.52 7.04 Intr - 158875 158666 210 1 0 121 86 194 0.999 21.61 7.03 Intr - 164823 164517 307 2 1 104 115 310 0.999 31.75 7.02 Intr - 171701 171637 65 2 2 111 121 79 0.998 11.02 7.01 Init - 177518 177258 261 2 0 89 109 480 0.999 45.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_1|679_aa MPVTTRTLSVALLGGSKDGHRSSQCHMTPAGGQSPRILAKFPGWTLHGSSRVMCPESISQ DGSDWLGQRQTESARANGTLELHWRPALLVKEGLDGRIREPGRARAAAEASGERFPAGLT NCRQIVPNETDLPVVRPPAPAAPNKPWLFVEEGEARPDCLRVSAGFRGEPARLEGLEQRL KTPADSGGCVDPPREFVWGPSLYNGREEDLCARPTRTIEPISGLEVSGEHFQGRCRLHPD PEPPPPAMEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDEVDVFFNHLD SQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGKAALSAAAAHHH NPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGASSSAGGSAARGEDKDGVKYQVSLTESM KMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPASSFTPK QRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSA ARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMS NKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHP SSSLSFGHPHPSSMVTAMG >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_1|2040_bp atgcctgtgaccaccaggactctttcagttgctctcctgggtggcagtaaagatggccac cgcagctcccagtgccacatgaccccagcaggagggcagtccccgaggattcttgccaag tttccaggttggactctccatggatcctccagagtcatgtgcccagaatcaatcagccag gatggctctgactggctgggccagagacagacagaaagtgcccgagcaaatggtacccta gagctccactggcggcctgccctgctggttaaggaaggtctggatggcaggattcgagaa ccaggtagggcccgtgcggcagccgaggcgtctggggagcgtttcccagccgggctgaca aattgcagacaaattgtgcctaacgaaacggatttaccagttgtcaggccgccggccccg gccgctccaaataaaccgtggctgttcgtggaggagggagaagcacggcccgattgtctc cgggtctcagcagggttccgcggggagcctgccaggcttgaaggcctggaacagcgcctc aagaccccagcagattctgggggctgcgttgaccctccccgggagtttgtttggggccca agtctgtacaatgggagggaggaagacttgtgcgcccggcccacacgaaccatagagccg atctccgggctagaagtgagtggggagcacttccagggccgttgccgtctgcacccagac cctgagccgccgccgccggccatggaggtggcgcccgagcagccgcgctggatggcgcac ccggccgtgctgaatgcgcagcaccccgactcacaccacccgggcctggcgcacaactac atggaacccgcgcagctgctgcctccagacgaggtggacgtcttcttcaatcacctcgac tcgcagggcaacccctactatgccaaccccgctcacgcgcgggcgcgcgtctcctacagc cccgcgcacgcccgcctgaccggaggccagatgtgccgcccacacttgttgcacagcccg ggtttgccctggctggacgggggcaaagcagccctctctgccgctgcggcccaccaccac aacccctggaccgtgagccccttctccaagacgccactgcacccctcagctgctggaggc cctggaggcccactctctgtgtacccaggggctggggcctcatcttccgcggggggtagt gcagcccgaggagaggacaaggacggcgtcaagtaccaggtgtcactgacggagagcatg aagatggaaagtggcagtcccctgcgcccaggcctagctactatgggcacccagcctgct acacaccaccccatccccacctacccctcctatgtgccggcggctgcccacgactacagc agcggactcttccaccccggaggcttcctggggggaccggcctccagcttcacccctaag cagcgcagcaaggctcgttcctgttcagaaggccgggagtgtgtcaactgtggggccaca gccacccctctctggcggcgggacggcaccggccactacctgtgcaatgcctgtggcctc taccacaagatgaatgggcagaaccgaccactcatcaagcccaagcgaagactgtcggcc gccagaagagccggcacctgttgtgcaaattgtcagacgacaaccaccaccttatggcgc cgaaacgccaacggggaccctgtctgcaacgcctgtggcctctactacaagctgcacaat gttaacaggccactgaccatgaagaaggaagggatccagactcggaaccggaagatgtcc aacaagtccaagaagagcaagaaaggggcggagtgcttcgaggagctgtcaaagtgcatg caggagaagtcatcccccttcagtgcagctgccctggctggacacatggcacctgtgggc cacctcccgcccttcagccactccggacacatcctgcccactccgacgcccatccacccc tcctccagcctctccttcggccacccccacccgtccagcatggtgaccgccatgggctag >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_2|243_aa MQPGLGLCLPLPSMHSNCGKTCAKYMDATNHWQRNRKHIPVARQMRGDGFIHILQEEIKL IDRSKCEWTGGLNRGADNIQTSTQASLPATAVPFSNGLCRRGCKMVAAAPTPAPFCGQVQ WEECEVFFQKLLQILISFSWRFLTSGWEHHKAMWITLIPGTMGPKCLWTNVWTNAPRALM LSGDSQALSTGHSQLSVVAMGPPVLRTSSGAPPPMPPGTMLSFPRVSAGPFQACGLCGIQ PVD >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_2|732_bp atgcagccagggctggggctctgcttgccattgccctctatgcactctaactgtggcaaa acctgtgcaaagtacatggatgctaccaaccactggcaaaggaataggaagcatattcct gtggcccggcaaatgcgtggggatggcttcatccacatcctgcaggaagaaattaaatta attgacaggtccaaatgtgagtggacaggtggccttaatcggggtgctgacaacatccag acgagcacgcaggcttctctccctgccacggcggtgccattctccaacggactctgccgc cgtggctgcaagatggtggcagcagctccaacccctgcacccttttgtggccaagtccag tgggaagaatgtgaggtcttcttccagaagcttctgcagattctcattagcttcagctgg agattcctcacttctggttgggaacaccacaaagccatgtggatcaccctcatccctgga acgatggggcccaaatgcctatggacaaatgtgtggacaaatgccccccgggctctgatg ctctctggagacagccaagccctaagcactggccattcacagctctctgtggtggctatg ggtccacctgtcctccggactagctctggagccccccctcccatgcctcctggaacaatg ctgagcttcccccgtgtgtctgcaggtcctttccaggcttgtggcctgtgtgggatccag cctgtggactga >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_3|118_aa MMRRRDHEFNSGYSKFEMPVWHYTVQEMMQYRGLKYRRGAYTSDQDLKIINKEFLHEEIH KDLLVMGAYEISNKSGGAGSLHSHLKVTDSAGHILYSKEDATKGKFAFILEDYDMFEV >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_3|357_bp atgatgagaaggagagatcatgagttcaactctgggtattccaaatttgagatgcctgtg tggcattacactgttcaggaaatgatgcagtatcgaggtctgaagtacaggagaggagcc tacacttcagaccaagatttgaaaattatcaacaaagagttcctccatgaggagatccac aaggacctgctagtgatgggtgcgtacgagatctccaacaagtctgggggtgctggcagc ctgcacagccacctcaaggtcacagattctgctggccacattctctactccaaagaggat gcaaccaaagggaaatttgcctttatcctggaagattatgacatgtttgaagtgtga >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_4|133_aa MGAALLLLRLIWTLVMAGLSQHVGASQLPDMRVRLSWLVQPPAVYRHMKELSRNQSRLPR TQEPFSCSTELRAKQKEPFRGASLVLGPGPSSGATVLPDMSLLSTSGGESQPVLESPSMA PHGCIAKKKQKCV >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_4|402_bp atgggggccgccttgctcctgttgagactgatttggaccttagtcatggctggcctgagt cagcacgtgggagccagccaattgccagatatgcgagtgaggttatcctggcttgttcag ccaccagctgtctataggcacatgaaagagctcagcaggaatcagtccaggttacccaga acacaagaaccgttcagctgctccacagaactgcgagcaaaacaaaaggagccattcagg ggtgcatccttagttctaggccctggaccaagctctggggctacagtattgccagatatg tcccttctttctacaagtggaggtgaaagtcagccagtcctggaaagccccagcatggct ccacatggatgcattgccaagaagaagcagaagtgtgtgtga >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_5|525_aa MSLALGIQSGVGRVPATGTLCSPQRCLQSGNDKEPVHALPAGGDGNPRPHALAAPGCPAG RQLPVQETQPREGRRKVPRLRGAQRGLAPDDRLRNGLLPRVSRSLQRPKSPADLEHLQAL GRGPGAIPAALAWELRLLVVAPGVQSPAGAGPFPSTGREPSRTINNDYKTNSLVTNDGSL QVFTLRRSTPLRKKLNNEASLRKHSACCLGGPEDTGWVPVVEVPGQHLHIADVLPVEGDP QMTQGPPDSQPLGTLGQGGWKLLGIVGSLAPETLGGLGTEFGPCTHPLPFDMYPVAIKVA PNGSQAMTKLNIGTGILQQALSWGHAAPRPQQPAHIAALILQNGPSSCNGLAGQQSAIAK GKDSGAWSTIFTLMTSKWIIFSPNFSPDSRLINPTADSSIVRSLGQQQDGSSHGEAEEST PAQEASGSPHQGSEMDRQPARPQEMGGFREAPQLELTQPVQLDLAQFPGGSRGGGRWGDC GAGPMALAALPSENKASCRNSGRMRLFPMPLLRLALRGKHPESGP >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_5|1578_bp atgtccctggccctggggatacagtcgggagtgggcagagtccccgccacagggaccttg tgcagcccacagagatgcctgcagagtggaaatgataaggagcccgttcacgctctgcca gccgggggtgacgggaacccacgtcctcacgccctggctgctcccggctgccccgcgggc cggcagcttcccgttcaggaaactcagcctcgggaagggcggagaaaggtcccgcggctg cgaggagcccagcggggcttggcgcctgatgaccggctccggaacgggctgttaccccgc gtatctcggtctctgcagcggccaaagtcgcccgctgacctggagcacctgcaggccctg ggcagaggacccggggccattcccgcagccctggcctgggagctccggcttctagtggtg gctccgggtgtccagagcccggctggcgcagggcccttcccctccacggggagagagcct tctagaaccataaacaatgactacaagacaaacagccttgtgaccaatgatgggagcttg caagtgtttacactgcgtagatcaacacctttaagaaaaaaacttaacaatgaagcaagt cttaggaagcacagtgcctgctgcctgggggggcctgaagacacaggatgggtccccgtt gtggaggtgcccgggcagcacctgcatatagcagatgtgcttcctgtggaaggagacccc cagatgacacagggacccccagactctcagcccctaggcaccctgggccagggtggttgg aagcttctaggcattgtggggtctctggcaccagagacactcgggggtctggggaccgag tttgggccctgtacccacccactaccatttgacatgtatcctgtggccattaaagtcgca ccgaatggctctcaggccatgaccaaactgaacattgggactggaattttgcagcaagcc ctcagctggggccatgctgccccccgcccccaacagccagcacacatcgctgctttgatt ctgcaaaatggaccaagcagctgcaatggacttgccggacagcagagcgcaattgccaaa ggcaaggactctggagcgtggagcaccatcttcacactgatgacttccaaatggattatc ttcagccccaatttctcccctgactccaggctcataaatccaactgccgactccagcatt gtcagaagcctggggcaacagcaagacggcagcagccatggcgaggcagaagagtcaacg ccagcccaggaagcatcaggaagtccacatcaggggagtgagatggacagacagcctgca cggccgcaggaaatgggtggcttccgcgaagccccccagctggagttgacacagcctgtc cagctggacctggcccagtttccaggcggctccaggggcggggggcgctggggggactgt ggcgccggtccgatggctctagcagcgctgccatctgagaacaaagcgtcctgcaggaat tccggccggatgaggctctttcccatgccccttctgcggctggctctgcgcggaaagcat cctgaatccggcccctga >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_6|203_aa MAKWRCSGHPSGSVPGQVDIQAWNYKVWCASGKCGAAGPRHGYGKGNPRAAVPHGGALLY RGCIPPRAIWSAASPALFILLLSLNLFLRFLEQSESLAEIKITRWARVQIGGRPLLGRRA SRRPEERAAPAPWCPFVTLRHVTRRAPGSMGEIAGADGGLGLEKSRMRGANLKAARLQQN QSQFSGCGATYGCGLVNGLLWPD >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_6|612_bp atggcaaaatggaggtgctcgggacatccaagtgggagtgtccctgggcaggtggatatt caggcctggaattataaggtctggtgtgcaagtggtaaatgtggggctgccgggcctaga catggttatggaaaagggaacccccgggcggctgtgcctcacggaggggccctgctttac cgcggctgcataccacctcgggccatttggtcagctgcttcaccagctctcttcatcctc ttactctccctgaatctttttcttcgttttttggagcagtcagaaagcttggccgaaatc aagataactcgctgggcccgcgtgcagattggtgggcgccctctgctgggccggcgggcc tctcgcaggcctgaggagcgagctgcgcctgcgccctggtgtcccttcgtaacactgcgg cacgtcacgaggcgggcaccggggagtatgggcgaaatcgcaggcgcagacggcgggctc gggctagaaaagtcgcgcatgcgtggggctaatttaaaggcggcgcggctccaacagaac caaagccaattctccggctgtggcgccacctacgggtgtgggctcgtaaacggcctcctc tggccggactag >gi568815595r:128473283_128673729|GENSCAN_predicted_peptide_7|599_aa MEAPAAGLFLLLLLGTWAPAPGSASSEAPPLINEDVKRTVDLSSHLAKVTAEVVLAHLGG GSTSRATSFLLALEPELEARLAHLGVQVKGEDEEENNLEVRETKIKGKSGRFFTVKLPVA LDPGAKISVIVETVYTHVLHPYPTQITQSEKQFVVFEGNHYFYSPYPTKTQTMRVKLASR NVESYTKLGNPTRSEDLLDYGPFRDVPAYSQDTFKVHYENNSPFLTITSMTRVIEVSHWG NIAVEENVDLKHTGAVLKGPFSRYDYQRQPDSGISSIRSFKDVYYRDEIGNVSTSHLLIL DDSVEMEIRPRFPLFGGWKTHYIVGYNLPSYEYLYNLGDQYALKMRFVDHVFDEQVIDSL TVKIILPEGAKNIEIDSPYEISRAPDELHYTYLDTFGRPVIVAYKKNLVEQHIQDIVVHY TFNKVLMLQEPLLVVAAFYILFFTVIIYVRLDFSITKDPAAEARMKVACITEQVLTLVNK RIGLYRHFDETVNRYKQSRDISTLNSGKKSLETEHKALTSEIALLQSRLKTEGSDLCDRV SEMQKLDAQVKELVLKSAVEAERLVAGKLKKDTYIENEKLISGKRQELVTKIDHILDAL >gi568815595r:128473283_128673729|GENSCAN_predicted_CDS_7|1800_bp atggaggcgccagccgccggcttgtttctgctcctgttgcttgggacttgggccccggcg ccgggcagcgcctcctccgaggcaccgccgctgatcaatgaggacgtgaagcgcacagtg gacctaagcagccacctggctaaggtgacggccgaggtggtcctggcgcacctgggcggc ggctccacgtcccgagctacctctttcctgctggctttggagcctgagctcgaggcccgg ctggcgcacctgggcgtgcaggtaaagggagaagatgaggaagagaacaatttggaagta cgtgaaaccaaaattaagggtaaaagtgggagattcttcacagtcaagctcccagttgct cttgatcctggggccaagatttcagtcattgtggaaacagtctacacccatgtgcttcat ccgtatccaacccagatcacccagtcagagaaacagtttgtggtgtttgaggggaaccat tatttctactctccctatccaacgaagacacaaaccatgcgtgtgaagcttgcctctcga aatgtggagagctacaccaagctggggaaccccacgcgctctgaggacctactggattat gggcctttcagagatgtgcctgcctatagtcaggatacttttaaagtacattatgagaac aacagccctttcctgaccatcaccagcatgacccgagtcattgaagtctctcactggggt aatattgctgtggaagaaaatgtggacttaaagcacacaggagctgtgcttaaggggcct ttctcacgctatgattaccagagacagccagatagtggaatatcctccatccgttctttt aaggatgtttattaccgggatgagattggcaatgtttctaccagccacctccttattttg gatgactctgtagagatggaaatccggcctcgcttccctctctttggcgggtggaagacc cattacatcgttggctacaacctcccaagctatgagtacctctataatttgggtgaccag tatgcactgaagatgaggtttgtggaccatgtgtttgatgaacaagtgatagattctctg actgtgaagatcatcctgcctgaaggagccaagaacattgaaattgatagtccctatgaa atcagccgtgccccagatgagctgcactacacctatctggacacatttggccgccctgtg attgttgcctacaagaaaaatctggtagaacagcacattcaggacattgtggtccactac acgttcaacaaggtgctcatgctgcaggagcccctgctggtggtggcggccttctacatc ctgttcttcaccgttatcatctatgttcggctggacttctccatcaccaaggatccagcc gcagaagccaggatgaaggtagcctgcatcacagagcaggtcttgaccctggtcaacaag agaataggcctttaccgtcactttgacgagaccgtcaataggtacaagcaatcccgggac atctccaccctcaacagtggcaagaagagcctggagactgaacacaaggccttgaccagt gagattgcactgctgcagtccaggctgaagacagagggctctgatctgtgcgacagagtg agcgaaatgcagaagctggatgcacaggtcaaggagctggtgctgaagtcggcggtggag gctgagcgcctggtggctggcaagctcaagaaagacacgtacattgagaatgagaagctc atctcaggaaagcgccaggagctggtcaccaagatcgaccacatcctggatgccctgtag