GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:16:57 Sequence gi568815580r:48821375_49050424 : 229050 bp : 50.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12707 12727 21 1 0 138 64 16 0.082 1.94 1.02 Intr + 13399 13492 94 0 1 35 73 64 0.081 -0.86 1.03 Intr + 15130 15304 175 2 1 64 110 32 0.573 2.00 1.04 Intr + 21944 22037 94 2 1 131 55 33 0.598 4.27 1.05 Intr + 22298 22373 76 1 1 88 67 66 0.824 3.59 1.06 Intr + 27368 27710 343 0 1 93 80 83 0.277 2.49 1.07 Intr + 33583 33676 94 1 1 84 30 27 0.115 -3.53 1.08 Intr + 36214 36267 54 0 0 125 99 17 0.548 5.68 1.09 Term + 37970 38185 216 1 0 90 48 329 0.665 26.24 1.10 PlyA + 41818 41823 6 1.05 2.19 PlyA - 43131 43126 6 1.05 2.18 Term - 53263 53123 141 1 0 113 49 67 0.444 3.23 2.17 Intr - 64388 64354 35 0 2 136 110 -24 0.088 2.54 2.16 Intr - 66182 66096 87 0 0 92 60 40 0.071 1.54 2.15 Intr - 67805 67718 88 2 1 85 100 -12 0.042 -0.76 2.14 Intr - 72914 72801 114 2 0 30 88 63 0.006 1.14 2.13 Intr - 76086 75957 130 2 1 80 75 38 0.022 2.40 2.12 Intr - 83712 83558 155 0 2 84 100 14 0.008 1.07 2.11 Intr - 87801 87682 120 0 0 97 4 72 0.018 0.39 2.10 Intr - 89757 89630 128 1 2 15 101 84 0.325 2.80 2.09 Intr - 93536 93489 48 0 0 94 105 40 0.941 4.95 2.08 Intr - 96254 96220 35 1 2 55 76 64 0.031 -0.23 2.07 Intr - 100536 100002 535 1 1 119 25 833 0.026 72.08 2.06 Intr - 119002 118939 64 1 1 115 -8 74 0.001 -1.21 2.05 Intr - 120939 120856 84 0 0 40 62 78 0.002 0.42 2.04 Intr - 121181 121107 75 2 0 73 110 26 0.003 3.01 2.03 Intr - 128176 128094 83 2 2 67 59 80 0.118 2.36 2.02 Intr - 129087 128438 650 2 2 36 66 423 0.122 26.49 2.01 Init - 129220 129114 107 1 2 30 -7 234 0.828 5.79 2.00 Prom - 134208 134169 40 -4.16 3.00 Prom + 152497 152536 40 -4.06 3.01 Init + 154640 154708 69 1 0 43 80 151 0.970 8.85 3.02 Intr + 156681 156800 120 2 0 101 39 72 0.515 4.29 3.03 Intr + 156825 156880 56 2 2 83 38 30 0.054 -4.72 3.04 Term + 163823 163940 118 2 1 110 48 95 0.719 5.71 3.05 PlyA + 164289 164294 6 1.05 4.03 PlyA - 166717 166712 6 1.05 4.02 Term - 172138 172037 102 1 0 130 41 108 0.985 8.58 4.01 Init - 172749 172705 45 0 0 82 98 30 0.807 4.08 4.00 Prom - 172815 172776 40 -4.86 5.03 PlyA - 172869 172864 6 1.05 5.02 Term - 174419 174279 141 2 0 62 42 152 0.562 5.93 5.01 Init - 193171 193025 147 1 0 70 92 83 0.645 5.09 5.00 Prom - 194347 194308 40 -6.66 6.11 PlyA - 194356 194351 6 1.05 6.10 Term - 197508 197372 137 1 2 115 43 66 0.818 2.98 6.09 Intr - 197923 197860 64 1 1 122 89 14 0.981 3.19 6.08 Intr - 200135 200030 106 1 1 108 84 73 0.944 9.12 6.07 Intr - 201750 201591 160 1 1 39 89 22 0.233 -3.55 6.06 Intr - 204746 204596 151 2 1 95 41 111 0.780 6.84 6.05 Intr - 205611 205415 197 1 2 59 45 100 0.512 1.93 6.04 Intr - 208012 207881 132 2 0 75 32 74 0.543 1.12 6.03 Intr - 208578 208463 116 2 2 103 72 77 0.877 7.69 6.02 Intr - 209614 209497 118 2 1 -26 31 173 0.850 -0.28 6.01 Init - 213436 213304 133 1 1 78 47 45 0.414 -0.30 6.00 Prom - 213560 213521 40 -2.46 7.03 PlyA - 213927 213922 6 1.05 7.02 Term - 222830 222681 150 2 0 81 55 222 0.996 16.11 7.01 Init - 226577 226485 93 2 0 54 52 85 0.463 0.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 57909 57804 106 0 1 81 94 78 0.828 8.11 S.002 Term - 100536 99998 539 1 2 119 42 849 0.969 77.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_1|388_aa MRVGKCLRSALPAHTAQGSSRDTGNVDVSPGAREQQGCSLGDYEAPVSEIPNQPHHFSPP GPNIPPGPPGLPAPAIPLEGQGSRVGSPRRLPLVMPRALRSSRELCPVFPVTITPSCSLR SGEGLQAQVTVYDPHQGSLESGKDAFTNDFEMTGTPGDSHQAAFLPGCCGASHSHSLTGA PWFLPEGGYDGRAHNKSPPETIGQECIRFRTNTVWARTIALDSCPRNTFCRSRAREACLP QPAHIPVSARDSDEWAAPPPTPGPLTPSAYRHLIEMFIGLFSAFPPPRSCISTALEQGLL LQSQDVKEDAVLCCSMELQSTGRLLEEQLPEMMTELLASARDKMLCPSESMLTRSLLLEV IELHANSWNPLTPPITQYYNRTIQKLTA >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_1|1167_bp atgagggtcggcaaatgtctgcgatctgccctgccagctcacactgcccaaggctctagc cgagatacaggaaatgtggatgtgtcccccggagcccgagagcagcagggctgcagcctg ggcgattatgaggcaccagtgagtgagatccccaaccaaccccatcacttcagccctcca ggtcccaacatcccaccaggtccccctggactgccagcacctgccattcccctagagggg caaggaagcagggtgggaagccccaggagactgccgttggtcatgcccagggccctgcgc tccagcagggagctgtgcccagtgttcccagtgaccataactccatcctgttccctgcgg agtggggagggcttgcaggctcaggtgaccgtatatgatccacatcagggcagtttggag agtggaaaggatgcttttactaatgactttgagatgacagggactccaggagactcccac caggctgcctttctccctggctgctgtggggcctcccacagccacagcctgactggagca ccttggtttctccctgagggaggatatgatgggcgtgcccacaacaagtctccgccagag acaataggacaggaatgcattcgcttccgaacaaacactgtgtgggctcgcacaattgcc ttggattcttgtccaagaaacacattttgccgttcgagggcacgggaggcttgcctgccc cagcccgcccacatccccgtcagtgccagagactctgatgagtgggctgctcctcccccg accccagggcctctgacccccagtgcttaccgccatctgatagagatgtttattggtttg ttttctgcatttccaccgccccggagctgcatttccaccgccctggagcagggcctgctc ttgcaatctcaggatgtgaaggaagatgctgtcctttgctgctctatggagctgcagagt acaggccggctgctggaggaacagctgcctgagatgatgacagagctcctggccagcgca cgggacaagatgctgtgcccctcggagtccatgctgacccggtcgctgctcctagaggtc atcgagctccacgctaacagctggaaccctctgacgccccccatcacgcagtactacaac agaaccatccagaaactgacagcctga >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_2|892_aa MPRSAPRAAAAPARAPAAAAVACACCPNSAPDFFMVRQTTFLLASSPRMFRTKRSALVRR LWRSRAPGGEDEEEGAGGGGGGGELRGEGATDSRAHGAGGGGPGRAGCCLGKAVRGAKGH HHPHPPAAGAGAAGGAEADLKALTHSVLKKLKERQLELLLQAVESRGGTRTACLLLPGRL DCRLGPGAPAGAQPAQPPSSYSLPLLLCKVFRWPDLRHSSEVKRLCCCESYGKINPELVC CNPHHLSRLCELGVYGHCSCSLRLKIRRLHFRINKGVQREQTVQMLCLPPLKQGERIIWP LGGFQLLILQSFTLWTLDPFRGRGDKSELGIALVIVDTDINKELPGRSSDSLKGNSQLLL EPGDRSHWCVVAYWEEKTRVGRLYCVQEPSLDIFYDLPQGNGFCLGQLNSDNKSQLVQKV RSKIGCGIQLTREVDGVWVYNRSSYPIFIKSATLDNPDSRTLLVHKVFPGFSIKAFDYEK AYSLQRPNDHEFMQQPWTGFTVQISFVKGWGQCYTRQFISSCPCWLEVIFNSRGHGGNVD LDGASAINLPNTKWPLEEDSAREITVPGRLVPEALPEVIPEQERMLGNNKNDRVWLPALD NGMVVSEQWGRENVSGRGWEEEEKKEEEEEEEEEEEEEEKEGKHNQGLHSKRQGKKDPLA ASASGLLSQQFHSGPGLALSKTKALQMTTPTLFTSTQELGSRKSWDPGLCKTLPIGLSWE RERCLLLATNPLETLGQQTLPPIALTACGSQAAQQEESMAAPLLVKQTPGLTVRCKRCIY PTAVALGKDQAICTWLGQRTQVVGKAGCAQWLAWHSEDLPINPGTSVCSASAQGGTMRGG KGRWWQVGMLWLMLKHQDLQLGSPECQIEKETTEARKMWPRTQMEGAKDKAS >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_2|2679_bp atgccacggagcgcccctcgggccgccgccgctcctgcccgggcccctgctgctgctgct gtcgcctgcgcctgctgccccaactcggcgcccgacttcttcatggtcaggcaaacgact tttctcctcgcctcctcgccccgcatgttcaggaccaaacgatctgcgctcgtccggcgt ctctggaggagccgtgcgcccggcggcgaggacgaggaggagggcgcagggggaggtgga ggaggaggcgagctgcggggagaaggggcgacggacagccgagcgcatggggccggtggc ggcggcccgggcagggctggatgctgcctgggcaaggcggtgcgaggtgccaaaggtcac caccatccccacccgccagccgcgggcgccggcgcggccgggggcgccgaggcggatctg aaggcgctcacgcactcggtgctcaagaaactgaaggagcggcagctggagctgctgctc caggccgtggagtcccgcggcgggacgcgcaccgcgtgcctcctgctgcccggccgcctg gactgcaggctgggcccgggggcgcccgccggcgcgcagcctgcgcagccgccctcgtcc tactcgctccccctcctgctgtgcaaagtgttcaggtggccggatctcaggcattcctcg gaagtcaagaggctgtgttgctgtgaatcttacgggaagatcaaccccgagctggtgtgc tgcaacccccatcaccttagccgactctgcgaactaggggtctacgggcactgctcctgc agtctgcggttaaaaattagacgccttcatttccggatcaacaagggggtgcagagggag cagactgtccagatgctgtgccttcctccgctgaaacagggggaacgaattatctggccc ctggggggctttcagctcctcatcctgcagtcctttacgctctggacactggatccgttc cgtggacggggagacaagagcgagctggggattgctttggtaattgtagacacagatatt aataaggagctgccaggaagaagcagtgactctctgaagggcaattcccaacttcttctg gagcctggggatcggtcacactggtgcgtggtggcatactgggaggagaagacgagagtg gggaggctctactgtgtccaggagccctctctggatatcttctatgatctacctcagggg aatggcttttgcctcggacagctcaattcggacaacaagagtcagctggtgcagaaggtg cggagcaaaatcggctgcggcatccagctgacgcgggaggtggatggtgtgtgggtgtac aaccgcagcagttaccccatcttcatcaagtccgccacactggacaacccggactccagg acgctgttggtacacaaggtgttccccggtttctccatcaaggctttcgactacgagaag gcgtacagcctgcagcggcccaatgaccacgagtttatgcagcagccgtggacgggcttt accgtgcagatcagctttgtgaagggctggggccagtgctacacccgccagttcatcagc agctgcccgtgctggctagaggtcatcttcaacagccgcgggcacggaggtaacgtggac ctggacggtgcctctgcaattaatctacccaacacaaagtggccgctggaggaggactct gccagggaaattacagtgcccgggagacttgtcccagaggccctgcccgaggtaattcca gagcaggagcgcatgctgggaaataacaagaatgacagagtctggctgccagccttagat aatggaatggtagtcagtgagcaatggggtagagagaatgtcagcgggagggggtgggag gaggaggaaaaaaaggaggaagaagaggaggaagaagaggaggaagaggaagaggagaaa gagggaaagcataaccaagggcttcacagcaagagacaagggaagaaggacccactggca gcttcagcctctggtctgttgtctcagcaattccactctggccctggacttgcacttagc aaaactaaagccctgcaaatgactacacccaccctgttcacaagcacccaggagctgggc agcaggaagtcttgggatcctggattgtgcaagaccctgcccatcggattaagttgggag agggagcggtgcctcctccttgctaccaaccccctagaaacactgggccagcagacgctg cctccaattgctttaactgcctgcggcagtcaggcagctcaacaggaggaaagcatggcg gcccctttgctagtgaagcaaacaccaggcctgactgtcagatgcaagaggtgcatttat ccgacggctgtggccttgggaaaagaccaagccatttgtacctggctaggtcagagaacc caggttgtggggaaagcaggctgtgcccaatggctggcttggcattccgaggacttgccc atcaatccagggacttctgtgtgctctgccagtgctcagggagggacgatgagaggtggc aagggaaggtggtggcaggtgggaatgctttggctcatgctgaagcatcaggaccttcag cttgggagccctgagtgccagatagagaaagaaacaactgaagccaggaagatgtggccc aggacacagatggaaggtgccaaagacaaagctagctga >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_3|120_aa MPSSRRRAPRARAASRGAARVAQMVLRGSVFLSQWTRQIHRCVGDTECERQGQQQGLGGR GDPCCDLGHIGFSQPQAKSSPVASGELRNLLDMELGTAKSPPPAQMEEKPGAPIKECLSL >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_3|363_bp atgccgtcttcccggcgccgcgctcccagagctcgcgccgcctcccggggcgccgcgcgg gtggcgcagatggtactgagaggatctgtcttcctctctcagtggacacgacagattcat cgctgtgtgggggacaccgagtgtgagcgtcagggtcagcagcagggccttggaggcaga ggggacccttgctgtgacctgggccatattggcttctctcaaccccaggccaagtcctcc cctgttgcttcaggagagctgaggaacttgttggatatggagctgggcacagcaaaatct ccacccccagctcagatggaggagaaacctggtgctccaataaaggagtgtttatccctt taa >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_4|48_aa MGERGHRRPFQVKEKGCEFPMVTATEKVVALKAQAETVFCTVKHQDPF >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_4|147_bp atgggggagcggggccacaggcgacccttccaagtcaaggagaagggctgtgaattcccc atggtcacggcaactgaaaaagtcgttgccttgaaagctcaagcagaaactgtgttctgc actgtgaaacaccaagacccgttctaa >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_5|95_aa MDVARWPNLLPQSWDPLEAPRALGILHHQCKGKVPPTPSFKAPQRDGLWDTKLLNNQEMK DGSITNQTIAWDSEDVLSAVLNAEEPMWMRGLRLG >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_5|288_bp atggacgttgcccggtggcccaacctcctgccacagagctgggacccactggaggctccc agagccctcgggatccttcatcatcagtgtaagggcaaggttcccccaacccccagcttc aaggctccacaaagagatggtttgtgggacacgaagctactgaataatcaagaaatgaaa gatgggtccatcacaaaccagacaattgcttgggactccgaggacgtcctgtcggcagtt ctcaatgcagaggaacccatgtggatgcgtggcttgcgcctaggctga >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_6|437_aa MEYYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKTKHHMFSLIGAFIIIIIIIIIIIII IIMLFAFSLSFLHECSVEFSRSCIHTGYPNLHPLTRKPRKTVTLLRLYFLGPGAPEGPWG NAGLRGNVAAALPNTRHPAMQLETSRETLPTMTPEKPRATTLPACPRQTQQLSLQPTQLV RCPSLGPQCPGRDPDSTMLGSPSGQRHAMRFTQEPHRHVGICKVPRNIRTGEEAVLDQPS SSILPSRAPKAQPQSCHSRYLPSAQDKQNTSISDEELALPVPARTGCGTSQGLHFLIYKM EVIIDVHAKLLRRLSPCCKVAPSEQSLMHPWSMPHRPELSRWCCTSIRGYMSPPPFSPNT GRQGSPVREEGDGTARPGTLVPQSPGLEGPLAGRGHALSRPHSVSARFNRILLRLLLWRL DVNMRGEDLVALSVVEA >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_6|1314_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacatggatg aagctagaaaccatcattctgagcaaactatcccaaggacagaaaaccaaacaccacatg ttctcactcataggtgctttcatcatcatcatcatcatcatcatcatcatcatcatcatc atcatcatgttatttgccttttctctttcattccttcatgaatgttcggtggagttttcc agaagctgcatccacactggctaccccaacctgcaccctctgactcgcaagcccaggaag actgtgactttgcttcggctctatttcctgggtcctggtgcaccggaagggccgtgggga aatgcaggactgagaggcaacgtggctgctgccctgcccaacactcggcatcctgccatg cagctagagaccagtcgtgagactctgcccacaatgaccccggagaagccaagggccact actttgcctgcatgcccacgccagactcagcagctgtccttgcagccaacccagctggtc agatgcccctccctgggcccacagtgtcccggtcgggaccctgactccacgatgttagga agcccctcggggcagcgtcatgccatgcgcttcacacaggagccccatcgccatgttggc atctgtaaggtccccaggaacatccggacaggagaggaagctgtcctggatcagcccagc tccagtatccttccttcccgggctcccaaagctcaaccacagtcctgccattcccgttac ctgccaagcgcacaggacaaacaaaacaccagcatcagcgatgaggagctggcccttcct gtgccagcacgtactggctgtgggacctctcagggcctccatttccttatctataaaatg gaggtaataattgatgttcacgcaaagctgttacgaagattaagcccgtgctgcaaggtg gcgccctcagagcagagcctgatgcatccatggtccatgcctcacaggccggagctctct cgttggtgctgcacctccatccggggctacatgtcccctccacctttctcacccaacact ggacggcagggaagccctgtgagggaagagggggatggcaccgccaggcctgggacactg gtgccccagagcccaggcctggaggggccccttgcaggacgaggacacgccctctctaga cctcactctgtttcggcgcggtttaatagaatcctcctcaggctgctgctgtggaggtta gatgtgaacatgcgtggagaggacttggtggcgctctcggtagtggaggcttag >gi568815580r:48821375_49050424|GENSCAN_predicted_peptide_7|80_aa MGLALASAGHLVTDQDAVGASDGMMYGERGSKFPELKFKYVEEEQPEEFFIPYVWSLVYN SAVGLYWNPQDIQLFTMDSD >gi568815580r:48821375_49050424|GENSCAN_predicted_CDS_7|243_bp atggggttggcactggcctcagcgggacatttggtgactgaccaggatgccgtgggagct tcagacggcatgatgtatggggagcgcgggtcgaaatttccagaattgaaattcaaatat gtggaagaggagcagcccgaggagttttttatcccctatgtctggtctcttgtctacaac tcagcagtcggcctgtactggaatccacaggacatccagctgttcaccatggattccgac tga