GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:46:49 Sequence gi568815581r:15878050_16099594 : 221545 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4191 4339 149 2 2 100 86 16 0.149 2.16 1.02 Intr + 4788 4927 140 0 2 63 58 66 0.142 1.21 1.03 Term + 5347 5477 131 2 2 95 45 50 0.119 -0.26 1.04 PlyA + 7325 7330 6 1.05 2.04 PlyA - 7735 7730 6 1.05 2.03 Term - 16385 16146 240 2 0 68 44 104 0.879 0.03 2.02 Intr - 18726 18640 87 0 0 141 89 43 0.925 9.87 2.01 Init - 35077 34964 114 1 0 77 96 51 0.726 5.01 2.00 Prom - 36328 36289 40 -5.36 3.00 Prom + 37714 37753 40 -8.16 3.01 Init + 38064 38130 67 2 1 74 91 26 0.292 2.73 3.02 Intr + 49648 49826 179 2 2 5 113 119 0.465 5.74 3.03 Intr + 50460 50518 59 2 2 72 80 68 0.074 2.08 3.04 Intr + 64936 65030 95 1 2 78 60 84 0.104 4.21 3.05 Intr + 67171 67534 364 2 1 39 94 794 0.142 69.34 3.06 Intr + 68953 69083 131 1 2 59 64 50 0.429 0.04 3.07 Term + 88852 88973 122 2 2 80 55 134 0.554 7.94 3.08 PlyA + 91406 91411 6 1.05 4.00 Prom + 92424 92463 40 -6.86 4.01 Init + 94144 94249 106 0 1 48 103 16 0.332 -0.51 4.02 Intr + 95526 95684 159 1 0 55 53 108 0.436 3.96 4.03 Term + 96587 97293 707 0 2 15 39 468 0.405 28.28 4.04 PlyA + 97670 97675 6 1.05 5.00 Prom + 117612 117651 40 -3.96 5.01 Init + 121800 121983 184 2 1 62 94 297 0.970 24.98 5.02 Intr + 122069 122313 245 0 2 68 65 242 0.665 17.12 5.03 Intr + 123866 123976 111 1 0 96 21 105 0.774 5.18 5.04 Intr + 125782 125838 57 0 0 72 110 15 0.555 1.18 5.05 Intr + 126152 126213 62 1 2 123 111 49 0.997 8.23 5.06 Intr + 128425 128519 95 1 2 75 91 83 0.846 6.91 5.07 Intr + 146968 147122 155 2 2 72 110 112 0.884 11.59 5.08 Intr + 148491 148653 163 2 1 53 92 80 0.982 4.45 5.09 Term + 149325 149473 149 1 2 75 35 139 0.956 5.36 5.10 PlyA + 150080 150085 6 1.05 6.14 PlyA - 151128 151123 6 1.05 6.13 Term - 154434 154247 188 1 2 74 43 165 0.996 8.25 6.12 Intr - 161605 161426 180 2 0 68 69 107 0.038 6.64 6.11 Intr - 166834 166616 219 2 0 85 42 152 0.026 8.57 6.10 Intr - 169044 168902 143 2 2 78 106 25 0.093 3.30 6.09 Intr - 170869 170796 74 1 2 94 107 25 0.129 3.20 6.08 Intr - 180015 179858 158 1 2 89 75 60 0.180 4.53 6.07 Intr - 183845 183352 494 1 2 82 34 292 0.917 15.84 6.06 Intr - 184221 184056 166 1 1 112 29 31 0.635 -1.38 6.05 Intr - 186138 186019 120 1 0 74 82 211 0.770 19.57 6.04 Intr - 187622 187436 187 2 1 60 85 33 0.428 -0.54 6.03 Intr - 192476 192276 201 2 0 129 91 190 0.749 22.88 6.02 Intr - 193616 193360 257 0 2 62 93 183 0.997 13.36 6.01 Init - 194102 194096 7 2 1 68 59 0 0.290 -3.76 6.00 Prom - 195203 195164 40 -3.56 7.00 Prom + 195213 195252 40 -8.16 7.01 Sngl + 198243 198653 411 2 0 67 33 378 0.985 26.49 7.02 PlyA + 199813 199818 6 1.05 8.08 PlyA - 201418 201413 6 1.05 8.07 Term - 202104 202085 20 1 2 92 50 15 0.638 -3.42 8.06 Intr - 202460 202359 102 0 0 102 83 25 0.779 3.55 8.05 Intr - 202678 202558 121 1 1 112 82 59 0.899 7.77 8.04 Intr - 208147 208048 100 0 1 29 70 54 0.346 -2.19 8.03 Intr - 209296 209215 82 2 1 73 82 80 0.584 4.60 8.02 Intr - 214009 213814 196 1 1 96 100 72 0.681 8.19 8.01 Intr - 220447 220318 130 0 1 53 115 35 0.305 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 166762 166908 147 0 0 11 42 198 0.843 5.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_1|139_aa MPPGGGGRWLRGASRKTGNSATLQAKGPHCASSDLPNSCEMWEKVRGLTRFDSVGQICQQ LVLQHDTTQHQSHHVTIQKKEALIFAAVPSPSPRTACQEYGSNSTGYRVAKGFQAMVAGG LSARTVVLEERRKPTTPEL >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_1|420_bp atgccccctggaggtggtggccggtggctgagaggagcatctaggaagaccggaaactca gccaccctgcaggctaagggcccacattgtgcctcttctgacctgccaaattcctgtgaa atgtgggagaaagtgaggggactcacaaggtttgactcagtaggtcaaatatgccagcag ctggtgctgcagcatgacaccacccaacaccaaagccatcatgtcaccattcagaaaaag gaggccttgatatttgctgctgttccctctccgtcccccaggacagcatgccaggaatat gggtctaactccactggctacagagttgcaaagggttttcaagccatggtggcaggtggt ctctcagccagaactgtggttctggaggagaggaggaaacccacaactcctgaattatga >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_2|146_aa MGFWAAETIATDSATTDASGGTLGASHLTPHLIRPLSRVEVMSKVWQLKGTSHTGTSMGT KANRLKMLLLRPGSALLPPPPSDTDFDITATVSPHLNVIPVTDSHSTEFLSGLFDDLSLF INNFTCYKVYRLLVCVVQEGMTVQSS >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_2|441_bp atgggcttctgggcagcagagaccattgccacagacagtgcaacaacagatgcctcaggg ggcacgttgggagcctctcacctgacacctcatttaatacgaccactttctagggtggaa gtgatgtctaaagtttggcagctaaaaggaaccagccacacaggaacaagtatggggaca aaagccaacagattaaagatgctactccttcgccctggctctgcactgctgcctcctcca ccttctgacaccgattttgacatcactgccactgtctcccctcatctaaatgtgatccct gtcactgactctcacagcactgagtttctctctgggctctttgacgacttatccttgttc attaataactttacttgttataaagtctaccgcttgcttgtctgtgtagttcaggagggc atgacagttcagagttcatga >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_3|338_aa MAACLRTGEDSCSPELSPSKGSVVQVRDEGTQPSKGRADGAAVENAGRRGIRDAPQCLLL LTWRRLGSRVRRKVAFNIQVEKPVQLYKGMGLDVSQTRLGDRKCYWRSCLISLAFCVAAE VCVNCCGNVIGGAGGARGPAGPAMLLETQDALYVALELVIAALSVAGNVLVCAAVGTANT LQTPTNYFLVSLAAADVAVGLFAIPFAITISLGFCTDFYGCLFLACFVLVLTQSSIFSLL AVAVDRYLAICVPLRLFRGSQQEQEPPAWEGCEPYVGVWPLHVLLPVLLDQEALCLPSVL SDLCSRAVTATGWRTEGRKDVAHALQECFGEMDKRMQT >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_3|1017_bp atggctgcctgcttacgaacaggcgaagactcttgcagcccagagctgtctccctccaaa ggcagtgtagttcaggtgagagatgaggggactcagcccagcaagggcagagcagatgga gcggctgtggagaatgcaggaaggagaggaatccgggacgcccctcagtgcctgctgctg ctgacatggagaagactcggatcaagagtcaggcgcaaggtggctttcaacatccaagtg gagaagccggtgcagctgtacaagggcatgggactggatgtcagccagacaaggctgggg gacaggaagtgctattggaggagctgcctgatctctctggccttctgtgtggcagcagaa gtttgtgtcaactgctgtgggaacgtgattggtggagcagggggcgcccggggcccagct ggcccggccatgctgctggagacacaggacgcgctgtacgtggcgctggagctggtcatc gccgcgctttcggtggcgggcaacgtgctggtgtgcgccgcggtgggcacggcgaacact ctgcagacgcccaccaactacttcctggtgtccctggctgcggccgacgtggccgtgggg ctcttcgccatcccctttgccatcaccatcagcctgggcttctgcactgacttctacggc tgcctcttcctcgcctgcttcgtgctggtgctcacgcagagctccatcttcagccttctg gccgtggcagtcgacagatacctggccatctgtgtcccgctcaggttgtttcgtgggtcc cagcaggaacaggagcctcctgcttgggagggctgtgagccttatgtgggtgtctggcca ctccatgtgctgctgcccgtgctgttggaccaggaggccttatgtctcccatcagtgctc agtgacctctgcagcagagcagtgactgccacgggctggcgaacggaaggacggaaggac gtggcccatgcccttcaggaatgttttggggagatggacaaacgcatgcagacctga >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_4|323_aa MSQREPLVKNGRHWTCGAEAGGFCGHAGKYTSYTADQQHSILIGANPIVNCARRGSTLCA PYENLMHHPPPLVCGNIVFHEAGPWCQKERYKLTVTDGILLNRYKSLVTGTRARGVIAVL WVLAFGIGLTPFLGWNSKDSATNNCTEPWDGTTNESCCLVKCLFENVVPMSYMVYFNFFG CVLPPLLIMLVIYIKIFLVACRQLQRTELMDHSRTTLQREIHAAKSLAMIVGIFALCWLP VHAVNCVTLFQPAQGKNKPKWAMNMAILLSHANSVVNPIVYAYRNRDFRYTFHKIISRYL LCQADVKSGNGQAGVQPALGVGL >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_4|972_bp atgagccagagggagccattggttaagaatggcaggcactggacgtgtggagccgaggct ggggggttctgcggccatgctgggaagtatacgtcttacactgcagatcagcagcattcg attctcataggagctaaccctattgtgaactgtgcacgcaggggatctacgctgtgcgct ccttatgagaatctaatgcatcaccccccgcccctggtctgtggaaacattgtcttccat gaagctggtccctggtgccagaaagagcgctataaactgaccgtaacagatggtattctt ttaaacaggtataaaagtttggtcacggggacccgagcaagaggggtcattgctgtcctc tgggtccttgcctttggcatcggattgactccattcctggggtggaacagtaaagacagt gccaccaacaactgcacagaaccctgggatggaaccacgaatgaaagctgctgccttgtg aagtgtctctttgagaatgtggtccccatgagctacatggtatatttcaatttctttggg tgtgttctgcccccactgcttataatgctggtgatctacattaagatcttcctggtggcc tgcaggcagcttcagcgcactgagctgatggaccactcgaggaccaccctccagcgggag atccatgcagccaagtcactggccatgattgtggggatttttgccctgtgctggttacct gtgcatgctgttaactgtgtcactcttttccagccagctcagggtaaaaataagcccaag tgggcaatgaatatggccattcttctgtcacatgccaattcagttgtcaatcccattgtc tatgcttaccggaaccgagacttccgctacacttttcacaaaattatctccaggtatctt ctctgccaagcagatgtcaagagtgggaatggtcaggctggggtacagcctgctctcggt gtgggcctatga >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_5|406_aa MFRLLSWSLGRGFLRAAGRRCRGCSARLLPGLAGGPGPEVQVPPSRVAPHGRGPGLLPLL AALAWFSRPAAAEEEEQQGADGAAAEDGADEAEAEIIQLLKRAKVRRLRALRPAERGSLC ERGLGVAAHYSLRLAEEWMEAPRLSIMKDEPEEAELILHDALRLAYQTDNKKAITYTYDL AEQLFKATMSYLLGGGMKQEDNAIIEISLKLASIYAAQNRQEFAVAGYEFCISTLEEKIE REKELAEDIMSVEEKANTHLLLGMCLDACARYLLFSKQPSQAQRMYEKALQISEEIQGER HPQTIVLMSDLATTLDAQGRFDEAYIYMQRASDLARQINHPELHMVLSNLAAVLMHRERY TQAKEIYQEALKQAKLKKDEISVQHIREELAELSKKSRPLTNSVKL >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_5|1221_bp atgttccggctcctgagctggagcctgggccgaggcttcctgcgggccgcggggcggcgg tgccggggctgctccgcgcgcctgctcccggggctggcaggaggtccggggcccgaggtg caggtgccgccatcccgagtcgcgccgcacggccggggcccaggcctgctgccgctgctg gcagcgctcgcctggttctcgaggcccgctgcggcagaggaggaggagcagcagggagcc gacggggccgctgccgaggacggggcggacgaggccgaggcagagatcatccagctgctg aagcgagccaaggtgaggcggctccgggccctgcgcccggccgagcgcggtagcctttgt gagcggggtcttggcgttgctgcgcattactctctccgcctcgccgaagagtggatggag gctccgcggttgagcattatgaaagatgagccagaagaggctgagttaattttgcatgac gctcttcgtctcgcctatcagactgataacaagaaggccatcacttacacttatgatttg gctgaacaactttttaaagcaacaatgagttacctccttggagggggcatgaagcaggag gacaatgcaataattgaaatttccctaaagctggccagtatctatgctgcgcagaacaga caggaatttgctgttgctggctatgaattctgcatttcaactctagaggaaaaaattgaa agagaaaaggaattagcagaagacattatgtcagtggaagagaaagccaatacccacctc ctcttgggcatgtgcttagacgcctgtgctcgctaccttctgttctccaagcagccgtca caggcacaaaggatgtatgaaaaagctctgcagatttctgaagaaatacaaggagaaaga cacccacagaccattgtgctgatgagtgacctggctactaccctggatgcacagggccgc tttgatgaggcctatatttatatgcaaagggcatcagatctggcaagacagataaatcat cctgagctacacatggtactcagtaatctagctgcagttttgatgcacagagaacgatat acacaagcaaaagagatctaccaggaagcactgaagcaagcaaagctgaaaaaagatgaa atttctgtacaacacatcagggaagagttggctgagctgtcaaagaaaagtagacctttg acaaattctgtcaagctctaa >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_6|797_aa MQGTPRATTESFEDGLKYPKQIKRESPPIRAFEGAITKGKPYDGITTIKEMGRSIHEIPR QDILTQESRKTPEVVQSTRPIIEGSISQGTPIKFDNNSGQSAIKHNVKSLITGPSKLSRG MPPLEIVPENIKVVERGKYEDVKAGETVRSRHTSVRQLSPTPGYPSQYQLYAMENTRQTI LNDYITSQQMQVNLRPDVARGLSPREQPLGLPYPATRGHPTHLAAAASAEREREREREKE RERERIAAASSDLYLRPGSEQPGRPGSHGYVRSPSPSVRTQETMLQQRPSVFQGTNGTSV ITPLDPTAQLRIMPLPAGGPSISQGLPASRYNTAADALAALVDAAASAPQMDVSKTKESK HEAARLEENLRSRSAAVSEQQQLEQKTLEVEKRSVQCLYTSSAFPSGKPQPHSSVVYSEA GKDKGPPPKSRYEEELRTRGKTTITAANFIDVIITRQIASDKDARERGSQSSDSSSSYDP TRQYEGPLHHYRPQQESPSPQQQLPPSSQAEGMGQVPRTHRLITLADHICVPVVHEKQDS LLLLSQRGAEPAEQRNDARSPGSISYLPSFFTKLENTSPMVKSKKQEIFRKLNSSGGGDS DMAGAAAVEWAGSHWCSTSCWSRSTSPYIVDEAPNVDIGQGHCKQAGPKRFNIYTSCFNE GIDLILRDGHLIVMQSSVSSRGHSFADPASNLGLEDIIRKALMGSFDDKVEDHGVVMSQP MGVVPGTANTSVVTSGSTQFPYNPLTMRMLSSTPPTPIACAPSAVNQAAPHQQNRIWERE PAPLLSAQYETLSDSDD >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_6|2394_bp atgcaggggacaccaagagcaacaactgaaagctttgaagatggccttaaatatcccaaa caaattaaaagggaaagtcctcccatacgagcatttgaaggtgccattaccaaaggaaaa ccatatgatggcatcaccaccatcaaagaaatggggcgttccattcatgagattccaagg caagatattttaactcaggaaagtcggaaaactccagaagtggtccagagcacacggccg ataattgagggttccatttcccagggcacaccaataaagtttgacaacaactcaggtcaa tctgccatcaaacacaatgtcaaatccttaatcacggggcctagcaaactatcccgtgga atgcctccgctggaaattgtgccagagaacataaaagtggtagaacggggaaaatatgag gatgtgaaagcaggcgagaccgtgcgttcccggcacacgtcagtgagacagctttcacca actccaggttacccaagtcagtatcagctttacgcaatggagaacacaagacagacaatc ttaaatgattacattacctcacaacagatgcaagtgaacttgcgtccagatgtggccaga ggactctccccaagagagcagccactgggtctcccatacccagcaacgagaggacaccca acacaccttgcagctgctgcaagtgctgagagggaacgggaacgggagcgggagaaggag cgggagcgggaacggattgctgcagcttcctccgacctctacctgcggccaggctcagaa cagcctggccgacctggcagtcatggatatgttcgctccccttccccttcagtaagaact caggagaccatgttgcaacagagacccagtgttttccaaggaaccaatggaaccagtgta atcacacctttggatccaactgctcagctacgaatcatgccactgcctgctgggggccct tcaataagccaaggcctgccagcctcccgttacaacactgctgcggatgccctggctgct cttgtggatgctgcagcttctgcaccccagatggatgtgtccaaaacaaaagagagtaag catgaagctgccaggttagaagaaaatttgagaagcaggtcagcagcagttagtgaacag cagcagctagagcagaaaaccctggaggtggagaagagatctgttcagtgtttatacact tcttcagcctttccaagtggcaagccccagcctcattcttcagtagtttattctgaggct gggaaagataaagggcctcctccaaaatccagatatgaggaagagctaaggaccagaggg aagactaccattactgcagctaacttcatagacgtgatcatcacccggcaaattgcctcg gacaaggatgcgagggaacgtggctctcaaagttcagactcttctagtagctatgatcct accagacaatatgaaggaccattacatcactatcgaccacagcaggaatcaccatctccc caacaacagctgcccccttcttcacaggcagagggaatggggcaagtgcccaggacccat cggctgatcacacttgctgatcacatctgtgttccggttgtgcatgagaaacaggacagc ttgctgctcttgtctcagaggggcgcagagcctgcagagcagaggaatgatgcccgctca ccagggagtataagctacttgccttcattcttcaccaagcttgaaaatacatcacccatg gttaaatcaaagaagcaggagatttttcgtaagttgaactcctctggtggaggtgactct gatatggctggagcagcagcagtggagtgggcaggatcccactggtgcagcaccagctgc tggagcaggtccaccagcccctacattgtagatgaggctcccaatgttgacattggccag ggccattgcaaacaagctggaccaaaaaggttcaacatttacaccagctgctttaacgag ggcattgatcttatcctccgtgatggtcacctcattgtcatgcagagctcagttagctct agaggccattcttttgctgatcctgccagtaatcttgggctggaagacattatcaggaag gctctcatgggaagctttgatgacaaagttgaggatcatggagttgtcatgtcccagcct atgggagtagtgcctggtactgccaacacctcagttgtgaccagtggctcaactcagttt ccttataaccctctgactatgcggatgctcagcagtactccaccaacaccgattgcatgt gctccctctgcggtgaaccaagcagctcctcaccaacagaacaggatctgggagcgagag cctgccccactgctctcagcacagtacgagaccctgtcggatagtgatgactga >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_7|136_aa MGCKAAETIHNINNAFGPGTVNKHTVQWWFKKFYKGNKSLEDEEHSGRPSEVDNNQLRAI IDADPLTTTREVAKELNVNHSTVVWHLKQTGKVKKLSKWVPHELSENFKNCLSEVSSLIL GNNKPFLNWIMTSDEK >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_7|411_bp atgggttgtaaagcagcagagacaattcataacatcaacaacgcatttggcccaggaact gttaacaagcatacagtgcagtggtggttcaagaagttttacaaaggaaacaagagcctt gaagatgaggagcatagtggccggccatcggaagttgacaacaaccaattgagagcaatc attgatgctgatcctcttacaactacacgagaagttgccaaagaactcaacgtcaaccat tctacggttgtttggcatttgaagcaaactggaaaggtgaaaaagctcagtaaatgggtg ccccatgagctgagcgaaaattttaaaaactgtctttctgaagtgtcttctcttattcta ggcaacaacaaaccatttctcaactggatcatgacatccgatgaaaagtag >gi568815581r:15878050_16099594|GENSCAN_predicted_peptide_8|250_aa XMFPMDSKPSLLNPTGSILVSSPLKPNPLDLPQLQHRAAVIPPMVSCTPCNIPIGTPVSG YALYQRHIKAMHESALLEEQRQRQEQIDLECRSSTSPCGTSKSPNREWEGKSVAYMPYAE VKRALEQEAQMHNTAARVVINNNLSEYEVSRDNDQKMQQSLYSCHKSELSGTPGTYLTSH NQASYTQETPKPSVGSISLGLPRQQESAKSATLPYIKQEEFSPRSQNSQPEGLLVRAQHE GVVRVEYSPK >gi568815581r:15878050_16099594|GENSCAN_predicted_CDS_8|753_bp nnaatgtttcctatggactcaaagccttcactgttaaaccccactggatctatactcgtc tcatctccgttaaaaccaaatccactggatctgccacagcttcagcatcgagctgctgtt atcccaccaatggtatcctgcaccccatgtaacataccaattggaaccccagtgagcggc tatgctctctaccagcgacacattaaagcaatgcatgagtcagcactcctggaggagcag cggcagagacaagaacagatagatttggaatgtagaagttctacaagtccatgtggcaca tccaagagtccaaacagagagtgggaaggaaaatcagtcgcctatatgccttatgctgag gtcaaacgtgctctagaacaggaagcacagatgcacaatactgcagcaagagttgtaata aataataatctgtctgaatatgaagtttcaagagataatgaccagaagatgcagcagtca ctgtactcttgccacaaatcagaactttcaggaacaccaggcacttatttgacttctcat aatcaggcttcctacactcaagaaacacccaagccgtcagtgggatctatctctcttgga ctgccacggcaacaggaatctgccaaatcagctactttgccctacatcaagcaggaagaa ttttctccccgaagccaaaactcacaacctgagggtctgttggtcagggcccaacatgaa ggtgtagtcagagttgagtactctcccaagtag