GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:23:28 Sequence gi568815596f:28293721_28512445 : 218725 bp : 48.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3155 3203 49 1 1 77 89 25 0.436 0.61 1.02 Intr + 4618 4771 154 2 1 122 106 -5 0.505 3.83 1.03 Intr + 27724 27905 182 1 2 45 103 102 0.047 6.91 1.04 Intr + 33554 33727 174 0 0 78 111 27 0.419 3.91 1.05 Intr + 38879 39030 152 0 2 84 82 73 0.659 6.18 1.06 Term + 44730 45032 303 2 0 104 33 102 0.082 1.47 1.07 PlyA + 45041 45046 6 1.05 2.04 PlyA - 47414 47409 6 1.05 2.03 Term - 50945 50636 310 1 1 99 34 139 0.702 4.13 2.02 Intr - 53306 53118 189 1 0 85 58 55 0.499 0.90 2.01 Init - 57420 57248 173 0 2 30 64 167 0.150 7.43 2.00 Prom - 59965 59926 40 -2.66 3.00 Prom + 60502 60541 40 -5.46 3.01 Init + 64463 64489 27 1 0 73 101 29 0.296 2.58 3.02 Intr + 66308 66362 55 1 1 82 69 23 0.158 -1.65 3.03 Intr + 67890 67935 46 1 1 107 87 38 0.639 3.07 3.04 Intr + 70368 70488 121 0 1 78 48 129 0.811 8.40 3.05 Term + 75168 75239 72 2 0 106 55 63 0.905 2.71 3.06 PlyA + 76324 76329 6 -0.45 4.00 Prom + 78097 78136 40 -1.76 4.01 Init + 80564 80646 83 1 2 60 92 48 0.511 2.84 4.02 Intr + 80744 80819 76 2 1 94 55 28 0.843 -0.38 4.03 Intr + 80948 81073 126 1 0 112 36 99 0.190 7.98 4.04 Intr + 89187 89262 76 2 1 84 47 62 0.105 0.79 4.05 Intr + 93106 93324 219 2 0 106 42 87 0.274 4.17 4.06 Intr + 94873 95043 171 2 0 64 95 29 0.193 1.11 4.07 Intr + 97625 97758 134 0 2 67 32 66 0.246 -0.74 4.08 Intr + 100010 100102 93 1 0 44 96 153 0.275 11.86 4.09 Intr + 110387 110638 252 1 0 96 92 257 0.964 24.53 4.10 Intr + 115039 115146 108 0 0 97 77 263 0.999 26.58 4.11 Term + 118210 118728 519 0 0 110 40 622 0.742 53.90 4.12 PlyA + 123571 123576 6 1.05 5.00 Prom + 125457 125496 40 -1.46 5.01 Init + 131012 131083 72 1 0 79 23 84 0.106 2.16 5.02 Intr + 134247 134414 168 2 0 75 32 111 0.145 4.34 5.03 Intr + 160535 160595 61 1 1 95 70 45 0.006 1.71 5.04 Term + 176182 176312 131 2 2 23 49 219 0.323 9.84 5.05 PlyA + 177263 177268 6 -1.75 6.04 PlyA - 178414 178409 6 1.05 6.03 Term - 178959 178831 129 0 0 93 55 22 0.123 -2.42 6.02 Intr - 180108 179984 125 1 2 39 113 95 0.140 7.40 6.01 Init - 184455 184299 157 0 1 79 38 75 0.185 1.67 6.00 Prom - 199933 199894 40 1.34 7.00 Prom + 201621 201660 40 -4.46 7.01 Init + 202395 202449 55 2 1 67 121 134 0.989 14.09 7.02 Intr + 213324 213451 128 1 2 43 52 98 0.087 2.10 7.03 Term + 215831 215869 39 1 0 91 39 67 0.126 -0.71 7.04 PlyA + 216822 216827 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_1|337_aa MRFHHVGQAGLELLTSVDLPLFFPRDQPTLTFQSVYHFTNSGQLYSQAQKNYPYSPRWDG NEMAKRAKSMTQAKPKPKMITPDSRPENRNRVKSKQNEAVSPLAKPLVHTAQVLGKQQPL MHARNSFHGDAKGAEMPAARGSKSWPLQLQKLAASCSSPRGTGQAPEGLLGGPQRERELL SNFDHFEGELFEDQANQHPALGSRSHVIPKYPSRKSCSIIPGQALTKTAHHTQGVHLGLF QNLCPSVPGGSICQWKALGNTSLERWPARLPVHMRVSTYSRFLEAAWNVFTAAFCSHSSF CTPQAAPTAEIQLELAGGPWILEPFTSGYSLFLASIS >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_1|1014_bp atgaggtttcaccatgttggccaggctggtcttgaactcctgacctcagttgacctgcct ctgtttttccctcgagaccagccaactctcacatttcagtccgtttatcactttaccaac agtggacagctttactcccaggcccaaaaaaattatccgtacagccccagatgggatgga aatgaaatggccaaaagagcaaaatctatgacccaggccaaacccaagccaaaaatgatc acccctgattctaggccagaaaataggaatcgtgtgaaatctaagcagaatgaggctgtg tcaccactagccaaacctcttgttcatactgcgcaggttttaggaaaacaacagccttta atgcatgctcgaaattccttccatggggatgccaagggagcagagatgcctgcagcccgt gggagcaagtcctggcctttgcagttgcaaaaactggctgcaagctgctccagccccaga ggaactggccaagctccagagggcctccttggagggcctcagagggagagagaactgctc agtaattttgatcactttgaaggagagctgtttgaggaccaggccaaccaacaccctgct cttggcagccggagccatgtgattcctaaatacccttctaggaagagctgcagcatcatc cctggtcaagccctgaccaaaactgcacaccacacacaaggagtacacctgggcttattt caaaacctttgtccctcagttccaggaggcagcatttgccaatggaaagctctaggaaac accagtcttgagaggtggccagccagactgcctgtccacatgcgtgtcagcacatacagc cgcttcctggaagccgcctggaatgtcttcacggcagcgttttgctcacacagcagcttt tgcacgccccaggcagccccgactgctgaaatccaacttgagctggctggtggtccctgg atcctagagcccttcacttcgggttactccctctttcttgcctctatttcttag >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_2|223_aa METLYATKGIPSSGQAVPHRQTTQPNRAALQADGPWWADITLTGVDTLKAHLRLSGHGPI LVKAFWLQDDPTPLPAASAGLATPPWVWDTPQGLWTTEDNTLSSPGNPASGFGLPVRAAH MICPDTLLRLPAGVGSPSSLKGQRIEKAPCGNHLEQGLDCHAWCLRRERAAVTFNTHPPP YFQMAPSTLLRTALLAPAQTAKSHRELQRSVLEQKRDHGVERD >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_2|672_bp atggaaaccttgtatgcaacaaaaggcatcccttcctctgggcaggcagtaccacatcgg caaaccacacagcctaacagggcggccctgcaggccgacgggccctggtgggctgacatc accctcaccggtgtggacactctgaaggcccacctgcgtctgagcggtcacgggcccatc ttggtcaaggccttttggctccaggatgacccaacacctcttcctgcagcctctgcaggc ctggccaccccaccctgggtgtgggacactcctcagggcctctggaccacagaagacaat accctctccagccctggaaaccctgcttctggctttgggctgcctgtcagggcggctcac atgatttgccctgatacattgcttcggctgcctgcaggggtgggctctcctagttctctg aagggtcagaggatcgagaaggcgccgtgtgggaaccacctggaacagggcctggactgt cacgcatggtgcttgagaagggagagggcagctgtcacctttaacacccacccaccacct tatttccagatggcacccagcacccttctacgtactgctctactggcacctgcgcagaca gccaaaagtcaccgggagctgcagcgttccgttttggagcaaaagcgagatcatggagtt gaaagggactga >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_3|106_aa MVAVVIVPMPSVSSPAKLPAFEVRDYVSPAGAQLPNSDQKEELPYHRSPDCGHSLLPVSV SPPNQRFTPLQLLIPGGPLTPSTLCDEAFLGQAPKFTPAFPAGSIP >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_3|321_bp atggtggcagtggtcatcgttcctatgccctcagtttcttcacctgccaaactccctgcc ttcgaggtgagagattacgtaagtcccgctggggctcagctgcccaactctgatcagaaa gaagagctgccttatcaccgctcacctgactgtggccacagtctcctccctgtcagcgtc tccccgcccaatcaacgcttcacacctttgcagctgctgattcccggaggaccacttaca ccaagcactctctgtgatgaggcgtttttaggacaagcccccaagttcaccccagctttt cctgccggctccatcccttga >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_4|618_aa MAGRHRTNHKHQGILIVPRKQMHVFLKSCSRKDSLKGWAALGSPKARRDPTYKLPSFQCL QCQLLLPNQTEDGLPGAHFLVQHEAEGRIRKAGHEGRGECGPQAAAHVLSDPQQLIHGVW DPQNRKVAESCFKVGLEDAWQETRPDGGAVGKSFIAFRTSVSPLPYNVPNIQSSVLVVAD VEAESGDVKGLGPGLWTKTATLAYSIVIDYKIHLAFRNVTISSKSMAYYKEMMAGSFRHD GYRGQCIIKRYKETEDRSCVTVHSELSAMDSQHLLTLGPHKDPVGYRWLALLKNWDYPGN FDTSSRGSSGSPAHAESYSSGGGGQQKFRVDMPGSGSAFIPTINAITTSQDLQWMVQPTV ITSMSNPYPRSHPYSPLPGLASVPGHMALPRPGVIKTIGTTVGRRRRDEQLSPEEEEKRR IRRERNKLAAAKCRNRRRELTEKLQAETEELEEEKSGLQKEIAELQKEKEKLEFMLVAHG PVCKISPEERRSPPAPGLQPMRSGGGSVGAVVVKQEPLEEDSPSSSSAGLDKAQRSVIKP ISIAGGFYGEEPLHTPIVVTSTPAVTPGTSNLVFTYPSVLEQESPASPSESCSKAHRRSS SSGDQSSDSLNSPTLLAL >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_4|1857_bp atggcagggcgccatagaaccaaccacaagcaccaggggatcctgattgtgcccaggaag caaatgcatgtcttcctgaaaagctgctcccgtaaagatagcttgaaaggctgggctgcc cttggttcacccaaggctcgacgtgatcccacatacaagctgccctctttccagtgcctt caatgccagcttctgctgcccaaccagactgaagatgggctgcccggtgcacacttcctg gtgcagcacgaggcggagggaagaatcaggaaagcaggtcatgaggggcgtggggagtgt gggccccaggcagcagcgcacgtgctcagtgacccacagcagctgatccatggggtgtgg gatcctcagaacaggaaggtggcagaatcgtgtttcaaagtgggacttgaagatgcttgg caggagactcgtccagatggaggggcagtggggaaaagctttattgcctttaggacttct gtgtctcccttgccttacaacgtacccaacattcaatccagtgtcctggtggtggcagat gtggaagccgagtctggagatgtcaaggggcttggcccaggtctttggacaaaaacggca acacttgcatattcaatagtcatcgattataagatacatctggctttcagaaatgttaca atatcatcaaaaagtatggcttactataaagaaatgatggcaggttcattcaggcatgat ggatatagaggacagtgtatcattaagagatataaggaaactgaggacagaagttgtgtg acggtgcacagcgagttgtcagcaatggactctcaacacctcttaacattgggtcctcat aaggaccctgtggggtacagatggctggcgctacttaagaattgggattatcccgggaac tttgacacctcgtcccggggcagcagcggctctcctgcgcacgccgagtcctactccagc ggcggcggcggccagcagaaattccgggtagatatgcctggctcaggcagtgcattcatc cccaccatcaacgccatcacgaccagccaggacctgcagtggatggtgcagcccacagtg atcacctccatgtccaacccataccctcgctcgcacccctacagccccctgccgggcctg gcctctgtccctggacacatggccctcccaagacctggcgtgatcaagaccattggcacc accgtgggccgcaggaggagagatgagcagctgtctcctgaagaggaggagaagcgtcgc atccggcgggagaggaacaagctggctgcagccaagtgccggaaccgacgccgggagctg acagagaagctgcaggcggagacagaggagctggaggaggagaagtcaggcctgcagaag gagattgctgagctgcagaaggagaaggagaagctggagttcatgttggtggctcacggc ccagtgtgcaagattagccccgaggagcgccgatcgcccccagcccctgggctgcagccc atgcgcagtgggggtggctcggtgggcgctgtagtggtgaaacaggagcccctggaagag gacagcccctcgtcctcgtcggcggggctggacaaggcccagcgctctgtcatcaagccc atcagcattgctgggggcttctacggtgaggagcccctgcacacccccatcgtggtgacc tccacacctgctgtcactccgggcacctcgaacctcgtcttcacctatcctagcgtcctg gagcaggagtcacccgcatctccctccgaatcctgctccaaggctcaccgcagaagcagt agcagcggggaccaatcatcagactccttgaactcccccactctgctggctctgtaa >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_5|143_aa MATLDMGIGEWAPRVFTAYFLLRQVSLDNWTLAPAHARALQEHRKLLRHRTWMDDSLSFM EQKKDASGSFCTKNEASKARPLLLRALAASVCTAGLGSVTEWEMYNFLKNGAKAGKSKKK KRKKKKEEEEEKKEEEEGEGEVG >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_5|432_bp atggccaccctcgatatgggcattggtgagtgggctccccgggtattcacagcctacttc ttgctgcggcaggtaagcttggacaactggacattggcccctgcgcatgccagggcactg caagagcatcgcaagcttctcagacacaggacctggatggatgacagtctttccttcatg gagcagaagaaggatgcctcaggaagcttctgcaccaagaatgaggcctcaaaagcaagg cccctgctgctgagagctctggcagcctccgtgtgcacagcaggcctgggctcagtcaca gaatgggaaatgtataatttcctgaaaaatggagctaaagcaggaaaatcgaagaagaag aagaggaagaagaagaaggaggaggaggaggagaagaaagaggaggaggagggagaaggg gaagtgggatga >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_6|136_aa MGHMVRISLKGSDQWAVGDIKECVRHPLKFEEHRNGLTDPPLKKPLCEREWLGVKLQTFA VSVTALKAACLELFLPPSGFVVLLASGVKLQTFTAEGASSGLGHPRKGLSQCSGRLKGSS SVARVGTKAKEAPRMS >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_6|411_bp atggggcacatggtgaggatttctctgaagggaagtgatcaatgggctgtaggggatatc aaagaatgtgtcaggcatcccctgaaatttgaggaacacaggaacggattgaccgaccca cccttgaaaaagcccctgtgtgagagggagtggctgggagtgaagctgcagacctttgcg gtgagtgttacagctcttaaggcagcatgtctggagttgttccttcctcccagtggcttc gtcgtcttgctggcttcaggagtgaagctgcagaccttcacggctgagggagccagctcc ggcctcggccatcccaggaaggggctctcacagtgcagcggcaggctgaagggctcctca agcgtggccagagtgggcaccaaggccaaggaggcaccaagaatgagctga >gi568815596f:28293721_28512445|GENSCAN_predicted_peptide_7|73_aa MGLRPGIFLLELLLLLGQALRNPRSRLKFWPMVAEVKLELLLIVEDISIGGVRALEPAIV KTPVVTALSNLDD >gi568815596f:28293721_28512445|GENSCAN_predicted_CDS_7|222_bp atggggctgcggccaggcattttcctcctggagctgctgctgcttctggggcaagctttg aggaatcccaggtcccgtttaaagttctggccaatggtggctgaggtgaagctagagctg ctgctgattgtggaagatattagtataggaggagtcagggctctggaacctgccatcgtc aagaccccagtggtcacagccctgtccaacttggatgactga