GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:38:36 Sequence gi568815597r:235452112_235683838 : 231727 bp : 44.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 1035 979 57 0 0 24 111 70 0.706 1.98 1.11 Intr - 2198 2045 154 1 1 36 91 112 0.691 6.37 1.10 Intr - 3573 3448 126 2 0 104 106 44 0.935 7.59 1.09 Intr - 6675 6492 184 1 1 72 78 130 0.790 9.25 1.08 Intr - 13603 13525 79 1 1 97 110 -16 0.590 0.62 1.07 Intr - 18849 18739 111 0 0 48 98 53 0.649 2.88 1.06 Intr - 28038 27943 96 0 0 64 77 131 0.895 9.81 1.05 Intr - 32404 32211 194 2 2 47 69 251 0.991 18.31 1.04 Intr - 37157 37057 101 1 2 40 111 55 0.861 2.75 1.03 Intr - 42717 42570 148 1 1 112 90 57 0.897 7.59 1.02 Intr - 51750 51577 174 1 0 73 18 87 0.459 0.11 1.01 Init - 52141 52030 112 1 1 26 94 265 0.986 19.28 1.00 Prom - 66016 65977 40 -5.46 2.00 Prom + 72415 72454 40 -3.96 2.01 Init + 79311 79331 21 2 0 99 50 17 0.525 -2.27 2.02 Term + 81825 81893 69 2 0 119 39 111 0.919 7.24 2.03 PlyA + 82267 82272 6 1.05 3.06 PlyA - 82279 82274 6 1.05 3.05 Term - 86706 86516 191 1 2 41 52 137 0.411 3.01 3.04 Intr - 92339 92308 32 1 2 116 74 16 0.621 0.77 3.03 Intr - 98149 97973 177 0 0 58 75 171 0.880 11.83 3.02 Intr - 100126 100020 107 1 2 144 -57 115 0.360 1.61 3.01 Init - 100804 100802 3 1 0 71 101 0 0.249 -0.40 3.00 Prom - 105044 105005 40 -1.96 4.00 Prom + 111968 112007 40 -1.86 4.01 Init + 114572 114625 54 1 0 84 59 75 0.328 5.48 4.02 Intr + 123121 123189 69 0 0 62 97 32 0.065 0.88 4.03 Intr + 139495 139706 212 0 2 47 74 133 0.390 5.51 4.04 Intr + 141664 141712 49 1 1 66 89 52 0.336 1.78 4.05 Term + 151923 152027 105 2 0 116 42 63 0.423 2.91 4.06 PlyA + 152623 152628 6 1.05 5.04 PlyA - 153827 153822 6 1.05 5.03 Term - 163480 163313 168 1 0 23 44 180 0.583 4.98 5.02 Intr - 164161 163932 230 2 2 88 55 185 0.790 12.79 5.01 Init - 165604 165556 49 1 1 86 58 30 0.353 -1.08 5.00 Prom - 171411 171372 40 -3.16 6.00 Prom + 171616 171655 40 -7.66 6.01 Init + 176455 176498 44 0 2 62 75 49 0.229 0.89 6.02 Intr + 181697 181846 150 2 0 85 45 135 0.825 8.18 6.03 Intr + 184089 184288 200 0 2 66 77 98 0.872 5.49 6.04 Term + 185818 185972 155 2 2 73 46 140 0.953 6.38 6.05 PlyA + 186846 186851 6 1.05 7.03 PlyA - 189771 189766 6 1.05 7.02 Term - 191611 191490 122 2 2 93 46 117 0.894 6.64 7.01 Init - 194506 194419 88 1 1 80 92 61 0.346 6.51 7.00 Prom - 199348 199309 40 -4.96 8.05 PlyA - 199752 199747 6 1.05 8.04 Term - 201903 201815 89 1 2 38 44 98 0.408 -1.78 8.03 Intr - 205209 205067 143 2 2 59 69 83 0.541 3.50 8.02 Intr - 211944 211873 72 2 0 108 110 29 0.947 5.62 8.01 Init - 212568 212354 215 0 2 44 116 47 0.948 1.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 174601 174484 118 0 1 78 42 112 0.800 3.61 S.002 Term + 221224 221249 26 2 2 99 39 77 0.924 2.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_1|512_aa MRNWLVLLCPCVLGAALHLWLRLRSPPPACASGAGPAGLVSDVRRRPFARERNLVCPSAA SAFCFGSSGSEVVFFPPKLLVFLGQDWGFSAVDNVYQLALFPQWKSTHYDVVVGVLSARN NHELRNVIRSTWMRHLLQHPTLSQRVLVKFIIGAHGCEVPVEDREDPYSCKLLNITNPVL NQEIEAFSLSEDTSSGLPEDRVVSVSFRVLYPIVITSLGVFYDANDVGFQRNITVKLYQA EQEEALFIARFSPPSCGVQVNKLWYKPVEQFILPESFEGTIVWESQDLHGLVSRNLHKVT VNDGGGVLRVITAGEGALPHEFLEGVEGVAGGFIYTIQEGDALLHNLHSRPQRLIDHIRN LHEEDALLKEESSIYDDIVFVDVVDTYRNVPAKLLNFYRWTVETTSFNLLLKTDDDCYID LEAVFNRIVQKNLDGPNFWWGKLNWAVDRTGKWQELEYPSPAYPAFACGSGYVISKDIVK WLASNSGRLKTYQGEDVSMGIWMAAIGPKRYQ >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_1|1536_bp atgcgaaactggctggtgctgctgtgcccgtgtgtgctcggggccgcgctgcacctctgg ctgcggctgcgctccccgccgcccgcctgcgcctccggggccggccctgcaggcctggtt agtgatgtccgacgccgcccgttcgcccgggagcggaacctcgtgtgcccttcagccgcc agcgctttctgctttgggagctctggctctgaagttgttttcttcccccccaagttgttg gtctttctcggtcaagactggggttttagtgccgtggataatgtgtatcagttggcctta tttcctcagtggaaatctactcactatgatgtggtagttggcgtgttgtcagctcgcaat aaccatgaacttcgaaacgtgataagaagcacctggatgagacatttgctacagcatccc acattaagtcaacgtgtgcttgtgaagttcataataggtgctcatggctgtgaagtgcct gtggaagacagggaggatccttattcctgtaaactactcaacatcacaaatccagttttg aatcaggaaattgaagcgttcagtctgtccgaagacacttcatcggggctgcctgaggat cgagttgtcagcgtgagtttccgagttctctaccccatcgttattaccagtcttggagtg ttctacgatgccaatgatgtgggtttccagaggaacatcactgtcaaactttatcaggca gaacaagaggaggccctcttcattgctcgcttcagtcctccaagctgtggtgtgcaggtg aacaagctgtggtacaagcccgtggaacaattcatcttaccagagagctttgaaggtaca atcgtgtgggagagccaagacctccacggccttgtgtcaagaaatctccacaaagtgaca gtgaatgatggagggggagttctcagagtcattacagctggggagggtgcattgcctcat gaattcttggaaggtgtggagggagttgcaggtggttttatatatactattcaggaaggt gatgctctcttacacaaccttcattctcgccctcaaagacttattgatcatataaggaat ctccatgaggaagatgccttactgaaggaggaaagcagcatctatgatgatattgttttt gtggatgttgtcgacacttatcgtaatgttcctgcaaaattattgaacttctatagatgg actgtggaaacaacgagcttcaatttgttgctgaagacagatgatgactgttacatagac ctcgaagctgtatttaataggattgtccaaaagaatctggatgggcctaatttttggtgg ggaaaactgaattgggcagttgaccgaaccggaaagtggcaggagttggagtacccgagc cccgcttaccctgcctttgcatgtgggtcaggatatgtgatctccaaggacatcgtcaag tggctggcaagcaactcggggaggttaaagacctatcagggtgaagatgtaagcatgggc atctggatggctgccataggacctaaaagataccag >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_2|29_aa MGLGPGPPTQCEDEDEDEDFYDDPLSLNE >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_2|90_bp atggggctggggcctggccctcctactcaatgtgaagatgaagatgaggatgaagacttt tatgatgatccactttcacttaatgaatag >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_3|169_aa MVSQAAADLLAYCEAHVREDPLIIPVPASENPFREKKRKPYSGSLRKVIHGCPKERLDID ELALGLSKINLKVLPKPNSYDPDERGEYTVGAELFVPQTHTGPRPGASLIHNLSEEQDIR KIGGLFKTLPLTSSSHFISSLTTYRYAFLTGFYSKDLIIEPANTSYTNA >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_3|510_bp atggtctcccaggcagctgcggacctcctggcctactgtgaagctcacgtgcgggaagat cctctcatcattccagtgcctgcatcagaaaacccctttcgcgagaagaagaggaagccg tattctggtagtttaaggaaggtgattcatggatgccccaaggaaaggctagacatcgat gaattagctcttggattgtctaagatcaacctgaaggtcctacctaaaccaaactcatat gatcccgatgaaagaggagaatacacagtgggagcagagctctttgtgccacagacccat actggtccgcggcctggggcgtccctcatccataacctcagtgaagaacaagacatccga aaaataggagggctattcaagactctacccctcacttcctcctcccattttatcagcagc ctcacaacttacaggtatgccttcctcacaggcttttactccaaagacctcattattgaa cctgcaaacacatcatacaccaatgcctga >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_4|162_aa MAVLSNENGDATSAEDLGWQKAIRIQYAPPLPMGSHPHKPITFMRLHIPPGPKKKNGSRN REELQQISTRTLATGNTDWALGFPEDEARGKRDHLLRGGTWGHVLTQLFLPSGFVVSLAS GVKPQTFVVEIKTLNQQEKMLEVYSWPEALTSALTDCDVNSF >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_4|489_bp atggcggtgctatctaatgagaacggggatgctaccagtgctgaagatctgggttggcag aaagcaatacgcattcagtatgctcctccacttcccatggggtcacatccgcataaaccc atcacattcatgcgtctgcacatcccacctggacccaagaagaaaaatggaagtaggaac agagaggagctgcaacaaatctccacacgcacactggctaccggcaacactgactgggct ctcggctttccagaagatgaggcaagggggaaaagggaccatttgcttaggggtggcacc tggggccacgtgctcacacagctctttcttcccagtgggtttgtggtctcgctggcttca ggagtgaagccgcagaccttcgtggtagaaatcaaaactcttaatcaacaggagaagatg ttagaagtttactcgtggcctgaggctcttacatcagcccttactgactgtgatgttaat tctttctga >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_5|148_aa MGFHYVGQAGLELLASDLIRLREHINHSNIQEHSSNQHKNSVGREVAAGQDAKGQTHIAA GGRGLKKTVLNGPPSAKQDDKVMWELLMEDGHQRVEIKSPRNELGEHIGLEDSNRQSYDV EQQLPSVKGLLEGTTMVPMREQREMTVT >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_5|447_bp atggggtttcactatgttggtcaggctggtcttgaactcctggcctcagacctcattcgt ctcagggaacatatcaaccacagcaatattcaagaacattcctccaaccagcacaaaaat tcagttggcagagaagtggctgctggccaggatgccaaaggccagacccacatagcagca ggcggaagggggttgaagaagacagtgctgaatggtcctcctagcgctaagcaggatgac aaagtcatgtgggaactcctcatggaagacggccaccagagggttgaaatcaaaagcccc aggaatgagctaggagagcacattggactagaggattccaatcgccagagctatgacgtg gagcagcagcttccttctgtaaaaggccttcttgaaggcaccacgatggtgcccatgagg gagcagagggagatgacagtcacatag >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_6|182_aa MVSYGSPSGQIPGPVSHLVVGEEAAGWEKSNQDKAIKSKKNVHMAQLMDSLYKQIGKDKK TRIQIQSYPHCLKENEDDYTEVQRGILLFTSKAGSESLLQEQVLDAKAFDLFEEGSLGVS EKGNPKGTRSCVRHRLEETTEKEPRPMACKATSLYLTKKSFPDPGSCLRIWPASSSNAPG LR >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_6|549_bp atggtgtcctacgggtcacccagtggacagatacctggtcctgtgagtcacctggtggtg ggtgaagaagcagctggatgggagaagtccaaccaggataaggccataaaaagcaagaaa aatgtacacatggcccagctcatggacagtctttataagcaaatagggaaggataaaaag acgagaatccaaatccagagttatccgcactgcctcaaggagaatgaggacgattacaca gaagtgcaaagaggaattttgctttttacttccaaagcgggttcagaaagccttcttcag gaacaagttttagacgctaaagcctttgatttatttgaggaaggctctttgggggtctca gaaaaggggaaccccaaaggcaccaggtcctgcgtccggcacagacttgaggagaccacc gaaaaggagcccaggccaatggcctgcaaagccacttccctctacttgaccaagaagagc ttcccggaccctggctcctgcctgcgcatctggccagcttcatcttccaatgctcctggt ctccgataa >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_7|69_aa MEELFVSMAVRFVGHQRPSLRGSDRARSPVLLSSTAPRHIPFHYWVLWKDFSNTDILLDA IDVFIEDCM >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_7|210_bp atggaggaactgttcgtctccatggccgtgaggtttgtgggtcaccaaaggcccagtctc cgaggatcggatagggctaggagcccagttttactgtccagtactgctccccgccacatt ccttttcattactgggtgctatggaaggacttttccaatacagacatactgctggatgcg atcgatgtgttcatcgaagactgcatgtga >gi568815597r:235452112_235683838|GENSCAN_predicted_peptide_8|172_aa MWALRQPHSGNPPDDLGVLAGGGSDLRLWTVNGDLVGHVHCREIICSVAFSNQPEGVSIN VIAGGLENGIVRLWSTWDLKPVREITFPKSNKPIISYYNQLFGIYFVLDNLSLVNIYEFR KQKSSELNLRNRTVFVTNKLQLRDPRNPMNTKQHKHKENHIKAHHNHTVENQ >gi568815597r:235452112_235683838|GENSCAN_predicted_CDS_8|519_bp atgtgggctctgagacagccacattctggtaacccgccggatgatttgggtgttttagct ggcggaggcagtgacctcagactctggacggtgaacggggatctcgttggacatgtccac tgcagggagatcatctgttccgtggctttctccaaccagcctgagggagtatctatcaat gtaatcgctgggggattagaaaatggaattgtaaggttatggagcacatgggacttaaag cctgtgagagaaattacatttcccaaatcaaataagcccatcatcagttattacaaccaa ttattcggtatatactttgtgttggacaatctttctcttgtgaacatctatgagtttcga aaacaaaaatcatcagagctgaatctgagaaatagaactgttttcgtcacaaacaagtta cagttacgagatccaagaaatccaatgaacaccaagcagcataaacacaaagaaaaccac atcaaggcacatcataatcacactgttgaaaaccagtga