GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:35:46 Sequence gi568815596f:47269506_47486610 : 217105 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2200 2219 20 0 2 95 111 -10 0.160 1.68 1.02 Intr + 7791 7847 57 0 0 95 83 39 0.073 2.10 1.03 Intr + 36972 37122 151 0 1 67 99 55 0.064 4.66 1.04 Intr + 56226 56319 94 2 1 80 93 69 0.293 6.14 1.05 Term + 59676 59698 23 1 2 85 44 37 0.318 -2.53 1.06 PlyA + 62589 62594 6 1.05 2.00 Prom + 65719 65758 40 -4.66 2.01 Init + 100001 100076 76 1 1 100 100 180 0.702 19.65 2.02 Intr + 103958 104065 108 0 0 63 75 77 0.808 4.16 2.03 Intr + 104303 104543 241 0 1 46 75 180 0.441 9.11 2.04 Intr + 105729 105794 66 0 0 95 105 69 0.936 7.32 2.05 Intr + 107509 107572 64 1 1 52 56 12 0.857 -6.88 2.06 Intr + 109448 109549 102 1 0 62 76 48 0.698 1.37 2.07 Intr + 110264 110464 201 1 0 110 105 176 0.925 20.98 2.08 Intr + 115661 115705 45 1 0 90 84 24 0.635 0.81 2.09 Term + 118820 118921 102 1 0 81 49 115 0.982 5.18 2.10 PlyA + 122002 122007 6 1.05 3.00 Prom + 130386 130425 40 -6.56 3.01 Init + 133687 133897 211 0 1 96 94 375 0.913 37.85 3.02 Intr + 138896 139050 155 0 2 88 86 59 0.986 5.49 3.03 Intr + 140589 140867 279 2 0 53 115 173 0.882 14.17 3.04 Intr + 142909 143055 147 0 0 65 88 121 0.999 10.23 3.05 Intr + 144764 144913 150 1 0 49 98 52 0.766 2.66 3.06 Intr + 146791 146924 134 0 2 97 87 112 0.999 11.44 3.07 Intr + 160237 160436 200 1 2 26 87 154 0.879 8.09 3.08 Intr + 176043 176152 110 1 2 88 85 67 0.731 6.40 3.09 Intr + 182401 182421 21 0 0 137 72 13 0.501 2.34 3.10 Intr + 193526 193649 124 1 1 98 91 64 0.923 7.86 3.11 Intr + 197153 197303 151 0 1 34 41 147 0.995 3.82 3.12 Intr + 201460 201557 98 1 2 75 97 76 0.976 6.85 3.13 Intr + 205520 205765 246 0 0 75 58 175 0.981 10.63 3.14 Intr + 206862 207066 205 1 1 77 111 70 0.981 6.46 3.15 Intr + 208767 209014 248 0 2 52 95 82 0.958 2.40 3.16 Intr + 211191 211366 176 1 2 104 68 75 0.953 6.76 3.17 Term + 213274 213444 171 0 0 56 48 107 0.913 1.33 3.18 PlyA + 213695 213700 6 1.05 4.00 Prom + 214181 214220 40 -3.86 4.01 Init + 214987 215071 85 0 1 76 115 43 0.134 6.78 4.02 Term + 215945 216036 92 0 2 38 48 77 0.110 -3.42 4.03 PlyA + 217044 217049 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:47269506_47486610|GENSCAN_predicted_peptide_1|114_aa MEGEGSRFGISGSLQLPEEKPYRFLRAPVLVALGQRPAAQEKPSSCWTSGPNCEWFLTHH PLLQHAWKMPLGEKEQRVYGDRMKYHSGSWMAVLKQVLHMRTFSPAHDVDIEDG >gi568815596f:47269506_47486610|GENSCAN_predicted_CDS_1|345_bp atggagggagagggctccagatttggcatttcaggatctctgcagttacctgaagagaaa ccatacagatttctaagggctcctgtgctcgtggccctaggacagcgacctgcagcccaa gagaaaccatcctcctgttggacttctgggcccaattgcgaatggttccttactcaccac ccactacttcaacatgcctggaaaatgccactaggggagaaagaacagagggtttatggg gatcgtatgaagtaccacagtggttcctggatggcagtccttaaacaggtgctacacatg aggacgttttcaccagcccacgatgtggacatagaagatggatag >gi568815596f:47269506_47486610|GENSCAN_predicted_peptide_2|334_aa MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGAQNTVICS KLAAKCLVMKAEMNGSKLGRRAKPEGALQNNDGLYDPDCDESGLFKAKQCNGTSMCWCVN TAGVRRTDKDTEITCSERVRTYWIIIELKHKAREKPYDSKSLRTALQKEITTRYQLDPKF ITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKKMDLTVNGEQL DLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVIVVVVIAVVAGIVVLVISRKKRMAKYEKA EAQKRPGEATPCCKSQIMEQPWMLQEPVKPGHQS >gi568815596f:47269506_47486610|GENSCAN_predicted_CDS_2|1005_bp atggcgcccccgcaggtcctcgcgttcgggcttctgcttgccgcggcgacggcgactttt gccgcagctcaggaagaatgtgtctgtgaaaactacaagctggccgtaaactgctttgtg aataataatcgtcaatgccagtgtacttcagttggtgcacaaaatactgtcatttgctca aagctggctgccaaatgtttggtgatgaaggcagaaatgaatggctcaaaacttgggaga agagcaaaacctgaaggggccctccagaacaatgatgggctttatgatcctgactgcgat gagagcgggctctttaaggccaagcagtgcaacggcacctccatgtgctggtgtgtgaac actgctggggtcagaagaacagacaaggacactgaaataacctgctctgagcgagtgaga acctactggatcatcattgaactaaaacacaaagcaagagaaaaaccttatgatagtaaa agtttgcggactgcacttcagaaggagatcacaacgcgttatcaactggatccaaaattt atcacgagtattttgtatgagaataatgttatcactattgatctggttcaaaattcttct caaaaaactcagaatgatgtggacatagctgatgtggcttattattttgaaaaagatgtt aaaggtgaatccttgtttcattctaagaaaatggacctgacagtaaatggggaacaactg gatctggatcctggtcaaactttaatttattatgttgatgaaaaagcacctgaattctca atgcagggtctaaaagctggtgttattgctgttattgtggttgtggtgatagcagttgtt gctggaattgttgtgctggttatttccagaaagaagagaatggcaaagtatgagaaggct gaggcccagaagcgtcctggagaagccaccccatgctgcaagagccagatcatggagcag ccctggatgctgcaggagcctgtcaagccaggacaccagagctag >gi568815596f:47269506_47486610|GENSCAN_predicted_peptide_3|941_aa MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYTAHGEDALLAAREVFKT QGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKNRAGNKASKENDWYLA YKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGLCEFPD NDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQRGGILITERKKADFSTKDIYQD LNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELLSDDSNFGQFELTTFDFSQYM KLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPLMDKNRIEERL NLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQA LEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVNSVNILVENHEFLVKPS FDPNLSELREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEKVL RNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQ TLNDVLAQLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFIPN DVYFEKDKQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILARVG AGDSQLKGVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIATK IGAFCMFATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAEL ANFPKHVIECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQ MPFTEMSEENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT >gi568815596f:47269506_47486610|GENSCAN_predicted_CDS_3|2826_bp atggcggtgcagccgaaggagacgctgcagttggagagcgcggccgaggtcggcttcgtg cgcttctttcagggcatgccggagaagccgaccaccacagtgcgccttttcgaccggggc gacttctatacggcgcacggcgaggacgcgctgctggccgcccgggaggtgttcaagacc cagggggtgatcaagtacatggggccggcaggagcaaagaatctgcagagtgttgtgctt agtaaaatgaattttgaatcttttgtaaaagatcttcttctggttcgtcagtatagagtt gaagtttataagaatagagctggaaataaggcatccaaggagaatgattggtatttggca tataaggcttctcctggcaatctctctcagtttgaagacattctctttggtaacaatgat atgtcagcttccattggtgttgtgggtgttaaaatgtccgcagttgatggccagagacag gttggagttgggtatgtggattccatacagaggaaactaggactgtgtgaattccctgat aatgatcagttctccaatcttgaggctctcctcatccagattggaccaaaggaatgtgtt ttacccggaggagagactgctggagacatggggaaactgagacagataattcaaagagga ggaattctgatcacagaaagaaaaaaagctgacttttccacaaaagacatttatcaggac ctcaaccggttgttgaaaggcaaaaagggagagcagatgaatagtgctgtattgccagaa atggagaatcaggttgcagtttcatcactgtctgcggtaatcaagtttttagaactctta tcagatgattccaactttggacagtttgaactgactacttttgacttcagccagtatatg aaattggatattgcagcagtcagagcccttaacctttttcagggttctgttgaagatacc actggctctcagtctctggctgccttgctgaataagtgtaaaacccctcaaggacaaaga cttgttaaccagtggattaagcagcctctcatggataagaacagaatagaggagagattg aatttagtggaagcttttgtagaagatgcagaattgaggcagactttacaagaagattta cttcgtcgattcccagatcttaaccgacttgccaagaagtttcaaagacaagcagcaaac ttacaagattgttaccgactctatcagggtataaatcaactacctaatgttatacaggct ctggaaaaacatgaaggaaaacaccagaaattattgttggcagtttttgtgactcctctt actgatcttcgttctgacttctccaagtttcaggaaatgatagaaacaactttagatatg gatcaggtaaacagtgttaacatcctggtggaaaaccatgaattccttgtaaaaccttca tttgatcctaatctcagtgaattaagagaaataatgaatgacttggaaaagaagatgcag tcaacattaataagtgcagccagagatcttggcttggaccctggcaaacagattaaactg gattccagtgcacagtttggatattactttcgtgtaacctgtaaggaagaaaaagtcctt cgtaacaataaaaactttagtactgtagatatccagaagaatggtgttaaatttaccaac agcaaattgacttctttaaatgaagagtataccaaaaataaaacagaatatgaagaagcc caggatgccattgttaaagaaattgtcaatatttcttcaggctatgtagaaccaatgcag acactcaatgatgtgttagctcagctagatgctgttgtcagctttgctcacgtgtcaaat ggagcacctgttccatatgtacgaccagccattttggagaaaggacaaggaagaattata ttaaaagcatccaggcatgcttgtgttgaagttcaagatgaaattgcatttattcctaat gacgtatactttgaaaaagataaacagatgttccacatcattactggccccaatatggga ggtaaatcaacatatattcgacaaactggggtgatagtactcatggcccaaattgggtgt tttgtgccatgtgagtcagcagaagtgtccattgtggactgcatcttagcccgagtaggg gctggtgacagtcaattgaaaggagtctccacgttcatggctgaaatgttggaaactgct tctatcctcaggtctgcaaccaaagattcattaataatcatagatgaattgggaagagga acttctacctacgatggatttgggttagcatgggctatatcagaatacattgcaacaaag attggtgctttttgcatgtttgcaacccattttcatgaacttactgccttggccaatcag ataccaactgttaataatctacatgtcacagcactcaccactgaagagaccttaactatg ctttatcaggtgaagaaaggtgtctgtgatcaaagttttgggattcatgttgcagagctt gctaatttccctaagcatgtaatagagtgtgctaaacagaaagccctggaacttgaggag tttcagtatattggagaatcgcaaggatatgatatcatggaaccagcagcaaagaagtgc tatctggaaagagagcaaggtgaaaaaattattcaggagttcctgtccaaggtgaaacaa atgccctttactgaaatgtcagaagaaaacatcacaataaagttaaaacagctaaaagct gaagtaatagcaaagaataatagctttgtaaatgaaatcatttcacgaataaaagttact acgtga >gi568815596f:47269506_47486610|GENSCAN_predicted_peptide_4|58_aa MQIREMQVKTTMSYNFTLITIVKIKKSDVHTLEMLLPIVLRDVYKNVPNTSTLETTHQ >gi568815596f:47269506_47486610|GENSCAN_predicted_CDS_4|177_bp atgcagatcagggaaatgcaagtcaaaaccacaatgagctacaacttcacactgattacg atagttaaaatcaaaaagtcagatgtgcatactctggagatgcttttgcccattgtgctg cgagatgtatacaagaatgttcctaatacctccacactggaaacaactcatcagtga