GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:16:35 Sequence gi568815577r:14273620_14481545 : 207926 bp : 36.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 419 538 120 1 0 36 103 101 0.354 6.17 1.02 Intr + 601 734 134 0 2 1 73 124 0.415 0.72 1.03 Intr + 13931 14078 148 2 1 99 86 64 0.844 6.62 1.04 Intr + 15446 15564 119 1 2 107 53 88 0.878 5.54 1.05 Intr + 17951 18128 178 2 1 59 93 52 0.095 1.80 1.06 Intr + 22610 22762 153 1 0 67 101 65 0.151 5.05 1.07 Intr + 25819 25976 158 0 2 118 -23 110 0.135 0.79 1.08 Intr + 30570 30741 172 0 1 50 97 48 0.230 0.92 1.09 Intr + 34752 34836 85 2 1 56 60 78 0.504 0.27 1.10 Intr + 34934 35000 67 0 1 98 89 11 0.506 -0.76 1.11 Intr + 35987 36053 67 2 1 68 117 35 0.698 2.39 1.12 Intr + 39325 39501 177 0 0 98 61 190 0.967 16.49 1.13 Intr + 42582 42749 168 2 0 104 35 159 0.859 11.42 1.14 Intr + 43854 44016 163 2 1 58 116 63 0.836 4.73 1.15 Intr + 46362 46571 210 1 0 85 40 86 0.281 1.46 1.16 Intr + 49893 49968 76 1 1 99 95 39 0.417 3.35 1.17 Intr + 63561 63621 61 0 1 60 86 49 0.113 -0.28 1.18 Intr + 64162 64251 90 0 0 58 85 69 0.220 2.87 1.19 Intr + 71644 71691 48 0 0 65 121 55 0.451 4.46 1.20 Intr + 79010 79209 200 1 2 55 95 120 0.965 6.63 1.21 Intr + 79935 80061 127 0 1 82 101 63 0.995 6.76 1.22 Intr + 81422 81523 102 1 0 93 74 92 0.992 7.75 1.23 Intr + 84112 84255 144 0 0 58 64 137 0.896 7.86 1.24 Term + 96728 96790 63 1 0 80 40 68 0.039 -1.89 1.25 PlyA + 97854 97859 6 1.05 2.09 PlyA - 97933 97928 6 1.05 2.08 Term - 100665 99998 668 1 2 79 38 771 0.998 64.20 2.07 Intr - 102200 102033 168 0 0 76 99 158 0.995 14.70 2.06 Intr - 104793 104580 214 0 1 75 92 227 0.997 19.17 2.05 Intr - 107924 107584 341 0 2 93 115 255 0.827 22.97 2.04 Intr - 109605 109476 130 0 1 27 80 62 0.421 -1.35 2.03 Intr - 109812 109637 176 1 2 16 37 156 0.114 2.04 2.02 Intr - 133501 133420 82 1 1 39 77 65 0.085 -1.21 2.01 Init - 133904 133764 141 2 0 93 37 68 0.661 2.38 2.00 Prom - 134496 134457 40 -3.45 3.00 Prom + 136433 136472 40 -4.35 3.01 Sngl + 140825 141046 222 1 0 31 44 224 0.775 7.20 3.02 PlyA + 141449 141454 6 1.05 4.05 PlyA - 141862 141857 6 1.05 4.04 Term - 143684 143387 298 1 1 -2 49 245 0.073 5.35 4.03 Intr - 147107 146945 163 0 1 70 30 132 0.053 3.71 4.02 Intr - 163975 163958 18 2 0 121 80 19 0.029 0.06 4.01 Init - 171715 171676 40 1 1 76 116 25 0.391 4.50 4.00 Prom - 180558 180519 40 -5.45 5.04 PlyA - 181017 181012 6 1.05 5.03 Term - 193034 192907 128 0 2 40 44 129 0.803 1.16 5.02 Intr - 196067 195888 180 0 0 58 72 99 0.407 4.22 5.01 Intr - 199602 199502 101 2 2 75 38 145 0.314 7.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:14273620_14481545|GENSCAN_predicted_peptide_1|1009_aa GSDSRSRALSQRPGAPRTMLSSTQNAGGSYQRVRGALDTQEWKWGEVSPRTLNVDGRALV SVANTHGTDRPAYTLNPQSRDQRSGVITLGYKRPLEREDLFELKESDSFCTACPIFEKQW RKEVLRNQERQKVKALNKLDEALCPGIILTQSTDSNANLFQKQPHRHTQTSGRWQIIIFC EHSSDFGWNGYGYAVALLVVVFLQTLILQQYQRFNMLTSAKVKTAVNGLIYKKLGWSGKV SWLILHDVGHGIMEGYIAWGKGSDVRITWEKKSTEMRTRPAQKMALLLSNVSRQKFSTGE IINLMSATHGLDSKPQSPLVCPFSNPNGRISPLARAGLADHYRVTHLQILKLYAWEPSYK NKIIKIRDQELEFQKSARYLTVFSMLTLTCIPFLTKISLGRLEDFLNTEELLPQSIETNY TGDHAIGFTDASFSWDKTGMPVLKESIRIRIEQVLNQLSLFETVDYPGSVAYVSQQAWIQ NCILQENILFGSIMKKEFYEQVLEACALLPDLEQLPKGDQTEIGERAVNISGGQQHRVSL ARAVYSGADVYLLDDPLSAIDVHVGKQLFEKVIGSLGLLKNRTHILVTHNLTLLPQMNLI VVMKSGRIAQMGIYQELLCKTKNLTNLHQVISEQEKGGRVGRVWTWQLRYLQMYRASESA KWSLGGGEEQQKPWLRLTHQGLPCPPEEVSGIQTFSSDPCLPGGLEAHALKQVSAINSRT RPKDKILEQKHRPSLDQGKQLSMKKEKIPVGGLKFSIILQYLQAFGWLWVWLTVVTYLGQ NLFFETNSTGQIISRFTKRYYVASSRQIRRLTGGSRSPVISHFSETLSGVSTIRAFGHEQ RFIQQYKEMVNEDLVCFYNNVIFNLWLSVRLEFLGNLMVLFAALLAVLAGNSIDSAIVGL SVSYALNITHSLNFWVKKACEIETNAVAVERVCEYENMDKESPWIMSRRPPLQWPNKGIV EFINYQARYQDDLGLALQDITFQTHGEEKLGSKRESQQEGASKKEADLF >gi568815577r:14273620_14481545|GENSCAN_predicted_CDS_1|3030_bp ggaagtgacagccgcagccgggctctcagccagcggccaggcgccccgcggaccatgctc tccagtacgcagaacgcgggcggctcctatcagcgggtccgcggggcgcttgatacacag gagtggaagtggggagaggtgagtcctcgaaccctgaacgtggacggacgagcccttgtt tcagtagccaacacacacgggacagacagaccggcgtacacactcaatcctcagagcaga gatcagagatcaggagtaattactttaggctataagagacctttggaaagagaggatctt tttgaactaaaggaaagtgattccttctgcactgcgtgtcccatctttgaaaaacaatgg agaaaggaagttttaaggaatcaagagaggcaaaaagtaaaggccctcaacaaattggat gaggcactatgccctgggatcatccttactcagtctactgattcaaatgctaatctcttc cagaaacagcctcacagacacacccagacatctggtagatggcaaattatcattttctgt gaacacagctcagattttggctggaatggctatggctatgcagtggcacttcttgttgta gtctttttgcaaactctgattcttcagcaatatcaacgttttaacatgctcacctcagca aaagttaagacagctgtaaatggactgatctacaaaaagctaggatggtcaggaaaagtg tcatggttaatattgcatgatgtgggtcatggaataatggaaggatacattgcatggggt aaaggctctgatgttagaataacttgggagaaaaaaagtacagaaatgcgtacaaggcct gctcagaaaatggccttacttttatcaaatgtttctcgacaaaagttttccactggggaa attattaacttgatgtcagcaactcatggacttgacagcaaacctcaatctcctctggtc tgccccttttcaaatcctaatggccgtatatctcctttggcaagagctggattggcagat cattatcgtgtgactcatttacagatcctcaaactttatgcatgggaaccctcctataaa aataagattatcaaaattcgagatcaggaattggaatttcaaaaatcagctaggtatctt actgtattttccatgttgacattaacctgcattccattcttgacaaagatatccctgggc cgtttggaagactttctcaacactgaggagcttcttcctcaaagtattgaaacgaactat acaggagatcatgctattgggtttacagatgcttctttctcctgggataaaacaggaatg ccagttctaaaagagtcaataagaataagaattgaacaagttcttaatcagctttctctt tttgaaacagtagactacccgggctctgtggcttatgtttctcagcaggcctggattcag aattgcattttgcaagaaaacattctctttggctccatcatgaagaaagagttttatgag caagtattggaagcctgtgctctccttccagatttggagcagttgccaaagggagatcaa actgagattggagaaagagctgtgaatataagtgggggccagcagcatcgagtaagcctg gccagagctgtctacagtggggccgacgtctacctcctggatgatcccttgtctgctatt gatgttcatgttggaaagcagctttttgaaaaagtgatagggtccttgggccttttgaaa aaccggactcatattttagtgacacacaatctcacacttctgccacaaatgaatcttatt gttgtaatgaaaagtggcagaatagctcaaatgggaatataccaggagctgctgtgtaaa accaaaaatctcactaatttgcaccaagtcatcagtgaacaagaaaaaggtgggagagtg ggaagagtctggacgtggcagctgaggtatctgcagatgtacagagcttctgagagtgca aagtggagcttgggagggggagaagagcagcaaaagccttggctgagactgactcatcaa ggactgccatgtcccccagaggaagtctcaggaatccagacattctcatcagatccttgc cttccaggtggcttggaagctcatgctctaaagcaagtcagtgcaatcaactccagaact agaccaaaagacaaaatcctggagcaaaaacataggccctcattggatcaaggaaaacag ctctcaatgaaaaaagaaaagatccctgttggtgggttgaagttctccatcattctgcag tacctccaagcctttggctggctctgggtgtggctgactgtggtcacttacttagggcag aatttgttttttgaaacaaattccacaggccagatcatcagtcgtttcaccaagagatac tatgtggcaagctctcggcaaatccggaggttgacaggagggtcccgttctcctgtcatt tctcactttagtgagactttatcaggagtgtccaccatcagagcatttggacatgaacag aggtttatccagcaatacaaagagatggtgaatgaagacttggtctgtttctacaataat gtaattttcaacctgtggctgtctgttagacttgaatttcttggcaacttaatggtgctt tttgctgcactgcttgctgtgctggctggcaattctatagattcagcaatagttggtttg tctgtatcctatgccctaaatataactcattctctgaatttttgggtgaagaaagcatgt gaaattgaaaccaatgcagttgctgttgaaagagtttgtgaatatgaaaatatggataaa gaatcaccttggataatgtctagaagacctccattacaatggcccaataaaggtatagtg gaatttattaattatcaggctcgataccaagatgatcttggtttagcgttgcaagatatc actttccagactcatggggaagagaagctgggatccaaaagagaaagccagcaagagggt gcatccaagaaggaagctgatcttttttaa >gi568815577r:14273620_14481545|GENSCAN_predicted_peptide_2|639_aa MEEGTEGAGKDFSSSSPQAQTTKAKMHKWEFKSKLKSFCTAKETINKLKWLSPKGQAITN AGEDMEKREPSYTVGSRDHRFDYKRQSLATALLPLSLTGPAGRCVEGGLSRHWQRLNGGS LPRALTVDEVLALSKRTTFPRGLRDCREEWGRCLTSGTVITSLFGGTVMAREMTILGSAV LTLLLAGYLAQQYLPLPTPKVIGIDLGTTYCSVGVFFPGTGKVKVIPDENGHISIPSMVS FTDNDVYVGYESVELADSNPQNTIYDAKRFIGKIFTAEELEAEIGRYPFKVLNKNGMVEF SVTSNETITVSPEYVGSRLLLKLKEMAEAYLGMPVANAVISVPAEFDLKQRNSTIEAANL AGLKILRVINEPTAAAMAYGLHKADVFHVLVIDLGGGTLDVSLLNKQGGMFLTRAMSGNN KLGGQDFNQRLLQYLYKQIYQTYGFVPSRKEEIHRLRQAVEMVKLNLTLHQSAQLSVLLT VEEQDRKEPHSSDTELPKDKLSSADDHRVNSGFGRGLSDKKSGESQVLFETEISRKLFDT LNEDLFQKILVPIQQVLKEGHLEKTEIDEVVLVGGSTRIPRIRQVIQEFFGKDPNTSVDP DLAVVTGVAIQAGIDGGSWPLQVSALEIPNKHLQKTNFN >gi568815577r:14273620_14481545|GENSCAN_predicted_CDS_2|1920_bp atggaggaaggcacagagggagcaggcaaagatttttcgagtagtagtccacaagcacag acaaccaaagcaaaaatgcacaaatgggagtttaaaagcaagttaaaaagcttctgcaca gcaaaagaaacaatcaacaaattaaaatggctttcacctaaaggacaggcaataacaaat gctggtgaggacatggagaaaagggaaccctcatacactgttggatcgcgggaccacagg tttgattataaacgccagtctctcgccacggcgctcctccccctcagtctcactggtcct gccggccgttgcgtggaagggggactcagtcgtcactggcagcggttgaatggaggctct ttgccccgcgccctcacggtagatgaagtccttgctttgagcaagcggactacatttccc agggggctgcgggactgccgggaggagtgggggcggtgcctcacgtctggtacagtcatc acaagcctgttcggcgggactgtgatggccagagagatgacgatcttaggatcggctgtt ttgactctcctgttggccggctatttggcacaacagtatttaccattgcctactcctaaa gtgattggtattgatcttggcaccacctattgttctgttggggtgttttttcctggcaca ggaaaagtaaaggtgattccagatgaaaatgggcatatcagcatacccagcatggtgtct tttactgacaatgatgtatatgtgggatatgaaagcgtagagctggcagattcaaatcct caaaacacaatatatgatgccaaaagattcataggcaagatttttaccgcagaagagttg gaggctgaaattggcagatacccatttaaggttttaaacaaaaatggaatggttgagttt tctgtgacaagtaatgagaccatcacagtgtccccagaatatgttggctctcgactattg ttgaagttaaaggaaatggcagaggcatatcttggaatgccagttgccaatgctgtcatt tctgtaccagcagaatttgatctaaaacagagaaattcaacaattgaagctgctaacctt gcaggactgaagattttgagggtaataaatgaacccacagcagcagctatggcctatggt ctccacaaggctgacgtcttccacgtcttggtgatagacttgggcggaggaactctagat gtgtctttactgaataaacaaggagggatgtttctaacccgagcaatgtctggaaacaat aaacttggaggacaggacttcaatcagagattgcttcagtacttatataaacagatctat caaacatatggcttcgtgccctctaggaaagaggaaatccacagattgagacaagctgtg gaaatggtcaaattaaatctgactcttcatcaatctgctcagttgtcagtattactaacg gtggaggagcaggacaggaaggaacctcacagtagtgacactgaactgccaaaagacaaa ctttcctcagcagatgaccatcgcgtgaacagtgggtttggacgtggcctttctgataag aaaagtggagaaagtcaggttttatttgaaacagaaatatcacggaaactctttgatacc cttaatgaagacctctttcagaaaatactggtacccattcagcaagtattgaaagaaggc cacctggaaaagactgagattgatgaggtggttttagttgggggctccactcgtattcct cggatccgtcaagtcattcaagagttctttggaaaagatcccaacacatctgtagaccct gacctagcagtagtaacgggagtggctatccaagcagggattgatggaggctcttggcct ctccaagtcagtgctttagaaattcccaataagcatttacaaaaaaccaacttcaactga >gi568815577r:14273620_14481545|GENSCAN_predicted_peptide_3|73_aa MFSPGFTLIQYDCCLHKAGSLDTGMHREDDVKTQKEEPSPTKEKGLKHILPSQLSERNNP VDILISDFQPPEP >gi568815577r:14273620_14481545|GENSCAN_predicted_CDS_3|222_bp atgttttcaccagggttcaccctcatccagtatgactgctgccttcataaagcaggatct ttggacacaggcatgcacagggaagatgatgtgaagacacagaaagaagagccatctcca accaaggagaaaggcctgaaacacatccttccctcacagctctcagaaagaaacaatcct gttgacatcttgatttcagacttccagcctccagaaccatga >gi568815577r:14273620_14481545|GENSCAN_predicted_peptide_4|172_aa MSLEWGEQEEAVKGLSKPAGRGVSLCRHYHPKSQGVLPDYCQCSHKAQGLLSQLVVNAAW TLFGEVGSPLVQGSDLGFRVGMAPQQLCPKGQGTVAVRPMELWGTAVAQALRAKCSTVVT PVSVKAEYCHRKGYEGWSSVVLSPLVCQDTVGTLALRGMVQQQHDPRMDTTL >gi568815577r:14273620_14481545|GENSCAN_predicted_CDS_4|519_bp atgagcctggaatggggtgagcaagaagaggctgtaaaaggtttatctaaacccgcaggc agaggagtttcactctgtaggcactaccaccccaagtcacagggagtactgccagactac tgccaatgttcccataaggcccaaggtctcttaagtcagcttgttgtaaatgctgcctgg actctcttcggggaagtgggctcccctctggtccagggcagtgacttaggcttcagggtt ggtatggctccacagcagctctgccccaaaggccagggcacagtggcagtaaggcccatg gaattgtggggtacagctgtagctcaggccctgagagccaagtgcagcacggtggtgact ccagtctctgtgaaggcagagtattgccacagaaagggctacgaaggatggagctcagtt gtgctaagccccctggtatgtcaggacacagtgggaactctggctctgaggggcatggta cagcagcaacatgaccccaggatggacacgaccctgtag >gi568815577r:14273620_14481545|GENSCAN_predicted_peptide_5|136_aa XMSAFMGKSRSSTYWICLIKHNRRYVTGSYWNVLWCRRDLQGSSGACPLTKPPLINSTIQ KVHLDKGSSKDMPMTLSCQMVYMSKVSLRECEASRMCTSAEKTWEGLNLSPVDKSEALHK QETKAKEEFSNCLLEC >gi568815577r:14273620_14481545|GENSCAN_predicted_CDS_5|411_bp nnaatgtcagcatttatgggaaaaagtcgatcaagtacttactggatctgcttaatcaag cataatcgtcgctatgtgactggcagttattggaacgtcctgtggtgcagaagagatctt cagggcagcagtggggcctgcccgcttacaaagcccccattgatcaacagtaccatccaa aaagtccacttagacaaggggtcatccaaggacatgccaatgactctttcttgccaaatg gtgtacatgagcaaagtgtctctgagagaatgcgaagcctccaggatgtgcacaagcgca gaaaagacctgggaaggccttaatctctcacctgtggataagtctgaggctttgcacaag caggaaacgaaggctaaagaagagttttcaaactgcctgctggagtgttga