GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:00:21 Sequence gi568815596r:18455194_18660572 : 205379 bp : 39.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 231 226 6 1.05 1.02 Term - 7815 7125 691 2 1 -86 43 378 0.685 7.97 1.01 Init - 8494 8169 326 1 2 70 100 164 0.228 12.65 1.00 Prom - 10690 10651 40 -7.35 2.00 Prom + 15224 15263 40 -3.45 2.01 Init + 17289 17338 50 2 2 87 49 32 0.116 -0.33 2.02 Intr + 20477 20739 263 2 2 67 94 102 0.207 4.91 2.03 Intr + 23377 23455 79 2 1 69 89 65 0.215 2.39 2.04 Intr + 27010 27162 153 1 0 47 115 153 0.538 12.17 2.05 Term + 29168 29321 154 2 1 77 38 105 0.773 0.81 2.06 PlyA + 32123 32128 6 1.05 3.05 PlyA - 33114 33109 6 1.05 3.04 Term - 36678 36426 253 2 1 57 38 221 0.319 8.13 3.03 Intr - 50405 50338 68 2 2 98 106 41 0.044 3.68 3.02 Intr - 59112 58990 123 0 0 57 92 68 0.044 3.96 3.01 Init - 64119 63991 129 0 0 45 42 125 0.198 3.80 3.00 Prom - 65249 65210 40 -2.85 4.06 PlyA - 65442 65437 6 -1.95 4.05 Term - 65755 65559 197 2 2 88 43 142 0.673 6.29 4.04 Intr - 69431 69257 175 2 1 104 28 77 0.066 1.89 4.03 Intr - 70513 70366 148 0 1 81 53 72 0.063 2.32 4.02 Intr - 72964 72849 116 1 2 71 24 101 0.035 0.33 4.01 Init - 87067 87062 6 1 0 110 100 10 0.117 4.76 4.00 Prom - 91404 91365 40 -7.15 5.00 Prom + 94307 94346 40 -4.75 5.01 Init + 95231 95372 142 1 1 73 34 110 0.466 4.44 5.02 Term + 95461 95588 128 2 2 82 53 65 0.897 -0.14 5.03 PlyA + 96172 96177 6 1.05 6.03 PlyA - 97899 97894 6 1.05 6.02 Term - 100615 99998 618 1 0 58 49 331 0.913 19.65 6.01 Init - 105379 105017 363 1 0 75 70 402 0.615 32.20 6.00 Prom - 106191 106152 40 -9.85 7.11 PlyA - 106923 106918 6 1.05 7.10 Term - 108926 108603 324 2 0 94 50 280 0.983 18.58 7.09 Intr - 121175 120991 185 0 2 28 96 248 0.995 18.19 7.08 Intr - 121702 121580 123 2 0 57 62 127 0.988 6.64 7.07 Intr - 127804 127675 130 1 1 82 127 76 0.993 10.25 7.06 Intr - 129062 128895 168 2 0 102 61 132 0.996 11.12 7.05 Intr - 129785 129321 465 2 0 74 105 598 0.920 52.50 7.04 Intr - 131198 131055 144 2 0 88 88 156 0.572 15.16 7.03 Intr - 131929 131750 180 1 0 90 68 104 0.574 7.74 7.02 Intr - 132399 132310 90 0 0 72 99 71 0.977 5.87 7.01 Init - 133470 133351 120 0 0 79 113 58 0.933 7.64 7.00 Prom - 136356 136317 40 -3.95 8.04 PlyA - 136475 136470 6 1.05 8.03 Term - 142049 141863 187 1 1 64 53 94 0.403 -0.52 8.02 Intr - 145206 145055 152 0 2 0 92 132 0.069 2.84 8.01 Init - 153930 153904 27 0 0 89 107 5 0.147 0.92 8.00 Prom - 161302 161263 40 -3.65 9.03 PlyA - 162065 162060 6 1.05 9.02 Term - 162757 162516 242 2 2 65 49 139 0.463 2.90 9.01 Init - 183263 182612 652 2 1 85 57 283 0.193 20.40 9.00 Prom - 183501 183462 40 -3.05 10.02 PlyA - 183718 183713 6 -0.45 10.01 Sngl - 185585 185220 366 2 0 45 49 179 0.341 5.54 10.00 Prom - 195146 195107 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 177187 177535 349 1 1 1 39 277 0.886 7.07 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_1|338_aa MPPDWETPPSRGQQAPHTGELQLASGGCPSGMKLPEEGTVSNLCCSAASTGNTQANRVWS GPPANSSRPASEGTVRRNTNKQKRIVSTSIKRTSTPKPHPKVTNIKDQSQIDQAEERISE IEDQLNEIKREGKIREKRMKRNQQSLQEIWDYTERPNLCLIGVPESDGENGTKLENTFQD IIQEDFPNVARQANIQIQKIQKTPQRYSSRRATPRQIIIRFTKVEMKEKMLRAAREKGWI THKVNPIRLTAGLSAETLQARREWGPIFNILKKKNFQPRISYPAKLSIISEGEIKPFTDK QMLRDFVTTRPALQELLKKALNMERNNWYQQLQKHTKL >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_1|1017_bp atgcctcctgactgggagacacctcccagcaggggtcaacaggcacctcatacaggagag ctccaactggcatctggtgggtgcccctctgggatgaagcttccagaggaaggaacagtc agcaatctctgctgttctgcagcctctactggtaatacccaggcaaacagggtctggagt ggacctccagcaaactccagcagacctgcatcagaggggactgttagaaggaacactaac aaacagaaaagaatagtatcaacatcaataaaacggacctccacaccaaaaccccatccg aaggtcaccaacatcaaagaccaaagccaaatcgaccaagcggaagaaaggatatcagag attgaagatcaacttaatgaaataaagcgtgaaggcaagattagagaaaaaagaatgaaa aggaatcaacaaagcctccaagaaatatgggactatacggaaagaccaaacctatgtttg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacacttttcaggat attatccaggaggacttccccaatgtagcaagacaggccaatattcaaattcagaaaata cagaaaacaccacaaagatactcctcgagaagagcaaccccaagacagataatcatcaga ttcaccaaggttgaaatgaaggagaaaatgttaagggcagccagagagaaaggttggatt acccacaaagtgaatcccatcagactaacagcaggtctctctgcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaaaaaaagaattttcaacccagaatt tcatatccagccaaactaagcatcataagtgaaggagaaataaaaccctttacagacaag caaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaagaaagca ctaaacatggaaaggaacaactggtaccagcaactgcaaaaacataccaaattgtaa >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_2|232_aa MTAIRCGVNSSEVEGASLYDDDDGGGGGGGDDGGGERERERKREERRGEERRKEKRGEGR GEERRREEGREEKKRRREGEKRRGGEGKRERRRKERRGHGERRGGMDGAESHYPSQTNAG TEKPNTTFSHLQMTLFAKLRAQHWFGDQNRARNYENGDRKQEDPHCPQIHRRVLGLNAAG ARHCARSQGLRDTKAVKGPYASLIRETPLLLTDYKIRWNLEDPESDLAGSSI >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_2|699_bp atgactgcaatcaggtgtggtgtcaactcatcagaggtggaaggtgccagcttgtatgat gatgatgatggtggtggtggcggtggtggtgatgatggtgggggagagagagagagagag agaaagagagaggaaaggagaggagaagagaggagaaaagagaagagaggagaggggagg ggagaggagaggagaagagaagagggaagagaagagaagaagagaagaagagaaggggag aagaggaggggaggggaagggaagagggagaggaggagaaaggagaggaggggacatggg gagagaaggggaggcatggatggagctgaaagccattatccttcacaaactaatgcagga acagaaaaaccaaacactacattttctcacttacagatgaccctctttgccaaacttaga gcccagcactggtttggagatcagaatagggctagaaattatgaaaatggagacaggaaa caagaagatcctcattgccctcagatccacaggagagtcctggggcttaatgctgctggt gcacggcactgtgctagaagccaaggactcagggacacaaaggctgtaaagggcccctat gcttcactaataagggagacacccttgctgttaactgactataaaatcaggtggaattta gaggacccagagtcagatttggctggaagttctatttga >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_3|190_aa MKQQMMLRKMRKETVGTEDRTWALPGNDIMPLLGPKSLGFGRLSTFSRSSLILTTTPSNH NDKCDEFPTVHRTRWEEWSRIIPQGTTISEPTTHMELSIGHVNRVSGSAAAGRRNGNSLH CHQVSSSAKEQLIPATSSRGKRLSKDSRRKRQTNPNDQHPSGQGVVQKRAVTGGREMRAV EMQEEQNVLG >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_3|573_bp atgaagcagcaaatgatgttgcgcaaaatgcgaaaagagaccgtgggtacggaagacaga acttgggctttgcctggaaatgatataatgccgctgctcggtcccaagtccctaggtttt ggaagattgagcacgttctcaagatcgagtctgattctcacaacgaccccaagtaaccac aatgacaagtgtgatgagtttcccacagttcacagaacaagatgggaagaatggtccaga atcatcccacaggggacaacaattagtgaaccaaccactcatatggaactctcaattggc cacgtgaatagagtgagtgggtcagcagctgcaggaagaagaaatggaaactctcttcat tgtcatcaagtctcttcctctgccaaagaacaattgatccctgcaacttctagcagaggg aagaggctttccaaggacagcagaaggaagagacaaactaacccaaatgatcagcatccc tcgggacagggagttgtccagaagcgagcagtaacaggagggagagagatgagggcagtg gagatgcaggaggagcagaatgtcctgggctag >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_4|213_aa MKGPGKPGRGQLLRNLKALSRETLESATAKAPRTPGATMPGTSIPVFGNTVNILANHLTF FPSKAVSESAFKEQTFLKLEFGPAALQQFEEGCHESPRPWGFALHGILSLLIARRNYFGI YHLLPYVATDLFHGLLLSLQRDCKSHRESCMGRLAQREVLTEDTGGERVPRERVWGFSLP HLALIHRYAIQGLKAQEGKDWIKELAGEAKPLI >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_4|642_bp atgaaggggcctgggaagcctggcagaggacagttgctgagaaatcttaaggcactgagc agagaaacattggagtctgcaactgccaaggcacctaggactccaggggccacaatgcca ggtacttctattcctgtctttgggaacactgtcaatatcctggcaaaccatttgacattt ttccccagcaaagctgtttctgagtctgcattcaaagagcagactttcttaaaattagaa tttggtcctgctgcccttcagcaatttgaggaaggctgccatgaatctccacggccatgg ggattcgcccttcatggaatccttagtttactcatagccaggaggaattattttggcatt tatcatttattgccttatgttgctacagatctgtttcatgggttactcttatctctgcag agggactgcaagagtcacagagagtcgtgcatgggcaggctggcccagagggaagttctc actgaagacactggaggagaaagggtccctcgtgaaagggtctgggggttctccctgccc cacctggctttgatccataggtatgccatccagggcctgaaagctcaggaaggaaaggac tggataaaggagcttgcaggggaggcaaaacccctgatatag >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_5|89_aa MDSTPGEEVVNISEMKAKDLEYYINLVEKAAIGFDRIDFNFEGSSTVDIAPANPAFSNHH PEVSSHRGETLHQKKYNDSLMAQIIVRGF >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_5|270_bp atggactctactcctggtgaagaggttgtgaacatttctgaaatgaaagcaaaggatttg gaatattacataaacttagttgagaaggcagcaatagggtttgataggattgacttcaat tttgaaggaagttctactgtggacattgctccagccaacccagccttcagcaaccaccac cctgaagtcagcagccatcgaggtgagaccctccaccagaaaaaatataatgactcactg atggctcagataattgttagaggtttttga >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_6|326_aa MAVATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLMHGKTVLITGANSGLGRATA AELLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEPGVSGVGELIVRELDLASLR SEEPRLDVLINNAGIFQCPYMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSRIVVV SSKLYKYGDINFDDLNSEQSYNKSFCYSRSKLANILFTRELARRLEGTNVTVNVLHPGIV RTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEE LLPKAMDESVARKLWDISEVMVGLLK >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_6|981_bp atggcagtggccactgcggcggcagtactggccgctctgggcggggcgctgtggctggcg gcccgccggttcgtggggcccagggtccagcggctgcgcagaggcggggaccccggcctc atgcacgggaagactgtgctgatcaccggggcgaacagcggcctgggccgcgccacggcc gccgagctactgcgcctgggagcgcgggtgatcatgggctgccgggaccgcgcgcgcgcc gaggaggcggcgggtcagctccgccgcgagctccgccaggccgcggagtgcggcccagag cctggcgtcagcggggtgggcgagctcatagtccgggagctggacctcgcctcgctgcgc tcggaagagcctaggctggatgtcttgatcaataacgcagggatcttccagtgcccttac atgaagactgaagatgggtttgagatgcagttcggagtgaaccatctggggcactttcta ctcaccaatcttctccttggactcctcaaaagttcagctcccagcaggattgtggtagtt tcttccaaactttataaatacggagacatcaattttgatgacttgaacagtgaacaaagc tataataaaagcttttgttatagccggagcaaactggctaacattctttttaccagggaa ctagcccgccgcttagaaggcacaaatgtcaccgtcaatgtgttgcatcctggtattgta cggacaaatctggggaggcacatacacattccactgttggtcaaaccactcttcaatttg gtgtcatgggcttttttcaaaactccagtagaaggtgcccagacttccatttatttggcc tcttcacctgaggtagaaggagtgtcaggaagatactttggggattgtaaagaggaagaa ctgttgcccaaagctatggatgaatctgttgcaagaaaactctgggatatcagtgaagtg atggttggcctgctaaaatag >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_7|642_aa MVRDLEQGNDNRDRKKELDSKYHLELEVRGIMVTAMYVYKNEPGMRSSKESLEAEKRKES DKTGVRLSNQGHSSCRRCLCAAEGTALGPCHTIRIYIHMCLLWEQGQQITMMRVSRDHTT DSGGEPQTTWGSQESSLRKTDSRGYLVRSQWSRISRSPSTKAPSIDEPRSRNTSAKVELP SSSTSSRTPSTSPSLHDSSPPPLSGQPSLQPPASPQLPRSLDSRPPTPPEPDPGSRRSTK MQENPEAWAQGIVREIRQTRDSQPLEYSRTSPTEWKSSSQRRGIYPASTQLDRNSLSEQQ QQQREDEDDYEAAYWASMRSFYEKNPSCSRPWPPKPKNAITIALSSCALFNMVDGRKIYE QEGLEKYMEYQLTNENVILTPGPAFRFVKALQYVNARLRDLYPDEQDLFDIVLMTNNHAQ VGVRLINSVNHYGLLIDRFCLTGGKDPIGYLKAYLTNLYIAADSEKVQEAIQEGIASATM FDGAKDMAYCDTQLRVAFDGDAVLFSDESEHFTKEHGLDKFFQYDTLCESKPLAQGPLKG FLEDLGRLQKKFYAKNERLLCPIRTYLVTARSAASSGARVLKTLRRWGLEIDEALFLAGA PKSPILVKIRPHIFFDDHMFHIEGAQRLGSIAAYGFNKKFSS >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_7|1929_bp atggtcagagacctggaacaaggcaatgataatagagatagaaaaaaggaactggattca aaatatcatttggaattagaagtaagaggtattatggtgactgctatgtatgtgtataag aatgagcctggaatgaggtcctcaaaagagagtctagaagcagaaaaaagaaaggaatct gacaaaacaggagttcgtctgagcaatcagggtcactcgtcgtgtagacgctgcctttgt gcagctgagggaacagcccttggcccctgccacacaatacgtatttatattcacatgtgc ctgttgtgggagcagggccagcagatcaccatgatgagggtgagcagagatcacactacc gattctggaggggaaccacagacaacctggggatcacaagaatcatcactgcggaagaca gactctcgagggtaccttgtgcgcagtcaatggtctagaatatcccggagcccatccacc aaggctccatccatagatgagcctagaagcaggaacaccagtgctaaggtagagctcccc agcagctccacgagctcccggactccatccacctccccaagcctgcatgactcctcaccg ccgccgctgtccgggcagccctcgctccagccacccgcgtcgccccagctgccccggtcg ctggactcgcggcctcccacgcccccagagcccgatcctggctcccggcgcagcaccaaa atgcaagagaatccggaggcctgggcccaaggcatcgtgcgggaaatccgccagacccgg gactcgcagccgctggaatattcgcgcacgtcccccaccgagtggaagtcctccagccag cgcagggggatctaccccgcctccacccagctggaccgcaactctctgtccgagcagcag cagcagcagcgggaggacgaggacgactacgaggctgcctactgggcatccatgaggtcg ttctacgagaagaacccgagctgctcgcgcccctggccgcccaaacccaagaacgccatc accattgctctctcatcctgcgcgctcttcaacatggtggacggcaggaaaatctacgag caagagggtctggaaaagtacatggagtatcagctcaccaatgagaacgtcatcctgacc ccgggcccggcgttccgcttcgtcaaggcactacagtatgtcaatgctagactccgtgat ctatatcctgatgaacaggacttatttgatattgtactgatgactaataaccatgcccaa gtgggagtgcggcttataaacagcgtcaatcactacggcttactgattgaccgcttctgt ctgaccgggggaaaagaccccattggctatttgaaggcatatcttaccaacttgtatatt gctgcagattctgaaaaagtgcaagaggcaatacaagaaggtattgcctctgcgacaatg tttgatggagccaaagacatggcttactgtgacactcagctccgtgtagcctttgatggg gatgctgtcctcttctctgatgagtctgaacattttaccaaggagcatgggctcgacaaa ttcttccagtatgatacattatgtgaaagtaagcctcttgctcagggtcccctaaaaggc tttctggaagatttaggcagactgcaaaagaagttctatgccaaaaatgaacggttactt tgtcctatcaggacctacctggttacagctaggagtgcagccagttcaggcgcccgtgtg ctgaagacccttcgacgctggggtctagagatagacgaagctcttttccttgctggagcc cccaaaagtcccatcttggtgaagatccggccccacatcttctttgatgaccacatgttc cacattgaaggggcacagaggttaggttccatcgcagcttatggctttaataaaaaattc agtagttag >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_8|121_aa MPSQAESTLKREGQELDSEQDEGADTRNITEDKSAKLSDQMNSFMVEPGEALHASPVLLS FDLHEWNVDVTAIWEEASRYTLTSSRYFGQGIFTYQHLARVFCICTMSRAKLMRSCSFIM I >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_8|366_bp atgcccagccaggcagagtctaccctgaagcgagaagggcaggaactagacagtgagcag gatgagggtgcagatacaagaaacattacagaagataaatcagcaaaactcagtgaccaa atgaacagtttcatggtggagccaggagaggctttgcatgcctctccagtcctgctcagc tttgacctacatgaatggaatgttgatgtaactgccatctgggaagaggccagtagatac acactgacctctagtcgatattttggacaagggattttcacttaccagcatttggcaaga gtattctgcatatgcaccatgagcagggccaagctcatgagaagctgctcctttatcatg atctga >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_9|297_aa MEARTFSQSSLCTFFCLLFIRAMLAADLIVPTHIEGGCAFPSPLTQMLIFFGNTLTDTPR KDTLHPSITSSWHSILTIIHTKLNKHVVTQCKEAKNHDKTMQQLTAKIANIERNITVLIE LKNPLRELHNAITSINSRIDQLEERISELEDYLFEIRQSDKNRGKKRMKMNEQNLQEIWD YVKRLNLKLIGVPEREGEGNQRGKRTSGYHPGKLYQPNDMTVYLENTIVPAQNLLKLISN FSKVSGYKINVQKSQAFIYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVNDLF >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_9|894_bp atggaggccagaacattcagccagtctagtctttgcacgtttttctgcctgctttttatt cgggccatgctggcagctgatttgattgtgcccacccatattgagggtgggtgtgccttt cccagtccactgactcaaatgttaatcttctttggcaacaccctcacagacacacccagg aaggatactttgcatccttcaatcacatcaagttggcactcaatattaaccatcatacac accaagctaaacaagcatgttgtaacccaatgcaaggaagctaaaaatcatgataaaaca atgcagcagctgacagcaaaaatagccaatatagaaaggaacataactgttttgatagag ctgaaaaacccactacgagaacttcacaatgcaatcacaagtattaatagcagaatagac caactggaggaaagaatctcagagcttgaagactatctttttgaaataagacagtcagac aagaatagaggaaaaaaaagaatgaaaatgaatgaacaaaacctccaagaaatatgggat tatgtaaagagactgaatctaaaactgattggggtacctgaaagagagggagaagggaac caacgtggaaaacgtacttccggatatcatccaggaaaactataccaacctaatgacatg actgtatacctagaaaacaccatcgtcccagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcatatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaacgacctcttctag >gi568815596r:18455194_18660572|GENSCAN_predicted_peptide_10|121_aa MRETTRTAENEEKQGGAMAYPGATGSQGTPHPQPREVVSECATPGKPRFSHRIFATHRSG DPLVSPYDQSLGSDTQSCGVECWQSSCTGTHRDPGALHTPVPGFPTKVSATQARHEIHTY P >gi568815596r:18455194_18660572|GENSCAN_predicted_CDS_10|366_bp atgagagaaacaactcgaactgcagaaaatgaagaaaaacagggtggggcgatggcctac ccaggagcaacagggagccaagggaccccccacccccagccaagggaagtggtgagtgaa tgcgcgacccccgggaaaccacgcttctcccacaggatctttgcaactcatagatcagga gatccccttgtgagtccatacgaccagagccttgggtctgacacacagagctgtggtgtg gagtgttggcagagcagctgtacaggcacacacagagacccaggagctttacatactcca gttccaggattcccaacaaaggtgtctgcaactcaggcaaggcatgagatacatacatac ccctag