GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:56:09 Sequence gi568815576r:29689016_29932325 : 243310 bp : 48.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3047 3165 119 1 2 59 68 106 0.060 5.47 1.02 Intr + 11466 11618 153 0 0 36 67 86 0.010 0.49 1.03 Intr + 13813 13948 136 1 1 41 49 127 0.015 4.67 1.04 Intr + 18011 18144 134 1 2 76 80 77 0.734 5.14 1.05 Intr + 21317 21524 208 2 1 87 40 82 0.220 2.38 1.06 Intr + 25774 25955 182 0 2 46 67 115 0.526 3.87 1.07 Intr + 31310 31571 262 2 1 88 72 296 0.162 25.49 1.08 Intr + 31637 32116 480 1 0 8 -15 259 0.299 0.83 1.09 Intr + 33726 33834 109 2 1 62 60 73 0.634 1.76 1.10 Intr + 38647 38790 144 2 0 79 89 342 0.969 33.65 1.11 Intr + 39615 39727 113 1 2 94 63 132 0.998 11.40 1.12 Intr + 40040 40193 154 1 1 40 70 406 0.937 33.65 1.13 Term + 40427 40554 128 0 2 30 42 314 0.934 19.44 1.14 PlyA + 42792 42797 6 -0.45 2.06 PlyA - 42866 42861 6 -0.45 2.05 Term - 45010 44848 163 0 1 82 53 83 0.747 1.61 2.04 Intr - 49426 49315 112 2 1 123 92 76 0.960 10.94 2.03 Intr - 51715 51635 81 2 0 90 83 132 0.994 12.51 2.02 Intr - 53465 53403 63 0 0 111 103 74 0.987 9.89 2.01 Init - 59529 59403 127 0 1 99 90 335 0.986 35.12 2.00 Prom - 60803 60764 40 -4.26 3.00 Prom + 68700 68739 40 -2.26 3.01 Init + 69920 69968 49 1 1 99 103 28 0.973 6.51 3.02 Intr + 73321 73392 72 2 0 10 105 80 0.470 1.28 3.03 Term + 73436 73509 74 0 2 53 49 82 0.807 -1.13 3.04 PlyA + 73718 73723 6 1.05 4.00 Prom + 74759 74798 40 -7.26 4.01 Init + 78384 78533 150 2 0 101 60 399 0.952 38.44 4.02 Term + 84488 84643 156 1 0 108 43 114 0.873 6.83 4.03 PlyA + 85497 85502 6 1.05 5.22 PlyA - 85843 85838 6 1.05 5.21 Term - 100169 99998 172 1 1 96 53 203 0.920 14.90 5.20 Intr - 101533 101454 80 1 2 109 84 24 0.933 2.45 5.19 Intr - 103520 103418 103 1 1 88 77 144 0.878 13.48 5.18 Intr - 104475 104345 131 0 2 102 94 318 0.977 33.29 5.17 Intr - 104661 104562 100 2 1 93 80 215 0.925 21.31 5.16 Intr - 112095 111976 120 2 0 98 105 185 0.999 20.81 5.15 Intr - 113193 112979 215 0 2 75 75 380 0.999 32.91 5.14 Intr - 115815 115623 193 2 1 -12 87 215 0.166 10.89 5.13 Intr - 117538 117470 69 0 0 98 105 121 0.999 13.10 5.12 Intr - 117889 117782 108 0 0 102 115 56 0.969 9.10 5.11 Intr - 119170 119096 75 0 0 65 61 68 0.572 0.43 5.10 Intr - 124527 124415 113 0 2 46 110 50 0.957 2.18 5.09 Intr - 125752 125642 111 1 0 77 95 42 0.946 4.38 5.08 Intr - 127058 126991 68 0 2 125 81 49 0.999 6.52 5.07 Intr - 133449 133320 130 0 1 84 75 151 0.992 13.67 5.06 Intr - 136242 136072 171 0 0 99 80 234 0.999 23.84 5.05 Intr - 136765 136607 159 1 0 52 92 155 0.910 12.48 5.04 Intr - 139173 139103 71 1 2 38 48 21 0.084 -8.00 5.03 Intr - 143327 143230 98 1 2 101 93 121 0.615 13.55 5.02 Intr - 149292 149163 130 1 1 -12 105 210 0.361 12.45 5.01 Init - 168831 168795 37 0 1 71 53 57 0.061 -1.12 5.00 Prom - 181143 181104 40 0.44 6.00 Prom + 187150 187189 40 -7.36 6.01 Init + 194553 194715 163 2 1 54 90 167 0.935 11.39 6.02 Term + 196813 196922 110 2 2 52 40 121 0.734 2.37 6.03 PlyA + 202226 202231 6 1.05 7.04 PlyA - 204120 204115 6 1.05 7.03 Term - 207048 206738 311 1 2 72 42 117 0.490 0.72 7.02 Intr - 207643 207326 318 2 0 120 62 355 0.881 32.03 7.01 Init - 210326 210287 40 2 1 100 81 8 0.554 -0.27 7.00 Prom - 217641 217602 40 -0.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 115784 115623 162 2 0 53 87 184 0.813 14.97 S.002 Term - 117275 117155 121 1 1 103 42 157 0.905 10.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_1|773_aa MGRQQPGLSKGRAAVAEGSQGPRAAALADQCSQVVAASSCTSFFAFGPRLSGEFLTAVPQ GCLPQKPSCGREDKEPPTYNEELAVKLWGERLVPRGPPRPCQASFTFQGAKMGVTGPGYR RSGYGYSHGREPEWNQPFSRPQFPLCTSEQQFSKGSTPRSRLAPSQHRRDSMMDEDPAGG SWQGHTVRPPSVLVSLDADRAHGSVWRILTWPQQLHQHPQPSSLCPALSCLSFPTCWVAE AAATRYHVPMGVPGKATKLTWQHLDILMTPLSRGGNQSGDGVSTKFKVILGLLITALPEP RVLLGEKEEGSRAPRPLQPPPGPPPAHEPRPQSLRRAGGRGASKMPFHPVTAALMYRGIY TVPNLLSEQRPVDIPEDELEGECPPGSPPRRPSYLCAQVGAPASSCAPRQDLSAPRGAAV GVAQAESGPAPARVVPSAAETCRAPQRGHGAAQSAVTVRQRGGLGRGSALSPQRRPGRST RILIPSAAPARGSLLALTFRSCRFKNSICSGSSRRGEGDCPALARCGAAPVPTCGRSPFP AALRGPPARRAGKAAREAGGLGETQALCREDAIRGGATCARKRELGSPGWNAADKIREAF KVFDRDGNGFISKQELGTAMRSLGYMPNEVELEVIIQRLDMDGDGQVDFEEFVTLLGPKL STSGIPEKFHGTDFDTVFWKCDMQKLTVDELKRLLYDTFCEHLSMKDIENIIMTEEESHL GTAEECPVDVETCSNQQIRQTCVRKSLICAFAIAFIISVMLIAANQVLRSGMK >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_1|2322_bp atggggcggcagcagccaggtctgtctaaggggcgagcagcagtggccgaaggctcccag ggccctcgggcagcagcccttgctgaccagtgcagccaggtcgtggctgcctccagctgc acttcattctttgcctttggtcctcgcctgagtggggagttcctgacagctgttccccag ggctgcctaccccagaagccttcgtgtggcagagaggataaggagccacccacatataat gaagagttggcagtgaaactctggggggaaagactggtcccccgaggtcctccaaggcca tgtcaagccagcttcaccttccaaggagccaaaatgggagtgactggcccaggataccgc aggagtggctatggctacagccatggccgggagccggagtggaaccagcccttctccagg cctcagtttcccttgtgcaccagcgagcagcagttttctaaaggttccaccccgaggagc cggctggcccccagccagcatcgcagggacagcatgatggatgaggatcccgcggggggc agctggcaaggacacaccgtccgtccgcccagtgtcctggtctccttggacgcagacaga gcccacggaagcgtctggcgcatcctgacttggccgcagcagctgcaccagcacccccag ccctcttccctgtgcccagcgctgtcctgcctgtccttccccacctgctgggtggccgaa gcagccgccacacgctaccatgtccccatgggtgtgccgggcaaggccacaaagctgacc tggcagcacctggatattttaatgaccccattgtccagaggaggaaatcaaagtggagac ggtgtaagcaccaagttcaaggtcatcctgggactgttgatcacagccctacctgagccg cgggtcctgctgggagagaaggaggaggggagccgcgcgccccgcccgctccagccgccc ccggggccgccaccggcccatgagccccggcctcaaagtttgcggcgggcgggcgggcgc ggagcctccaagatgccgttccacccggtgacggcggcgttgatgtaccggggcatctac accgtgcccaacctgctgtcggagcagcgcccggtggacatcccggaggacgagctggag ggtgagtgtccgccgggatccccgccccggcggccctcctacctgtgcgcccaggtgggc gccccagctagcagctgtgccccgcggcaagacctgtccgcaccccggggcgccgcggtg ggggtcgctcaggcggagagcggcccagcccctgcccgcgtggtccccagcgctgcggaa acttgccgggccccgcagcggggtcacggggccgcgcagtcggcggtgacggtgcggcaa cgcggcggactggggcgggggtccgcgctgagcccccagcgccggcccggccggagcacc cgcatcctgatcccctccgcggcgcccgcccgcggctctctgctcgcattgacattccgc tcgtgtcgctttaaaaattcaatctgctcgggcagcagcagaaggggagagggcgactgc cctgctcttgcccgctgcggggccgcccccgtccccacctgcggccgtagccccttccct gcagccctgcggggacccccagcccggcgcgccgggaaggcggcccgggaggcgggcggt ctgggcgagacccaggccctctgccgggaggacgccattcgcggaggagccacatgtgcc aggaagagggagctgggcagcccgggatggaatgctgcagacaagatccgagaggccttc aaggtgtttgaccgtgacggcaatggcttcatctccaagcaggagctgggcacagccatg cgctcactgggttacatgcccaacgaggtggagctggaggtcatcatccagcggctggac atggatggtgatggtcaagtggactttgaggagtttgtgacccttctgggacccaaactc tccacctcagggatcccagagaagttccatggcaccgactttgatactgtcttctggaag tgcgacatgcagaagctgacggtggatgagctgaagcggctgctctacgacaccttctgc gagcacctgtccatgaaggacatagagaacatcatcatgacggaggaggagagccacctg ggcacagccgaggagtgtcccgtggatgtggagacctgctccaaccagcagatccgccag acttgcgtgcgcaagagtctcatctgcgccttcgccatcgccttcatcatcagtgtcatg ctcattgcggccaaccaggtgctgcgcagtggcatgaagtag >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_2|181_aa MGKRYFCDYCDRSFQDNLHNRKKHLNGLQHLKAKKVWYDMFRDAAAILLDEQNKRPCRKF LLTGQCDFGSNCRFSHMSERDLQELSIQVEEERRAREWLLDAPELPEGHLEDWLEKRAKR LSSAPSSRTAAGKMQESGSCPHAQDKQVTREATSRVVPDTHLNVPKALAHVSGALAVCQC V >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_2|546_bp atggggaagcgatacttctgtgactactgcgaccgctccttccaggacaacctccacaac cgcaagaagcacctgaacgggctgcagcacctcaaggccaagaaggtctggtacgacatg ttccgagatgcagctgccatcttgctggatgagcagaacaagcggccctgcaggaagttt ctactgacaggccagtgcgactttggctccaactgcagattttcccacatgtcagagcga gacctgcaggagctgagcatccaggtggaggaggagaggcgagccagggagtggctacta gatgctcctgagctccccgagggccatctggaggactggctggagaagagagccaagcgg ctgagctcagccccaagtagcaggaccgcggctgggaaaatgcaagagagtggctcctgt cctcacgcacaggacaaacaggtcaccagagaagctacgagcagagttgtgccagacacg cacctcaatgtccccaaggctttggctcacgtgtctggggccctggcagtgtgccagtgt gtgtga >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_3|64_aa MEGSRDLVHSDEPARGGDPNPQAPERYGSMACQELGRTAGATPHCSHYRLSFVSCQIIGS IRFS >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_3|195_bp atggagggttccagagacctggttcactcggatgaaccagccaggggtggggaccccaac ccccaggccccggaacggtacgggtccatggcctgtcaggagctgggccgcacagcagga gccactccccattgctcgcattaccgcctgagcttcgtctcctgtcagatcatcggcagc attagattctcatag >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_4|101_aa MAAATLTSKLYSLLFRRTSTFALTIIVGVMFFERAFDQGADAIYDHINEGPDVIQPELDY PESSLMEEDSGQETHGEVGETLEIVQFFLPEVYLCEDEVEA >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_4|306_bp atggcggccgcgacgttgacttcgaaattgtactccctgctgttccgcaggacctccacc ttcgccctcaccatcatcgtgggcgtcatgttcttcgagcgcgccttcgatcaaggcgcg gacgctatctacgaccacatcaacgaggggcctgatgtgattcagccagagctggattat ccagaatcttctctgatggaggaagattcaggtcaggaaacccatggagaggtaggagag accctagaaattgtccagtttttcctacctgaggtgtatctttgtgaagacgaagttgaa gcctag >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_5|817_aa MRGASARPPLLGSRDFLSAPAESQFDPGSAGAQGAVTVVGGGGGGGGTEPVVEPPRRVTQ HNASSAPGPTPDHPQGPEDRKAEDFTSAVLILECLEPWRKSACSGKPSLILQHPEQKADR YFVLYKPPPKDNIPALVEEYLERATFVANDLDWLLALPHDKFWCQVIFDETLQKCLDSYL RYVPRKFDEGVASAPEVVDMQKRLHRSVFLTFLRMSTHKESKDHFISPSAFGEILYNNFL FDIPKILDLCVLFGKGNSPLLQKMIGNIFTQQPSYYSDLDETLPTILQVFSNILQHCGLQ GDGANTTPQKLEERGRLTPSDMPLLELKDIVLYLCDTCTTLWAFLDIFPLACQTFQKHDF CYRLASFYEAAIPEMESAIKKRRLEDSKLLGDLWQRLSHSRKKLMEIFHIILNQICLLPI LESSCDNIQGFIEEFLQIFSSLLQEKRDETRTAYILQAVESAWEGVDRRKATDAKDPSVI EEPNGEPNGVTVTAEAVSQASSHPENSEEEECMGAAAAVGPAMCGVELDSLISQVKDLLP DLGEGFILACLEYYHYDPEQVINNILEERLAPTLSQLDRNLDREMKPDPTPLLTSRHNVF QNDEFDVFSRDSVDLSRVHKGKSTRKEENTRSLLNDKRAVAAQRQRYEQYSVVVEEVPLQ PGESLPYHSVYYEDEYDDTYDGNQVGANDADSDDELISRRPFTIPQVLRTKVPREGQEED DDDEEDDADEEAPKPDHFVQDPAVLREKAEARRMAFLAKKGYRHDSSTAVAGSPRGHGQS RETTQERRKKEANKATRANHNRRTMADRKRSKGMIPS >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_5|2454_bp atgaggggcgcctctgcccggccgcccctactgggaagccgagacttcctgtctgctcct gcggaatcgcagtttgaccccggaagtgcgggcgctcagggagctgtcaccgtggtcggc ggcggcggcggcggcggcggcacagagccggtggtggagccgccgaggagggtcacgcag cacaatgccagctctgcccctggaccaactccagatcacccacaaggacccgaagacagg aaagctgaggacttcaccagcgctgtccttatcctcgaatgcttggagccttggaggaag tctgcttgctctgggaagccgtcccttatcctccagcaccccgagcagaaggcagaccgg tattttgtgttatacaaaccgccccctaaagacaacattcccgccctagtggaggagtac ctggaacgcgccaccttcgtagccaatgacctcgactggctcctggccttgcctcacgat aaattctggtgccaggtgatctttgacgagactctacagaagtgcctggactcctacctg cgctatgtcccccgcaaattcgacgagggggtggcctcagcccctgaggttgttgacatg cagaagcgcctccatcgaagtgtttttctcaccttcctccgcatgtccactcacaaggaa tccaaagatcacttcatttccccttctgcgtttggagaaatcctctacaataacttcctc tttgacattccaaagatcctggacctctgcgtgctctttggaaaaggcaactcaccactg ctccagaagatgataggaaacatctttacacagcagccaagttactacagtgacctggat gaaaccctgcctaccatccttcaggtcttcagcaatatcctccagcactgtggtttgcaa ggggacggggccaataccacaccccagaagcttgaggagaggggccgattgacccccagt gacatgcctctcctggaattaaaggacattgttctctacctttgtgatacctgcaccaca ctttgggcctttctggatatcttccctttggcttgccagaccttccagaagcacgacttt tgttacagactagcttccttctacgaagcagcaattcccgaaatggagtctgcaattaag aagaggaggcttgaagatagcaagcttcttggtgacctgtggcagaggctctcccattcc aggaagaagctaatggagattttccacatcatcctgaaccagatctgcctccttcccatc ctagaaagcagctgtgacaacattcagggcttcatcgaagagttccttcagatcttcagc tccttgctgcaggagaagagggacgagacgcggactgcctacatcctccaggcagtcgag agtgcatgggaaggggtggacagacggaaagccacagatgctaaagacccatcggtgatt gaggagcctaatggggagcctaacggggtcacggtgacagcagaggcagtcagtcaagca tcatcacatccggagaactcggaggaagaggagtgcatgggagcagccgcggctgtgggc cctgccatgtgtggggtggaactggactctctcatctcccaagtgaaggacctgctgcca gaccttggtgagggcttcatcctggcctgcctggagtactaccactacgacccagagcag gtgatcaacaatatcctggaggagcggctggcccccaccctcagccagctggaccgcaac ctagacagagaaatgaaaccagaccctacacccctgctgacgtctcgccacaacgtcttc cagaatgacgagtttgatgtgttcagcagggactcagtagacctgagccgggtgcacaag ggcaagagcaccaggaaggaggaaaacacgcggagtttgctgaacgacaagcgtgcagtg gcggcacagcggcagcgctacgagcagtacagcgtggtggtggaggaggtgccactgcag ccaggcgagagcctgccctaccacagtgtctactacgaggatgagtacgatgacacatac gatggcaaccaggtgggcgccaatgatgcagactctgatgacgagctcatcagccgcagg ccattcaccatccctcaggtgctgagaaccaaagtgcctagagaagggcaggaggaggat gacgacgatgaggaagacgatgctgacgaggaggctcccaagcccgaccattttgttcag gaccctgcagtgctgagagagaaggcagaagccaggcgcatggcctttctcgccaagaaa gggtaccggcatgacagctcaacagcagtggccggcagcccccgaggccatgggcagagc cgcgagacaacccaggaacgcaggaagaaggaagccaacaaggcgacaagagccaaccac aaccggagaaccatggccgaccgcaagaggagcaaaggcatgatcccatcctga >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_6|90_aa MGARRPSSARTLDAGEAGSRARAAFPDLAGMGPCGSGCGTYPRSARSYSHELPGVKWNAV TKGSIRVLEGGVVKVLTSLEEVEENVVTIV >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_6|273_bp atgggagctcggcggccgagctcggcccggaccctagatgcgggggaggcggggtcccgg gctcgggctgccttcccagacctggcggggatgggcccgtgcggctctgggtgtgggacg taccctcggagcgcccggagttattcccacgaactcccgggagtaaagtggaatgctgtt accaaaggaagtattcgtgttttagaggggggtgtggtgaaggttcttacaagtttggaa gaggtggaagaaaatgttgttacaattgtgtag >gi568815576r:29689016_29932325|GENSCAN_predicted_peptide_7|222_aa MAHVYNPSTLGGQDYGNYVRQKANSSDFLEFKMGREAVETTHNINYTSGPETVQWWFKKC CKGDESLEDEECSGRPEVGNDQLRAIIEADPLTTTREIAEELNVDHSTLVWQAIEANWKV LGIQAWATTPSQKVKKLDKWVPHELTENFKNCRFEMLSSLILRNDDEPFLGWIVMCDKKW ILYNNSDDQLSGWTEKTLQSTSQSRTCTKKWSWSLFGGLLLV >gi568815576r:29689016_29932325|GENSCAN_predicted_CDS_7|669_bp atggctcacgtctataatcccagcactctgggaggccaagactatggaaattatgttaga caaaaagcaaattcaagtgattttcttgagttcaaaatgggtcgtgaagcagtggagaca actcacaacatcaactacacatctggcccagaaactgtgcagtggtggttcaagaagtgt tgcaaaggagacgagagccttgaagatgaggagtgtagtggccggccagaagttggcaat gaccaactgagagcaatcatcgaagctgatcctcttacaactacacgagaaattgccgaa gaactcaatgtcgaccattctacccttgtttggcaggcaattgaagccaactggaaagtg ctgggaatacaggcgtgggctaccacgcccagccaaaaggtgaaaaagcttgataagtgg gtgcctcatgagctgaccgaaaattttaaaaattgtcgatttgaaatgttgtcttctctt attctacgtaacgacgacgaaccatttcttggttggattgtgatgtgcgacaaaaagtgg attttatacaacaacagtgatgaccagctaagtggctggaccgagaagacactccaaagc acttcccaaagccgaacttgcaccaaaaagtggtcatggtcactgtttggcggtctgctc ctggtctga