GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:18:38 Sequence gi568815596f:28794699_29046776 : 252078 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 72 67 6 1.05 1.01 Sngl - 16803 16129 675 0 0 56 45 194 0.961 8.09 1.00 Prom - 21255 21216 40 -3.16 2.07 PlyA - 21291 21286 6 1.05 2.06 Term - 45610 45566 45 1 0 105 43 41 0.087 -1.49 2.05 Intr - 55658 55630 29 0 2 43 109 36 0.176 -0.97 2.04 Intr - 56574 56374 201 1 0 23 89 120 0.297 4.86 2.03 Intr - 70421 70319 103 2 1 74 110 18 0.246 2.35 2.02 Intr - 75712 74881 832 0 1 73 90 208 0.295 10.90 2.01 Init - 80757 80612 146 0 2 35 80 143 0.835 7.79 2.00 Prom - 90034 89995 40 -5.96 3.00 Prom + 95129 95168 40 -8.06 3.01 Init + 100001 100225 225 1 0 102 98 276 0.823 26.47 3.02 Intr + 107289 107426 138 2 0 127 92 2 0.854 5.16 3.03 Intr + 111762 111883 122 2 2 106 89 3 0.031 1.49 3.04 Intr + 117892 118012 121 1 1 32 84 52 0.026 -0.30 3.05 Intr + 119371 119510 140 0 2 78 84 60 0.938 3.86 3.06 Intr + 130284 130455 172 0 1 117 76 30 0.788 4.65 3.07 Intr + 132871 133002 132 0 0 80 80 65 0.941 5.74 3.08 Intr + 134881 135012 132 0 0 100 101 31 0.969 6.44 3.09 Intr + 140823 140909 87 2 0 111 78 7 0.708 2.17 3.10 Intr + 147614 147683 70 1 1 93 98 62 0.944 6.45 3.11 Intr + 151752 151801 50 1 2 103 81 96 0.949 8.80 3.12 Term + 151902 152081 180 2 0 60 49 391 0.999 30.01 3.13 PlyA + 152273 152278 6 1.05 4.00 Prom + 153444 153483 40 -0.36 4.01 Init + 161842 161999 158 0 2 57 94 77 0.337 4.04 4.02 Intr + 164574 164683 110 0 2 58 -5 92 0.016 -3.17 4.03 Intr + 173749 173877 129 2 0 129 98 -44 0.018 1.37 4.04 Intr + 186558 186707 150 1 0 83 44 111 0.028 6.33 4.05 Intr + 192903 193114 212 1 2 26 63 99 0.008 -0.17 4.06 Intr + 200134 200164 31 0 1 73 116 59 0.197 4.90 4.07 Intr + 203445 203555 111 1 0 101 100 101 0.768 12.95 4.08 Intr + 204483 204770 288 1 0 112 83 204 0.988 19.52 4.09 Intr + 207838 208049 212 2 2 85 94 102 0.950 9.13 4.10 Intr + 208794 208984 191 2 2 91 95 60 0.469 5.38 4.11 Intr + 212330 212463 134 2 2 114 92 -29 0.247 0.39 4.12 Intr + 219697 219863 167 2 2 91 77 166 0.048 15.48 4.13 Intr + 222456 222606 151 2 1 21 79 189 0.987 11.04 4.14 Intr + 223094 223258 165 0 0 100 80 88 0.990 9.13 4.15 Intr + 226555 226646 92 2 2 70 30 57 0.390 -2.19 4.16 Intr + 228368 228493 126 1 0 85 78 114 0.721 10.98 4.17 Intr + 229441 229676 236 0 2 49 41 377 0.911 25.59 4.18 Intr + 232155 232313 159 0 0 87 91 255 0.988 24.80 4.19 Intr + 238236 238353 118 0 1 94 99 146 0.995 16.77 4.20 Intr + 238771 238865 95 0 2 66 39 163 0.993 7.96 4.21 Intr + 240766 240958 193 1 1 73 99 277 0.988 26.79 4.22 Intr + 241843 242059 217 0 1 103 92 290 0.787 28.98 4.23 Intr + 250626 250712 87 1 0 106 99 -7 0.098 2.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 117950 118012 63 1 0 67 84 55 0.882 4.15 S.002 Init - 215198 215091 108 2 0 82 90 135 0.917 11.35 S.003 Init + 218281 218333 53 0 2 96 94 69 0.940 7.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:28794699_29046776|GENSCAN_predicted_peptide_1|224_aa MSYGPGTETQQLRSQNSGADDLGDKKRCLMGHKEVGFIKKTPQISIPPTIKAAGTRGDGS ACPGPSLRADGRASGAASGIGPARPSPGTFTRYAAGRRAAKAPTCAATASLARSLSPEPA AGSACVVAAKAAEGAHGRRREDEAGALPSHLRRAVGSPASEPRDRAHSKLEIRLRGPFPG ASTGTCGSPGLRGRGPGNGGQGSVAALLASDGCSSVASQQRYPL >gi568815596f:28794699_29046776|GENSCAN_predicted_CDS_1|675_bp atgagctacgggcctggtactgagacacagcagctcagatctcaaaattctggggcagac gatcttggggacaaaaagaggtgtttaatgggacataaggaggtaggatttatcaaaaag accccccaaatctccattcctcccacaataaaggcggcaggcacacgtggagacgggagc gcctgcccagggccctccctccgagcagacggccgagcttcgggagcagcctccggtatc ggccctgcccgtccttcccctggaaccttcacccgctacgccgccgggcggagggcggcc aaagccccaacctgcgcggccactgcctccctcgccaggtccctcagcccagagcccgct gcggggagcgcgtgtgtcgtcgccgcgaaggcagctgagggcgcccacgggaggcggcgt gaggacgaggctggagcgctgccttctcatctaaggcgggcggtggggtcgccggcgagc gaacccagggaccgggcacactcgaaactggagattcgcctgcgaggccccttcccgggg gcgagcacaggtacctgcggaagcccggggctgcgcgggagagggccgggcaacggcggt caaggctccgtcgcagcgctcctggcctcagacggttgctcgtcggtcgctagccagcag cggtacccgctctaa >gi568815596f:28794699_29046776|GENSCAN_predicted_peptide_2|451_aa MFEKVLNFSEFGFAVSSNTAVNPRGEVLQNPDSSLAATGDKVKKQEKSRRSRGAVEPHAA AEPSGCCAMRATGKEGVALGLRHSSATAPSRNTMLMAWCRGPVLLCLRQGLGTNSFLHGL GQEPFEGARSLCCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCLSSLEN LRLPTLREESSPRELEDSSGDQGRCGPTHQGSEDPSMLSQAQSATEVEERHVSPSCSTSR ERPFQAGELILAETGEGETKFKKLFRLNNFGLLNSNWGAVPFGKIVGKFPGQILRSSFGK QYMLRRPALEDYVVLMKRGTAITFPKDINMILSMMDINPGDTVLEAGSGSGGMSLFLSKA DGIRTCELALSCEKISEVIVRDWLVCLAKQKNGILAQKVESKINTDVQLDSQEKIGVKGE LFQEDDHDQYTGNLVIQSEQSTGVALGPRGS >gi568815596f:28794699_29046776|GENSCAN_predicted_CDS_2|1356_bp atgtttgagaaggttttaaacttctctgaatttggatttgcagtttctagtaacacagca gtaaatcctagaggagaagttctacagaatccagattcaagtttagcagcaactggggat aaagtgaaaaagcaggagaaaagcaggcgcagtcgcggagctgtagagccccacgcagct gcagagccatcgggctgctgcgccatgcgcgcgactgggaaagaaggggtcgcgctaggc ttgcgtcactcgtctgcgacggcgccttcgcgaaacactatgctaatggcatggtgccgc ggtcctgtcttgctgtgcctgcggcaggggctcggaaccaattcattcctgcacggcctg gggcaggagcccttcgagggagctcggtcactgtgttgcaggtcctcgcctagagacctg cgagatggagaaagagagcacgaggcggcacaaaggaaagccccaggagcagagtcttgc ccatctctccctctgagcatctcggacattgggactggatgtctttcgtcactggaaaac ctcagactgccgacgctgcgggaagagtcatcccctcgagagctcgaggactcgagcgga gaccagggccggtgcggtcccacacaccagggatccgaggatccttcgatgctctcgcag gcccagtccgctaccgaggtcgaagagcgtcacgtctccccttcttgttcaacttccaga gagagaccctttcaggctggggaactgattttagctgagactggggagggagaaacaaaa tttaagaaattatttaggttgaacaacttcggactcttaaatagtaactggggggcagtc ccgttcggcaagatcgtggggaagttccccggccagatactgaggagttccttcggtaag cagtacatgctgaggaggccagccttggaagactatgtagtattgatgaaaagagggact gccataacattcccaaaggatattaatatgattctctcaatgatggatatcaacccaggt gatactgttttggaagctggctcaggctctggtggaatgagcttatttttatccaaagca gatggaattcgcacctgtgaacttgctctttcatgtgaaaagataagcgaggtcattgtc agagattggttggtttgccttgcaaaacagaaaaatggaattttagctcaaaaagtagaa tctaaaatcaacacagatgtacaactagattctcaagagaaaattggagttaaaggtgag ctgtttcaagaggatgaccatgaccagtacactggcaacctggtcatacagagtgaacaa tctactggtgtggcactaggtccccggggcagctga >gi568815596f:28794699_29046776|GENSCAN_predicted_peptide_3|522_aa MAAGGGGSCDPLAPAGVPCAFSPHSQAYFALASTDGHLRVWETANNRLHQEYVPSAHLSG TCTCLAWAPARLQAKESPQRKKRKSEAVGMSNQTDLLALGTAVGSILLYSTVKGELHSKL ISGGHDNRVNCIQWHQDSGCLYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDG KMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGA VHDRLLNVWYCKKPLTSNCTIQIATPGKGKKSTPKPIPILAAGFCSDKMSLLLVYGSWFQ PTIERVVRTPVMNSEAKVLVPGIPGHHAAIKPAPPQTEQVESKRKSGGNEVSIEERLGAM DIDTHKKGKEDLQTNSFPVLLTQGLESNDFEMLNKVLQTRNVNLIKKTVLRMPLHTIIPL LQEVTASEKTKGATSPGQKAKLVYEEESSEEESDDEIADKDSEDNWDEDEEESESEKDED VEEEDEDAEGKDEENGEDRDTASEKELNGDSDLDPENESEEE >gi568815596f:28794699_29046776|GENSCAN_predicted_CDS_3|1569_bp atggcggcgggcggcggcggtagctgcgaccccctggcccctgctggggtcccttgcgcc ttctccccgcacagccaggcctacttcgctttggcctctaccgacggtcacttacgagta tgggagacggccaacaaccggctgcaccaggagtacgtgccttccgcgcacctcagtggt acctgcacctgtctggcctgggcgccagcgcggctgcaggccaaggaaagtccccagagg aaaaaaaggaaatcagaagctgtaggaatgagtaaccagactgacttattggctcttggc acagcagttggtagcattttattatacagcacagtaaaaggagagttacacagtaaatta ataagtggtggacatgacaacagagtcaactgcatacagtggcatcaagacagtggctgt ttatatagttgttcagatgataaacatattgtggaatggaacgtacagacatgcaaagta aagtgcaaatggaaaggcgacaatagcagtgtcagttccctatgtatcagcccagatgga aagatgttgctttcagctggtcgaacaatcaaactatgggttttggagaccaaagaagtc tacaggcatttcacaggacatgcaacgccagtttcgtcactgatgttcactaccatcaga cctcctaatgagagccagccctttgatggaattacaggtctttatttcttatctggagca gtacatgaccggttacttaatgtctggtactgcaaaaagcctttgacttcaaactgcaca attcagatagcaacacctgggaaaggcaagaagtcaacaccaaaacccatccctattcta gctgctggtttttgctcagacaaaatgtcattgttgcttgtatatggcagttggtttcag cctactattgagcgagtggtgaggacaccagtgatgaattctgaagcaaaagttctggtg cctgggattcctggtcatcatgcagctatcaagcccgctcctccacaaaccgagcaagta gagagcaagaggaagtcagggggaaatgaggttagcattgaagaacgtctgggagcaatg gatatagacacacacaaaaaaggaaaggaagacctccagacgaatagctttccagttctt cttacccagggcttagaaagtaacgattttgaaatgctaaataaagtacttcaaactagg aatgtaaaccttataaagaagactgtattaaggatgcccctgcatactattattccgttg ttacaagaggtaacagcatcagagaagacaaagggagcaacttcccctggacagaaggca aagttggtgtatgaagaagagtcttctgaagaggagtctgatgatgaaatagcagataag gattctgaagataattgggatgaagatgaggaggagagtgaaagtgaaaaagatgaggac gttgaagaggaagatgaggatgccgaaggaaaagatgaagaaaatggcgaggacagagat acagcaagtgaaaaagaattaaatggagattctgatttagatcctgaaaatgaaagtgaa gaagaatga >gi568815596f:28794699_29046776|GENSCAN_predicted_peptide_4|1178_aa MLLLLLCYGCASENHCCHSLIDRCLGCFLFLSKVKLRRPRACLMCTCASIFLSEVIAKIR KKLTTSYTNCINFPIDFVHEIQEFTYVNDTEVVHFLSVSVGCNEAILVLYTFVLFLAPIC TLGKVCMTLGGQGSYGLLQRVVAMQPCLFKESFAESWSTPAVELVATQPLLRQEAAAQDC VAVLNSFIQSHYSYCIIAMNLALFVPLSAFCGIAGWLHPILSENRVATGVLTPPDLTMGE ETPSAAVVVDPELDMGTRDDVPEAKVLVPVAVYCGSIPRTSAGPRVLPPGSINSSLPHGE GSLQPEPRALLNNEEPSQLLRGLGQLGGLKLDTPSKGWQARNGHPRNLRALSLGDQPLVL LPSPESEANSVARDTIQIKDKLKKRRLSEGLAASSRASLDPGGGPQGVPLHSTIPRATSQ RLLRVPRPMPLIQSIPTTPEASGVKEKGLDLPGSIPGPHELRPGAQEAQISWQYLHCNDE KMQKSLGAIVIPPIPKARTVAATPSRVPGSLPSPLPPGQGVLTGLRAPRTRQLPVCPLPF LSSAIPKIVPGLFAPISCFLLGPLCVPKSSRWWNVEPKPLASPIRDRPAAAKKPALPFSQ SAPTLTAFSFDCAREACPPLKEEDQKEIGTKIQVTISKSAREKMQLKQMKEMELLRRLEE PRTGQELTSQCLGSQRAFMKEGLLPLRGSGTLSVPTRLSGPCRNDVSIILRKWASRASLP SIPISRQEPRFARHASDLPRTSRCTVVKIILVEVKLGRKLARKEDVVGLALLLRQMKEKG LVSIQRLAACHSEVLTGKLHDVCLVVTGEVTNLRSKVSHLAISTLGDLFQALKKNMDQEA EEIARCLLQKMADTNEFIQRAAGQSLRAMVENVTLARSLVVLTSAGVYHRNPLIRKYAAE HLSAVLEQIGAEKLLSGTRDSTDMLVHNLVRLAQDSNQDTRFYGRKMVNILMANTKFDAF LKQSLPSYDLQKVMAAIKQQGIEDNDELPSAKGRKVLRSLVVCENGLPIKEGLSCNGPRL VGLRSTLQGRGEMVEQLRELTRLLEAKDFRSRMEGVGQLLELCKAKTELVTAHLVQVFDA FTPRLQDSNKKVNQWALESFAKMIPLLRESLHPMLLSIIITVADNLNSKNSGIYAAAVAV LDAMVESLDNLCLLPALAGRVRFLSGRAVLDVTDRLAX >gi568815596f:28794699_29046776|GENSCAN_predicted_CDS_4|3534_bp atgctgctgctgctgctgtgttacggctgcgcttcggagaaccactgctgccattccctg atcgatagatgcttggggtgcttcctgtttctcagtaaagtgaagttgcggcgacctcgt gcctgccttatgtgcacctgtgccagcatctttctcagtgaagtaatagccaaaataaga aagaaattaaccacatcatacaccaactgcatcaatttccccatagactttgtccatgaa atccaggagtttacatatgtcaatgatacggaggttgtccatttcctcagtgtgtctgtt ggttgcaatgaggccatcttggtcctttacacctttgtccttttcctcgcacccatatgc accctaggaaaagtgtgcatgactttgggaggccaagggagctatgggctccttcagcga gtggttgctatgcagccttgtttattcaaggagagctttgctgagagctggagcacgcca gcagtggagctcgtggccacccagcccctgctcagacaggaggctgcggcccaggactgt gtggctgttttgaattcattcattcagtcacactacagttactgcattattgccatgaac ctggcactgtttgtccctttgtctgcattctgtggaattgctggctggctgcaccctata ttgtctgagaacagagtggctacaggagtattaaccccacctgatctcacgatgggagag gagacgccatctgcagcagtggtggtagacccagagctggacatgggcacccgtgacgat gtccccgaagccaaggtcctggtccccgtggccgtgtactgcgggagcatccctcggacc agtgctgggccccgggtgctcccgcctggaagcatcaactccagtctgcctcatggagaa ggttctctccagcctgagccaagagccctgctgaacaacgaggaaccgtcacagctcctg cgtggactcggacagctgggtggcctcaagctggacaccccttccaagggctggcaggca aggaatggtcaccccaggaacctcagggccttgtctttgggggaccagcccctggtgctc ctcccttctccggagtcagaggccaacagcgtggccagggacaccatccagattaaggac aagctcaagaaaaggaggctctcagagggcttggcagcgtcttcccgagcctctctggat ccagggggaggcccccaaggagttcccctgcacagcaccatcccccgagccacctctcag aggctgctgagggtgcccaggccgatgcctctcatccagagcatccctaccacccctgag gccagcggagtcaaagagaagggcctggacctaccggggagcattccgggtcctcacgag ttgagacccggtgctcaggaggcgcagatctcctggcaatacctgcactgcaatgatgag aagatgcagaagtccctgggcgccatcgtgatcccacccatcccaaaggccaggacggtt gcagcgaccccctcccgtgtgcctggctcccttcccagcccgttacctccaggccaggga gtcctcacaggcctgagggccccacgcacgcggcagcttcctgtgtgccctcttccattt ctaagttctgccattcccaaaattgttccaggactgtttgcccccatctcctgttttctc ctggggcccttatgtgtgcccaagtcttccagatggtggaatgtggagccaaaacctttg gcctcacccatcagagacaggcctgccgctgccaagaagcctgccctgcctttttctcag tctgctcccacgctgacagccttctcctttgactgtgccagagaagcctgccctccgctg aaagaagaggaccagaaggagatcggcaccaagatccaagtcaccatctccaagtctgcc cgggagaagatgcagctgaagcagatgaaggagatggagctgcttcggaggctggaggag cccaggacagggcaggagctcacttcccagtgcctgggctcccagagagccttcatgaag gaaggcctccttcccctccggggcagcgggacactgtctgtgcccactaggctgagcggc ccatgcagaaacgacgtcagcatcatcctgaggaagtgggccagccgggcctccctgccc agcatccccatcagccggcaggagccccgctttgcccgccacgcctcagacttgcctagg acctccagatgtactgtggtgaaaatcatcctggtggaggtgaagctcgggagaaaactc gccaggaaggaagatgtggtaggcctggccttgcttctcaggcagatgaaggagaagggt ctggtgagcatccagcgcttggcagcctgtcactcagaggtcctcaccgggaagctgcac gacgtgtgcttggtggtgactggggaggtcaccaacctgcggtccaaggtgtctcacctg gccatcagcaccttgggagacctcttccaggccttgaagaagaatatggaccaggaggcc gaggagatcgcccgctgcttgctgcagaagatggcggacaccaacgagttcatccagaga gcagccggccagtctctgagggctatggtggagaatgtgacccttgcccgctccctggtg gtcctcacctcggcgggtgtctaccaccggaaccccttgatccggaaatacgcggctgag cacctctcagctgtgctggagcagatcggcgctgagaagcttctctcgggcaccagagac agcacagacatgttggtgcacaacctggtgaggctggcacaggactccaaccaggacacc agattttatggccggaagatggtgaatatcttgatggcgaacactaagtttgatgcattt ctgaagcaatctctcccatcttacgacttgcagaaggtcatggcggccattaaacagcag ggaatagaagataatgatgaacttccctctgccaaaggccgcaaggtgttgaggagtctg gtggtgtgtgagaacgggctgcccatcaaggaggggctcagctgcaatggcccaaggctg gtggggctgcgctccacactgcagggccgcggggagatggtggagcagctacgggagctg acacggctgctggaggccaaggacttccggtcccggatggaaggcgtggggcagctcctg gagctctgcaaggccaagacggagcttgtcactgcccacctggtccaggtctttgatgct ttcaccccaaggcttcaggattccaacaagaaagtgaaccagtgggcgctggagtccttc gccaagatgatccccctcctcagagagagcttacaccccatgctgctctccatcatcatc actgttgcagacaacctcaactccaagaactcagggatttacgctgctgccgtggctgtg ctggatgcgatggttgagagcctggacaacctttgccttctaccagcgcttgctgggcga gtgcgtttcctgagtggccgtgcggtgctggatgtcacagatcgcctggcagnn